Statistics Tutorial, Part 8.
Causality Testing

There's no actual statistical test that can prove one variable causes another. But a sort of limited causality test is possible with certain assumptions. Economists talk about "Granger causality," proposed by C.W.J. Granger in 1969. You look for an effect from past values of another variable. We assume causality only runs from past to present. So if past values of X help explain Y, then X "Granger-causes" Y.

Sims (1972) invented a test for Granger causality. You regress your Y variable on past values of itself. Then you regress it on past values of itself and past values of X. If significantly more variance is accounted for, Bingo! It doesn't absolutely prove X causes Y, but at least it "Granger-causes" it.

"Granger-causality" assumes the earlier phenomenon causes the later one. It might seem obvious that the one which happens first is the cause, but it isn't always true. The late economist David Tobin gave this example: People who listen to weather reports take umbrellas with them when rain is expected. Nearly always, the people take the umbrellas with them first, and the rain happens later. But the decision to carry the umbrellas doesn't cause the rain. The anticipated rain causes the decision, so here the effect precedes the cause.

But let's ignore that. In most cases, cause comes before effect.

"The Sims partial-F test for Granger causality" is a long name, so I'll just call it the Sims test. First you regress your effect variable on its own autoregression terms--perhaps price inflation on inflation one and two years ago. Then you add lagged values of the presumed cause--say, money supply growth one and two years ago. Data from the two regressions can be combined to yield an F statistic (details in Appendix 6). If the F value exceeds the appropriate critical threshold, we say the added variable "Granger-causes" the independent variable--here, money-supply growth Granger-causes inflation (if, in fact, it does).

The question then arises, how long a lag period is appropriate? You can decide this with "information criteria," such as the Akaike Information Criterion (AIC, and an improved version called "second-order AIC," abbreviated AICc), and the Schwarz Bayesian Information Criterion (SIC or SBC). Appendix 6 shows how to calculate both. For every autoregression, you can calculate AIC and SIC values, and the appropriate lag for the variable of interest is found by selecting the lag period that has the lowest AIC or SIC value. Minimize the information criterion and you've found the appropriate lag period.

The AIC and SIC don't always give the same answers. Choose one, tell people which you're using, and run with it.

I'll give an example without showing the calculations. I have annual money supply data back to 1898 and inflation data back to 1930. For each variable, I calculate autoregressions of varying lag periods from 1 to 10 years and calculate the AIC, AICc and SIC. To make all the autoregressions comparable, I start them all in 1940. Thus the period covered is 1940-2004, for N = 65 points.

Inflation (GDP Price Deflator)				Money Supply Growth (M2)
Lag	AIC	AICc	SIC	Lag	AIC	AICc	SIC
1	94.650	94.713	6.554	1	138.186	138.249	6.334
2	89.636	89.830	5.620	2	135.984	136.177*	6.333*
3	86.927	87.320	5.612	3	137.392	137.786	6.388
4	79.619	80.286	5.533*	4	135.897*	136.564	6.399
5	81.535	82.552	5.596	5	136.742	137.759	6.445
6	77.227	78.675	5.563	6	138.738	140.186	6.510
7	79.220	81.185	5.627	7	140.616	142.581	6.572
8	74.485	77.056	5.588	8	142.532	145.103	6.635
9	76.433	79.705	5.651	9	140.991	144.263	6.645
10	71.057*	75.131*	5.602	10	139.875	143.949	6.661

The lowest value in each column has a star next to it. For inflation, the AIC and AICc recommend a 10-year lag while the SIC recommends 4 years. For money supply growth, the AIC recommends 4 years while the AICc and SIC recommend 2 years.

To keep the results comparable, and because the maximum lag length here is ten years, I did all the Granger tests over the period 1940-2004 (N = 65 years). Here are the results:

Hypothesis	F-Statistic	p-value
dM → dP	F_4,56 = 2.656	0.0422*
dM → dP	F_10,44 = 1.346	0.2371
dP → dM	F_2,60 = 1.364	0.2634
dP → dM	F_4,56 = 1.427	0.2370

In the tests above, only dM causing dP with a four-year lag is significant at the 5% level; the rest are insignificant. Thus we can conclude that money Granger-causes inflation, and not vice versa.

Page created:	04/10/2017
Last modified:	04/12/2017
Author:	BPL

Statistics Tutorial, Part 8.Causality Testing

Statistics Tutorial, Part 8.
Causality Testing