Supplemental material for A note on spurious …motegi/WEB_random_walk_spurious_reg...Changing the...

15
Supplemental material for A note on spurious regression and random walks with zero, local, or constant drifts Jay Dennis * – Institute for Defense Analyses Kaiji Motegi – Kobe University This draft: July 4, 2020 * Institute for Defense Analyses (IDA). Research results and conclusions expressed are those of the authors and do not necessarily reflect the views of IDA. E-mail: [email protected] Corresponding author. Graduate School of Economics, Kobe University. Address: 2-1 Rokkodai-cho, Nada, Kobe, Hyogo 657-8501 Japan. E-mail: [email protected] 1

Transcript of Supplemental material for A note on spurious …motegi/WEB_random_walk_spurious_reg...Changing the...

Page 1: Supplemental material for A note on spurious …motegi/WEB_random_walk_spurious_reg...Changing the regression model from (1) to (2) does ff the asymptotic properties of ^ and ^t2

Supplemental material for“A note on spurious regression and random walks

with zero, local, or constant drifts”

Jay Dennis∗– Institute for Defense Analyses

Kaiji Motegi†– Kobe University

This draft: July 4, 2020

∗Institute for Defense Analyses (IDA). Research results and conclusions expressed are those of the authors anddo not necessarily reflect the views of IDA. E-mail: [email protected]

†Corresponding author. Graduate School of Economics, Kobe University. Address: 2-1 Rokkodai-cho, Nada,Kobe, Hyogo 657-8501 Japan. E-mail: [email protected]

1

Page 2: Supplemental material for A note on spurious …motegi/WEB_random_walk_spurious_reg...Changing the regression model from (1) to (2) does ff the asymptotic properties of ^ and ^t2

Contents

1 Time trend in the spurious regression model 3

2 Simulation results 4

List of Figures

1 Empirical distributions of β̂ and t̂2β when time trend t is included (Case Y1X1) . 6

2 Empirical distributions of β̂ and t̂2β when time trend t is included (Case Y1X2) . 7

3 Empirical distributions of β̂ and t̂2β when time trend t is included (Case Y1X3) . 8

4 Empirical distributions of β̂ and t̂2β when time trend t is included (Case Y2X1) . 9

5 Empirical distributions of β̂ and t̂2β when time trend t is included (Case Y2X2) . 10

6 Empirical distributions of β̂ and t̂2β when time trend t is included (Case Y2X3) . 11

7 Empirical distributions of β̂ and t̂2β when time trend t is included (Case Y3X1) . 12

8 Empirical distributions of β̂ and t̂2β when time trend t is included (Case Y3X2) . 13

9 Empirical distributions of β̂ and t̂2β when time trend t is included (Case Y3X3) . 14

List of Tables

1 Summary of correct scaling factors when time trend t is included (conjecture) . . 15

2

Page 3: Supplemental material for A note on spurious …motegi/WEB_random_walk_spurious_reg...Changing the regression model from (1) to (2) does ff the asymptotic properties of ^ and ^t2

1 Time trend in the spurious regression model

In the main paper Dennis and Motegi (2020), the regression model is specified as

yt = α + βxt + ut. (1)

Based on (1), the asymptotic properties of the estimated slope parameter β̂ and the squared

t-statistic t̂2β are derived in the main paper. It is of interest to investigate how the asymptotic

properties change when the linear time trend is added to the model:

yt = α + βxt + γ × t+ ut. (2)

A motivation of the inclusion of the time trend t is as follows. Suppose that a data generating

process (DGP) for y is a random walk with drift:

yt = dyn + yt−1 + ϵyt. (3)

Equation (3) can be rewritten as

yt = dyn × t+ y0 +t∑

τ=1

ϵyτ ,

where y0 is a non-stochastic initial value. One might think that the time trend t in model (2) would

(partially) resolve the spurious regression, since it would capture the deterministic trend dyn × t

though not the stochastic trend∑t

τ=1 ϵyτ . It is therefore of interest to study the consequence of

using the extended model (2).

It is beyond the scope of the present note to formally derive the asymptotic properties of β̂ and

t̂2β under model (2). Clearly, required algebra would be more tedious than the present note since

model (2) is more general than model (1). To draw some conjecture on the unknown asymptotic

properties, Monte Carlo simulations are performed in this supplemental material. We generate

J = 5000 Monte Carlo samples of size n ∈ {100, 500, 1000} from the DGP:

yt = dyn + yt−1 + ϵyt, xt = dxn + xt−1 + ϵxt.

The drift terms are specified in accordance with the main paper:

Case Y1 (zero drift in y): dyn = 0.

3

Page 4: Supplemental material for A note on spurious …motegi/WEB_random_walk_spurious_reg...Changing the regression model from (1) to (2) does ff the asymptotic properties of ^ and ^t2

Case Y2 (nonzero local drift in y): dyn = n−1/2.

Case Y3 (nonzero constant drift in y): dyn = 1.

Case X1 (zero drift in x): dxn = 0.

Case X2 (nonzero local drift in x): dxn = n−1/2.

Case X3 (nonzero constant drift in x): dxn = 1.

We write Case Y1X2, for example, to express the combination of Cases Y1 and X2. The error

terms are drawn as follows.1 [ϵyt

ϵxt

]i.i.d.∼ N (0, I2).

For each Monte Carlo sample, we fit the extended model (2) and compute β̂ and t̂2β. Then, the

empirical distributions of scaled β̂ and t̂2β are plotted. The scaling factors (i.e., conjecture on

the rate of convergence or divergence) are chosen so that the empirical distributions seem stable

across the sample sizes n ∈ {100, 500, 1000}.

2 Simulation results

See Figures 1-9 for the empirical distributions. Changing the regression model from (1) to (2)

does affect the asymptotic properties of β̂ and t̂2β. For all figures, the empirical distributions of

β̂ itself seem stable across the sample sizes, which suggests that β̂ = Op(1) for all cases. The

spurious regression is still present under the extended model (2) in the sense that β̂ does not

converge in probability to 0. A plausible reason for the spurious regression is that the stochastic

trend∑t

τ=1 ϵyτ in y is left uncaptured under model (2). For all cases, the empirical distribution

of β̂ is unimodal and symmetric around 0.

Scaling factors which stabilize t̂2β vary across cases. Our conjecture is that t̂2β = Op(n2) if x has

a nonzero constant drift (Cases Y1X3, Y2X3, and Y3X3) and t̂2β = Op(n) otherwise (Cases Y1X1,

Y1X2, Y2X1, Y2X2, Y3X1, and Y3X2). In any case, the squared t-statistic diverges and hence

the probability of making Type I Error (i.e., rejecting the correct null hypothesis H0 : β = 0)

approaches 1 asymptotically. This suggests that using model (2) is not a solution to the spurious

regression; indeed, including a time trend appears to even exacerbate the symptoms of spurious

regression in some cases (i.e., Cases Y1X3 and Y2X3) by accelerating the rates of divergence of

1 The i.i.d. normal errors are drawn in order to focus ideas. Assumption 1 in Dennis and Motegi (2020) impliesthat the error sequences need only be α-mixing.

4

Page 5: Supplemental material for A note on spurious …motegi/WEB_random_walk_spurious_reg...Changing the regression model from (1) to (2) does ff the asymptotic properties of ^ and ^t2

the squared t-statistic. Uncovering the reason for this phenomenon is left for future work. For

all cases, the empirical distribution of scaled t̂2β is a positively skewed distribution whose peak is

located at 0.

The conjecture on the rate of convergence or divergence of β̂ and t̂2β is summarized in Table

1. It is left as a future task to confirm the conjecture and to derive the asymptotic distributions

analytically.

References

Dennis, J., and K. Motegi (2020): “A note on spurious regression and random walks with

zero, local, or constant drifts,” Institute for Defense Analyses and Kobe University.

5

Page 6: Supplemental material for A note on spurious …motegi/WEB_random_walk_spurious_reg...Changing the regression model from (1) to (2) does ff the asymptotic properties of ^ and ^t2

Figure 1: Empirical distributions of β̂ and t̂2β when time trend t is included (Case Y1X1)

(a) β̂ (n = 100) (b) β̂ (n = 500) (c) β̂ (n = 1000)

(d) n−1t̂2β (n = 100) (e) n−1t̂2β (n = 500) (f) n−1t̂2β (n = 1000)

We generate J = 5000 Monte Carlo samples of size n ∈ {100, 500, 1000} from the DGP yt = dyn + yt−1 + ϵyt and

xt = dxn + xt−1 + ϵxt, where dyn = 0, dxn = 0, ϵyti.i.d.∼ N (0, 1), and ϵxt

i.i.d.∼ N (0, 1). {ϵyt} and {ϵxt} are mutually

independent. The regression model is yt = α + βxt + γt + ut. β̂ is the least squares estimator for β. t̂β is the

conventional t-statistic with respect to H0 : β = 0. This figure presents the empirical distribution functions of β̂

and n−1t̂2β .

6

Page 7: Supplemental material for A note on spurious …motegi/WEB_random_walk_spurious_reg...Changing the regression model from (1) to (2) does ff the asymptotic properties of ^ and ^t2

Figure 2: Empirical distributions of β̂ and t̂2β when time trend t is included (Case Y1X2)

(a) β̂ (n = 100) (b) β̂ (n = 500) (c) β̂ (n = 1000)

(d) n−1t̂2β (n = 100) (e) n−1t̂2β (n = 500) (f) n−1t̂2β (n = 1000)

We generate J = 5000 Monte Carlo samples of size n ∈ {100, 500, 1000} from the DGP yt = dyn + yt−1 + ϵyt and

xt = dxn + xt−1 + ϵxt, where dyn = 0, dxn = n−1/2, ϵyti.i.d.∼ N (0, 1), and ϵxt

i.i.d.∼ N (0, 1). {ϵyt} and {ϵxt} are

mutually independent. The regression model is yt = α+ βxt + γt+ ut. β̂ is the least squares estimator for β. t̂β

is the conventional t-statistic with respect to H0 : β = 0. This figure presents the empirical distribution functions

of β̂ and n−1t̂2β .

7

Page 8: Supplemental material for A note on spurious …motegi/WEB_random_walk_spurious_reg...Changing the regression model from (1) to (2) does ff the asymptotic properties of ^ and ^t2

Figure 3: Empirical distributions of β̂ and t̂2β when time trend t is included (Case Y1X3)

(a) β̂ (n = 100) (b) β̂ (n = 500) (c) β̂ (n = 1000)

(d) n−2t̂2β (n = 100) (e) n−2t̂2β (n = 500) (f) n−2t̂2β (n = 1000)

We generate J = 5000 Monte Carlo samples of size n ∈ {100, 500, 1000} from the DGP yt = dyn + yt−1 + ϵyt and

xt = dxn + xt−1 + ϵxt, where dyn = 0, dxn = 1, ϵyti.i.d.∼ N (0, 1), and ϵxt

i.i.d.∼ N (0, 1). {ϵyt} and {ϵxt} are mutually

independent. The regression model is yt = α + βxt + γt + ut. β̂ is the least squares estimator for β. t̂β is the

conventional t-statistic with respect to H0 : β = 0. This figure presents the empirical distribution functions of β̂

and n−2t̂2β .

8

Page 9: Supplemental material for A note on spurious …motegi/WEB_random_walk_spurious_reg...Changing the regression model from (1) to (2) does ff the asymptotic properties of ^ and ^t2

Figure 4: Empirical distributions of β̂ and t̂2β when time trend t is included (Case Y2X1)

(a) β̂ (n = 100) (b) β̂ (n = 500) (c) β̂ (n = 1000)

(d) n−1t̂2β (n = 100) (e) n−1t̂2β (n = 500) (f) n−1t̂2β (n = 1000)

We generate J = 5000 Monte Carlo samples of size n ∈ {100, 500, 1000} from the DGP yt = dyn + yt−1 + ϵyt and

xt = dxn + xt−1 + ϵxt, where dyn = n−1/2, dxn = 0, ϵyti.i.d.∼ N (0, 1), and ϵxt

i.i.d.∼ N (0, 1). {ϵyt} and {ϵxt} are

mutually independent. The regression model is yt = α+ βxt + γt+ ut. β̂ is the least squares estimator for β. t̂β

is the conventional t-statistic with respect to H0 : β = 0. This figure presents the empirical distribution functions

of β̂ and n−1t̂2β .

9

Page 10: Supplemental material for A note on spurious …motegi/WEB_random_walk_spurious_reg...Changing the regression model from (1) to (2) does ff the asymptotic properties of ^ and ^t2

Figure 5: Empirical distributions of β̂ and t̂2β when time trend t is included (Case Y2X2)

(a) β̂ (n = 100) (b) β̂ (n = 500) (c) β̂ (n = 1000)

(d) n−1t̂2β (n = 100) (e) n−1t̂2β (n = 500) (f) n−1t̂2β (n = 1000)

We generate J = 5000 Monte Carlo samples of size n ∈ {100, 500, 1000} from the DGP yt = dyn + yt−1 + ϵyt and

xt = dxn + xt−1 + ϵxt, where dyn = n−1/2, dxn = n−1/2, ϵyti.i.d.∼ N (0, 1), and ϵxt

i.i.d.∼ N (0, 1). {ϵyt} and {ϵxt} are

mutually independent. The regression model is yt = α+ βxt + γt+ ut. β̂ is the least squares estimator for β. t̂β

is the conventional t-statistic with respect to H0 : β = 0. This figure presents the empirical distribution functions

of β̂ and n−1t̂2β .

10

Page 11: Supplemental material for A note on spurious …motegi/WEB_random_walk_spurious_reg...Changing the regression model from (1) to (2) does ff the asymptotic properties of ^ and ^t2

Figure 6: Empirical distributions of β̂ and t̂2β when time trend t is included (Case Y2X3)

(a) β̂ (n = 100) (b) β̂ (n = 500) (c) β̂ (n = 1000)

(d) n−2t̂2β (n = 100) (e) n−2t̂2β (n = 500) (f) n−2t̂2β (n = 1000)

We generate J = 5000 Monte Carlo samples of size n ∈ {100, 500, 1000} from the DGP yt = dyn + yt−1 + ϵyt and

xt = dxn + xt−1 + ϵxt, where dyn = n−1/2, dxn = 1, ϵyti.i.d.∼ N (0, 1), and ϵxt

i.i.d.∼ N (0, 1). {ϵyt} and {ϵxt} are

mutually independent. The regression model is yt = α+ βxt + γt+ ut. β̂ is the least squares estimator for β. t̂β

is the conventional t-statistic with respect to H0 : β = 0. This figure presents the empirical distribution functions

of β̂ and n−2t̂2β .

11

Page 12: Supplemental material for A note on spurious …motegi/WEB_random_walk_spurious_reg...Changing the regression model from (1) to (2) does ff the asymptotic properties of ^ and ^t2

Figure 7: Empirical distributions of β̂ and t̂2β when time trend t is included (Case Y3X1)

(a) β̂ (n = 100) (b) β̂ (n = 500) (c) β̂ (n = 1000)

(d) n−1t̂2β (n = 100) (e) n−1t̂2β (n = 500) (f) n−1t̂2β (n = 1000)

We generate J = 5000 Monte Carlo samples of size n ∈ {100, 500, 1000} from the DGP yt = dyn + yt−1 + ϵyt and

xt = dxn + xt−1 + ϵxt, where dyn = 1, dxn = 0, ϵyti.i.d.∼ N (0, 1), and ϵxt

i.i.d.∼ N (0, 1). {ϵyt} and {ϵxt} are mutually

independent. The regression model is yt = α + βxt + γt + ut. β̂ is the least squares estimator for β. t̂β is the

conventional t-statistic with respect to H0 : β = 0. This figure presents the empirical distribution functions of β̂

and n−1t̂2β .

12

Page 13: Supplemental material for A note on spurious …motegi/WEB_random_walk_spurious_reg...Changing the regression model from (1) to (2) does ff the asymptotic properties of ^ and ^t2

Figure 8: Empirical distributions of β̂ and t̂2β when time trend t is included (Case Y3X2)

(a) β̂ (n = 100) (b) β̂ (n = 500) (c) β̂ (n = 1000)

(d) n−1t̂2β (n = 100) (e) n−1t̂2β (n = 500) (f) n−1t̂2β (n = 1000)

We generate J = 5000 Monte Carlo samples of size n ∈ {100, 500, 1000} from the DGP yt = dyn + yt−1 + ϵyt and

xt = dxn + xt−1 + ϵxt, where dyn = 1, dxn = n−1/2, ϵyti.i.d.∼ N (0, 1), and ϵxt

i.i.d.∼ N (0, 1). {ϵyt} and {ϵxt} are

mutually independent. The regression model is yt = α+ βxt + γt+ ut. β̂ is the least squares estimator for β. t̂β

is the conventional t-statistic with respect to H0 : β = 0. This figure presents the empirical distribution functions

of β̂ and n−1t̂2β .

13

Page 14: Supplemental material for A note on spurious …motegi/WEB_random_walk_spurious_reg...Changing the regression model from (1) to (2) does ff the asymptotic properties of ^ and ^t2

Figure 9: Empirical distributions of β̂ and t̂2β when time trend t is included (Case Y3X3)

(a) β̂ (n = 100) (b) β̂ (n = 500) (c) β̂ (n = 1000)

(d) n−2t̂2β (n = 100) (e) n−2t̂2β (n = 500) (f) n−2t̂2β (n = 1000)

We generate J = 5000 Monte Carlo samples of size n ∈ {100, 500, 1000} from the DGP yt = dyn + yt−1 + ϵyt and

xt = dxn + xt−1 + ϵxt, where dyn = 1, dxn = 1, ϵyti.i.d.∼ N (0, 1), and ϵxt

i.i.d.∼ N (0, 1). {ϵyt} and {ϵxt} are mutually

independent. The regression model is yt = α + βxt + γt + ut. β̂ is the least squares estimator for β. t̂β is the

conventional t-statistic with respect to H0 : β = 0. This figure presents the empirical distribution functions of β̂

and n−2t̂2β .

14

Page 15: Supplemental material for A note on spurious …motegi/WEB_random_walk_spurious_reg...Changing the regression model from (1) to (2) does ff the asymptotic properties of ^ and ^t2

Table 1: Summary of correct scaling factors when time trend t is included (conjecture)

Case X1: Case X2: Case X3:

dxn = 0 dxn = n−1/2dx, dx ̸= 0 dxn = dx ̸= 0

Case Y1: β̂ = Op(1) β̂ = Op(1) β̂ = Op(1)

dyn = 0 t̂2β = Op(n) t̂2β = Op(n) t̂2β = Op(n2)

Case Y2: β̂ = Op(1) β̂ = Op(1) β̂ = Op(1)

dyn = n−1/2dy t̂2β = Op(n) t̂2β = Op(n) t̂2β = Op(n2)

dy ̸= 0

Case Y3: β̂ = Op(1) β̂ = Op(1) β̂ = Op(1)

dyn = dy ̸= 0 t̂2β = Op(n) t̂2β = Op(n) t̂2β = Op(n2)

The DGP is yt = dyn + yt−1 + ϵyt and xt = dxn +xt−1 + ϵxt, where ϵyti.i.d.∼ N (0, 1) and ϵxt

i.i.d.∼ N (0, 1). {ϵyt} and

{ϵxt} are mutually independent. The regression model is yt = α+βxt+γ× t+ut. β̂ is the least squares estimator

for β. t̂β is the conventional t-statistic with respect to H0 : β = 0. This table summarizes a simulation-based

conjecture on the order of stochastic convergence or divergence of β̂ and t̂2β .

15