Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using...

31
Forecasting inflation rates using daily data: A nonparametric MIDAS approach org Breitung * University of Cologne Christoph Roling University of Bonn 18th December 2014 Abstract In this paper a nonparametric approach for estimating mixed-frequency fore- cast equations is proposed. In contrast to the popular MIDAS approach that employs an (exponential) Almon or Beta lag distribution, we adopt a penal- ized least-squares estimator that imposes some degree of smoothness to the lag distribution. This estimator is related to nonparametric estimation proce- dures based on cubic splines and resembles the popular Hodrick-Prescott fil- tering technique for estimating a smooth trend function. Monte Carlo exper- iments suggest that the nonparametric estimator may provide more reliable and flexible approximations to the actual lag distribution than the conven- tional parametric MIDAS approach based on exponential lag polynomials. Parametric and nonparametric methods are applied to assess the predictive power of various daily indicators for forecasting monthly inflation rates. It turns out that the commodity price index is a useful predictor for inflations rates 20–30 days ahead with a hump-shaped lag distribution. Key words Forcasting, mixed frequency, nonparametric regression JEL classification C12, C32 * Corresponding author: Institute of Econometrics, University of Cologne, Albert-Magnus-Platz, 50923 Bonn, Germany. E-mail address: [email protected]. 1

Transcript of Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using...

Page 1: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

Forecasting inflation rates using daily data:A nonparametric MIDAS approach

Jorg Breitung∗

University of CologneChristoph RolingUniversity of Bonn

18th December 2014

Abstract

In this paper a nonparametric approach for estimating mixed-frequency fore-cast equations is proposed. In contrast to the popular MIDAS approach thatemploys an (exponential) Almon or Beta lag distribution, we adopt a penal-ized least-squares estimator that imposes some degree of smoothness to thelag distribution. This estimator is related to nonparametric estimation proce-dures based on cubic splines and resembles the popular Hodrick-Prescott fil-tering technique for estimating a smooth trend function. Monte Carlo exper-iments suggest that the nonparametric estimator may provide more reliableand flexible approximations to the actual lag distribution than the conven-tional parametric MIDAS approach based on exponential lag polynomials.Parametric and nonparametric methods are applied to assess the predictivepower of various daily indicators for forecasting monthly inflation rates. Itturns out that the commodity price index is a useful predictor for inflationsrates 20–30 days ahead with a hump-shaped lag distribution.

Key wordsForcasting, mixed frequency, nonparametric regression

JEL classificationC12, C32

∗Corresponding author: Institute of Econometrics, University of Cologne, Albert-Magnus-Platz,50923 Bonn, Germany. E-mail address: [email protected].

1

Page 2: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

1 Introduction

Due to increased accessibility of high-frequency data it is becoming more andmore attractive to employ variables observed at different sampling frequenciesfor economic forecasting (see Ghysels et al. (2006), Monteforte and Moretti (2008),Andreou et al. (2011), Modugno (2013), and Banbura et al. (2013) for a compre-hensive review of the literature). In such exercises various high-frequency obser-vations (30 days, say) of the predictor are associated with a single low-frequencyobservation (one month). This raises the problem of how to exploit the potentiallylarge number of intra-period observations to forecast the low frequency process.The simplest approach just averages the corresponding high-frequency observa-tions to obtain an aggregated predictor for the dependent low-frequency variable.Accordingly, the intra-period observations are equally weighted when formingthe predictor. A more appealing approach is to adapt a more flexible weightingscheme. For example, when using daily financial data to forecast monthly vari-ables like inflation rates we may want to assign larger weights to more recentobservations. Using least-squares methods for estimating the weighting schememay however yield unreliable estimates due to a large number of estimated pa-rameters and possible multicollinearity. The MIDAS approach suggested by Ghy-sels et al. (2006) is a convenient and elegant parametric approach to overcomethese problems.

An important drawback of such a parametric approach is that the shape ofthe lag distribution is governed by a (more or less) arbitrary class of functionssuch as exponential polynomials or the beta function. In empirical applications,parsimonious specifications of parametric functions may fail to yield an accurateapproximation of the lag distribution. For example, using exponential polynomi-als or the beta function implies that the regressors affect the dependent variablein the same direction, ruling out the case that the predictive regressor affects thetarget variable positively for some range of lags, whereas the effect becomes nega-tive at some other lags. This happens, for example if the long-run effect is smallerthan the short-run effect. Such scenarios are ruled out in the standard parametricMIDAS framework.

It is therefore desirable to employ nonparametric techniques that circumventthe choice of a particular class of parametric functions when fitting the lag dis-tribution to high-frequency data. In this paper a nonparametric approach is sug-gested that merely imposes some degree of smoothness to the lag distribution.

2

Page 3: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

This approach is similar to fitting cubic splines for approximating an unknownfunctional form. Related approaches are popular in many areas of applied statis-tics. For example, the widely used Hodrick-Prescott filter for an unknown trend(e.g. Hodrick and Prescott (1997)) or the flexible least-squares approach (e.g. Kal-aba and Tesfatsion (1989) and Lutkepohl (1993)) is based on similar principles.

In Section 2 we sketch the parametric MIDAS approach proposed by Ghyselset al. (2006) and motivate our nonparametric framework. More details of thesuggested smoothed least-squares (SLS) estimator are considered in Section 3 andsome first results on the relative performance of the nonparametric estimator arepresented in Section 4. In the empirical application of Section 5 we investigate thepredictive power of daily time series (such as commodity prices, stock prices andinterest rates) for monthly inflation rates in Germany. It turns out that the dailycommodity price index (or daily oil prices in Rotterdam) of the previous monthexplains more than 40 percent of the variance of monthly changes in the Germaninflation rates. Section 6 concludes.

2 Mixed frequency estimation

Consider the regression model combining the low-frequency variable yt and thehigh-frequency variable xt,j, where t = 1, . . . , T is the low-frequency time index(monthly, say) and j is the intra-period high-frequency (daily) observation withj = 1, . . . , nt (nt ∈ {28, . . . , 31} in our example). For convenience we assumethat the time index s runs in the opposite direction, that is, the pair (t, nt) repre-sents the first and (t, 0) the final observation of the month t. The low and highfrequency variables are linked by a linear regression of the form

yt+h = α0 +p

∑j=0

β jxt,j + ut+h ,

where ut+h is uncorrelated with xt, . . . , xt−p. To simplify the exposition, we con-fine ourselves to a single predictor variable xt,j so that β j is a scalar. Furthermore,the lag-length p is assumed to be smaller than the minimum number of intra-period observations (i.e. p < min(nt) = 28 in our example with daily obser-vations). The extension to higher lag-orders is straightforward but involves anadditional notational burden. In the MIDAS approach of Ghysels et al. (2007)and Andreou et al. (2011)) the coefficients are written as β j = α1ωj(θ), where the

3

Page 4: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

weights are generated by an exponential polynomial

ωj(θ) =exp

(θ1 j + · · ·+ θk jk)

∑pi=0 exp

(θ1i + · · ·+ θkik

) .

The vector of k hyper-parameters θ = (θ1, . . . , θk)′ is unknown. Alternatively, the

Beta distribution may be used (e.g. Ghysels et al. (2007)). Obviously, wj(θ) ∈[0, 1], and ∑

pj=0 wj (θ) = 1. With this specification, the model becomes

yt+h = α0 + α1

p

∑j=0

wj (θ) xt,j + ut+h , (1)

and the parameters α0, α1, and θ1, . . . , θk can be estimated by non-linear leastsquares (NLS). In many empirical applications a second order exponential Almonlag with k = 2 is sufficient to yield an accurate approximation of the weight func-tion (cf. Ghysels et al. (2007)). Essentially, MIDAS regression models extend theaggregation with equal-weights to more general temporal aggregation schemesthat appear to be better suited to many empirical applications, in particular forthe purpose of forecasting volatility (see Ghysels et al. (2006)) or macroeconomicforecasting with high-frequency predictors (see Andreou et al. (2013) or Foroni etal. (2014)).

The MIDAS approach may also be motivated as a restricted OLS regressionwhere the “reduced form parameters” in the OLS regression

yt+h = α0 + β′xt,• + ut+h , (2)

with β =(

β0, . . . , βp)′ and xt,• =

(xt,0, . . . , xt,p

)′ are restricted by the nonlinearfunction β = g(α1, θ) with the j’th element β j = α1ωj(θ). Accordingly, the pa-rameters are estimated by minimizing the criterion function

S(α, θ) =T

∑t=1

[yt+h − α0 − x′t,•g(α1, θ)

]2 .

The crucial feature of the function g(α, θ) is that it represents the high-dimensionalvector β by a smooth parametric function depending on a low-dimensional vectorof parameters θ. In this paper we propose an alternative nonparametric approachthat does not impose a particular functional form but merely assumes that the co-efficient β j is a smooth function of j in the sense that the absolute values of the

4

Page 5: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

second differences

∇2β j = β j − 2β j−1 + β j−2 for j = 2, . . . , p ,

are small. Specifically, the coefficients β0, . . . , βp are obtained by minimizing thepenalized least-squares objective function

S(α0, β) =T

∑t=1

(yt+h − α0 − β′xt,•)2 + λ

p

∑j=2

(∇2β j)2 , (3)

where λ is a pre-specified smoothing parameter. This objective function providesa trade-off between goodness of fit (which is maximized by using the unrestrictedleast-squares estimator) and an additional term that penalizes large fluctuationsof the coefficients β0, . . . , βp. To some extend this approach resembles the wellknown Hodrick-Prescott (1997) filter for extracting a smooth trend. An impor-tant difference is, however, that the dimension of the parameter space is typicallysmaller than the number of observations, whereas the parameter space underly-ing the Hodrick-Prescott filter (represented by the number of terms in the penaltyfunction) equals the number of observations. Our approach is also related toother nonparametric approaches such as cubic splines (e.g. Hardle (1990)) that in-volve an objective function based on the sum of squared residuals and a quadraticpenalty function.

In the following section some properties of the estimator βλ that results fromminimizing the penalized least-squares function are analyzed and data-dependentmethods for choosing the smoothing parameter are considered.

3 A nonparametric MIDAS approach

3.1 The SLS Estimator

Consider the smoothed least-squares (SLS) estimator resulting from

βλ = argminβ

{(yh − Xβ)′(yh − Xβ) + λβ′D′Dβ

}=

(X′X + λD′D

)−1 X′yh , (4)

5

Page 6: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

where

D =

1 −2 1 0 · 00 1 −2 1 · 0· · · · · ·0 · · 1 −2 1

(p− 1)× (p + 1),

yh = [y1+h, . . . , yT+h]′, X = [x1,•, . . . , xT,•]

′, and xt,• =[xt,0, . . . , xt,p

]′. For sim-plicity we ignore the constant α0 in the regression.

This estimator admits different interpretations. First, it may be rewritten as ashrinkage (ridge) estimator. Define the (p − 1)× 1 vector of second differencesas γ2 = Dβ. Furthermore, let D0 denote a 2× (p + 1) matrix such that D =

[D′0, D′]′ is invertible. For concreteness, let D0 = [I2, 02×(p−1)]) such that γ =

Dβ = [γ′1, γ′2]′ with γ1 = [β0, β1]

′. Minimizing (3) is equivalent to minimizing theobjective function

S(γ) = (yh − X1γ1 − X2γ2)′(yh − X1γ1 − X2γ2) + λγ′2γ2 , (5)

with respect to γ = [γ′1, γ′2]′ = Dβ, where X = XD−1 = [X1, X2]. This reformu-

lation of the minimization problem shows that the SLS is related to the shrinkage(ridge) estimator since the estimator of γ2 resulting from a minimization of (5)takes the form

γ2 =(

X∗′2 X∗2 + λIp−1

)−1X∗′2 yh , (6)

where X∗2 = M1X2 and M1 = IT − X1(X′1X1)−1X′1. Note that the unrestricted

OLS estimator results from setting λ = 0, whereas for λ → ∞ the coefficientsβ0, . . . , βp lie on a straight line.

A second interpretation of the estimator emerges from assuming stochastic

coefficients with ∇2βiiid∼ N (0, η2). In this case λ = σ2

u/η2 and the objective func-tion is equivalent to the log-likelihood function. In a similar manner a Bayesianinterpretation is available (with a normal prior distribution for ∇2βi). Finally wenote that the SLS estimator can be conveniently computed as an OLS estimatorwith [y′, 0′p−2] as the vector of dependent variables and [X′,

√λD′]′ as regressor

matrix.Another interesting interpretation is obtained from rewriting the first order

6

Page 7: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

condition as

βλ =

[Ip+1 + λT

(1T

X′X)−1

D′D

]−1

β , (7)

where β = (X′X)−1X′yh denotes the OLS estimator and λT = λ/T. This re-lationship between the unrestricted and the smoothed estimator shows that βλ

is obtained as a weighted average of all coefficients β0, β1, . . . , βp. Assuming

T−1X′Xp→ SX it follows that as λT = λ/T → 0, the SLS estimator converges

to the OLS estimator in the sense that√

T(βλ − β) = op(1).

3.2 Choice of the smoothing parameter

When deriving the SLS estimator we have assumed that the smoothing parame-ter λ is given. For empirical applications, however, some guidance is needed forchoosing this parameter. One possible approach is to adapt some cross-validationmethod. To this end the sample is partitioned into k blocks with int(T/k) obser-vations. For a given value of the smoothing parameter λ, we discard the j-thblock of observations, j = 1, . . . , k, and obtain βλ by using the observations in theremaining k− 1 blocks. Let β

(−j)λ denote the SLS estimator by leaving out the j’th

block. The CV criterion is obtained by computing the squared prediction error forall k blocks. By evaluation the CV criterion for a grid of values from the intervalλ ∈ [λ`, λu] the MSE optimal value of the smoothing parameter can be found. Inpractice, however, the CV procedure is computational demanding as the interval[λ`, λu] is typically very large.

A computationally simpler approach is to select the parameter λ based onsome information criterion. Note that the fitted values of the SLS method resultas

yh(λ) = X(X′X + λD′D

)−1 X′yh = Pλyh,

with Pλ = X (X′X + λD′D)−1 X′. For λ = 0 we have tr(Pλ) = p, whereas forλ → ∞ it follows that tr(Pλ) = 2. Accordingly, the trace of the matrix measuresthe dimension of the space spanned by the estimator βλ and the choice of λ isequivalent to select the “dimension” of the space spanned by estimated coeffi-cients. Although this notion is reasonable for an integer number of κλ = tr(Pλ),

7

Page 8: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

this interpretation is somewhat unclear in the case of fractional values of κλ.The notion of the pseudo-dimension κλ is also useful to compare the non-

parametric approach to alternative parametric procedures. Choosing the de-gree of exponential Almon lag distributions imply discrete values of the pseudo-dimension, whereas the nonparametric approach allows us to choose any realvalue in the interval [2, ∞), and, therefore, we expect choosing a more appropri-ate level of smoothness in practice.

The information criteria balances the trade-off between goodness-of-fit andthe dimension reduction of the estimation procedure: as λ increases, the sum ofsquared residuals increases relative to OLS, while the (pseudo) dimensionality κλ

shrinks down. We adapt the framework of Hurvich, Simonoff and Tsai (1998) andemploy the modified Akaike criterion given by

AIC(λ) = log([yh − yh(λ)]′[yh − yh(λ)]

)+

2(κλ + 1)T − κλ − 2

.

It is interesting to note that for large T the modified Akaike crriterion is equivalentto the original AIC criterion suggested by Akaike (1974), where κλ replaces thenumber of parameters. The smoothing parameter results from minimizing thiscriterion with respect to λ. As shown in the appendix, the minimizer λ∗AIC resultsfrom solving the first-order condition(

yh′XQ−1λ∗AIC

(D′D

)Q−1

λ∗AICX′(

yh − yh (λ∗AIC)))

(SSR (λ∗AIC))−1

= (T − 1) tr[(

X′X)

Q−1λ∗AIC

(D′D

)Q−1

λ∗AIC

] (T − κλ∗AIC

− 2)−2

, (8)

where Qλ = (X′X + λD′D), SSR (λ) =[yh − yh (λ)

]′ [yh − yh (λ)]. Accordingly,

the optimal choice of λ can be found by determining the root of equation (8).

4 Estimation Performance

4.1 Design of Monte Carlo simulation

In this section, we compare the small sample properties of parametric and non-parametric variants of the MIDAS regression. In order to compare our results tostudies that have been conducted previously in the literature, we generate data

8

Page 9: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

according to the model of Andreou et al. (2010), given by

yt+h = β0 +p

∑j=0

β jxt,j + ut+h , (9)

β j = α1ωj(θ), (10)

ut+hi.i.d.∼ N (0, 0.125) , (11)

for t = 1, 2, . . . , T, where β0 = 0.5, and ωj (·) is a weighting function for whichseveral specifications are presented in more detail below. The high-frequencypredictor is generated by the AR(1) process

xt,j = c + $ xt,j−1 + ε j,t, ε j,ti.i.d.∼ N (0, 1) ,

for j = 0, . . . , p, where xt,j−p−k = xt−1,j−k for all k > 0. Accordingly, xt,j denotesthe j′th lag of the AR(1) series xt,0. As in Andreou et al. (2010), c = 0.5 and$ = 0.9.1

Since X is independent of uh with E(uhuh ′) = σ2u IT, the (trace) MSE condi-

tional on X is obtained as

E[(βλ − β)′(βλ − β)|X] = β′ΨT(λ)′ΨT(λ)β + σ2

u[Ip+1 −ΨT(λ)]X′X[Ip+1 −ΨT(λ)′],

(12)

where ΨT(λ) = λ(X′X + λD′D)−1D′D. Minimizing the (conditional) MSE(λ)with respect to λ yields the MSE minimal value of the smoothing parameter λMSE.In our simulation experiments the smoothing parameter is reported relative to thesample size, that is, λMSE = λMSE/T. Similarly, λAIC = λAIC/T is obtained fromminimizing the modified AIC criterion.

In model (9) – (11), we choose the sample size as T ∈ {100, 200, 400} andthe number of high-frequency lags as p + 1 ∈ {20, 40, 60}. Moreover, in (10),we choose the scaling parameter α1 ∈ {0.2, 0.3, 0.4}, corresponding to models ofsmall, medium and large signal-to-noise-ratios, respectively. Although the im-plied (centered) R2 differ a bit due to different sample sizes, weighting functionsand the number of high-frequency lags, these signal-to-noise ratios roughly cor-

1In additional Monte Carlo simulations (cf. Breitung et al. (2013)) we found that the perfor-mance of the nonparametric MIDAS approach improves relative to the parametric competitor asthe autocorrelation of the regressor gets larger. This may be due to the implied multicollinearityof the regressors that tends to affect the parametric estimator more severely.

9

Page 10: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

respond to 0.40 ≤ R2 < 0.50, 0.50 ≤ R2 ≤ 0.70 and R2 > 0.70, respectively. Thenumber of Monte Carlo replications is 1000.

For each of the different weighting functions given below, we then considerthe ratio of the median MSE of the nonparametric approach relative to medianMSE of the parametric estimation procedure. Accordingly, a ratio above oneimplies that the parametric estimator produces more accurate estimates of theweighting function on average, whereas a ratio below one indicates that the non-parametric method is superior. All results regarding relative estimation accuracyare presented in table 1.

The different weighting functions are presented in figure 1. The weightingfunctions generated for this exercise are meant to cover different types of behav-ior of the high-frequency weights, including fast decay (with a lot of weightsapproximately equal to zero after some cut-off lag), as well as modest or evenslow decay (see the first and third row of figure 1).

4.2 Results

In our first Monte Carlo experiment we consider a lag distribution that fits intothe parametric MIDAS framework. The weight function is exponentially declin-ing with

ωj(θ) =exp(θ1 j)

p∑

i=0exp(θ1i)

, j = 0, . . . , p. (13)

where we follow Andreou et al. (2010) and set θ1 = 7 · 10−4 and θ2 = −6 · 10−3.The parametric NLS estimator is computed using the exponential Almon lag withk = 2 parameters. The top row of figure 1 depicts the lag distributions and thetop row of table 1 shows the MSE ratios.

Next, we also examine a hump-shaped pattern given by

ωj(θ) =exp

(θ1 j− θ2 j2

)p∑

i=0exp (θ1i− θ2i2)

. (14)

We determine the parameters such that the weighting function reaches a max-imum at j = 5, j = 10, and j = 15 when 20, 40 and 60 lags are employed,respectively. To this end, we choose θ1 = 8 · 10−2 and θ2 = θ1/10, θ2 = θ1/20,

10

Page 11: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

and θ2 = θ1/30, respectively. The parametric NLS estimator is computed usingthe exponential Almon lag with k = 2 parameters. The second row of figure 1presents the lag distributions and the MSE ratios are given in the second row oftable 1.

As another example of a monotonically declining weight function we considerthe linear function:

ωj(a0, a1) =a0 + a1 j

a0(p + 1) + a1(p + 1)(p + 2)/2(15)

The weights are normalized to sum up to unity. The third row in figure 1 presentsthe shapes of the lag distributions and the third row of table 1 reports the of MSEratios in this case.

Finally, to highlight the flexibility of the nonparametric approach, we considera cyclical weight function in which the weights change sign.

ωj(c1, c2) =c1

p + 1

[sin(

c2 +j2π

p

)], (16)

where c2 = 0.01, and c1 = 5, c1 = 2.5, and c1 = 5/3 for the cases of 20, 40 and 60lags, respectively, ensuring that these weights sum up to one.2 The bottom rowof figure 1 show the implied lag distribution of this specification, while the MSEratios are displayed in the bottom row of table 1.

The results of table 1 can be summarized as follows. First, the nonparametricapproach can improve in-sample estimation accuracy substantially relative to theparametric approach. This observation applies in particular to the linear and thesign-changing weights, but to some extent also to the exponentially decliningweights. For the humped-shaped weights, the parametric estimator yields moreaccurate estimates in most cases. Moreover, the nonparametric approach tends toperform better for low or medium signal-to-noise ratios.

Regarding the exponentially declining weights, the performance of the para-metric approach improves as the sample size T increases, which is to be expected,as the parametric model for the weights is correctly specified. The same patternapplies to the humped-shaped case, in which the parametric estimator clearlyoutperforms the nonparametric estimator as the sample size grows. Turning to

2For the sign-changing weights, we also experimented with the 3 parameter version of theexponential Almon lag, but it did not yield an improvment in terms of MSE relative to the speci-fication with two parameters. See also Breitung et al. (2013).

11

Page 12: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

the sign-changing weights, the nonparametric approach yields sizeable gains inestimation accuracy for this weighting function. This result is not too surprisingas the parametric approach is misspecified and cannot produce negative weights.Similarly, the case of linear weights may be favorable for the nonparametric ap-proach, as the SLS estimator attains a linear function as the degree of smoothingincreases. The point is, however, that in a given estimation problem, the trueweights may not be described by a presupposed (even though flexible) paramet-ric family of models, and the nonparametric approach may be very beneficial inthese cases.

Second, selecting the smoothing parameter by the AIC criterion leads to MSEratios that are relatively close to the ratios implied by the MSE minimal (infeasi-ble) value of the smoothing parameter in some cases, but may fall short of thisbenchmark in other cases considered here. Hence, although the AIC criterionsuggests reasonable (albeit not optimal) values for the smoothing parameter, ex-perimentation with different values for the smoothing parameters may be neces-sary to select the smoothing parameter in practice.

Until now we have focused on estimating the weights β j in (10). We now turnto evaluate the accuracy of out-of-sample forecasts made by the nonparametricand the parametric approach in this simple model. To this end, we partitionthe whole sample into an estimation sample, comprising observations 1, 2, . . . , Te,and a forecasting sample consisting of observations Te + 1, . . . , T. In this exercisewe set Te = T/2.

The estimation sample is used to obtain a baseline estimate of the weights β j

from the sample

yt+h =p

∑j=0

β j|T0xt,j + ut+h , for t = 1, . . . , Te.

Given the baseline estimates, the one-step ahead forecast of the dependentvariable is

yTe+1|Te =p

∑j=0

β j|Te xTe,j .

We then include the next period in the estimation sample while dropping thefirst period, re-estimate the model and obtain the next one-step ahead forecast.Proceeding in this fashion, we obtain a series of one-step ahead forecast errors

12

Page 13: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

employing a moving window of Te observations in each step.3

It is important to note that forecasts relying on the parametric and nonpara-metric MIDAS regressions are based on the same information set It = {xt,0, . . . , xt,p}.Thus, it is not surprising that both forecasting methods perform similarly. Lety(1)t+h|t = f

θ(It) and y(2)t+h|t = f

λ(It) denote the parametric and nonparametric

forecasts based on the high-frequency predictor. The decomposition of the fore-cast error yields

yt+h − y(j)t+h|t = E(yt+h|It)− y(j)

t+h|t︸ ︷︷ ︸estimation error

+ ut+h︸ ︷︷ ︸forecast error

.

In our out-of-sample forecast exercise the MSE is dominated by the forecast error,whereas the differences between y(1)t+h|t and y(2)t+h|t is typically small relative to theforecast error. Therefore it is not suprising that the MSE ratios of the two forecastmethods are generally close to unity. Notable exceptions are the results for thehump-shaped weight function with p + 1 = 60 and sign-changing weights withp + 1 = 20, where the forecasts of the non-parametric MIDAS procedure clearlyoutperforms the parametric approach.

5 Forecasting Monthly Inflation Rates Using Daily In-

dicators

5.1 Daily predictors

Frequent updates of inflation forecasts are becoming more and more importantfor central banks and financial analysts. Such updates can be easily obtainedby employing daily time series. Nowadays a variety of financial time series isavailable on a daily basis and their role in macroeconomic now- and forecastingis considered; see Andreou et al. (2013) and Banbura et al. (2013) for examplesof recent applications. Modugno (2013) analyzes the predictive content of dailyworld market prices of raw materials as well as some financial indicators such asinterest rates, stock price indices and exchange rates. From the set of monthly,weekly and daily indicators, she extracts leading factors by using the state-spacerepresentation of the mixed-frequency dynamic factor model.

3Due to the recursive nature of the exercise and the multiple non-linear optimization algo-rithms involved, the number of Monte Carlo replications is reduced to 250.

13

Page 14: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

In our empirical application we focus on the most reliable daily indicator forEuro area inflation. In a preliminary analysis Breitung et al. (2013) analyze yieldspreads, inflation-indexed bonds, commodity prices and stock price indices asdaily indicators for monthly inflation. The choice of these variable was basedon the literature on forecasting inflation. Most of the variables are related toinflation expectations. Kozicki (1997) argues that yield spreads reflect expecta-tions on the direction of future inflation changes. The US Treasury began issuinginflation-linked bonds with its treasury inflation-protected securities (TIPS) since1997. From subtracting yields of inflation-linked bonds from nominal interestrates, market expectations of future inflation can be derived. Stock and Watson(2003) employ overnight interest rates, stock price indices and exchange rates intheir analysis of inflation forecasts.

In a comprehensive comparison of alternative predictors Stock and Watson(1999) and Modugno (2013) found that the commodity prices index is the mostreliable leading indicator among a variety of candidate predictors.4 Breitung etal. (2013) analyze the predictive content of 15 daily indicators for the Germaninflation rate and found that predictive (MIDAS) regressions using a broad com-modity price index and crude oil prices yield a regression R2 of more than 0.3,whereas all other predictive regressions exhibit an R2 of less than 0.10. As ar-gued by Bruneau et al. (2007) energy prices do play a dominant role in infla-tion forecasting, whereas exchange rates are poor predictors of inflation changes.In a similar vein, empirical results suggest that interests rates, inflation-indexedbonds, bond yields and yield spreads do not reveal any considerable predictivepower.

In our empirical investigation we therefore focus on the commodity prices in-dex as a daily predictor of inflation. The results for the oil price index are broadlysimilar. In our empirical application we therefore analyze the predictive power ofa daily commodity price index employing parametric and nonparametric MIDASframeworks.

5.2 Data and preliminary analysis

The target variable is the seasonally adjusted Euro area HICP overall price indexfor the Euro area as provided by the European Central Bank. The monthly in-

4See also Edelstein (2007), Browne and Cronin (2009), Gospodinov and Ng (2013) and Monte-forte and Moretti (2013) for more recent empirical studies.

14

Page 15: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

flation rate is computed as yt = 100 [log (Pt)− log (Pt−1)], where Pt denotes theprice index, see figure 2. The time span ranges from 2000m1 until 2013m12. Theunivariate time series is well represented by an AR(1) process, yielding a prettylow R2 of 0.07, cf. Table 3 for the full-sample estimation results.

Our daily predictor is the HWWI total commodity price index (Code S275VR).This price index is derived from world market prices of various commoditiesand provided by the Hamburgisches WeltWirtschaftsInstitut (HWWI) on a daily(Euro) basis. For our preliminary analysis the aggregated monthly indicator iscomputed as the change rate from the first to the final available day of the respec-tive month. Table 3 presents the estimation results of the full-sample predictiveregressions including the lagged commodity price index as an additional regres-sor. It turn out that this predictor improves the fit up to an R2 of 0.425, suggestingthat changes of the commodity price index is indeed a powerful predictor formonthly inflation rates. Furthermore, the lagged inflation rate becomes insignif-icant and, therefore, we drop the lagged dependent variable in our predictiveregressions. Finally we observe that any further lag of changes in the commodityprice index is insignificant and does not improve the fit.

An important practical problem is that the number of daily observations ina month is not constant: months have different lengths (from 28 up to 31 days)and some observations are missing due to weekends, holidays, publication lagsand so on. In fact the number of daily observations ranges from 17 to 23. Asthere is no silver bullet to fix this problem, previous research has addressed it indifferent ways. Andreou et al. (2013) in their work on the predictive ability offinancial time series fixed the number of trading days in a month to 22. In a sim-ilar vein, Hamilton (2008) assumes 21 business days in a month when analyzingmeasures of the policy impact at daily frequencies. Dias and Embrechts (2004)linearly interpolate missing observations of foreign exchange spot rates for USdollar against Deutsche Mark. Hence, there are different possible strategies todeal with missing daily observations and which approach prevails is an empiri-cal matter. In the daily time series we use, there are not enough observations ineach month to fix the length equal to 22 or 23 days and cutting all months to 17days leads to an unacceptable loss of information. The simplest way to deal withthis problem is to ignore the missing observations and include a fixed number oflags in the regression so that the actual time span may be longer than the numberof lags. We also experimented with linearly interpolating missing observationsin the daily series and use imputed observations in the regression. Overall, the

15

Page 16: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

results are not much affected by the alternative ways to deal with missing ob-servations and, therefore, we only present results where the gaps between dailyobservations are ignored.5

Figure 3 depicts the adjusted R2 as a function of the number of daily lags p. Itturns out that the goodness of fit converges to a value of roughly 0.55 after 30 lags.This implies that including 30 lags should be sufficient to capture the predictivecontent of the daily change of commodity prices.

5.3 In-sample results of MIDAS regressions

Table 4 presents the estimation results for the parametric MIDAS regression usingthe HWWI index as the predictor:

yt+h = α0 + α1

(p

∑j=0

ωj(θ)xt,j

)+ ut+h

where

ωj(θ) =exp

(θ1 j + θ2 j2

)p∑

i=0exp (θ1i + θ2i2)

.

The estimated parameters for different forecast horizons h and lag lengths p arereported along with their t-statistics. Interestingly, the R2 of the daily HWWIseries exceeds 0.3 only for h = 1 and p ∈ {30, 50}, which suggests that the HWWIindex is a useful predictor only for the next month and h > 10. A plausible reasonfor this is that prices need 20 up to 30 days to adjust to changes in commodity (orenergy) prices. Related research (e.g. Kilian 2008) finds a similar response lag.

To test the hypothesis that the lag distribution is flat (i.e. the weights are iden-tical for all 30 lags), the regression is performed by using the simple average of30 days, yielding an R2 of 0.3421 for h = 1. The F-statistic of the joint hypothesisθ1 = θ2 = 0 takes a value of 3.689 which is significant with a p-value less than0.01.

The lag distributions resulting from different estimation methods are pre-sented in figures 4a - 4c for the case of 20, 30 and 50 daily lags. UnrestrictedOLS estimation of the weights already suggest a hump-shaped weight function,

5In our preliminary work (Breitung et al. (2013)) we present results based on a linear interpo-lation of the missing values.

16

Page 17: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

and the parametric and nonparametric estimators smooth out these erratic un-restricted estimates. Moreover, in the preferred specification with 30 daily re-gressors, the lag distribution estimated by SLS follows the unrestricted estimatesslightly more closely than the parametric MIDAS estimates, with a maximumaround 5 lags, while the parametric estimator suggests a maximum around 9lags.

5.4 Out-of-sample forecasting comparison

Following Kuzin et al. (2009), direct forecasts are performed based on a horizon-specific model (see also Marcellino et al., (2006)). For this forecasting approach weregress the future values of the dependent variable (denoted by yt+h) on currentor past values of the regressors. Consequently, all parameters of the model arere-estimated for each forecast horizon.

We now turn to forecasting monthly and quarterly inflation in the Euro area.The samples are split into an estimation sample t = 1, . . . , Te and a forecastingsample t = Te + 1, . . . , T, where n f = T − Te denotes the number of forecasts.The estimation sample runs from January 2000 to December 2006 and the fore-casting exercise covers January 2007 until December 2013. Forecasts are obtainedrecursively with a moving estimation window. Given the estimated lag distri-butions obtained from the estimation sample, the first one-period-ahead forecastis produced. Then the first observation from the estimation sample is droppedwhile the end of the sample is extended to include the observations in the nextperiod. The MIDAS regressions are re-estimated and inflation in the next periodis forecasted. This forecasting scheme continues until the penultimate period inthe sample is reached and inflation is forecasted for the last period in the sample.

We evaluate forecasts according to the root mean squared forecast error

RMSE =

√√√√ 1n f

Te+n f

∑t=Te+1

(yt+h − yt+h|t

).

Table 5 shows the relative forecasting accuracy of the nonparametric (SLS) MI-DAS relative to the parametric approach (NLS) for various choices of the laglength. As a benchmark, we also present the RMSE of the forecast based onthe mean of the regressors x = (p + 1)−1 ∑

pj=0 xt,j. Starting with the one-step

ahead forecasts, the nonparametric approach produces (slightly) more accurate

17

Page 18: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

forecasts than the benchmark and the parametric (NLS) MIDAS regression. Forh = 2 all forecasts are uninformative provided that the RMSE is close to the sam-ple standard deviation σ2

y = 0.1953 of the target variable.

6 Conclusion

Combining variables sampled at different frequencies involves the problem ofestimating a rich autoregressive lag distribution, where the effective sample sizeis governed by the low frequency variable. The MIDAS regressions representa simple, parsimonious, and flexible class of time series models that allow theleft-hand and right-hand side variables of time series regressions to be sampledat different frequencies. In this framework the lag distribution is represented bya class of parametric functions such as exponential Almon lag polynomials. Adrawback of this approach is that the shape of the lag distribution is restricted bya small number of parameters. To allow for a more flexible representation of thelag distribution we only impose a smoothness condition yielding some smoothedversion of the unrestricted autoregressive coefficients. The resulting SLS estima-tor can be seen as a shrinkage estimator that penalizes deviations from a (local)linear functional form. Monte Carlo simulations suggest that the SLS estimatorperforms similarly to the parametric MIDAS regression if the lag distribution isincluded in the parametric class of specification. In other cases the SLS estimatormay yield a substantially lower MSE than the usual parametric MIDAS regres-sions. To assess the predictive power of daily indicators for monthly (Euro area)inflation rates, we apply parametric and nonparametric methods to a sample of14 years. It turns out that the HWWI commodity price index is a useful one-month ahead predictor for inflations rates. The estimated lag-distribution coversaround 30 days and is hump-shaped with a maximum at 5–10 days.

Appendix A: First order conditions for the minimum

of the AIC criterion

We consider the problem

minλ≥0

AIC (λ) ,

18

Page 19: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

where

AIC (λ) = log([y− y (λ)]′ [y− y (λ)]

)+

2 (κλ + 1)T − κλ − 2

.

Assuming a minimum exists in the interior of R+, we consider

ddλ

AIC (λ) =d

(log[(

yh − yhλ

)′ (yh − yh

λ

)]+

2 (κλ + 1)T − κλ − 2

),

where κλ = tr[

X (X′X + λD′D)−1 X′]. First,

ddλ

(log[(

yh − yhλ

)′ (yh − yh

λ

)])=

1SSRλ

ddλ

((yh − yh

λ

)′ (yh − yh

λ

))

with SSRλ =(yh − yh

λ

)′ (yh − yhλ

). We establish two intermediate results: first,

ddλ

Pλ =d

(X(X′X + λD′D

)−1 X′)

= X(

ddλ

(X′X + λD′D

)−1)

X′

= X(−(X′X + λD′D

)−1 (D′D) (X′X + λD′D)−1)

X′

= −XQ−1λ

(D′D

)Q−1

λ X′ (17)

with Qλ = (X′X + λD′D). Second,

ddλ

(P′λPλ

)=

(d

(P′λ))

Pλ + P′λ

(d

dλ(Pλ)

)=

(d

dλ(Pλ)

)′Pλ + P′λ

(d

dλ(Pλ)

)=(−XQ−1

λ

(D′D

)Q−1

λ X′)

Pλ + P′λ(−XQ−1

λ

(D′D

)Q−1

λ X′)

(18)

Using (17) and (18),

ddλ

((yh − yh

λ

)′ (yh − yh

λ

))=

ddλ

(−2yh ′Pλyh + yh ′P′λPλyh

)= −2yh ′

(d

dλ(Pλ)

)yh + yh ′

(d

(P′λPλ

))yh

= 2yh ′XQ−1λ

(D′D

)Q−1

λ X′(

yh − yhλ

)(19)

19

Page 20: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

Next,

ddλ

(2 (κλ + 1)T − κλ − 2

)=

2 ddλ (κλ) (T − κλ − 2) + 2 (κλ + 1)

(d

dλ (κλ))

(T − κλ − 2)2 (20)

and

ddλ

(κλ) =d

dλ(tr [Pλ])

= tr[

ddλ

(Pλ)

]= −tr

[(X′X

)Q−1

λ

(D′D

)Q−1

λ

](21)

Inserting (21) into (20) yields

ddλ

(2 (κλ + 1)T − κλ − 2

)= −

2 (T − 1) tr[(X′X) Q−1

λ (D′D) Q−1λ

](T − κλ − 2)2

Taken together, a necessary condition for the minimizer λ?AIC is

(SSRλ?

AIC

)−1 (yh ′XQ−1

λ?AIC

(D′D

)Qλ?

AICX′(

yh − yhλ?

AIC

))=

T − 1(T − κλ?

AIC− 2)2 tr

[(X′X

)Q−1

λ?AIC

(D′D

)Q−1

λ?AIC

]

This condition can be solved numerically. A sufficient condition for a minimizeris obtained by checking the second-order derivative at λ?

AIC. Using the same rulesfor matrix differential calculus as above, we find

d2

dλ2 AIC (λ) = SSR−2λ

{(2yh ′XQ−1

λ

(D′D

)Q−1

λ

(X′X

)Q−1

λ

(D′D

)Q−1

λ X′yh−

4yh ′XQ−1λ

(D′D

)Q−1

λ

(D′D

)Q−1

λ X′(

yh − yhλ

))SSRλ−(

2yh ′XQ−1λ

(D′D

)Q−1

λ X′(

yh − yhλ

))2}+ 4 (T − 1) (T − κλ − 2)−3 ·{

tr[(

X′X)

Q−1λ

(D′D

)Q−1

λ

(D′D

)Q−1

λ

](T − κλ − 2)

+(

tr[(

X′X)

Q−1λ

(D′D

)Q−1

λ

])2}

.

20

Page 21: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

References

Akaike (1974), A new look at the statistical model identification, IEEE Transac-tions on Automatic Control, 19, 716-723.

Andreou, E., Ghysels, E., and Kourtellos, A. (2010), Regression models with mixedsampling frequencies, Journal of Econometrics, 158, 246–261.

Andreou E., Ghysels, E., and Kourtellos, A. (2011), Forecasting with Mixed-frequencydata. Chapter prepared for the Oxford Handbook on Economic ForecastingOxford University Press. (ed.) with David F. Hendry.

Andreou, E., Ghysels, E., and Kourtellos, A. (2013), Should macroeconomic fore-casters use daily financial data and how? Journal of Business and EconomicStatistics, 31, 240-251.

Banbura, M., D. Giannone, M. Modugno and L. Reichlin (2013), Now-Castingand the Real Time Data Flow, in: G. Elliott and A. Timmermann, eds., Hand-book of Economic Forecasting, Vol. 2, Elsevier, North Holland.

Breitung, J., and Schumacher, C. (2008), Real-time Forecasting of German GDPbased on a large Factor Model with Monthly and Quarterly Data, Interna-tional Journal of Forecasting, 24, 386–398.

Breitung, J., Roling, C., and Elengikal, S. (2013), Forecasting inflation rates us-ing daily data: A nonparametric MIDAS approach, Working paper, Univer-sity of Bonn.

Browne, F. and D. Cronin (2010), Commodity prices, money and inflation, Jour-nal of Economics and Business, 62, 331–345.

Bruneau, C., de Bandt, O., Flageollet, A., and Michaux, E. (2007), Forecasting In-flation Using Economic Endicators: the Case of France, Journal of Forecast-ing, 26, 1-22.

Dias, A., and Embrechts, P. (2004), Dynamic Copula Models for Multivariate high-frequency Data in Finance. Discussion Paper.

Edelstein, P. (2007), Commodity Prices, Inflation Forecasts, and Monetary Policy,Working Paper, University of Michigan.

21

Page 22: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

Foroni C., Marcellino M., and Schumacher, C. (2014), Unrestricted mixed datasampling (MIDAS): MIDAS regressions with unrestricted lag polynomials,Journal of the Royal Statistical Society: Series A, forthcoming.

Ghysels, E., Santa-Clara, P., and Valkanov R. (2006), Predicting Volatility: Get-ting the most out of return data sampled at different frequencies, Journal ofEconometrics, 131, 59–95.

Ghysels, E., Sinko, A., and Valkanov R. (2007), MIDAS regressions: Further Re-sults and new Directions, Econometric Reviews, 26, 53-90. 131, 59–95.

Gospodinov, N. and S. Ng (2013), Commodity Prices, Convenience Yields andInflation, Review of Economics and Statistics 95, 206–219.

Hardle, W. (1990), Applied Nonparametric Regression, Cambridge UniversityPress.

Hamilton, J.D. (2008), Daily Monetary Policy Shocks and New Home Sales, Jour-nal of Monetary Economics, 55, 1171-1190.

Hodrick, R., and Prescott, E.C. (1997), Post war business cycles: An empirical in-vestigation, Journal of Money Credit and Banking, 1-16.

Hurvich, C.M., J.S. Simonoff, and C.-L. Tsai (1998), Smoothing Parameter Selec-tion in Nonparametric Regression Using an Improved Akaike InformationCriterion, Journal of the Royal Statistical Association, Series B, 60, 271–293.

Kalaba, R. and Tesfatsion, L. (1989), Time-Varying Linear Regression Via Flexi-ble Least Squares, Computers and Mathematics with Applications, 17, 1215–1245.

Kilian, L. (2008), A Comparison of the Effects of Exogenous Oil Supply Shockson Output and Inflation in the G7 Countris, Journal of European EconomicAssociation, 6, 78-121.

Kozicki, S. (1997), Predicting Real Growth and Inflation with the Yield Spread,Fed. Reserve Bank Kansas City Econ. Review, 82, 39-57.

Kuzin, V., Marcellino, M., and Schumacher, C. (2009), MIDAS versus Mixed- Fre-quency VAR: Nowcasting GDP in the Euro Area, Discussion Paper series 1:Economic studies, No. 07/2009, Deutsche Bundesbank.

22

Page 23: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

Lutkepohl, H. (1993), The Sources of the U.S. Money Demand Instability, Empir-ical Economics 18, 729–743.

Marcellino, M., Stock, J.H., and Watson, M.W. (2006), A Comparison of Directand Iterated Multistep AR Methods for Forecasting Macroeconomic TimeSeries, Journal of Econometrics, 135, 499-526

Modugno, M. (2013), Now-casting Inflation using High-Frequency Data, Inter-national Journal of Forecasting 29, 664-675.

Monteforte, L., and Moretti, G. (2013), Real Time Forecasts of Inflation: the Roleof Financial Variables, 32, 51-61.

Stock, J.H., and Watson, M.W. (1999), Forecasting Inflation, Journal of MonetaryEconomics, 44, 293-335.

Stock, J.H., and Watson, M.W. (2003), Forecasting Output and Inflation: the Roleof Asset Prices, Journal of Economic Literature, 41, 778-829.

23

Page 24: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

Figure 1: From top to bottom: exponentially declining, hump-shaped, linear andcycical, sign-changing weights for p + 1 ∈ {20, 40, 60}

Note: The weighting functions (13) - (16) employed in (9)-(11) are presented here for T =

100, where for the exponentially declining weights, θ1 = 7 · 10−4, θ2 = −6 · 10−3, while forthe humped-shaped weights θ1 = 8 · 10−2 and θ2 = θ1/10, θ2 = θ1/20 and θ2 = θ1/30 forp + 1 = 20, p + 1 = 40, p + 1 = 60, respectively, yielding a maxium at the lag 5, 10 and 15,respectively.

24

Page 25: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

Table 1: In-sample MSE-ratios

High-frequency lags p+1 = 20 p+1 = 40 p+1 = 60

T α1 = 0.2 α1 = 0.3 α1 = 0.4 α1 = 0.2 α1 = 0.3 α1 = 0.4 α1 = 0.2 α1 = 0.3 α1 = 0.4

exp. declining weights

λMSE

100 0.26 0.31 0.35 0.59 0.61 0.73 0.63 0.71 0.81200 0.29 0.36 0.42 0.73 0.73 0.88 0.72 0.89 1.10400 0.36 0.47 0.60 0.88 0.92 1.19 0.84 1.09 1.37

λAIC

100 0.38 0.43 0.47 0.77 0.80 0.89 1.05 1.02 1.07200 0.43 0.49 0.56 0.81 0.88 1.00 1.11 1.18 1.33400 0.51 0.60 0.73 0.94 1.09 1.30 1.18 1.34 1.53

hump-shaped weights

λMSE

100 0.41 0.61 0.74 0.73 1.07 1.38 0.89 1.13 1.48200 0.56 0.76 0.92 0.98 1.48 1.74 1.27 1.46 1.69400 0.75 0.98 1.20 1.40 1.67 1.88 1.46 1.69 1.95

λAIC

100 0.47 0.63 0.84 1.13 1.33 1.52 1.58 1.53 1.80200 0.62 0.90 1.28 1.32 1.71 2.04 1.75 1.80 2.04400 0.92 1.45 1.50 1.68 2.12 2.54 2.01 2.17 2.49

linear weights

λMSE

100 0.24 0.26 0.25 0.34 0.34 0.35 0.28 0.21 0.35200 0.25 0.26 0.23 0.38 0.37 0.36 0.35 0.23 0.33400 0.29 0.28 0.22 0.36 0.34 0.30 0.38 0.26 0.31

λAIC

100 0.36 0.38 0.40 0.60 0.60 0.63 0.65 0.48 0.81200 0.37 0.38 0.35 0.64 0.63 0.61 0.87 0.58 0.83400 0.44 0.42 0.35 0.66 0.62 0.55 0.97 0.66 0.79

sign-changing weights

λMSE

100 0.10 0.06 0.04 0.42 0.26 0.17 0.85 0.70 0.54200 0.06 0.03 0.02 0.28 0.16 0.11 0.68 0.46 0.32400 0.04 0.02 0.01 0.17 0.09 0.06 0.50 0.30 0.27

λAIC

100 0.53 0.69 0.06 0.58 0.32 0.20 1.19 0.86 0.63200 0.09 0.05 0.03 0.37 0.20 0.12 0.87 0.55 0.38400 0.05 0.03 0.02 0.22 0.11 0.07 0.63 0.38 0.20

Note: The entries are the in-sample MSE ratios of the nonparametric SLS estimator relative tothe parametric NLS estimator with an exponential Almon lag based on two parameters. Dataare generated as in (9) – (11), where the weights are generated by (13) – (16). The nonparametricestimator is computed using the MSE minimal choice of the smoothing parameter

(λMSE

)and

by using the modified AIC criterion(λAIC

). The number of replications is 1,000.

25

Page 26: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

Table 2: Out-of-sample forecast comparison

High-frequency lags p+1 = 20 p+1 = 40 p+1 = 60

T α1 = 0.2 α1 = 0.3 α1 = 0.4 α1 = 0.2 α1 = 0.3 α1 = 0.4 α1 = 0.2 α1 = 0.3 α1 = 0.4

exp. declining weights

λMSE

100 1.04 1.04 1.04 1.03 1.03 1.04 1.18 1.16 1.21200 1.07 1.06 1.06 1.06 1.07 1.08 1.07 1.07 1.07400 1.09 1.09 1.09 1.08 1.08 1.08 1.07 1.07 1.07

λAIC

100 1.04 1.04 1.04 1.04 1.03 1.04 1.01 0.99 1.04200 1.07 1.06 1.06 1.05 1.05 1.05 1.05 1.05 1.04400 1.09 1.09 1.08 1.06 1.06 1.05 1.05 1.05 1.05

hump-shaped weights

λMSE

100 1.04 1.04 1.04 0.98 0.91 0.98 1.11 0.88 0.98200 1.07 1.07 1.06 1.03 0.96 1.03 0.94 0.78 0.89400 1.09 1.09 1.08 1.05 1.01 1.06 0.97 0.77 0.95

λAIC

100 1.04 1.04 1.04 1.00 0.92 0.99 0.95 0.75 0.84200 1.07 1.07 1.06 1.04 0.97 1.03 0.95 0.78 0.89400 1.09 1.09 1.08 1.05 1.02 1.06 0.97 0.77 0.94

linear weights

λMSE

100 1.05 1.05 1.05 1.00 1.02 1.01 1.21 1.20 1.20200 1.06 1.06 1.06 1.05 1.05 1.05 1.04 1.04 1.04400 1.09 1.08 1.08 1.05 1.05 1.05 1.04 1.04 1.04

λAIC

100 1.06 1.05 1.05 1.00 1.01 1.01 1.01 1.00 1.00200 1.07 1.07 1.06 1.05 1.05 1.05 1.05 1.05 1.04400 1.09 1.09 1.08 1.05 1.05 1.05 1.05 1.05 1.05

sign-changing weights

λAIC

100 0.82 0.70 0.61 1.04 1.06 1.07 1.11 0.98 1.15200 0.86 0.72 0.62 1.07 1.08 1.11 1.04 0.97 1.16400 0.86 0.72 0.63 1.07 1.09 1.10 1.08 1.02 1.16

λAIC

100 0.82 0.70 0.61 1.00 0.97 0.92 0.95 0.85 0.99200 0.85 0.71 0.62 1.02 0.98 0.93 1.02 0.90 1.02400 0.85 0.71 0.62 1.02 0.98 0.93 1.04 0.95 1.02

Note: The entries are the MSE ratios of the nonparametric SLS estimator relative to the paramet-ric NLS estimator with an exponential Almon lag based on two parameters for exp. decliningand humped-shaped weights and three parameters for the sign-changing weights. Data aregenerated as in (9) – (11), where the weights are generated by (13) – (16). The nonparametricestimator is computed using the MSE minimal choice of the smoothing parameter

(λMSE

)and

by using the modified AIC criterion(λAIC

). The number of replications is 250.

26

Page 27: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

Figure 2: monthly Euro Area HPCI inflation rate

-.6

-.4

-.2

.0

.2

.4

.6

.8

00 01 02 03 04 05 06 07 08 09 10 11 12 13

Figure 3: adjusted R2 as a function of the lag length p

.0

.1

.2

.3

.4

.5

.6

5 10 15 20 25 30 35 40

27

Page 28: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

Figure 4a: Estimated lag distributions with p + 1 = 20

Figure 4b: Estimated lag distributions with p + 1 = 30

Figure 4c: Estimated lag distributions with p + 1 = 50

Note: OLS (dashed line), nonparametric SLS (straight line) and parametricNLS (dashed-dotted line) estimates for h = 1 and the full sample based on168 monthly observations.

28

Page 29: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

Table 3: Preliminary analysis: in-sample regressions

yt: monthly inflation rate (EU area)regressor (i) (ii) (iii)

yt−1 0.265 0.110 0.140(3.53) (1.81) (1.77)

HWWAt−1 — 0.318 0.318(10.1) (10.0)

HWWAt−2 — — –.0238(-0.59)

const 0.123 0.143 0.139(6.85) (10.0) (8.53)

R2 0.070 0.425 0.426LM(6) 0.599 0.545 0.592

LM(12) 0.522 0.616 0.631Note: t-statistics presented in parentheses. LM(k) reports the p-value of theLM (Breusch-Godfrey) test for autocorrelation up to the lag order k.

29

Page 30: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

Table 4: In-sample estimation of MIDAS regressionyt+h = α0 + α1 ∑

pj=0 ωj (θ) xt−j + ut+h

p + 1 = 50 p + 1 = 30 p + 1 = 10

h = 1α0 0.0016 0.0016 0.0017

(16.13) (16.09) 13.90

α1 0.358 0.359 0.169(9.86) (9.94) (5.57)

θ1 0.158 0.151 0.352(2.31) (2.25) (1.15)

θ2 -0.010 -0.009 -0.030(-2.90) (-2.85) (-1.02)

R2 NLS 0.443 0.444 0.202

R2 SLS 0.451 0.479 0.199

h = 2α0 0.0017 0.0016 0.0017

(12.42) (12.34) (12.49)

α1 0.059 0.069 -0.015(1.08) (1.38) (-0.53)

θ1 0.254 0.309 -2.045(0.41) (0.43) (-0.31)

θ2 -0.007 -0.009 0.205(-0.42) (-0.43) (0.28)

R2 NLS 0.019 0.002 0.006

R2 SLS 0.051 0.007 0.000

Note: Estimates based on 168 monthly observations. t-values arereported in parenthesis. R2 NLS is the R2 from the parametricMIDAS regression, while R2 SLS refers to the R2 from the non-parametric approach, where the smoothing parameter is selectedby the AIC criterion.

30

Page 31: Forecasting inflation rates using daily data ... - Statistik · Forecasting inflation rates using daily data: A nonparametric MIDAS approach Jor¨ g Breitung University of Cologne

Table 5: Out-of-sample forecast comparison

p + 1 = 20 p + 1 = 30 p + 1 = 40 p + 1 = 50

h = 1RMSE(x) 0.1516 0.1667 0.1735 0.1736RMSE(NLS) 0.1511 0.1541 0.1543 0.1621RMSE(SLS) 0.1509 0.1521 0.1512 0.1584

h = 2RMSE(x) 0.1955 0.1953 0.1952 0.1982RMSE(NLS) 0.1982 0.1962 0.1971 0.1983RMSE(SLS) 0.1935 0.1966 0.1969 0.1948

Note: RMSE(x) indicates the root mean squared error using the mean of p + 1daily observations as the predictor. RMSE(NLS) and RMSE(SLS) denote the rootmean squared error of the parametric MIDAS based on an exponential Almonlag distribution with k = 2 and the nonparametric MIDAS estimator, where thesmoothing parameter λ is determined by the AIC criterion. Initial estimationsample uses Te = 84 (Te = 83) months. Forecasts are obtained with a rollingwindow of size Te observations by successively dropping the first period in thesample and incorporating the following period.

31