Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert...

52
Stock Market Volatility and Learning 1 Klaus Adam 2 , Albert Marcet 3 Juan Pablo Nicolini 4 April 2, 2012 1 Thanks go to Davide Debortoli, Luca Dedola, George Evans, Katharina Greulich, Seppo Honkapohja, Ricardo Lagos, Erzo Luttmer, Bruce McGough, Bruce Preston, Jaume Ventura, Joachim Voth, IvÆn Werning and Raf Wouters for inter- esting comments and suggestions. We particularly thank Philippe Weil for a very interesting and stimulating discussion. Oriol Carreras, Sofa Bauducco and Davide Debortoli have supported us with outstanding research assistance. Klaus Adam ac- knowledges support from the ERC Grant No. 284262. Albert Marcet acknowledges support from CIRIT (Generalitat de Catalunya), Plan Nacional project ECO2008- 04785/ECON (Ministry of Science and Education, Spain), CREI, Axa Research Fund, Programa de Excelencia del Banco de Espaæa and the Wim Duisenberg fel- lowship from the European Central Bank. The views expressed herein are solely those of the authors and do not necessarily reect the views of the Federal Reserve Bank of Minneapolis or the Federal Reserve System. 2 University of Mannheim and CEPR 3 Institut dAnlisi Econmica CSIC, ICREA, Barcelona GSE, UAB and CEPR 4 Universidad Di Tella and Federal Reserve Bank of Minneapolis

Transcript of Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert...

Page 1: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

Stock Market Volatility and Learning1

Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4

April 2, 2012

1Thanks go to Davide Debortoli, Luca Dedola, George Evans, KatharinaGreulich, Seppo Honkapohja, Ricardo Lagos, Erzo Luttmer, Bruce McGough, BrucePreston, Jaume Ventura, Joachim Voth, Iván Werning and Raf Wouters for inter-esting comments and suggestions. We particularly thank Philippe Weil for a veryinteresting and stimulating discussion. Oriol Carreras, Sofía Bauducco and DavideDebortoli have supported us with outstanding research assistance. Klaus Adam ac-knowledges support from the ERC Grant No. 284262. Albert Marcet acknowledgessupport from CIRIT (Generalitat de Catalunya), Plan Nacional project ECO2008-04785/ECON (Ministry of Science and Education, Spain), CREI, Axa ResearchFund, Programa de Excelencia del Banco de España and the Wim Duisenberg fel-lowship from the European Central Bank. The views expressed herein are solelythose of the authors and do not necessarily re�ect the views of the Federal ReserveBank of Minneapolis or the Federal Reserve System.

2University of Mannheim and CEPR3Institut d�Anàlisi Econòmica CSIC, ICREA, Barcelona GSE, UAB and CEPR4Universidad Di Tella and Federal Reserve Bank of Minneapolis

Page 2: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

Abstract

We study a standard consumption based asset pricing model with rationalinvestors who entertain subjective prior beliefs about price behavior. Opti-mal behavior then dictates that investors learn about price behavior frompast price observations. We show that this imparts momentum and meanreversion into the equilibrium behavior of the price dividend ratio, similar towhat can be observed in the data. Estimating the model on U.S. stock pricedata using the method of simulated moments, we show that it can quan-titatively account for the observed stock price volatility, the persistence ofthe price-dividend ratio, and the predictability of long-horizon returns. Forreasonable degrees of risk aversion, the model also passes a formal statis-tical test for the overall goodness of �t, provided one excludes the equitypremium from the set of moments to be matched.

JEL Class. No.: G12, D84

Page 3: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

"Investors, their con�dence and expectations buoyed by past priceincreases, bid up speculative prices further, thereby enticing moreinvestors to do the same, so that the cycle repeats again and again.�

Irrational Exuberance, Shiller (2005, p. 56)

1 Introduction

The purpose of this paper is to show that a very simple asset pricing modelis able to quantitatively reproduce a variety of stylized asset pricing facts ifone allows for slight deviations from rational expectations while maintainingthe assumption of rational behavior.

We study a simple variant of the Lucas (1978) model with standard timeseparable preferences. It is well known that the asset pricing implicationsof this model under rational expectations (RE) are at odds with basic facts,such as the observed high persistence and volatility of the price dividendratio, the high volatility of stock returns, the predictability of long horizonexcess stock returns, and the high level of the risk premium.

We stick to Lucas� framework but allow for agents whose prior beliefsabout stock price behavior deviate slightly from those assumed under RE.Investors nevertheless hold a consistent system of subjective beliefs about allpayo¤-relevant random variables that are beyond their control; this includesbeliefs about model endogenous variables, such as competitive stock marketprices, as well as model exogenous variables, such as the dividend and incomeprocesses. Given these subjective beliefs, investors maximize utility subjectto their budget constraints. We call such agents �internally rational�, as theyknow all internal aspects of their individual decision problem and behaveoptimally given this knowledge. They just do not know the exact modelgenerating stock prices or external variables more generally.

We employ this setup to relax the standard assumption that agents haveperfect knowledge about how a certain history of fundamental shocks mapsinto a market outcome for the stock price.1 We assume that agents expressthis lack of knowledge by specifying a subjective joint distribution for thebehavior of stock prices, incomes and dividends over time. In such a setting,

1Such uncertainty may arise from a lack of common knowledge of investors�price anddividend beliefs, as is explained in detail in Adam and Marcet (2011).

1

Page 4: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

agents optimally update their subjective expectations about stock price be-havior in the light of realized market outcomes, so that agents�stock priceexpectations in�uence stock prices and observed stock prices feed back intoagents�expectations. This self-referential aspect of the model turns out tobe key for generating stock price volatility of the kind that can be observedin the data.

We demonstrate the ability of the model to produce data-like behaviorby deriving a number of analytical results about the behavior of stock pricesthat is implied by a general class of belief updating rules. Speci�cally, weshow that learning from market outcomes imparts �momentum� on stockprices around their RE value, which gives rise to sustained deviations of theprice dividend ratio from its mean, as can be observed in the data. Suchmomentum arises because if agents�expectations about stock price growthincrease in a given period, the actual growth rate of prices has a tendency toincrease beyond the fundamental growth rate, thereby reinforcing the initialbelief of higher stock price growth through the feedback from outcomes tobeliefs. At the same time, the model displays �mean reversion�over longerhorizons, so that even if subjective stock price growth expectations are veryhigh (or very low) at some point in time, they will eventually return tofundamentals. The model thus displays price cycles of the kind described inthe opening quote above.

For the quantitative evaluation of the learning model, we consider aspeci�c system of subjective beliefs which allows for subjective prior un-certainty about the average growth rate of stock market prices. Internalrationality then dictates that agents�beliefs optimally react to the realiza-tion of market prices. In particular, agents optimally use a constant gainmodel of adaptive learning to update their conditional expectations of onestep ahead risk-adjusted price growth, while no such updating occurs in theabsence of prior uncertainty. In our empirical section we document that thequantitative �t with the data improves tremendously, if agents place a smallbut strictly positive weight on observed market information.

To evaluate the learning model we �rst consider how well the modelmatches asset pricing moments individually, just as many papers on stockprice volatility have done. We use formal structural estimation based on themethod of simulated moments (MSM) and we derive the asymptotic distri-bution for test statistics adapting results of Du¢ e and Singleton (1993). We�nd that the model can quantitatively match most of the moments we con-sider, including the volatility of stock market returns, the mean, persistenceand volatility of the price dividend ratio and the evidence on excess returnpredictability over long-horizons. With low relative risk aversion (equal to

2

Page 5: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

�ve) the t-statistics for these moments are all below two in one of our esti-mated models (see Table 4 in section 5.2).

We also perform a formal econometric test for the overall goodness of�t of our consumption based asset pricing model. As far as we know, thisis a more stringent test than has been applied in previous papers matchingstock price volatility, but a natural one to explore given our MSM strat-egy. When risk aversion is as high as in Campbell and Cochrane (1999)the model marginally fails the overall goodness of �t test at the 1% level.When considering a lower degree of relative risk aversion (equal to �ve) andwhen we exclude the risk premium by excluding the risk free rate from theestimation, the p-value of the model for the overall goodness of �t increasesto more than 6%.

As we explain, this is a remarkable improvement relative to the perfor-mance of the model under RE, even though our model is extremely simpleand represents a minimal deviation from the standard asset pricing frame-work. We �nd it a striking observation that the quantitative asset pricingimplications of the standard model are not robust to such small departuresfrom rational price expectations and that this non-robustness is empiricallyso encouraging. This suggests that allowing for such minimal departuresfrom a strong assumption (RE) could be a promising avenue for researchmore generally.

The paper is organized as follows. In section 2 we discuss the relatedliterature. Section 3 presents the stylized asset pricing facts we seek tomatch. We outline the asset pricing model in section 4 where we also deriveanalytic results showing how - for a general class of belief systems - our modelcan qualitatively deliver the stylized asset pricing facts described before insection 3. Section 5 presents our MSM estimation and testing strategyand documents that the model with subjective beliefs can quantitativelyreproduce the stylized facts. Readers interested in obtaining a glimpse ofthe quantitative performance of our one parameter extension of the REmodel may directly jump to Table 2 in section 5.2.

2 Related Literature

A large body of literature documents that the basic asset pricing modelwith time separable preferences and RE has great di¢ culties in matchingthe observed volatility of stock returns.2 Models of learning have long beenconsidered as a promising avenue to match stock price volatility.

2See Campbell (2003) for an overview.

3

Page 6: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

Stock price behavior under Bayesian learning has been studied in Tim-mermann (1993, 1996), Brennan and Xia (2001), Cecchetti, Lam, and Mark(2000), Cogley and Sargent (2008) and Pastor and Veronesi (2003), amongothers. Agents in these papers learn about the dividend or income processand then set the asset price equal to the discounted expected sum of divi-dends. As explained in Adam and Marcet (2011), this amounts to assumingthat agents know exactly how dividends and income map into prices, so thatthere is a rather asymmetric treatment of the issue of learning: while agentslearn about how dividends and income evolve, they are assumed to knowperfectly the stock price process, conditional on the realization of dividendsand income. As a result, in all these models stock prices represent redun-dant information given what agents are assumed to know and there exists nofeedback from market outcomes (stock prices) to beliefs. Since agents arethen learning about exogenous processes only, their beliefs are �anchored�by the exogenous processes and the volatility e¤ects resulting from learningare generally limited when considering standard time separable preferencespeci�cations. In contrast, we largely abstract from learning about the divi-dend and income processes and focus on learning about stock price behavior.Price beliefs and actual price outcomes then mutually in�uence each other.It is precisely this self-referential nature of the learning problem that impartsmomentum to expectations and is key in explaining stock price volatility.

A number of papers within the adaptive learning literature study agentswho learn about stock prices. Bullard and Du¤y (2001) and Brock andHommes (1998) show that learning dynamics can converge to complicatedattractors and that the RE equilibrium may be unstable under learningdynamics.3 Branch and Evans (2010) study a model where agents�algorithmto form expectations switches depending on which of the available forecastmodels is performing best. Timmermann (1996) analyzes a case with self-referential learning, assuming that agents use dividends to predict futureprice.4 Marcet and Sargent (1992) also study convergence to RE in a modelwhere agents use today�s price to forecast the price tomorrow in a stationaryenvironment with limited information. Cárceles-Poveda and Giannitsarou(2007) assume that agents know the mean stock price and �nd that learningdoes then not signi�cantly alter the behavior of asset prices. Chakrabortyand Evans (2008) show that a model of adaptive learning can account forthe forward premium puzzle in foreign exchange markets.

3Stability under learning dynamics is de�ned in Marcet and Sargent (1989).4Timmerman reports that this form of learning delivers even lower volatility than a

settings with learning about the dividend process only. It is thus crucial for our resultsthat agents use information on past price growth behavior to predict future price growth.

4

Page 7: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

We contribute relative to the adaptive learning literature by deriving thelearning and asset pricing equations from internally rational investor behav-ior. In addition, we use formal econometric inference and testing to showthat the model can quantitatively match the observed stock price volatil-ity. Finally, our paper also shows that the key issue for matching the datais that agents learn about the mean growth rate of stock prices from paststock prices observations.

Other papers have studied stock prices under rational market expec-tations when agents have asymmetric information or asymmetric beliefs,examples include Biais, Bossaerts and Spatt (2010) and Dumas, Kurshevand Uppal (2009). In contrast to the RE literature, the behavioral �nanceliterature seeks to understand the decision-making process of individual in-vestors by means of surveys, experiments and micro evidence, exploring theintersection between economics and psychology, see Shiller (2005) for a non-technical summary. We borrow from this literature an interest in deviatingfrom RE but are keen on making only a minimal deviation from the stan-dard approach: we assume that agents behave optimally given a consistentsystem of subjective beliefs that is assumed to be close to the RE beliefs.

3 Facts

This section describes stylized facts of U.S. stock price data that we seek toreplicate in our quantitative analysis. These observations have been exten-sively documented in the literature, we reproduce them here as a point ofreference using a single and updated data base.5

Since the work of Shiller (1981) and LeRoy and Porter (1981) it has beenrecognized that the volatility of stock prices in the data is much higher thanstandard RE asset pricing models suggest, given the available evidence onthe volatility of dividends. Figure 1 shows the evolution of the quarterlyprice dividend (PD) ratio in the United States. The PD ratio displays verylarge �uctuations around its sample mean (the bold horizontal line in thegraph): in the year 1932, for example, the quarterly PD ratio takes on valuesbelow 30, while in the year 2000 values close to 350. The standard deviationof the PD ratio (�PD) is almost one half of its sample mean (EPD). Wereport this feature of the data as Fact 1 in Table 1.

Figure 1 also shows that the deviation of the PD ratio from its samplemean are very persistent, so that the �rst order quarterly autocorrelation ofthe PD ratio (�PD;�1) is very high. We report this as Fact 2 in Table 1.

5Details on the data sources are provided in Appendix 7.1.

5

Page 8: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

0

100

200

300

400

1925 1930 1935 1940 1945 1950 1955 1960 1965 1970 1975 1980 1985 1990 1995 2000 2005

Figure 1: Quarterly U.S. price dividend ratio 1927:1-2005:4

Related to the excessive volatility of prices is the observation that thevolatility of stock returns (�rs) in the data is almost four times the volatilityof dividend growth (��D=D). We report the volatility of returns as Fact 3in Table 1, and the mean and standard deviation of dividend growth at thebottom of the table.

While stock returns are di¢ cult to predict at short horizons, the PDratio helps to predict future excess stock returns in the long run. Moreprecisely, estimating the regression

Xt;n = c1n + c

2n PDt + ut;n (1)

where Xt;n is the observed real excess return of stocks over bonds fromquarter t to quarter t plus n years, and ut;n the regression residual, theestimate c2n is found to be negative, signi�cantly di¤erent from zero, andthe absolute value of c2n and the R-square of this regression, denoted R

2n,

increase with n. We choose to include the OLS regression results for the5-year horizon as Fact 4 in Table 1.6

6We focus on the 5-year horizon for simplicity, but obtain very similar results forother horizons. Our focus on a single horizon is justi�ed because chapter 20 in Cochrane(2005) shows that Facts 1, 2 and 4 are closely related: up to a linear approximation, thepresence of return predictability and the increase in the R2n with the prediction horizon n

6

Page 9: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

U.S. asset pricing facts, 1927:2-2005:4(quarterly real values, growth rates & returns in percentage terms)

Fact 1 Volatility of EPD 113.20PD ratio �PD 52.98

Fact 2 Persistence of �PD;�1 0.92PD ratio

Fact 3 Excessive return �rs 11.65volatility

Fact 4 Excess return c25 -0.0048predictability R25 0.1986

Fact 5 Equity premium Ers 2.41Erb 0.18

Dividend Mean Growth E�DD

0.0035

Behavior Std of Growth ��DD

2.98

Table 1: Stylized asset pricing facts

Finally, it is well known that through the lens of standard models realstock returns tend to be too high relative to short-term real bond returns,a fact often referred to as the equity premium puzzle. We report it as Fact5 in Table 1, which shows that the average quarterly real return on bondsErb is much lower than the corresponding return on stocks Ers :

Table 1 reports ten statistics. As we show in section 5, we can replicatethese statistics using a model that has only four free parameters.

are qualitatively a joint consequence of persistent PD ratios (Fact 2) and i:i:d: dividendgrowth. It is not surprising, therefore, that our model also reproduces the increasing sizeof c2n and R

2n with n: We match the regression coe¢ cients at the 5-year horizon to check

the quantitative model implications.

7

Page 10: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

4 The Model

We describe below a Lucas (1978) asset pricing model with rationally in-vesting agents who hold subjective prior beliefs about stock price behavior.The presence of subjective uncertainty implies that agents optimally updatetheir beliefs about stock price behavior from observed stock price realiza-tions. Using a generic updating mechanism, section 4.2 shows that suchlearning gives rise to oscillations of asset prices around their fundamentalvalue and qualitatively contributes to reconciling the Lucas asset pricingmodel with the empirical evidence. Section 4.3 then introduces a speci�csystem of prior beliefs that gives rise to constant gain learning and that weemploy in our empirical work in section 5.

4.1 Model Description

The Environment: Consider an economy populated by a unit mass ofin�nitely-lived investors, endowed with one unit of a stock that can be tradedon a competitive stock market and that pays dividend Dt consisting of aperishable consumption good. Dividends evolve according to

DtDt�1

= a "dt (2)

for t = 0; 1; 2; :::; where log "dt � iiN (� s2d2 ; s

2d) and a � 1. This implies

E("dt ) = 1; E�DD

� E�

DtDt�1

�= a and �2�D

D

� var�

DtDt�1

�= es

2d � 1.

To capture the fact that the empirically observed consumption process isconsiderably less volatile than the dividend process and to replicate theweak correlation between dividend and consumption growth, we assume thateach agent receives in addition an endowment Yt of perishable consumptiongoods. Total supply of consumption goods in the economy is then given bythe feasibility constraint Ct = Yt + Dt. Following the consumption-basedasset pricing literature, we impose assumptions directly on the consumptionsupply process7

CtCt�1

= a "ct ; (3)

where log "ct � iiN (� s2c2 ; s

2c) and (log "

ct ; log "

dt ) jointly normal. In our em-

pirical application, we follow Campbell and Cochrane (1999) and choosesc = 1

7sd and the correlation between log "ct and log "

dt equal to �c;d = 0:2.

7The process for Yt is then implied by feasibility.

8

Page 11: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

Objective Function and Probability Space: Agent i 2 [0; 1] has astandard time-separable expected utility function8

EP0

1Xt=0

�t�Cit�1�

1�

where 2 (0;1) and Cit denotes consumption demand of agent i. Theexpectation is taken using a subjective probability measure P that assigns aconsistent set of probabilities to all external variables, i.e., all payo¤-relevantvariables that are beyond the agent�s control. Importantly, Cit denotes theagent�s consumption demand, while Ct denotes the total supply of consump-tion goods in the economy.

The competitive stock market assumption and the exogeneity of the div-idend and income processes imply that investors consider the process forstock prices fPtg and the income and dividends processes fYt; Dtg as ex-ogenous to their decision problem. The underlying sample (or state) space thus consists of the space of realizations for prices, dividends and in-come. Speci�cally, a typical element ! 2 is an in�nite sequence ! =fPt; Yt; Dtg1t=0. As usual, we let t denote the set of histories from periodzero up to period t and !t its typical element. The underlying probabilityspace is thus given by (;B,P) with B denoting the corresponding �-Algebraof Borel subsets of ; and P is the agent�s subjective probability measureover (;B). Expected utility is then de�ned as

EP0

1Xt=0

�t�Cit�1�

1� �Z

1Xt=0

�tCit(!

t)1�

1� dP(!): (4)

Our speci�cation of the probability space is more general than the oneused in other modeling approaches because we include also price histories inthe realization !t. Standard practice is to assume instead that agents knowthe exact mapping from a history of income and dividends to equilibriumasset prices Pt(Y t; Dt), so that market prices carry only redundant informa-tion. This allows - without loss of generality - to exclude prices from theunderlying state space. This practice is standard in models of rational ex-pectations, models with rational bubbles, in Bayesian RE model, and modelsincorporating robustness concerns. The standard practice amounts to im-posing a singularity in the joint density over prices, income and dividends,which is equivalent to assuming that agents know exactly the equilibrium

8We assume standard preferences so as to highlight the e¤ect of learning on asset pricevolatility.

9

Page 12: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

pricing function Pt(�). While being a convenient modeling device, assumingexact knowledge of this function is at the same time very restrictive, whichmakes it of interest to study the implication of (slightly) relaxing it. Adamand Marcet (2011) show that rational behavior is indeed perfectly com-patible with agents not knowing the exact form of the equilibrium pricingfunction Pt(�).9

Choice Set and Constraints: Agents make contingent plans for con-sumption Cit , bondholdings B

it and stockholdings S

it , i.e., choose the func-

tions �Cit ; S

it ; B

it

�: t ! R3 (5)

for all t � 0. Agent�s choices are subject to the budget constraint

Cit + Pt Sit +B

it � (Pt +Dt)Sit�1 + (1 + rt�1)Bit�1 + Yt (6)

for all t � 0, where rt�1 denotes the real interest rate on riskless bondsissued in period t � 1 and maturing in period t. The initial endowmentsare given by Si�1 = 1 and Bi�1 = 0, so that bonds are in zero net supply.To avoid Ponzi schemes and to insure existence of a maximum the followingbounds must hold

S � Sit � S (7)

B � Bit � B

for some �nite bounds with the property S < 1 < S; B < 0 < B.Maximizing Behavior (Internal Rationality): The investor�s prob-

lem then consists of choosing the sequence of functions fCit ; Sit ; Bitg1t=0 tomaximize (4) subject to the budget constraint (6) and the asset limits (7),where all constraints have to hold for all t almost surely in P. Later on,the probability measure P will be constructed by specifying some perceivedlaw of motion describing the agent�s view about the evolution of (P; Y;D)over time, together with a prior distribution about the parameters governingthis law of motion. Optimal behavior will then entail learning about theseparameters, in the sense that agents update their posterior beliefs about theunknown parameters in the light of new price, income and dividend observa-tions. For the moment, this learning problem remains �hidden�in the beliefstructure P.

9Speci�cally, with incomplete markets, i.e., in the absence of state contingent forwardmarkets for stocks, agents cannot simply learn the equilibrium mapping Pt(�) by observingmarket prices. Furthermore, if the preferences and beliefs of agents in the economy failto be common knowledge, then agents cannot deduce the equilibrium mapping from whatthey know.

10

Page 13: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

Optimality Conditions: Since the objective function is concave andthe feasible set is convex, the agent�s optimal plan is characterized by the�rst order conditions�

Cit��

Pt = �EPt

h�Cit+1

�� Pt+1

i+ �EPt

h�Cit+1

�� Dt+1

i(8)�

Cit��

= �(1 + rt)EPt

h�Cit+1

�� i(9)

These conditions are standard except for the fact that the conditional ex-pectations are taken with respect to the subjective probability measure P.

4.2 Asset Pricing Implications: Analytical Results

This section presents analytical results that explain why the asset pricingmodel with subjective beliefs can explain the asset pricing facts presentedin Table 1.

As is well known, under RE the model is completely at odds with theseasset pricing facts. A routine calculation shows that the unique RE solutionof the model is given by

PREt =�a1� �"

1� �a1� �"Dt (10)

where

�" = Eh("ct+1)

� "dt+1

i= e (1+ )

s2c2 e� �c;dscs:

The PD ratio is then constant, return volatility equals approximately thevolatility of dividend growth and there is no (excess) return predictability,so that the model misses Facts 1 to 4 listed in Table 1. This holds inde-pendently of the parameterization of the model. Furthermore, even for veryhigh degrees of relative risk aversion, say = 80, the model implies a fairlysmall risk premium. This emerges because of the low correlation betweenthe innovations to consumption growth and dividend growth in the data(�c;d = 0:2).

10 The model thus also misses Fact 5 in Table 1.

10Under RE, the risk free rate is given by 1+r =��a� e (1+ )

s2c2

��1and the expected

equity return equals Et[(Pt+1 + Dt+1)=Pt] =��a� �"

��1. For �c;d = 0 there is thus no

equity premium, independently of the value for .

11

Page 14: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

We now characterize the equilibrium outcome under learning. One maybe tempted to argue that Cit+j can be substituted by Ct+j for j = 0; 1

in the �rst order conditions (8) and (9), simply because Cit = Ct holdsin equilibrium for all t.11 However, outside of strict rational expectationswe may have EPt

�Cit+1

�6= EPt [Ct+1] even if in equilibrium Cit = Ct holds

ex-post.12 To understand how this arises, consider the following simpleexample: suppose agents know the aggregate process for Dt and Yt. In thiscase, EPt [Ct+1] is a function only of the exogenous variables (Y

t; Dt). Atthe same time, EPt

�Cit+1

�is generally a function of price realizations also,

since in the eyes of the agent, optimal future consumption demand dependson future prices and, therefore, also on today�s prices whenever agents arelearning about price behavior. As a result, we have EPt

�Cit+1

�6= EPt [Ct+1],

so that one cannot routinely substitute individual by aggregate consumptionon the right-hand side of agent�s �rst order conditions (8) and (9).

Nevertheless, if in any given period t the optimal plan for period t+1 fromthe viewpoint of the agent is such that

�Pt+1(1� Sit+1)�Bit+1

�= (Yt +Dt)

is expected to be small according to the agent�s expectations EPt ; thenCt+1=Ct is very close to Cit+1=C

it . Hence, in this case one can rely on the

approximations13

EPt

"�Ct+1Ct

�� (Pt+1 +Dt+1)

#' EPt

"�Cit+1Cit

�� (Pt+1 +Dt+1)

#(11)

EPt

"�Ct+1Ct

�� #' EPt

"�Cit+1Cit

�� #; (12)

The following assumption provides su¢ cient conditions for this to be thecase:

Assumption 1 Given some asset holding constraints S < 1 < S and B <0 < B, we assume Yt to be high enough so that the approximations(11) and (12) hold with su¢ cient accuracy.

Intuitively, for high enough income Yt, the agent�s asset trading decisions

matter little for the agents�stochastic discount factor�Cit+1Cit

�� , allowing us

11Cit = Ct follows from market clearing and the fact that all agents are identical.12This is the case because the preferences and beliefs of agents are not assumed to be

common knowledge, so that agents do not know that Cit = Ct must hold in equilibrium.13Equations (11) and (12) follow from using the budget constraint (6) to express Cit+1

and by exploiting the fact that market clearing in period t implies that equilibrium choicesare such that Cit = Ct, S

it = 1 and B

it = 0.

12

Page 15: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

to approximate individual consumption in t + 1 by aggregate consumptionin t+ 1.14

With assumption 1, the risk-free interest rate solves

1 = �(1 + rt)EPt

"�Ct+1Ct

�� #; (13)

Furthermore, de�ning the subjective expectations of risk-adjusted stockprice growth

�t � EPt

�Ct+1Ct

�� Pt+1Pt

!(14)

and subjective expectations of risk-adjusted dividend growth

�Dt � EPt

�Ct+1Ct

�� Dt+1Dt

!;

the �rst order condition for stocks (8) implies that the equilibrium stockprice under subjective beliefs is given by

Pt =��Dt1� ��t

Dt; (15)

provided �t < ��1. The equilibrium stock price is thus increasing in (subjec-

tive) expected risk-adjusted dividend growth and also increasing in expectedrisk-adjusted price growth.

For the special case when agents know the RE growth rates �t = �Dt =

a1� �" for all t, equation (15) delivers the RE price outcome (10). Fur-thermore, when agents hold subjective beliefs about risk-adjusted dividendgrowth but objectively rational beliefs about risk-adjusted price growth,then with �Dt and �t denoting the respective posterior means of these be-liefs, equation (15) delivers the pricing implications derived in the BayesianRE asset pricing literature, as surveyed at the beginning of section 2. Tohighlight the fact that the improved empirical performance of the presentasset pricing model derives exclusively from the presence of subjective be-liefs about risk-adjusted price growth, we shall entertain assumptions thatare orthogonal to those made in the Bayesian RE literature. Speci�cally, weassume that agents know the true process for risk-adjusted dividend growth:

14Note that independent from their tightness, the asset holding constraints never preventagents from marginally trading or selling securities in any period t along the equilibriumpath, where Sit = 1 and B

it = 0 holds for all t.

13

Page 16: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

Assumption 2 Agents know the process for risk adjusted dividend growth,i.e., �Dt � a1� �" for all t.

Under this assumption the asset pricing equation (15) simpli�es to:15

Pt =�a1� �"1� ��t

Dt: (16)

4.2.1 Stock Price Behavior under Learning

We now derive a number of analytical results regarding the behavior of assetprices over time. We start out with a general observation about the volatilityof prices and thereafter derive results about the behavior of prices over timefor a general belief updating scheme.

The asset pricing equation (16) implies that �uctuations in subjectiveprice expectations can contribute to the �uctuations in actual prices. Aslong as the correlation between �t and the last dividend innovation "

dt is

small (as occurs for the updating schemes for �t that we consider in thispaper), equation (16) implies

var

�ln

PtPt�1

�' var

�ln1� ��t�11� ��t

�+ var

�ln

DtDt�1

�: (17)

The previous equation shows that even small �uctuations in subjective pricegrowth expectations can signi�cantly increase the variance of price growth,and thus the variance of stock price returns, if �t �uctuates around valuesclose to but below ��1.

To determine the behavior of asset prices over time, one needs to takea stand on how the subjective price expectations �t are updated over time.The optimal updating rules are thereby dictated by the probability measureP. In section 4.3 we present a speci�c probability measure P that we employin our empirical application. Yet, to illustrate that the results we obtain inour empirical application do not depend on the speci�c measure assumed andto improve our understanding for the empirical performance of the model,we derive the analytical results in this section for a more general nonlinearbelief updating scheme.

15Some readers may be tempted to believe that entertaining subjective price beliefswhile entertaining objective beliefs about the dividend process is inconsistent with indi-vidual rationality. Adam and Marcet (2011) show, however, that there exists no suchcontradiction, as long as the preferences and beliefs of agents in the economy are notcommon knowledge.

14

Page 17: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

Given that �t denotes the subjective one-step-ahead expectation of riskadjusted stock price growth, it appears natural to assume that the measureP implies that rational agents revise �t upwards (downwards) if they under-predicted (overpredicted) the risk adjusted stock price growth ex-post. Thisprompts us to consider measures P that imply updating rules of the form16

��t = ft

�Ct�1Ct�2

�� Pt�1Pt�2

� �t�1; �t�1

!(18)

for given non-linear updating functions ft : R2 ! R with the properties

ft(0;�) = 0 (19)

ft (�;�) increasing (20)

0 < � + ft (x;�) < �U (21)

for all (t; x; �) and for some constant �U 2 (a1� �"; ��1). Properties (19)and (20) imply that �t is adjusted in the same direction as the last predic-tion error, where the strength of the adjustment may depend on the currentlevel of beliefs, as well as on calendar time (e.g., on the number of observa-tions available to date). Property (21) is needed to guarantee that positiveequilibrium prices solving (16) always exist.

In section 4.3 below we provide an explicit system of beliefs P in whichagents optimally update beliefs according to a special case of equation (18).Updating rule (18) is more general and nests also other systems of beliefs, aswell as a range of learning schemes considered in the literature on adaptivelearning, such as the widely used least squares learning or switching gainslearning used by Marcet and Nicolini (2003).

To derive the equilibrium behavior of price expectations and price real-izations over time, we �rst use (16) to determine realized price growth

PtPt�1

=

�a+

a� ��t1� ��t

�"dt (22)

Combining the previous equation with the belief updating rule (18) oneobtains

��t+1 = ft+1

�T (�t;��t) ("

ct)� "dt � �t;�t

�(23)

16Note that �t is determined from observations up to period t� 1 only. This simpli�esthe analysis and it avoids simultaneity of price and forecast determination. This lag in theinformation is common in the learning literature. Di¢ culties emerging with simultaneousinformation sets in models of learning are discussed in Adam (2003).

15

Page 18: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

where

T (�;��) � a1� + a1� � ��

1� ��Given initial conditions (D0; P�1), and initial expectations �0, equation (23)completely characterizes the equilibrium evolution of the subjective priceexpectations �t over time. Given that there is a one-to-one relationshipbetween �t and the PD ratio, see equation (16), the previous equation alsocharacterizes the evolution of the equilibrium PD ratio under learning. High(low) price growth expectations are thereby associated with high (low) valuesfor the equilibrium PD ratio.

The properties of the second order di¤erence equation (23) can be il-lustrated in a 2-dimensional phase diagram for the dynamics of (�t; �t�1),which is shown in Figure 2 for the case where the shocks ("ct)

� "dt assumetheir unconditional mean value �".

17 The e¤ects of di¤erent shock realiza-tions for the dynamics will be discussed separately below.

The arrows in Figure 2 indicate the direction in which the vector (�t; �t�1)evolves over time according to equation (23), and the solid lines indicate theboundaries of these areas.18 Since we have a di¤erence equation rather thana di¤erential equation, we cannot plot the evolution of expectations exactly,as the di¤erence equation gives rise to discrete jumps in the vector (�t; �t�1)over time. Yet, if agents update beliefs only relatively weakly in responseto forecast errors, as will be the case for our estimated model later on, thenthese jumps will be correspondingly small. The arrows then suggest thatfor the case where the shocks assume their average values, the expectationsare likely to move in ellipses around the rational expectations equilibrium(�t; �t�1) = (a

1� �"; a1� �").

Consider, for example, point A in the diagram. At this point �t is alreadybelow its fundamental value a1� �", but the phase diagram indicates thatexpectations will fall further. This shows that there is momentum in pricechanges: the fact that agents at point A have become less optimistic relativeto the previous period (�t < �t�1) implies that price growth optimism andprices will fall further. Expectations move, for example, to point B overtime where they will start to revert direction and move on to point C, thendisplay upward momentum and move to point D, thereby displaying meanreversion. The elliptic movements imply that expectations (and thus thePD ratio) are likely to oscillate in sustained and persistent swings aroundthe RE value a1� �".

17Appendix 7.2 explains in detail the construction of the phase diagram.18The vertical solid line close to ��1 is meant to illustrate the restriction � < ��1.

16

Page 19: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

B

A

βt

βt­1

βt = βt­1

a1­γρε(REE belief)

δ­1

βt+1 = βt

C

D

Figure 2: Phase diagram illustrating momentum and mean-reversion

The e¤ects of the stochastic disturbances ("ct)� "dt is to shift the curve

labeled "�t+1 = �t" in Figure 2. Speci�cally, for realizations ("ct)� "dt > �"

this curve is shifted upwards. As a result, beliefs are more likely to increase,which is the case for all points below this curve. Conversely, for ("ct)

� "dt <�" this curve shifts downward, making it more likely that beliefs decreasefrom the current period to the next.

The previous results show that learning causes beliefs and the PD ratioto stochastically oscillate around its RE value. Such behavior will be key inexplaining the observed volatility and the serial correlation of the PD ratio,i.e., Facts 1 and 2 in Table 1. Also, from the discussion around equation(17) it should be clear that such behavior makes stock returns more volatilethan dividend growth, which contributes to replicating Fact 3. As discussedin Cochrane (2005), a serially correlated and mean reverting PD ratio givesrise to excess return predictability, i.e., contributes to matching Fact 4.

The momentum of changes in beliefs around the RE value of beliefs, aswell as the overall mean reverting behavior can be more formally capturedin the following results:

17

Page 20: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

Momentum: If ��t > 0 and

�t � a1� ("ct)� "dt , (24)

then ��t+1 > 0. This also holds if all inequalities are reversed.

The result follows from the fact that condition (24) implies that the�rst argument in the f function on the right-hand side of equation (23) ispositive. Note that the expected value of random variables appearing onthe right-hand side of condition (24) is equal a1� �", which is the RE valueof risk-adjusted stock price growth. Therefore, if the updating function f issu¢ ciently close to linear in its �rst argument, the previous result impliesthat

Et�1[��t+1] > 0

whenever ��t > 0 and �t � a1� �e, so that beliefs have a tendency toincrease further following an initial increase, whenever beliefs are at or belowthe RE value. By the same token, beliefs have a tendency to decrease furtherfollowing an initial decrease, provided that beliefs are above or at the REvalue.

The following result shows formally that stock prices would eventuallyreturn to their (deterministic) RE value in the absence of further distur-bances, and that such reverting behavior occurs monotonically:19

Mean reversion: Consider an arbitrary initial belief �t 2 (0; �U ). In theabsence of further disturbances

limt!1

sup�t � a1� � limt!1

inf �t

Furthermore, if �t > a1� ; there is a period t0 � t such that �t isnon-decreasing between t and t0 and non-increasing between t0 andt00; where t00 is the �rst period where �t00 is arbitrarily close to a

1� :Symmetrically, if �t < a

1� .

The previous result implies that - absent any shocks - �t cannot stay awayfrom the RE value forever. Beliefs either converge to the (deterministic) REvalue or stay �uctuating around it forever. Any initial deviation, however,is eventually eliminated with the reversion process being monotonic.

Summing up, the previous results show that for a general set of nonlinearbelief updating rules, stock prices and beliefs �uctuate around their REvalues in a way that helps to qualitatively account for Facts 1 to 4 listed inTable 1.19See Appendix 7.3 for the proof under an additional technical assumption.

18

Page 21: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

4.3 Optimal Belief Updating: Constant Gain Learning

We now introduce a fully speci�ed probability measure P and derive theoptimal belief updating equation it implies. We employ this belief updatingequation in our empirical work in section 5. As we prove below, it guaranteesthat prices are geometrically ergodic, a condition that is required for theMSM estimation approach to be applicable, see Du¢ e and Singleton (1993).In addition, we show that this system of beliefs represents a small deviationfrom RE.

We parameterize the measure P such that the implied expectations aboutrisk-adjusted stock price and dividend growth can be chosen to lie arbitrarilyclose to the RE beliefs of these variables. We shall even assume that agentsknow the objective distributions for the dividend and aggregate consumptionprocesses (or alternatively for the dividend and income processes). In linewith Assumption 2, agents then hold objectively rational expectations aboutrisk-adjusted dividend growth. At the same time, we allow for subjective be-liefs about risk-adjusted stock price growth by allowing agents to entertainthe possibility that risk-adjusted price growth may contain a small and per-sistent time-varying component. This is motivated by the observation thatin the data there are periods in which the PD ratio increases persistently, aswell as periods in which the PD ratio falls persistently, see �gure 1. In anenvironment with unpredictable innovations to dividend growth, this impliesthe existence of persistent and time-varying components in stock market re-turns. For this reason, we consider agents who think that the process forrisk-adjusted stock price growth is the sum of a persistent component bt andof a transitory component "t�

CtCt�1

�� PtPt�1

= bt + "t

bt = bt�1 + �t

where the persistent process bt follows a random walk and "t � iiN(0; �2"),�t � iiN(0; �2�), independent of each other.

Besides the empirical appeal of the model, the previous setup is of the-oretical interest because there is a limiting case in which it gives rise toRE. This happens when agents believe �2� = 0 and assign probability oneto b0 = a1� �". Expectations about risk-adjusted stock price growth thenequals the RE value a1� �" in all periods.

In what follows we relax these beliefs and allow for a non-zero variance�2� , i.e., for the presence of a persistent time-varying component in pricegrowth. The setup then gives rise to a learning problem because agents

19

Page 22: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

observe only the realizations of risk-adjusted price growth, but not the per-sistent and transitory component separately. The learning problem thusconsist of optimally �ltering out the persistent component of price growth.Assuming that agents prior beliefs about b0 are centered at the RE valueand given by

b0 � N(a1� �"; �20)

and setting �20 equal to the steady state Kalman �lter uncertainty about bt,which is given by

�20 =��2� +

r��2�

�2+ 4�2��

2"

2;

agents�posterior beliefs at any time t are given by

bt � N(�t; �0)

with optimal updating implying that �t, de�ned in equation (14), recursivelyevolves according to

�t = �t�1 +1

�CtCt�1

�� PtPt�1

� �t�1

!(25)

The optimal (Kalman) gain is given by 1=� =��20 + �

2�

�=��20 + �

2� + �

2"

�and captures the strength with which agents optimally update their poste-riors in response to surprises.20

In the limiting case with vanishing innovations to the random walkprocess (�2� ! 0), agents� prior uncertainty vanishes (�20 ! 0) and theoptimal gain converges to zero (1=�! 0). As a result, �t ! a1� �" in dis-tribution for all t for any given distribution of prices. We prove below thatthis convergence result also applies when we consider instead the equilibriumprice distribution emerging from the beliefs with vanishing gain.

For our empirical application, we need to slightly modify the updatingequation (25) to guarantee that the bound �t < �U holds for all periodsand equilibrium prices always exist. The exact way in which this boundis imposed matters little for our empirical result, because the moments wecompute do not change much as long as �t is close to �

U only rarely over

20 In line with equation (18) we incorporate information with a lag, so as to eliminatethe simultaneity between prices and price growth expectations. The lag in the updatingequation could be justi�ed by a speci�c information structure where agents observe someof the lagged transitory shocks to risk-adjusted stock price growth.

20

Page 23: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

the sample length considered. To impose this bound, we consider in ourempirical application a concave, increasing and di¤erentiable function w :R+ ! (0; �U ) and modify the belief updating equation (25) to21

�t = w

�t�1 +

1

"�Ct�1Ct�2

�� Pt�1Pt�2

� �t�1

#!(26)

wherew(x) = x if x 2 (0; �L)

for some �L 2 (a1� �"; �U ). Beliefs thus continue to evolve according to(25), as long as they are below the threshold �L, while for higher beliefs wehave that w(x) � x. The modi�ed algorithm (26) satis�es the constraint(21) and can be interpreted as an approximate implementation of a Bayesianupdating scheme where agents have a truncated prior that puts probabilityzero on bt > �U .22 The learning setup will then give rise to a stationary andergodic equilibrium outcome and in the limit 1=� ! 0 to small deviationsfrom rational expectations:

Stationarity, Ergodicity, and Small Deviations from RE: Suppose agents�posterior beliefs evolve according to equation (26) and equilibrium pricesare determined according to equation (16). Then �t is geometricallyergodic for � su¢ ciently large. Furthermore, as 1=� ! 0; we haveE[�t]! a1� �" and V AR(�t)! 0.

The proof is based on results from Du¢ e and Singleton (1993) andcontained in appendix 7.4. Geometric ergodicity implies the existence ofa unique stationary distribution for �t that is ergodic and that is reachedfrom any initial condition. This justi�es the MSM approach to estimationpursued in section 5. The previous result also shows that risk-adjusted stockprice growth expectations have a distribution that is increasingly centeredat the RE value a1� �" as the gain parameter becomes vanishingly small.

21The exact functional form for w can be found in appendix 7.5.22The issue of bounding beliefs so as to insure that expected utility remains �nite is

present in many applications of both Bayesian and adaptive learning to asset prices. Theliterature has typically dealt with this issue by assuming that agents simply ignore obser-vations that would imply updating beliefs beyond the required bound, see Timmermann(1993, 1996), Marcet and Sargent (1989), or Evans and Honkapohja (2001). This approach,however, introduces a discontinuity in the simulated moments and creates di¢ culties forour MSM estimation in section 5, prompting us to pursue the di¤erentiable approach tobounding beliefs described above.

21

Page 24: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

From equation (16) it then follows that actual equilibrium prices also be-come increasingly concentrated at their RE value, so that the di¤erencebetween beliefs and outcomes becomes vanishingly small as 1=� ! 0. Thisproves that for small values of 1=� agent�s conditional expectations aboutrisk-adjusted price growth deviate only slightly from RE.

5 Quantitative Model Performance

This section evaluates the quantitative performance of our asset pricingmodel with subjective price beliefs and shows that it can robustly repli-cate Facts 1 to 4 listed in Table 1. We formally estimate and test the modelusing the method of simulated moments (MSM). This approach to struc-tural estimation and testing helps us focusing on the ability of the model toexplain the speci�c moments of the data described in Table 1.23

While the model gives rise to an equity premium (Fact 5 in Table 1),it tends to fall short of replicating the observed magnitude of this premiumfor reasonable degrees of relative risk aversion. This occurs even though thepredicted equity premium is much higher than under RE. To document thatthe ability of the model to replicate Facts 1 to 4 in Table 1 is not sensitiveto the assumed degree of risk aversion, we consider a setting with a veryhigh degree of risk aversion, asking the model then to replicate also the riskpremium, and a setting with a lower value, in which case we leave the riskpremium out of the estimation targets. For the high value of relative riskaversion we choose = 80, which is the steady state value of relative riskaversion used in Campbell and Cochrane (1999).24 The model then repli-cates all moments in Table 1, including the risk premium, but marginallyfails a formal statistical test for the overall goodness of �t. For low riskaversion we choose = 5. The model then replicates Facts 1 to 4 in Table1. It also passes a formal statistical test for the overall goodness of �t whenwe do not include the risk free rate, and it explains all individual momentswhen we exclude stock returns instead.23A popular alternative approach in the asset pricing literature has been to test if

agents��rst order conditions hold in the data. Hansen and Singleton (1982) pioneeredthis approach for RE models and Bossaerts (2003) provides an approach that can beapplied to models of learning. We pursue the MSM estimation approach here because itnaturally provides additional information on how the formal test for goodness of �t of themodel relates to the model�s ability to match the moments of interest. The results arethen easily interpretable, they point out which parts of the model �t well and which partsdo not, thus providing intuition about possible avenues for improving the model �t.24This value is reported on p.244 in their paper.

22

Page 25: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

The next section explains the method of simulated moments (MSM) ap-proach for estimating the model and the formal statistical test for evaluatingthe goodness of �t. The subsequent section reports on the estimation andtest outcomes.

5.1 MSM Estimation and Statistical Test

This section outlines the MSM approach and the formal test for evaluatingthe �t of the model. All technical details are contained in appendix 7.6.

For a given value of the coe¢ cient of relative risk aversion, there are fourfree parameters left in the model, comprising the discount factor �, the gainparameter 1=�, and the mean and standard deviation of dividend growth,denoted by a and ��D

D, respectively. We summarize these in the parameter

vector� �

��; 1=�; a; ��D

D

�:

Throughout the paper we restrict consideration to discount factors satisfying� < 1. The four parameters will be chosen so as to match the ten samplemoments in Table 1:25� bErs ; bEPD; b�rs ; b�PD; b�PD;�1; bc52; bR25; bErb ; bE�D

D; b��D

D

�(27)

Let bSN 2 Rs denote the subset of sample moments in (27) that will bematched in the estimation, with N denoting the sample size and s � 10.26Furthermore, let eS(�) denote the moments implied by the model for someparameter value �.27 The MSM parameter estimate b�N is de�ned as

b�N � argmin�

h bSN � eS(�)i0 b��1S;N h bSN � eS(�)i (28)

where b�S;N is an estimate of the variance-covariance matrix of the samplemoments bSN . The MSM estimate b�N chooses the model parameter such thatthe model moments eS(�) �t the observed moments bSN as close as possiblein terms of a quadratic form with weighting matrix b��1S;N . Appendix 7.6explains how we estimate b�S;N from the data. Adapting standard results

25Many elements listed in (27) are not sample moments but they are non-linear functionsof sample moments. This generates some technical complications, which are discussed inappendix 7.6. It would be more precise to refer to the elements in (27) as �sample statistics�,as we do in the appendix. For simplicity we avoid this terminology in the main text.26As discussed before, we sometimes exclude the risk premium from the estimation.27Note that the smooth bounding function w in equation (26) guarantees that a Monte-

Carlo approximation to eS(�) is di¤erentiable.23

Page 26: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

from MSM, we have the following results: for a given the list of momentsincluded in bSN , the estimate b�N is consistent and the best estimate amongstthose obtained with di¤erent weighting matrices.

The MSM estimation approach also provides an overall test of the model.Under the null hypothesis that the model is correct, we have

cWN � Nh bSN � eS(b�N )i0 b��1S;N h bSN � eS(b�N )i! �2s�4 as N !1 (29)

where convergence is in distribution. Furthermore, we obtain a proper as-ymptotic distribution for each element of the deviations bSN� eS(b�N ); so thatwe can build t-statistics that indicate which moments are better matched inthe estimation, see appendix 7.6 for details.

All asymptotic distribution results hinge on the assumption that b�S;N isinvertible, as the inverse of this matrix appears in equations (28) and (29).In our practical implementation of the MSM approach, we �nd that theestimate b�S;N obtained from the data happens to be nearly singular. Thedistribution of cWN in small samples is then not well approximated by theasymptotic chi-square statistic, as some moments are almost exact linearcombinations of other moments. These moments carry almost no additionalinformation, while the estimation procedure would ask them to be �ttedexactly. To eliminate this near singularity in b�S;N we drop one additionalmoment from the list of moments (27), using a formal procedure describedin detail in appendix 7.6. The procedure picks the moment that is mosthighly correlated with the remaining moments and suggests that we dropthe coe¢ cient from the �ve year ahead excess return regression bc52. In theempirical section below, the value of the regression coe¢ cient implied by theestimated model is always such that the t-statistic for this moment remainsbelow one. This happens despite the fact that information about bc52 has notbeen used in the estimation.

5.2 Estimation Results

This section presents the estimation results and test outcomes from usingthe MSM approach described in the preceding section.

Table 2 reports the estimation outcomes when we assume a high degreeof risk aversion ( = 80, in line with Campbell and Cochrane (1999)). Wethen use for estimation all asset pricing moments listed in equation (27),with the exception of the coe¢ cient c52 from the excess return regression.28

28As discussed in the previous section, c52 is excluded from the estimation to avoid thenear-singularity of the weighting matrix.

24

Page 27: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

The results in table 2 show that the estimated learning model successfullyreplicates all moments in the data, including the risk-premium. All the t-statistics for the individual moments are below two, with most of them evenassuming values below one. The model also quantitatively replicates thevalue of the regression coe¢ cient c52, which has not been included in theestimation.29

Figure 3 shows some realizations of the time series outcomes for the PDratio generated from simulating the estimated model for the same numberof quarters as numbers of observations in our data sample. The time serieslooks similar to that of the data, see �gure 1, so that the model also passesan informal �eyeball test�.

The discount factor estimated in Table 2 is close to but below one andthe estimated gain coe¢ cient is small. The latter implies that agents�risk-adjusted return expectations respond only 0.21% in the direction of the lastobserved forecast error, suggesting that the system of price beliefs in ourmodel represents indeed only a small deviation from RE beliefs. Understrict RE the reaction to forecast errors is assumed to be zero, but themodel then provides a very bad match with the data. It counterfactuallyimplies �rs � ��D=D, �PD = 0, and R25 = 0.30 The model then also cannotsimultaneously match Ers ; E�D=D and EPD, unlike is the case with thelearning model, see Table 2. This follows from the fact that the PD ratio isconstant under RE so that the mean asset return can be expressed as

E[Pt+1 +Dt+1

Pt] =

PD + 1

PDE[Dt+1Dt

]

If the RE model replicates the sample averages of the PD ratio and dividendgrowth in the data, the previous equation implies an average quarterly realstock return of 0:35%, while the data average for this moment is 2:41%.31

Interestingly, the learning model also gives rise to a signi�cantly larger

29The model also behaves well along other dimensions that have not been included inthe estimation, e.g., the model implied autocorrelation of quarterly returns is 0.14. Thecorresponding value in the data equals -0.06, which is estimated with a standard devationof 0.13, so that the t-statistic for this moment equals 1.59.30Due to the constant PD ratio, the autocorrelation of the PD ratio and the regression

coe¢ cient c52 would not be well de�ned under RE.31Exploiting the fact that the average PD ratio and average dividend growth rate are

estimated with uncertainty does not improve the situation much: assuming a model im-plied RE PD ratio that is two standard deviations below and a mean dividend growthrate that is two standard devations above their respective data averages, delivers an up-per bound for the quarterly real stock return of 0:74%, which still falls signi�cantly shortof the sample average.

25

Page 28: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

risk premium than its RE counterpart.32 For the estimated parameter valuesin Table 2, the quarterly real risk premium under RE is just 0.5%, whichfalls short of the 2.08% emerging in the model with learning.33 The learningmodel generates a higher equity premium than the RE model independentlyof the assumed degree of risk aversion. We provide an explanation for thisoutcome at the end of this section.

US data Estimated model(c52 not included)

Data moment Std. dev. Model moment t-statisticbSN;i b� bSi eSi(b�)Mean stock return Ers 2.41 0.45 2.16 0.57Mean bond return Erb 0.18 0.23 0.08 0.47Mean PD ratio EPD 113.20 15.15 115.85 -0.18Std.dev. stock return �rs 11.65 2.88 16.99 -1.89Std.dev. PD ratio �PD 52.98 16.53 70.93 -1.09Autocorrel. PD ratio �PD;�1 0.92 0.02 0.95 -1.52Excess return reg. coe¢ cient c25 -0.0048 0.002 -0.0063 0.73R2 of excess return regression R25 0.1986 0.082 0.2516 -0.06Mean dividend growth E�D=D 0.35 0.19 0.18 0.98Std. dev. dividend growth ��D=D 2.98 0.84 4.67 -2.02Discount factor b�N 0.9986Gain coe¢ cient 1=b�N 0.0021Test statistic cWN 17.23

Table 2: Estimation outcome for = 80

Our MSM approach naturally provides a measure for the overall goodnessof �t via the test statisticcWN . To the best of our knowledge, previous papersseeking to describe stock price volatility do not consider such a formal test.While the learning model is able to �t all moments individually and alsosigni�cantly improves upon the empirical performance of model under RE,the model marginally fails an overall test of the goodness of �t. Table

32The RE counterpart is the model with the same parameterization, except for 1=� = 0.33The learning model and the RE model imply the same risk free rate, as we assumed

that agents have objective beliefs about the aggregate consumption and dividend process.

26

Page 29: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

2 reports the value for the test statistic cWN . It has an �25 asymptoticdistribution and the p-value for the reported value is 0.41%, so that themodel would marginally fail a test at the 1% signi�cance level. Therefore,even if each asset pricing moment can be matched individually, some of thejoint deviations observed in the data are unlikely to happen if the model istrue. This shows that the cWN test statistic is a much stricter test than thatimposed by matching moments individually.

We now show that the model�s ability to replicate many aspects of thebehavior of stock prices does not hinge on the assumed high degree of riskaversion. Table 3 below reports estimation outcomes when assuming = 5.The fourth and �fth column in the table report the outcome when we againseek to match all moments, except for c52. With the exception of the averagestock return, the t-statistics for the individual moments are then again allbelow two and half of the t-statistics are below one. The failure of themodel to match the equity premium causes the cWN statistic to increasesigni�cantly relative to the case with a high degree of risk aversion. To showthat the equity premium is indeed the source of the di¢ culty for passing theoverall goodness of �t test, columns 6 and 7 in Table 3 report the estimationoutcome obtained when excluding the risk-free rate Erb from the estimation.The t-statistics for the majority of included moments then decreases andthe model comfortably passes the overall goodness of �t test: the p-valuefor the reported cWN statistic is 6.31%.34 We conclude that the model givesa very good �t of all moments in Table 1 even for low risk aversion withthe exception of the equity premium. The model nevertheless generates asizeable equity premium, but has di¢ culties to quantitatively match thisaspect of the data precisely.

34The asymptotic distribution of cWN is now �24, as there is one moment less in theestimation and the statistic is computed using only moments that are included in theestimation.

27

Page 30: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

Figure 3: Simulated paths for the PD ratio, estimated model ( = 80)

28

Page 31: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

US data Estimated model Estimated model(c52 not included) (c52, Erb not included)

Data moment Std. dev. Model moment t-statistic Model moment t-statisticbSN;i b� bSi eSi(b�) eSi(b�)Ers 2.41 0.45 1.25 2.58 1.34 2.38Erb 0.18 0.23 0.42 -1.07 0.88 -3.07EPD 113.20 15.15 123 -0.68 116 -0.20�rs 11.65 2.88 10.59 0.37 7.86 1.32�PD 52.98 16.53 67.23 -0.86 53.90 -0.06�PD;�1 0.92 0.02 0.96 -1.84 0.95 -1.68c25 -0.0048 0.002 -0.0067 0.94 -0.0055 0.36R25 0.1986 0.082 0.23 -0.36 0.21 -0.18

E�D=D 0.35 0.19 0.03 1.73 0.17 0.95��D=D 2.98 0.84 2.16 0.99 2.10 1.04Discount factor b�N 0.9972 0.9999Gain coe¢ cient 1=b�N 0.0073 0.0070Test statistic cWN 57.02 8.92

Table 3: Estimation results for = 5

We have performed a range of robustness tests using a number of vari-ations of the model and of the estimation strategy. For example, we usedthe mean bond return in the estimation and dropped the mean stock returninstead, assuming as in Table 3 that = 5. Results for this case are shownin Table 4. All t-statistics then assume values well below 2, including thet-statistic for the mean stock return, which has not been used in the estima-tion. This estimation outcome performs better than that in many previouspapers matching moments on stock price volatility, as it successfully matchesindividual moments with a relatively low degree of risk aversion. Neverthe-less, the more stringent test statistic cWN increases to a value of 35 in this

29

Page 32: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

setting, implying that the overall �t of the model is rejected.

US data Estimated model(c52, Ers not included)

Data moment Std. dev. Model moment t-statisticbSN;i b� bSi eSi(b�)Ers 2.41 0.45 1.61 1.79Erb 0.18 0.23 0.45 -1.19EPD 113.20 15.15 113.74 -0.04�rs 11.65 2.88 12.92 -0.44�PD 52.98 16.53 72.32 -1.17�PD;�1 0.92 0.02 0.94 -1.08c25 -0.0048 0.002 -0.0067 0.96R25 0.1986 0.082 0.1681 0.36

E�D=D 0.35 0.19 0.06 1.55��D=D 2.98 0.84 2.89 0.10Discount factor b�N 0.9986Gain coe¢ cient 1=b�N 0.0080Test statistic cWN 35.7

Table 4: Matching bond returns, = 5

Furthermore, we considered a setting where agents also learn about risk-adjusted dividend growth, using the same weight 1=� for the learning mech-anism about dividend and price growth rates. The moments of the modeland t-statistics are then virtually unchanged relative to a setting withoutlearning about dividend growth. We �nd the same outcome when we usea model of learning that switches between ordinary least squares learningand constant gain learning, as in Marcet and Nicolini (2003). We also useddi¤erent values for the coe¢ cient of relative risk aversion . For lower val-ues of relative risk aversion around 2, we �nd that the model continues togenerate volatile stock prices, but not enough to quantitatively match thedata.

We now brie�y discuss why our model is also able to generate a sizablerisk premium for stocks. Surprisingly, the model generates a small ex-postrisk premium for stocks even when investors are risk neutral ( = 0). Tounderstand this feature, note that the realized gross stock return between

30

Page 33: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

period 0 and period N can be written as the product of three terms

NYt=1

Pt +DtPt�1

=NYt=1

DtDt�1| {z }=R1

��PDN + 1

PD0

�| {z }

=R2

�N�1Yt=1

PDt + 1

PDt| {z }=R3

:

The �rst term (R1) is independent of the way prices are formed, thus cannotcontribute to explaining the emergence of an equity premium in the modelwith learning. The second term (R2), which is the ratio of the terminalover the starting value of the PD ratio could potentially generate an equitypremium but is on average below one in our simulations of the learningmodel, while it is slightly larger than one under RE.35 The equity premium inthe learning model must thus be due to the last component (R3). This termis convex in the PD ratio, so that a model that generates higher volatility ofthe PD ratio (but the same mean value) will also give rise to a higher equitypremium. Therefore, because our learning model generates a considerablymore volatile PD ratio, it also gives rise to a larger ex-post risk premium.

6 Conclusions and Outlook

A very simple consumption based asset pricing model is able to quanti-tatively replicate a number of important asset pricing facts, provided oneslightly relaxes the assumption that agents perfectly know how stock pricesare formed in the market. We assume that agents formulate their doubtsabout market outcomes using a consistent set of subjective beliefs aboutprices which is close - but not equal - to the RE prior beliefs routinely as-sumed in the literature. Optimal behavior then dictates that agents learnabout the equilibrium price process from past price observations. This givesrise to a self-referential model of learning about prices that imparts momen-tum and mean reversion behavior into the price dividend ratio. As a result,sustained departures of asset prices from their fundamental value emerge,even though all agents act rationally in the light of their beliefs.

We submit our consumption based asset pricing model also to a formaleconometric test based on the method of simulated moments. The modelperforms remarkably well despite its simplicity. While the model gives riseto a signi�cant equity premium, it fails to match quantitatively the premiumfound in the data for reasonable degrees of relative risk aversion. When riskaversion is su¢ ciently high, and as high as in some of the previous work, the

35For the learning model we choose the RE PD ratio as our starting value.

31

Page 34: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

model can also replicate the equity premium, but we leave a full treatmentof this issue to future research.

Given the di¢ culties documented in the empirical asset pricing litera-ture in accounting for stock price volatility under RE, our results suggestthat models of learning may be economically more relevant than previouslythought. Indeed, the most convincing case for models of learning can bemade by explaining facts that appear �puzzling�from the RE viewpoint, aswe attempt to do in this paper.

The �nding that large asset price �uctuations can result from individ-ually rational �uctuations in investor optimism and pessimism is also rele-vant from a policy perspective. The desirability of policy responding to assetprice �uctuations will depend to a large extent on whether or not asset price�uctuations are fundamentally justi�ed.

32

Page 35: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

References

Adam, K. (2003): �Learning and Equilibrium Selection in a MonetaryOverlapping Generations Model with Sticky Prices,�Review of EconomicStudies, 70, 887�908.

Adam, K., and A. Marcet (2011): �Internal Rationality, Imperfect Mar-ket Knowledge and Asset Prices,� Journal of Economic Theory, 146,1224�1252.

Biais, B., P. Bossaerts, and C. Spatt (2010): �Equilibrium Asset Pric-ing and Portfolio Choice Under Asymmetric Information,�Review of Fi-nancial Studies, 23, 1503�1543.

Bossaerts, P. (2003): �Filtering Returns for Unspeci�ed Biases in Priorswhen Testing Asset Pricing Theory,� Review of Economic Studies, 70,1�24.

Branch, W., and G. W. Evans (2010): �Asset Return Dynamics andLearning,�Review of Financial Studies, 23, 1651�1680, University of Ore-gon mimeo.

Brennan, M. J., and Y. Xia (2001): �Stock Price Volatility and EquityPremium,�Journal of Monetary Economics, 47, 249�283.

Brock, W. A., and C. H. Hommes (1998): �Heterogeneous Beliefs andRoutes to Chaos in a Simple Asset Pricing Model,�Journal of EconomicDynamics and Control, 22, 1235�1274.

Bullard, J., and J. Duffy (2001): �Learning and Excess Volatility,�Macroeconomic Dynamics, 5, 272�302.

Campbell, J. Y. (2003): �Consumption-Based Asset Pricing,�inHandbookof Economics and Finance, ed. by G. M. Constantinides, M. Harris, andR. Stulz, pp. 803�887. Elsevier, Amsterdam.

Campbell, J. Y., and J. H. Cochrane (1999): �By Force of Habit: AConsumption-Based Explanation of Aggregate Stock Market Behavior,�Journal of Political Economy, 107, 205�251.

Carceles-Poveda, E., and C. Giannitsarou (2007): �Asset Pricingwith Adaptive Learning,�Review of Economic Dynamics (forthcoming).

33

Page 36: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

Cecchetti, S., P.-S. Lam, and N. C. Mark (2000): �Asset Pricing withDistorted Beliefs: Are Equity Returns Too Good to Be True?,�AmericanEconomic Review, 90, 787�805.

Chakraborty, A., and G. Evans (2008): �Can Perpetual Learning Ex-plain the Forward Premium Puzzle?,� Journal of Monetary Economics,55(3), 477�490.

Cochrane, J. H. (2005): Asset Pricing (Revised Edition). Princeton Uni-versity Press, Princeton, USA.

Cogley, T., and T. J. Sargent (2008): �The Market Price of Risk andthe Equity Premium: A Legacy of the Great Depression?,� Journal ofMonetary Economics, 55, 454�476.

Duffie, D., and K. J. Singleton (1993): �Simulated Moments Estima-tion of Markov Models of Asset Prices,�Econometrica, 61, 929�952.

Dumas, B., A. Kurshev, and R. Uppal (2009): �Equilibrium Portfo-lio Strategies in the Presence of Sentiment Risk and Excess Volatility,�Journal of Finance, 64, 579�629.

Evans, G. W., and S. Honkapohja (2001): Learning and Expectationsin Macroeconomics. Princeton University Press, Princeton.

Hansen, L. P., and K. J. Singleton (1982): �Generalized InstrumentalVariables Estimation of Nonlinear Rational Expectations Model,�Econo-metrica, 50, 1269�1286.

LeRoy, S. F., and R. Porter (1981): �The Present-Value Relation: TestBased on Implied Variance Bounds,�Econometrica, 49, 555�574.

Lucas, R. E. (1978): �Asset Prices in an Exchange Economy,�Economet-rica, 46, 1426�1445.

Marcet, A., and J. P. Nicolini (2003): �Recurrent Hyperin�ations andLearning,�American Economic Review, 93, 1476�1498.

Marcet, A., and T. J. Sargent (1989): �Convergence of Least SquaresLearning Mechanisms in Self Referential Linear Stochastic Models,�Jour-nal of Economic Theory, 48, 337�368.

(1992): �The Convergence of Vector Autoregressions to Ratio-nal Expectations Equilibria,� in Macroeconomics: A Survey of Research

34

Page 37: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

Strategies, ed. by A. Vercelli, and N. Dimitri, pp. 139�164. Oxford Uni-versity Press, Oxford.

Pastor, L., and P. Veronesi (2003): �Stock Valuation and LearningAbout Pro�tability,�Journal of Finance, 58(5), 1749�1789.

Shiller, R. J. (1981): �Do Stock Prices Move Too Much to Be Justi�edby Subsequent Changes in Dividends?,�American Economic Review, 71,421�436.

(2005): Irrational Exuberance: Second Edition. Princeton Univer-sity Press, Princeton, USA.

Timmermann, A. (1993): �How Learning in Financial Markets GeneratesExcess Volatility and Predictability in Stock Prices,�Quarterly Journalof Economics, 108, 1135�1145.

(1996): �Excess Volatility and Predictability of Stock Prices in Au-toregressive Dividend Models with Learning,�Review of Economic Stud-ies, 63, 523�557.

35

Page 38: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

7 Appendix

7.1 Data Sources

Our data is for the United States and has been downloaded from �The GlobalFinancial Database�(http://www.global�nancialdata.com). The period cov-ered is 1925:4-2005:4. For the subperiod 1925:4-1998:4 our data set corre-sponds very closely to Campbell�s (2003) handbook data set available athttp://kuznets.fas.harvard.edu/~campbell/ data.html.

In the calibration part of the paper we use moments that are based onthe same number of observations. Since we seek to match the return pre-dictability evidence at the �ve year horizon (c25 and R

25) we can only use data

points up to 2000:4. For consistency the e¤ective sample end for all othermoments reported in table 1 has been shortened by �ve years to 2000:4. Inaddition, due to the seasonal adjustment procedure for dividends describedbelow and the way we compute the standard errors for the moments de-scribed in appendix 7.6, the e¤ective starting date was 1927:2.

To obtain real values, nominal variables have been de�ated using the�USA BLS Consumer Price Index�(Global Fin code �CPUSAM�). The monthlyprice series has been transformed into a quarterly series by taking the indexvalue of the last month of the considered quarter.

The nominal stock price series is the �SP 500 Composite Price Index(w/GFD extension)�(Global Fin code �_SPXD�). The weekly (up to the endof 1927) and daily series has been transformed into quarterly data by takingthe index value of the last week/day of the considered quarter. Moreover,the series has been normalized to 100 in 1925:4.

As nominal interest rate we use the �90 Days T-Bills Secondary Market�(Global Fin code �ITUSA3SD�). The monthly (up to the end of 1933), weekly(1934-end of 1953), and daily series has been transformed into a quarterly se-ries using the interest rate corresponding to the last month/week/day of theconsidered quarter and is expressed in quarterly rates, i.e., not annualized.

Nominal dividends have been computed as follows

Dt =

�ID(t)=ID(t� 1)IND(t)=IND(t� 1) � 1

�IND(t)

where IND denotes the �SP 500 Composite Price Index (w/GFD extension)�described above and ID is the �SP 500 Total Return Index (w/GFD exten-sion)�(Global Fin code �_SPXTRD �). We �rst computed monthly dividendsand then quarterly dividends by adding up the monthly series. Following

36

Page 39: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

Campbell (2003), dividends have been deseasonalized by taking averages ofthe actual dividend payments over the current and preceding three quarters.

7.2 Details on the phase diagram

The second order di¤erence equation (23) describes the evolution of beliefsover time and allows to construct the directional dynamics in the

��t; �t�1

�plane, as shown in Figure 2 for the case ("ct)

� "dt = 1. Here we show thealgebra leading to the arrows displayed in this �gure as well the e¤ectsof realizations ("ct)

� "dt 7 1. De�ne x0t � (x1;t; x2;t) ���t; �t�1

�. The

dynamics can then be described by

xt+1 =

x1;t + ft+1

��a1� +

a1� �(x1;t�x2;t)1��x1;t

�("ct)

� "dt � x1;t; x1;t�

x1;t

!The points in Figure 2 where there is no change in each of the elementsof x are the following: we have �x2 = 0 at points x1 = x2; so that the45o line gives the point of no change in x2, and �x2 < 0 above this line.We have �x1 = 0 for x2 = 1

� �x1(1��x1)

a1� �("c)� "d. For ("c)� "d = �" this is

the curve labelled "�t+1 = �t" in Figure 2, and we have �x1 > 0 belowthis curve. So for ("c)� "d = �", the zeroes for �x1 and �x2 intersectare at x1 = x2 = a1� �" which is the REE value and, interestingly, atx1 = x2 = ��1 which is the limit of rational bubble equilibria. Theseresults give rise to the directional dynamics shown in �gure 2. Finally, for("c)� "d > �" (("

c)� "d < �") the curve "�t+1 = �t" in Figure 2 is shifted

upwards (downwards) as indicated by the function x2 = 1� �

x1(1��x1)a1� �("c)� "d

.

7.3 Proof of mean reversion

To prove mean reversion for the general learning scheme (18) we need thefollowing additional technical assumption on the updating function ft:

Assumption A1 There is a � > 0 such that ft(�; �) is di¤erentiable in theinterval (��; �) for all t and all �Furthermore, letting

Dt � inf�2(��;�);�2(0;�U )

@ft(�; �)

@�,

we have 1Xt=0

Dt =1

37

Page 40: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

This is satis�ed by all the updating rules considered in this paper and bymost algorithms used in the stochastic control literature. For example, it isguaranteed in the OLS case where Dt = 1=(t+�1) and in the constant gainwhere Dt = 1=� for all t; �. The assumption would fail and

PDt <1, for

example, if the weight given to the error in the updating scheme is 1/t2: Inthat case beliefs could get �stuck�away from the fundamental value simplybecause updating of beliefs ceases to incorporate new information for t largeenough. In this case, the growth rate would be a certain constant but agentswould forever believe that the growth rate is another constant, di¤erent fromthe truth. Hence in this case agents would make systematic mistakes forever.Therefore, assumption A1 is likely to be satis�ed by any system of beliefsthat puts a "grain of truth" on the RE equilibrium.

The statement about limsup is equivalent to saying that if �t > a insome period t, then for any � > 0 su¢ ciently small, there is a �nite periodt00 > t such that �t00 < a+ �.

Fix � > 0 such that � < min(�; (�t � a)=2) where � is as in assumptionA1.

We �rst prove that there exists a �nite t0 � t such that

��et � 0 for all et such that t < et < t0, and (30)

��t0 < 0 (31)

To prove this, choose � = ��1� ��U

�. Since �t < �U and � > 0 it is

impossible that ��et � � for all et > t: Let t � t to be the �rst period where��t < �.

There are two possible cases: either i) ��t < 0 or ii) ��t � 0:In case i) we have that (30) and (31) hold if we take t0 = t.In case ii) �t can not decrease between t and t so that

�t � �t > a+ �

Furthermore, we have

T (�t;��t) = a+��t1� ��t

< a+�

1� ��t< a+

1� ��U= a+ �

where the �rst equality follows from the de�nition of T in the main text.The previous two relations imply

�t > T (�t;��t)

38

Page 41: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

Therefore��t+1 = ft+1 (T (�t;��t)� �t;�t) < 0

and in case ii) we have that (30) and (31) hold for t0 = t+ 1.This shows that (30) and (31) hold for a �nite t0 as in the �rst part of

the statement of Mean Reversion in the text. Now we need to show thatbeliefs eventually fall below a+ � and do decrease monotonically.

Consider � as de�ned above. First, notice that given any j � 0, if

��t0+j

< 0 and (32)

�t0+j

> a+ � (33)

then

��t0+j+1 = ft0+j+1

�a+

��t0+j1� ��t0+j

� �t0+j ; �t0+j�< ft0+j+1

�a� �t0+j ; �t0+j

�(34)

< ft0+j+1���; �t0+j

�� ��Dt0+j+1 � 0 (35)

where the �rst inequality follows from (32), the second inequality from (33)and the third from the mean value theorem, � > 0 and Dt0+j+1 � 0. Assume,towards a contradiction, that (33) holds for all j � 0: Since (32) holds forj = 0, it follows by induction that ��t0+j � 0 for all j � 0 and, therefore,that (35) would hold for all j � 0 hence

�t0+j =

jXi=1

��t0+i + �t0 � ��jXi=1

Dt0+i + �t0

for all j > 0. Assumption A1 above would then imply �t ! �1 showingthat (33) can not hold for all j. Therefore there is a �nite j such that �t0+jwill go below a+ � and � is decreasing from t0 until it goes below a+ �.

For the case �t < a��, choosing � = � one can use a symmetric argumentto construct the proof.

7.4 Proof of Geometric Ergodicity

De�ning �t � ("ct)� "dt and using (22) and (26) we can write the learning

algorithm that gives the dynamics of �t as��t��t

�= F

��t�1��t�1

; �t�1

39

Page 42: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

where the �rst element of F , denoted F1, is given by the right side of (26)and F2(�;��; �t�1) � F1(�;��; �t�1)� �. Therefore,

F0t �

@F��; �t�1

�@��t ��t

� = w0t � � At; 1� 1� +Bt

At; � 1� +

1�Bt

for At = 1�

a��t�11���t�1

, Bt = 1�

a� ��t�1�t�1

(1���t�1)2 ; with w0t denoting the derivative of

w at period t. The eigenvalues of the matrix in brackets are

�+t ; ��t =

At + 1� 1� +Bt �

q�At + 1� 1

� +Bt�2 � 4At

2

Since At; Bt ! 0 for large � we have that �+t is the larger eigenvalue inmodulus and that the radicand is positive. We wish to �nd a uniformbound for �+t ; because given that jw0tj < 1 this will be a uniform bound forthe largest eigenvalue of F 0t : Such a bound will play the role of ��("t) in thede�nition of the "L2 unit circle condition" on p. 942 in Du¢ e and Singleton(1993) (henceforth DS).

Consider the function fa(x) = x+a+q(x+ a)2 � 4a for some constant

a > 0 and x large enough for the radicand to be positive. For " > 0 themean value theorem implies

fa(x+ ") �

0@1 + xq(x+ a)2 � 4a

1A "+ fa(x)Evaluating this expression at a = At; " = Bt and x = 1� 1=� we have

�+t � Bt +fAt(1� 1

�)

2< Bt + 1�

1

�for ��t�1 � 0 (36)

where we used

fAt(1�1

�) < At + 1�

1

�+

s�At + 1�

1

�2� 4At(1�

1

�) = 2

�1� 1

�Since fAt(�) is monotonic, using the expression for Bt we have

�+t �1

2fAt(1�

1

�+Bt) �

fAt(1� 1�)

2< 1� 1

�for ��t�1 < 0 (37)

40

Page 43: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

From (26) we have

��t �1

��t�1a

1� �1 +

��t�11� ��t�1

�� �t�1

�(38)

So, if ��t�1 � 0; using �t�1 > 0

��t �1

��t�1a

1�

"1 +

����t�1��1� ��t�1

#Therefore, adding the right side of this inequality to (37), and using theinequality for (36) we have that for all ��t�1

�+t � 1

a1� ��1� ��t�1

�2 a1�

��t�2

"1 +

����t�2��1� ��t�2

#!�t�1 + 1�

1

� 1

�2eK�t�2�t�1 + 1� 1

for a constant 0 < eK <1; where we used����t�2�� ; �t�1; �t�2 < �U :

Since w0 � 1, it is clear from the mean value theorem thateK�2�t�2�t�1+

1� 1� plays the role of ��("t) in the De�nition of "L

2 unit circle condition"of DS, where our � plays the role of �, and �t�1�t�2 the role of "t in DS.

Therefore, we need to check that E� eK�2�t�t�1 + 1� 1

�2< 1 for � large

enough. A routine calculation shows that

E

eK�2�t�t�1 + 1�

1

!2= 1� 1

�� 1�

"1� 1

�� 2

�1� 1

� eK�E��t�t�1

��eK2

�3E��2t �

2t�1�#

For � large the term in brackets is positive. Therefore,

E

eK�2�t�t�1 + 1�

1

!2< 1� 1

�< 1

This proves that for large � the variable �t satis�es DS�L2 unit-circle

condition, hence it satis�es DS�AUC condition, and Lemma 3 in DS guar-antees that �t is geometrically ergodic.

Now, adding a1� �t�1 on both sides of (38) and taking expectations atthe ergodic distribution we have

E��t�1 � �t�1a1�

�� E

���t�11� ��t�1

�t�1a1� �

(39)

41

Page 44: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

Our previous argument shows that the right side is arbitrarily small for �large, therefore E�t�1 � E�t�1a1� : A similar argument shows that var�tgoes to zero as �!1: Therefore for � large �t � �L with arbitrarily largeprobability so that (38) holds as equality with arbitrarily large probability.Taking expectations on both sides for the realizations where this holds asequality, we have that E�t ! E�t�1a

1� = �RE as �!1 which completesthe proof.�

7.5 Di¤erentiable projection facility

The function w used in the di¤erentiable projection facility is

w(x) = U

(x if x � �L

�L + x��Lx+�U�2�L (�

U � �L) if �L < x � �U (40)

�L +1

2

��U � �L

�Clearly w is continuous, the only point where continuity is questionable isat x = �L; but it is easy to check that

limx%�L

w(x) = limx&�L

w(x) = �L

limx%�L

w0(x) = limx&�L

w0(x) = 1

limx!1

w(x) = �U

In our numerical applications we choose �U so that the implied PD rationever exceeds UPD = 500 and �L = ��1 � 2(��1 � �U ), which implies thatthe dampening e¤ect of the projection facility starts to come into e¤ect forvalues of the PD ratio above 250. Therefore, this dampening is applied infew observations.

7.6 Details of the MSM procedure

7.6.1 Estimation and testing

Here we provide an estimator for the covariance matrix of the statisticsb�S;N ; we give some more details about consistency of the estimator de�nedin (28), and derive the needed asymptotic distribution results.

Let N be the sample size, (y1; :::;yN ) the observed data sample, with ytcontaining m variables. The standard version of the method of simulated

42

Page 45: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

moments (MSM) is to �nd parameter values that make the moments ofthe structural model close to sample moments cMN � 1

N

PNt=1 h(yt) for a

given moment function h : Rm ! Rq. However, many of the statistics thatwe wish to match in Table 1 are not of this form, but they are functionsof moments. Formally, the statistics in table 1 can be written as bSN �S(cMN ) for a statistic function S : Rq ! Rs mapping sample momentscMN into the considered statistics. Explicit expressions for h(�) and S(�) inour particular application are stated in section 7.6.2. In the text we talkedabout "moments" as describing all statistics to be matched in (27). In thisappendix we properly use the term "statistic" as possibly di¤erent from"moment".

We propose to base our MSM estimates and tests on matching the statis-tics bSN . Since this deviates from standard MSM we need to adapt standardproofs and asymptotic distribution results. The proofs follow standard stepsso we provide an outline of the argument and the derivations. The statis-tics to be matched bSN can be all or a subset of the statistics in (27), thereason we sometimes consider a subset will be discussed in detail after re-sult "Asymptotic Distribution of MSM" below. Since we have endogenousstate variables in the model (namely past beliefs �t�1; �t�2) the asymptotictheory result needed is from Du¢ e and Singleton (1993) (DS).

Let yt(�) be the series generated by the structural model at hand forparameter values � and some realization of the underlying shocks. All theresults below are derived under the null hypothesis that the model is true,more speci�cally, that the observed data is generated by the structural modelat hand for a true parameter value �0: Let M(�) � E [ h(yt(�)) ] be thetrue moments for parameter values � at the stationary distribution of yt(�);hence M0 � M(�0) are the true moments, and let eS(�) � S(M(�)) bethe true statistics when the model parameter is �: Denote by M j

0 the j-thautocovariance of the moment function at the true parameter, that is

M j0 � E [h(yt(�0))�M0] [h(yt�j(�0))�M0]

0

De�ne

Sw �1X

j=�1M j0 (41)

We use the following estimate of the variance for the sample statisticsbSN b�S;N � @S(MN )

@M 0bSw;N @S(MN )

0

@M

The asymptotic properties of this estimate are given in the following result:

43

Page 46: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

Variance of bSN Suppose that

a) Sw <1; and we have consistent estimators of this matrix bSw;N !Sw a.s. as N !1

b) S is continuously di¤erentiable at M0.

c) the observed process fytg is stationary and ergodic

Then we have that

b�S;N ! �S �@S(M0)

@M 0 Sw@S(M0)

0

@M(42)

and that �S is the asymptotic covariance matrix of bSN :Eh bSN � S(M0)

i h bSN � S(M0)i0! �S (43)

both limits occurring a.s. as N !1:Therefore, b�S;N is a consistent estimator of the asymptotic varianceof the sample statistics.

Proof. Assumptions a), c) imply

cMN !M0 a.s. as N !1

and assumption b) gives (42).Assumptions a), c) imply

E [MN �M0] [MN �M0]0 ! Sw a.s. as N !1

The mean value theorem implies that

S(MN )� S(M0) =@S(fMN )

@M 0 [MN �M0] (44)

for a fMN !M0 a.s. asN !1: Taking expectations of [S(MN )� S(M0)] [S(MN )� S(M0)]0

we have (43).

Conditions a), c) are standard minimal assumptions used in time seriesasymptotic results, condition b) is clearly satis�ed in our application, seethe expression for S in section 7.6.2. We choose consistent estimates bSw;Napplying the Newey West estimator using only the data. Hence the esti-mator b�S;N can be found purely from data, without using the model or itsparameter estimates. We now turn to

44

Page 47: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

Consistency Let b�N be the estimator de�ned in (28), where the maximiza-tion is over a set � � Rn: Assume

a) � is compact, the process fyt(�)g is well de�ned for all � 2 �; eSis continuous in �; and �0 2 �:

b) fyt(�)g is geometrically ergodic for all � 2 �c) �S is invertible

d) h eS(�0)� eS(�)i0��1S h eS(�0)� eS(�)i > 0 for all � 6= �0

Then b�N ! �0 a:s: as N !1:

The proof is easily obtained by adapting the consistency result from DS.Condition a) is standard in GMM applications, the set � should be large

enough to insure that it contains admissible values of the true parameter val-ues. DS emphasize that a strong form of ergodicity is needed as in conditionb), we showed in appendix 7.4 that this holds for � large enough, thereforeb) is guaranteed if � is restricted to large �. Conditions c) and d) are stan-dard identi�cation requirements that the statistics selected are su¢ cient toidentify the true parameter values. A necessary condition for d) is that thenumber of parameters is less than the number of statistics s.

Let

B0�@M 0(�0)

@�

@S 0(M0)

@M

Asymptotic Distribution In addition to all the assumptions in the aboveresults, assume that B0��1S B

00 is invertible. Then

pNhb�N � �0i ! N (0;

�B0��1S B0

��1) (45)

pNh bSN � S(M(b�))i ! N (0;�S � B00(B0�

�1S B

00)�1B0)) (46)cWN ! �2s�n (47)

in distribution as N !1; wherecWN � Nh bSN � eS(b�)i0 b��1S;N h bSN � eS(b�)i

Proof. The central limit theorem and (44) imply

pN [S(MN )� S(M0)] =

@S(fMN )

@M 0

pN [MN �M0]! N (0;�S) (48)

45

Page 48: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

in distribution. Letting

B(�;M)�@M0(�)

@�

@S 0(M)@M

The asymptotic distribution of the parameters is derived as

S(M(b�N ))� S(M(�0)) = B(e�;fM) hb�N � �0iB(b�N ;cMN )b��1S;N hS(M(b�))� S(M(�0))i = B(b�N ;cMN )b��1S;NB(e�;fM)0 hb� � �0iB(b�N ;cMN )b��1S [S(MN )� S(M(�0))] = (49)

B(b�N ;cMN )b��1S B(e�;fM)0 hb� � �0iwhere the last equality follows because B(b�N ;cMN )b��1S h

S(MN )� S(b�N )i =0 at the maximum of (28): This implies (45).

To obtain (46) we use mean value theorem and (49) to conclude

S(cMN )� S(M(b�N )) = S(cMN )� S(M(�0)) + B(e�;fM) h�0 � b�i =�I �

hB(b�N ;cMN )�

�1S B(e�;fM)0i�1 B(b�N ;cMN )b��1S;N�hS(cMN )� S(M(�0))

iThis gives (46).

(47) follows from (46) and that��S � B00

�B0��1S B00

��1 B0� is an idem-potent matrix.

We mentioned in the main text and earlier in this appendix that wedrop some statistics from bSN when b�S;N is nearly singular: The reason isthat, as stated above, we need invertibility of �S both for consistency andasymptotic distribution. In practice, a nearly singular b�S;N creates manyproblems. First, the results for the test W change very much with smallchanges in the model or testing procedure and the maximization algorithmis nearly unstable, making it di¢ cult to �nd a maximum numerically. Thishappens because in this case the formula for cWN nearly divides zero byzero, hence the objective function is nearly unde�ned, and the asymptoticdistribution is not necessarily a good approximation to the true distributionof the test statistic. But this is not a bad situation for an econometrician,it just means that one of the statistics is redundant, so it makes sense tosimply drop one statistic from the estimation.

To decide which statistic to drop we compute the variability of eachstatistic that can not be explained by a linear combination of the remaining

46

Page 49: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

statistics: This is analogous to the R2 coe¢ cient of running a regression ofeach statistic on all the other statistics when the regression coe¢ cients arecomputed from b�S;N : We drop the statistics for which this R2 is less than1%, as it turns out this only occurs for bc52 with an R2 equal to 0.006. Afterwe drop bc52 the estimation results become su¢ ciently stable.36 As we explainin our discussion around Table 2, the model is in any case able to match bc52even when we drop it from the statistics used in the estimation.

There are various ways to compute the moments of the model eS(�) fora given � 2 Rn:We use the following Monte-Carlo procedure. Let !i denotea realization of shocks drawn randomly from the known distribution thatthe underlying shocks are assumed to have and

�y1(�; !

i); :::yN (�; !i)�the

random variables corresponding to a history of length N generated by themodel for shock realization !i and parameter values �. Furthermore, let

MN (�; !i) � 1

N

NXt=1

h(yt(�; !i))

denote the model moment for realization !i. We set the model statisticseS(�) equal to1

K

KXi=1

S(MN (�; !i))

for large K: In other words, eS(�) is an average across a large number ofsimulations of length N of the statistics S(MN (�; !

i)) implied by each sim-ulation. We use K of the order of 1000, therefore the model moments arecomputed with KN observations. These are the averages reported as modelmoments in tables 2 to 4 of the main text.

Many papers on MSM emphasize the dependence of the estimates onthe ratio of number of observations in simulations to N: Since this is 1000in our application this adds a negligible factor to the asymptotic variance-covariance matrices computed and we entirely ignore it in our results.

7.6.2 The statistic and moment functions

This section gives explicit expressions for the statistic function S(�) andthe moment functions h(�) that map our estimates into the framework just36An alternative to dropping one moment would be to match the linear combinations

corresponding to the principal components of bSw. While this has some advantages fromthe point of view of asymptotic theory, we �nd it less attractive from the economic point ofview, since the economic meaning of these principal components would be unclear, whileeach individual moment does have clear economic interpretation.

47

Page 50: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

discussed in this appendix.The underlying sample moments needed to construct the statistics of

interest are

MN �1

N

NXt=1

h(yt)

where h(�) : R42 ! R11 and yt are de�ned as

h(yt) �

26666666666666666664

rstPDt(rst )

2

(PDt)2

PDt PDt�1rs;20t�20�rs;20t�20

�2rs;20t�20PDt�20

rbtDt=Dt�1(Dt=Dt�1)

2

37777777777777777775

; yt �

266666666666664

PDtDt=Dt�1PDt�1

Dt�1=Dt�2...

PDt�19Dt�19=Dt�20PDt�20rbt

377777777777775

where rs;20t denotes the stock return over 20 quarters, which can be computedusing from yt using (PDt; Dt=Dt�1; :::; PDt�19; Dt�19=Dt�20).

The ten statistics we consider can be expressed as function of the mo-ments as follows:

S(M) �

2666666666666664

E(rst )E(PDt)�rst�PDt�PDt;�1c52R25E(rbt )

E(�Dt=Dt�1)�Dt=Dt�1

3777777777777775=

26666666666666666664

M1

M2qM3 � (M1)

2qM4 � (M2)

2

M5�(M2)2

M4�(M2)2

c52(M)R25(M)M9

M10qM11 � (M10)

2

37777777777777777775where Mi denotes the i�th element of M and the functions c52(M) andR25(M) de�ne the OLS and R

2 coe¢ cients of the excess returns regressions,more precisely

48

Page 51: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

c5(M) ��1 M2

M2 M4

��1 �M6

M8

�R25(M) � 1�

M7 � [M6;M8] c5(M)

M7 � (M6)2

7.6.3 Derivatives of the statistic function

This appendix gives explicit expressions for @S=@M 0 using the statistic func-tion stated in appendix 7.6.2. Straightforward but tedious algebra shows

@Si@Mj

= 1 for (i; j) = (1; 1); (2; 2); (8; 9); (9; 10)

@Si@Mj

=1

2Si(M)for (i; j) = (3; 3); (4; 4); (10; 11)

@Si@Mj

=�Mj

Si(M)for (i; j) = (3; 1); (4; 2); (10; 10)

@S5@M2

=2M2(M5 �M4)

(M4 �M22 )2;

@S5@M5

=1

M4 �M22

;@S5@M4

= � M5 �M22�

M4 �M22

�2@S6@Mj

=@c52(M)

@Mjfor i = 2; 4; 6; 8

@S7@Mj

=[M6;M8]

@c5(M)@Mj

M7 �M26

for j = 2; 4

@S7@M6

=

hc51(M) + [M6;M8]

@c5(M)@M6

i �M7 �M2

6

�� 2M6 [M6;M8] c

5(M)�M7 �M2

6

�2@S7@M7

=M26 � [M6;M8] c

5(M)�M7 �M2

6

�2@S7@M8

=c52(M) + [M6;M8]

@c5(M)@M8

M7 �M26

Using the formula for the inverse of a 2x2 matrix

c5(M) =1

M4 �M22

�M4M6 �M2M8

M8 �M2M6

49

Page 52: Stock Market Volatility and Learning1 · Stock Market Volatility and Learning1 Klaus Adam2, Albert Marcet3 Juan Pablo Nicolini4 April 2, 2012 1Thanks go to Davide Debortoli, Luca

we have

@c5(M)

@M2=

1

M4 �M22

�2M2c

5(M)��M8

M6

��@c5(M)

@M4=

1

M4 �M22

��c5(M) +

�M6

0

��@c5(M)

@M6� 1

M4 �M22

�M4

�M2

�@c5(M)

@M8� 1

M4 �M22

��M2

1

�All remaining terms @Si=@Mj not listed above are equal to zero.

50