Endogeneity Bias

38
Endogeneity Bias Module 4 Day 1 Session 1

Transcript of Endogeneity Bias

Page 1: Endogeneity Bias

Endogeneity Bias

Module 4 Day 1 Session 1

Page 2: Endogeneity Bias

Meaning• In traditional usage, a variable is said to be endogenous if it is

determined within the context of the model

• An Explanatory Variable is said to be endogenous if it is correlated with u

• If, however, E(u/x) = 0 holds, then the explanatory variables are exogenous

• Intuitive definition of Endogeneity• A situation where the “ceteris paribus condition” is not fulfilled

whenever the independent variable of interest is changed.

Page 3: Endogeneity Bias

Source• In applied Econometrics, endogeneity can arise in three possible

ways (Wooldridge, 2002):

– Omitted Variable

– Measurement Error

– Simultaneity

• Note: the distinction between the three possible sources of bias is not sharp, generally an equation can exhibit more than one source of endogeneity

Page 4: Endogeneity Bias

Omitted Variable Bias

• Omitted Variables appear when we would like to control for one or more additional variables but because of data unavailability we cannot include them in the regression model

• Omitted variable could be

– Observable

– Unobservable

Page 5: Endogeneity Bias

Example• Impact of Monetary Policy shocks on Stock Returns

• When estimating the effect of monetary policy on stock prices, estimates may be severely biased due to omitted variable issues

• Caused by factors that influence both policy rates and stock prices which are commonly excluded from regression analysis

• Changes in policy rates and stock prices may be caused by other important news released earlier in the period which are omitted from regression, giving rise to the omitted variable bias.

• E.g. employment, inflation or economic activity reports etc.

Page 6: Endogeneity Bias

Omitted Observable Variables

• Solution : Include all factors that are I. Important in explaining the dependent variable II. Associated with one of the explanatory variables, into the structural

equation. • In practical terms, this will usually require the researcher to conduct a thorough

review of the extant theoretical and empirical literatures to identify these joint determinants. (Note :Factors that are associated with both dependent and one or more explanatory variables are said to be ‘joint determinants’.)

• Once included in the structural model, the disturbance term is purged from the source of its correlation with the explanatory variables and the estimation of the parameters of interest should no longer be affected by endogeneity.

Page 7: Endogeneity Bias

Omitted Unobservable Variables

• Since the researcher will not be able to gather data on omitted variables that are unobservable, the solution of including any joint determinants will no longer work.

• Two Ways to address to the problem

(1) Self-Selection(2) Firm-Specific Heterogeneity

Page 8: Endogeneity Bias

Self-Selection• Self or sample-selection arises if the probability that a firm is included into the sample

and the dependent variable are both affected by an (omitted unobservable) variable.

• Sample is no longer random

• Example: whether a commitment to increased disclosure, as required under international standards, has tangible benefits in the form of lower cost-of-capital?

• Increased disclosure is equivalent to switching reporting regime

• Firms will decide on disclosure based on the expected consequences with regard to their cost-of-capital

• the factors that determine the disclosure choice are likely to also affect the dependent variable, current cost-of-capital

Page 9: Endogeneity Bias

Example

• In most corporate governance studies, researchers study the largest listed companies

• this induces a sample selection bias in the model• The average characteristics of the sample will differ from that

of the population• Leads to incorrect measure of the relationship between

corporate governance and performance as the impact of corporate governance is confounded with the impact of the parameters that determine selection

Page 10: Endogeneity Bias

Firm-Specific Heterogeneity.

• Unobserved omitted variables often represent features of the firm that are given and do not change over the period in question.

• Specifically, firm characteristics like managerial ability, structural arrangements, and employee skills can be thought of as roughly constant over time.

• If these firm characteristics impact on both the dependent variable and one or more explanatory variables, the structural disturbance will be correlated with those explanatory variables.

Page 11: Endogeneity Bias

Example of Unobserved Firm Heterogeneity

• Consider two firms that are identical except that the owner of one of the firms has access to a superior monitoring technology.

• Under the optimal contracting regime, the owners with access to the superior monitoring technology will choose a lower level of managerial ownership to align incentives

• This firm will have a higher valuation because fewer resources will be diverted to managerial perquisites.

• If measures of the quality of the monitoring technology are omitted from the specification, a regression of firm value on managerial ownership will spuriously (and falsely) indicate a negative relation, because ownership is a negative proxy for the quality of monitoring technology.

Page 12: Endogeneity Bias

Example of Unobserved Firm Heterogeneity

• Intangible assets provide an example of unobserved firm heterogeneity.

• Suppose two firms are identical except that one of the firms operates with a higher fraction of its assets in the form of intangibles.

• Under the optimal contracting regime, the owners of this firm will require a higher level of managerial ownership to align incentives because the intangible assets are hard to monitor

• This firm will also have a higher Q value because the market will value intangibles in the numerator (market value), but the book value of assets in the denominator will understate the value of intangibles (because Tobin's Q is measured as the ratio of the market value of the firm's outstanding debt and equity divided by the book value of assets).

• In this example, the unobserved level of intangibles induces a positive correlation between managerial ownership and Tobin's Q, but this relation is spurious, not causal.

Page 13: Endogeneity Bias

Solution

• Proxy variable: for the firm characteristic and plug this into the structural equation

• Instrumental variable (IV) estimation: instruments might be available for those explanatory variables that are correlated with the unobservable firm characteristic

• Panel Data Techniques: fixed effects to account for firm-specific heterogeneity

Page 14: Endogeneity Bias

Proxy Variable

• In corporate finance literature, moral hazard is a factor to be accounted for in a model measuring performance

• Why?• How do we measure moral hazard?• Common proxies for moral hazard include • Firm size• Capital intensity- hard capital• R&D intensity• Advertising intensity• Cash flow• Investment rate

Soft Capital

Page 15: Endogeneity Bias

Instrumental Variable• The instrument used should have a high correlation with the explanatory

variable• The instrument and the error term should be uncorrelated• Slightest correlation between them would lead to a bias > bias in case of

OLS with endogeneity• In practise, it is extremely difficult to find such an instrument• Most empirical studies use “imperfect instruments”• Imperfect Instruments:

– either exogenous but having low correlation with the endogenous variable, termed as weak instruments

– Or having a high correlation with the endogenous variable, but not exogenous, termed as semi-exogenous or quasi-instrumental

Page 16: Endogeneity Bias

Types of Instruments• Both instruments have their disadvantages

• Weak instruments often lead to inefficient as well as inconsistent estimators

• Example of weak instruments??

• Semi-exogenous instruments include lagged value of the endogenous variable, e.g. in the corporate governance model, for instance, lagged corporate rating could be used as an instrument

• Problem: performance in period (t-1) is related to corporate governance in period (t-1) and performance in period (t-1) is related to performance in period t. Degree of exogeneity depends on how much past performance affect current performance

Page 17: Endogeneity Bias

Characteristics of a Good Instrument• Uncontroversial instruments are those which are determined from outside the

firm but affect the firms (Borsh-Supan and Koke 2002)

• Again using the corporate governance example, a good candidate would be country-level factors, like, country score on corporate governance practices, number of recommendations published for corporate governance practices in a year

• This measure should be highly correlated with the endogenous variable, assuming that implementation in the country is strong

• Exogeneity ensured by the fact that companies do not generally influence such recommendations

• Size or wealth effect may have an effect though

Page 18: Endogeneity Bias

Example

• Is nominal exchange rate affected by the sterilized interventions in foreign exchange market?

• The answer to this question cannot be conclusively obtained due to endogeneity problems

• Source 1 : simultaneity between the intervention decision and contemporaneous exchange rate

• Intervention affects current spot rate, current spot rate triggers intervention

• Source 2: omitted variable• What should be the valid controls when the dependent variable is daily

nominal exchange rate?– Macroeconomic variables are unlikely to have relevance on a daily horizon

Page 19: Endogeneity Bias

Instrument

• To avoid simultaneity, one may model an interventionary reaction function

• Explanatory factors include: change in nominal exchange rate, other factors that explain intervention

• Same problem: valid instruments may not be available

• Plausible valid instrument: customer trades data from the market microstructure finance data

Page 20: Endogeneity Bias

Another Solution• Recall the omitted variable bias problem in the problem of monetary

shock effecting stock returns• Solution: • the omitted variable issue can be resolved by introducing a new high

frequency data set consisting of intraday changes around policy announcement time in, for instance, S&P500 futures and spot-month federal funds futures.

• S&P500 futures can be used to capture the stock market response to policy actions,

• Federal funds futures can be used to measure policy shocks. • The new data set decreases the likelihood that other common factors

impact both policy rates and stock prices during the tight intervals around announcements, thus reducing the omitted variable bias.

Page 21: Endogeneity Bias

Caution

• The mere fact that some variable represents a decision (or choice) to the firm or, more generally, an economic agent, is not in itself sufficient for ‘econometric endogeneity’ to arise.

• Only if the factors that impact on the decision by the

economic agent, whether observable or not, are also inter-related with the dependent variable will endogeneity exist.

Page 22: Endogeneity Bias

Measurement Error

• We would like to measure the effect of a variable, but we can observe only an imperfect measure of it

• Replacing the actual measure by the imperfect measure creates a measurement error

• Magnitude of the measurement error depends on the correlation between the actual and imperfect measure

• Measurement error can arise due to misreporting

Page 23: Endogeneity Bias

Measurement Error & Proxy Variable Problem

Proxy Variable• One is looking for a variable

which is somehow related to the unobserved variable

• The partial effect of the omitted variable is rarely of central interest

Measurement Error• The variable we do not

observe has well-defined, quantitative meaning

• Mismeasured independent variable is of central interest

Page 24: Endogeneity Bias

Measurement Error in Dependent Variable

Page 25: Endogeneity Bias

Saving-Income Example• Suppose savings is a function of income, size, education and age

• Actual Saving may deviate from reported saving measurement error

• Will the measurement error be correlated with the explanatory variables ?

• In general not

• However, one might expect families with higher incomes or more education to report their saving more accurately

Page 26: Endogeneity Bias

Measurement Error in Independent Variable

Page 27: Endogeneity Bias

Example: Beta Estimation

Page 28: Endogeneity Bias

Measurement Error in Beta

Page 29: Endogeneity Bias

Bias in Beta Estimation

Page 30: Endogeneity Bias

Solution to Measurement Error

Page 31: Endogeneity Bias

Simultaneity• Simultaneity arises when at least one of the explanatory variables is

determined simultaneously along with the dependent variable

• The concerned explanatory variable and the disturbance term would generally be correlated

• Example: In Modelling financial intermediary development and sources of growth, endogenity may arise due to simultaneous determination of financial intermediary development and

• (i) private savings rates, • (ii) capital accumulation, • (iii) total factor productivity growth, and • (iv) overall real per capita GDP growth.

Page 32: Endogeneity Bias

EXAMPLE• Revisiting the corporate governance and performance relation• The direction of causation is important• Board Composition could affect firm performance and firm performance could also

cause a change in board composition• This reverse causality leads to biased OLS estimators• Direction of bias depends on whether performance positively or negatively impact

governance• Expected Relationship: poor performing companies increase the number of

independent board members to improve performance• Reverse causality has a negative impact• Coefficient of corporate governance is underestimated• OLS estimate will lead to downward biased estimates

Page 33: Endogeneity Bias

Structural Equation

• To capture instantaneous feedback relations, researchers write a system of equations

• It consists of separate structural equations for each endogenous variable.

• When variables y1 impacts on y2 and vice versa, – y2 would be included as an explanatory variable in the

structural– equation for y1; y1, in turn, is an explanatory variable in

the structural equation of y2.

Page 34: Endogeneity Bias

Rank and Order Condition

Page 35: Endogeneity Bias

General Order Condition

Page 36: Endogeneity Bias

Estimation

• Single Equation Methods– Indirect least squares– Two-stage least squares– LIML

• System Methods– Three-stage least squares– FIML

Page 37: Endogeneity Bias

Example• Estimation in the Corporate governance and performance problem

• Instrument variable method in 3 stages:

• First stage: Selection Equation• Probit (in the sample) = f(variable in selection process)

• Second stage: Reduced form Regression• Endogenous Variable = f(instruments, inverse mills ratio, control variables)

• Third stage: Structural Equation• Dependent Variable = f (stage 2 predicted value, inverse mills ratio, control

variables)

Page 38: Endogeneity Bias

Shifting Supply Function to trace Demand Function

S1

S2

S3Price

Quantity

D