Why Macroeconomics? J. Bradford DeLong U.C. Berkeley Econ 101b First Lecture Notes January 23, 2008.
2014 02 03 and 05 econ 141 uc berkeley
-
Upload
seema-soleja -
Category
Documents
-
view
54 -
download
2
description
Transcript of 2014 02 03 and 05 econ 141 uc berkeley
Ch.4: Simple Linear Regression
Econ 141 Spring 2014
Lecture: February 02 and 05, 2014
Bart Hobijn
2/03&05/2014 Econ 141, Spring 2014 1
The views expressed in these lecture notes are solely those of the instructor and do not necessarily
reflect those of the UC Berkeley, or other institutions with which he is affiliated.
Example: Estimate MPC
β’ MPC: Marginal Propensity to Consume
Suppose householdsβ pre-tax income
increases by a dollar, what fraction of this
dollar would they end up spending versus
paying in taxes or saving?
2/03&05/2014 Econ 141, Spring 2014 2
Example: Estimate MPC
β’ Basic equation
ππ = π½0 + π½1ππ + π’π β π MSA, (unit of observation)
β ππ Average consumption expenditures per household
β ππ Average pre-tax income per household
β π½0 Average consumption level at zero income
β π½1 Marginal propensity to consume (MPC)
β π’π MSA-specific deviation from average linear
relationship between income and spending
β’ How can we estimate value of MPC, i.e. π½1?
2/03&05/2014 Econ 141, Spring 2014 3
Income and spending by MSA
MSA
(π) Spending
(ππ)
Income
(πΏπ)
MSA
(π) Spending
(ππ)
Income
(πΏπ)
Chicago 57.7 74.4 Atlanta 51.9 71.2
Detroit 50.5 79.8 Miami 40.6 58.9
Minneapolis-
St. Paul 56.7 66.8
Dallas-
Fort Worth 57.1 71.0
Cleveland 48.0 65.9 Houston 58.2 73.5
New York 58.7 80.2 Los
Angeles 55.3 69.6
Philadelphia 53.5 71.7 San
Francisco 73.6 98.2
Boston 65.0 79.8 San Diego 56.2 76.4
Washington,
D.C. 77.9 111.9 Seattle 60.7 74.1
Baltimore 62.3 96.9 Phoenix 53.7 63.2
2/03&05/2014 Econ 141, Spring 2014 4
Note: Spending and income are annual average across households in thousands of dollars
Source: Consumer Expenditure Survey
Data scatterplot
2/03&05/2014 Econ 141, Spring 2014 5
Chicago
Detroit
Minneapolis-St. Paul
Cleveland
New York
Philadelphia
Boston
Washington,D.C.
Baltimore
Atlanta
Miami
Dallas-Fort Worth
Houston
LosAngeles
SanFrancisco
San Diego
Seattle
Phoenix
30
40
50
60
70
80
90
50 60 70 80 90 100 110 120
Source: Consumer Expenditure Survey by MSA
Annual income and expenditures by household; 000's dollars; 2012
Average Income and Expenditures by major MSA
Income
Expenditures
Data scatterplot
2/03&05/2014 Econ 141, Spring 2014 6
30
40
50
60
70
80
90
50 60 70 80 90 100 110 120
Source: Consumer Expenditure Survey by MSA
Annual income and expenditures by household; 000's dollars; 2012
Average Income and Expenditures by major MSA
Income
Expenditures
Estimate of MPC (π·π)?
2/03&05/2014 Econ 141, Spring 2014 7
30
40
50
60
70
80
90
50 60 70 80 90 100 110 120
Source: Consumer Expenditure Survey by MSA
Annual income and expenditures by household; 000's dollars; 2012
Average Income and Expenditures by major MSA
Income
Expenditures
Estimate of MPC (π·π)?
2/03&05/2014 Econ 141, Spring 2014 8
30
40
50
60
70
80
90
50 60 70 80 90 100 110 120
Source: Consumer Expenditure Survey by MSA
Annual income and expenditures by household; 000's dollars; 2012
Average Income and Expenditures by major MSA
Income
Expenditures
What is best estimate of line
defined by π·π and π·π?
Ordinary Least Squares
2/03&05/2014 Econ 141, Spring 2014 9
Simple linear regression model
ππ = π½0 + π½1ππ + π’π
β π observation number
β ππ dependent variable (regressand)
β ππ independent (explanatory) variable (regressor)
β π½0 intercept / constant
β π½1 slope coefficient
β π’π error term / residual
2/03&05/2014 Econ 141, Spring 2014 10
Simple linear regression model
ππ = π½0 + π½1ππ + π’π
β π observation number
β ππ dependent variable (regressand)
β ππ independent (explanatory) variable (regressor)
β π½0 intercept / constant
β π½1 slope coefficient
β π’π error term / residual
2/03&05/2014 Econ 141, Spring 2014 11
Population regression line /
Population regression function
Average linear relationship between
dependent and independent variable.
Simple linear regression model
ππ = π½0 + π½1ππ + π’π
β π observation number
β ππ dependent variable (regressand)
β ππ independent (explanatory) variable (regressor)
β π½0 intercept / constant
β π½1 slope coefficient
β π’π error term / residual
2/03&05/2014 Econ 141, Spring 2014 12
Error term / Residual
Observation-specific deviation from
average linear relationship between
dependent and independent variable.
Simple linear regression model
ππ = π½0 + π½1ππ + π’π
β π observation number
β ππ dependent variable (regressand)
β ππ independent (explanatory) variable (regressor)
β π½0 Intercept / constant
β π½1 Slope coefficient
β π’π error term / residual
2/03&05/2014 Econ 141, Spring 2014 13
Why we need to estimate
Observed: Sample ππ , ππ for π = 1, β¦ , π.
Unobserved: Parameters π½0 and π½1 as well
as error terms π’π for π = 1, β¦ , π.
Ordinary Least Squares (OLS)
β’ OLS estimates: Choose π½ 0 and π½ 1 to minimize the sum of squared
residuals (SSR)
π½ 0, π½ 1 = argminπ1,π2
ππ β π0 β π1ππ2
π
π=1
β’ Properties:
β What is solution for π½ 0, π½ 1 ?
β Are π½ 0, π½ 1 consistent estimates of true π½0 and π½1?
β Are π½ 0, π½ 1 unbiased estimates of true π½0 and π½1?
β What is their asymptotic distribution? 2/03&05/2014 Econ 141, Spring 2014 14
Solution for π· π, π· π
First order necessary condition for π· π
0 =π
ππ½ 0 ππ β π½ 0 β π½ 1ππ
2π
π=1
= π
ππ½ 0ππ β π½ 0 β π½ 1ππ
2π
π=1
= β2 ππ β π½ 0 β π½ 1ππ
π
π=1
β 0 =1
π ππ β π½ 0 β π½ 1ππ
π
π=1
.
Solving for π½ 0
0 =1
π ππ β π½ 0 β π½ 1ππ
π
π=1
= π β π½ 0 β π½ 1π
Such that
π½ 0 = π β π½ 1π
2/03&05/2014 Econ 141, Spring 2014 15
Solution for π· π, π· π
First order necessary condition for π· π
0 =π
ππ½ 1 ππ β π½ 0 β π½ 1ππ
2π
π=1
=π
ππ½ 1 ππ β π β π½ 1 ππ β π
2π
π=1
= β2 ππ β π ππ β π β π½ 1 ππ β π
π
π=1
β 0 =1
π ππ β π ππ β π β π½ 1 ππ β π
π
π=1
.
2/03&05/2014 Econ 141, Spring 2014 16
Solution for π· π, π· π
First order necessary condition for π· π
0 =π
ππ½ 1 ππ β π½ 0 β π½ 1ππ
2π
π=1
=π
ππ½ 1 ππ β π β π½ 1 ππ β π
2π
π=1
= β2 ππ β π ππ β π β π½ 1 ππ β π
π
π=1
β 0 =1
π ππ β π ππ β π β π½ 1 ππ β π
π
π=1
.
2/03&05/2014 Econ 141, Spring 2014 17
This implies that
0 =1
π ππ β π ππ β π½ 0 β π½ 1ππ
ππ=1
Solution for π· π, π· π
First order necessary condition for π· π
0 =π
ππ½ 1 ππ β π½ 0 β π½ 1ππ
2π
π=1
=π
ππ½ 1 ππ β π β π½ 1 ππ β π
2π
π=1
= β2 ππ β π ππ β π β π½ 1 ππ β π
π
π=1
β 0 =1
π ππ β π ππ β π β π½ 1 ππ β π
π
π=1
=1
π ππ β π ππ β π
π
π=1
β π½ 11
π ππ β π 2
π
π=1
= π ππ βπ½ 1π π2 .
2/03&05/2014 Econ 141, Spring 2014 18
Solution for π· π, π· π
First order necessary condition for π· π
0 =π
ππ½ 1 ππ β π½ 0 β π½ 1ππ
2π
π=1
=π
ππ½ 1 ππ β π β π½ 1 ππ β π
2π
π=1
= β2 ππ β π ππ β π β π½ 1 ππ β π
π
π=1
β 0 =1
π ππ β π ππ β π β π½ 1 ππ β π
π
π=1
=1
π ππ β π ππ β π
π
π=1
β π½ 11
π ππ β π 2
π
π=1
= π ππ β π½ 1π π2 .
2/03&05/2014 Econ 141, Spring 2014 19
π· π =π ππ
π π2
Simple linear regression with OLS
β’ OLS estimators of π·π and π·π:
β π½ 1 =1
π ππβπ ππβπ π
π=11
π ππβπ 2π
π=1
=π ππ
π π2
β π½ 0 = π β π½ 1π
β’ Derived estimates for each π = π, β¦ , π
β π π = π½ 0 + π½ 1ππ, predicted/fitted value of ππ
β π’ π = ππ β π π, residual
2/03&05/2014 Econ 141, Spring 2014 20
Estimating the MPC
2/03&05/2014 Econ 141, Spring 2014 21
30
40
50
60
70
80
90
50 60 70 80 90 100 110 120
Source: Consumer Expenditure Survey by MSA
Annual income and expenditures by household; 000's dollars; 2012
Average Income and Expenditures by major MSA
Income
Expenditures
Estimating the MPC
2/03&05/2014 Econ 141, Spring 2014 22
30
40
50
60
70
80
90
50 60 70 80 90 100 110 120
Source: Consumer Expenditure Survey by MSA
Annual income and expenditures by household; 000's dollars; 2012
Average Income and Expenditures by major MSA
Income
Expenditures
San Francisco
Estimating the MPC
2/03&05/2014 Econ 141, Spring 2014 23
30
40
50
60
70
80
90
50 60 70 80 90 100 110 120
Source: Consumer Expenditure Survey by MSA
Annual income and expenditures by household; 000's dollars; 2012
Average Income and Expenditures by major MSA
Income
Expenditures
San Francisco
Estimated MPC out of
pre-tax income is 56
cents on the dollar
73.6
69.6
30
40
50
60
70
80
90
50 60 70 80 90 100 110 120
Source: Consumer Expenditure Survey by MSA
Annual income and expenditures by household; 000's dollars; 2012
Average Income and Expenditures by major MSA
Income
Expenditures
San Francisco
SF predicted value and residual
2/03&05/2014 Econ 141, Spring 2014 24
Expenditures in
SF higher than
predicted by
regression
SF predicted value and residual
2/03&05/2014 Econ 141, Spring 2014 25
73.6
69.6
30
40
50
60
70
80
90
50 60 70 80 90 100 110 120
Source: Consumer Expenditure Survey by MSA
Annual income and expenditures by household; 000's dollars; 2012
Average Income and Expenditures by major MSA
Income
Expenditures
San Francisco
Residual:
Goodness of fit
β’ Main question
What fraction of the variance of the
dependent variable, ππ, is explained by the
regression line rather than unexplained? (unexplained means part of the residuals)
β’ Variance accounting β πππ = ππ β π 2π
π=1 total sum of squares
β πΈππ = π π β π 2π
π=1 , estimated sum of squares
a.k.a. model sum of squares
β πππ = ππ β π π2π
π=1 = π’ π2 π
π=1 sum of squares residuals
a.k.a. residual sum of squares
2/03&05/2014 Econ 141, Spring 2014 26
TSS = ESS + SSR decomposition
πππ = ππ β π 2
π
π=1
= π π β π + ππ β π π2
π
π=1
= π π β π + π’ π
2=
π
π=1
π½ 0 + π½ 1ππ β π + π’ π
2π
π=1
= π β π½ 1π + π½ 1ππ β π + π’ π
2π
π=1
= π½ 1 ππ β π + π’ π2
π
π=1
= π½ 1 ππ β π 2
π
π=1
+ 2π½ 1 ππ β π π’ π
π
π=1
+ π’ π2
π
π=1
.
2/03&05/2014 Econ 141, Spring 2014 27
TSS = ESS + SSR decomposition
πππ = ππ β π 2
π
π=1
= π π β π + ππ β π π2
π
π=1
= π π β π + π’ π
2=
π
π=1
π½ 0 + π½ 1ππ β π + π’ π
2π
π=1
= π β π½ 1π + π½ 1ππ β π + π’ π
2π
π=1
= π½ 1 ππ β π + π’ π2
π
π=1
= π½ 1 ππ β π 2
π
π=1
+ 2π½ 1 ππ β π π’ π
π
π=1
+ π’ π2
π
π=1
.
2/03&05/2014 Econ 141, Spring 2014 28
ππ β π π’ πππ=1 = π according to first-order
necessary condition derived on slide 17.
TSS = ESS + SSR decomposition
πππ = ππ β π 2
π
π=1
= π π β π + ππ β π π2
π
π=1
= π π β π + π’ π
2=
π
π=1
π½ 0 + π½ 1ππ β π + π’ π
2π
π=1
= π β π½ 1π + π½ 1ππ β π + π’ π
2π
π=1
= π½ 1 ππ β π + π’ π2
π
π=1
= π½ 1 ππ β π 2
π
π=1
+ 2π½ 1 ππ β π π’ π
π
π=1
+ π’ π2
π
π=1
.
2/03&05/2014 Econ 141, Spring 2014 29
π½ 1 ππ β π = π π β π
TSS = ESS + SSR decomposition
πππ = ππ β π 2
π
π=1
= π π β π + ππ β π π2
π
π=1
= π π β π + π’ π
2=
π
π=1
π½ 0 + π½ 1ππ β π + π’ π
2π
π=1
= π β π½ 1π + π½ 1ππ β π + π’ π
2π
π=1
= π½ 1 ππ β π + π’ π2
π
π=1
= π½ 1 ππ β π 2
π
π=1
+ 2π½ 1 ππ β π π’ π
π
π=1
+ π’ π2
π
π=1
= π π β π 2
π
π=1
+ π’ π2
π
π=1
= πΈππ + πππ .
2/03&05/2014 Econ 141, Spring 2014 30
Goodness of fit for MCP regression
2/03&05/2014 Econ 141, Spring 2014 31
30
40
50
60
70
80
90
50 60 70 80 90 100 110 120
Source: Consumer Expenditure Survey by MSA
Annual income and expenditures by household; 000's dollars; 2012
Average Income and Expenditures by major MSA
Income
Expenditures
San Francisco
πΉπ: measure of goodness of fit
Measure Equation Value Share (percentage)
ESS π π β π 2π
π=1 944.0 74.9
SSR π’ π2
π
π=1 316.4 25.1
TSS ππ β π 2π
π=1 1260.3 100.0
2/03&05/2014 Econ 141, Spring 2014 32
πΉπ: measure of goodness of fit
Measure Equation Value Share (percentage)
ESS π π β π 2π
π=1 944.0 74.9
SSR π’ π2
π
π=1 316.4 25.1
TSS ππ β π 2π
π=1 1260.3 100.0
2/03&05/2014 Econ 141, Spring 2014 33
πΉπ fraction of the variation in
the dependent variable, i.e.
of π»πΊπΊ, explained by the
regression line. πΉπ = π. πππ.
Standard error of the regression
Measure Equation Value Share (percentage)
ESS π π β π 2π
π=1 944.0 74.9
SSR π’ π2
π
π=1 316.4 25.1
TSS ππ β π 2π
π=1 1260.3 100.0
2/03&05/2014 Econ 141, Spring 2014 34
πΊπ¬πΉ = ππ , where ππ π =
π
πβπ π π
π ππ=π is
an unbiased estimate of variance of
residuals πππ ππ
Standard error of the regression
ππΈπ = π π’ , where π π’ 2 =
1
πβ2 π’ π
2 ππ=1
β’ Degrees of freedom correction If we had only two observations we would be able to
perfectly fit a straight line and residuals would be
zero.
2/03&05/2014 Econ 141, Spring 2014 35
Standard error of the regression
ππΈπ = π π’ , where π π’ 2 =
1
πβ2 π’ π
2 ππ=1
β’ Degrees of freedom correction If we had only two observations we would be able to
perfectly fit a straight line and residuals would be
zero.
β’ Measure of spread around regression line Estimate of standard deviation of deviation from the
regression line.
2/03&05/2014 Econ 141, Spring 2014 36
MCP regression in Excel
2/03&05/2014 Econ 141, Spring 2014 37
30
40
50
60
70
80
90
50 60 70 80 90 100 110 120
Source: Consumer Expenditure Survey by MSA
Annual income and expenditures by household; 000's dollars; 2012
Average Income and Expenditures by major MSA
Income
Expenditures
San Francisco
SUMMARY OUTPUT
Dependent variable: Expenditures
Regression Statistics
Multiple R 0.865434015
R Square 0.748976034
Adjusted R Square 0.733287037
Standard Error 4.446724217
Observations 18
ANOVA
df SS MS F Significance F
Regression 1 943.9589518 943.9589518 47.73893411 3.51406E-06
Residual 16 316.3737002 19.77335626
Total 17 1260.332652
Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0%
Intercept 14.69690817 6.304502449 2.331176535 0.033146025 1.331960021 28.061856 1.33196002 28.06185632
Income 0.558872878 0.080886618 6.909336734 3.51406E-06 0.387400909 0.7303448 0.38740091 0.730344847
Example Excel regression output
2/03&05/2014 Econ 141, Spring 2014 38
π· π estimated
intercept
SUMMARY OUTPUT
Dependent variable: Expenditures
Regression Statistics
Multiple R 0.865434015
R Square 0.748976034
Adjusted R Square 0.733287037
Standard Error 4.446724217
Observations 18
ANOVA
df SS MS F Significance F
Regression 1 943.9589518 943.9589518 47.73893411 3.51406E-06
Residual 16 316.3737002 19.77335626
Total 17 1260.332652
Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0%
Intercept 14.69690817 6.304502449 2.331176535 0.033146025 1.331960021 28.061856 1.33196002 28.06185632
Income 0.558872878 0.080886618 6.909336734 3.51406E-06 0.387400909 0.7303448 0.38740091 0.730344847
Example Excel regression output
2/03&05/2014 Econ 141, Spring 2014 39
π· π estimated slope
SUMMARY OUTPUT
Dependent variable: Expenditures
Regression Statistics
Multiple R 0.865434015
R Square 0.748976034
Adjusted R Square 0.733287037
Standard Error 4.446724217
Observations 18
ANOVA
df SS MS F Significance F
Regression 1 943.9589518 943.9589518 47.73893411 3.51406E-06
Residual 16 316.3737002 19.77335626
Total 17 1260.332652
Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0%
Intercept 14.69690817 6.304502449 2.331176535 0.033146025 1.331960021 28.061856 1.33196002 28.06185632
Income 0.558872878 0.080886618 6.909336734 3.51406E-06 0.387400909 0.7303448 0.38740091 0.730344847
Example Excel regression output
2/03&05/2014 Econ 141, Spring 2014 40
π sample size
SUMMARY OUTPUT
Dependent variable: Expenditures
Regression Statistics
Multiple R 0.865434015
R Square 0.748976034
Adjusted R Square 0.733287037
Standard Error 4.446724217
Observations 18
ANOVA
df SS MS F Significance F
Regression 1 943.9589518 943.9589518 47.73893411 3.51406E-06
Residual 16 316.3737002 19.77335626
Total 17 1260.332652
Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0%
Intercept 14.69690817 6.304502449 2.331176535 0.033146025 1.331960021 28.061856 1.33196002 28.06185632
Income 0.558872878 0.080886618 6.909336734 3.51406E-06 0.387400909 0.7303448 0.38740091 0.730344847
Example Excel regression output
2/03&05/2014 Econ 141, Spring 2014 41
π¬πΊπΊ Explained sum of squares
πΊπΊπΉ Sum of squared residuals
π»πΊπΊ Total sum of squares
SUMMARY OUTPUT
Dependent variable: Expenditures
Regression Statistics
Multiple R 0.865434015
R Square 0.748976034
Adjusted R Square 0.733287037
Standard Error 4.446724217
Observations 18
ANOVA
df SS MS F Significance F
Regression 1 943.9589518 943.9589518 47.73893411 3.51406E-06
Residual 16 316.3737002 19.77335626
Total 17 1260.332652
Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0%
Intercept 14.69690817 6.304502449 2.331176535 0.033146025 1.331960021 28.061856 1.33196002 28.06185632
Income 0.558872878 0.080886618 6.909336734 3.51406E-06 0.387400909 0.7303448 0.38740091 0.730344847
Example Excel regression output
2/03&05/2014 Econ 141, Spring 2014 42
πΉπ
SUMMARY OUTPUT
Dependent variable: Expenditures
Regression Statistics
Multiple R 0.865434015
R Square 0.748976034
Adjusted R Square 0.733287037
Standard Error 4.446724217
Observations 18
ANOVA
df SS MS F Significance F
Regression 1 943.9589518 943.9589518 47.73893411 3.51406E-06
Residual 16 316.3737002 19.77335626
Total 17 1260.332652
Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0%
Intercept 14.69690817 6.304502449 2.331176535 0.033146025 1.331960021 28.061856 1.33196002 28.06185632
Income 0.558872878 0.080886618 6.909336734 3.51406E-06 0.387400909 0.7303448 0.38740091 0.730344847
Example Excel regression output
2/03&05/2014 Econ 141, Spring 2014 43
π π’ 2 =
1
π β 2 π’ π
2 π
π=1
SUMMARY OUTPUT
Dependent variable: Expenditures
Regression Statistics
Multiple R 0.865434015
R Square 0.748976034
Adjusted R Square 0.733287037
Standard Error 4.446724217
Observations 18
ANOVA
df SS MS F Significance F
Regression 1 943.9589518 943.9589518 47.73893411 3.51406E-06
Residual 16 316.3737002 19.77335626
Total 17 1260.332652
Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0%
Intercept 14.69690817 6.304502449 2.331176535 0.033146025 1.331960021 28.061856 1.33196002 28.06185632
Income 0.558872878 0.080886618 6.909336734 3.51406E-06 0.387400909 0.7303448 0.38740091 0.730344847
Example Excel regression output
2/03&05/2014 Econ 141, Spring 2014 44
ππΈπ = π π’
Why use OLS?
β’ Most common estimation method
β Implemented in many different applications
β Most common methodology. Thus important to
understand.
β’ OLS has very desirable properties
Under relatively general conditions OLS estimates
are
β Consistent
β Unbiased
β Have tractable asymptotic distribution 2/03&05/2014 Econ 141, Spring 2014 45
Why use OLS?
β’ Most common estimation method
β Implemented in many different applications
β Most common methodology. Thus important to
understand.
β’ OLS has very desirable properties
Under relatively general conditions OLS estimates
are
β Consistent
β Unbiased
β Have tractable asymptotic distribution 2/03&05/2014 Econ 141, Spring 2014 46
Conditions listed in book
ππ = π½0 + π½1ππ + π’π, where π = 1, β¦ , π
β’ No information in πΏπ about ππ
πΈ π’π = πΈ π’π ππ = 0
2/03&05/2014 Econ 141, Spring 2014 47
Conditions listed in book
ππ = π½0 + π½1ππ + π’π, where π = 1, β¦ , π
β’ No information in πΏπ about ππ
πΈ π’π = πΈ π’π ππ = 0 Suppose not and instead πΈ π’π ππ = πΎππ, then we can write
ππ = π½0 + π½1ππ + π’π = π½0 + π½1ππ + πΎππ + π’π β πΎππ
= π½0 + π½1 + πΎ ππ + π’π β πΎππ = π½0 + π½ 1ππ + π’ π
where πΈ π’ π ππ = πΈ π’π β πΎππ ππ = πΈ π’π ππ β πΎππ = 0
So, in this case there is an alternative representation of the
linear regression line with a different slope parameter,
π½ 1 = π½1 + πΎ, that satisfies this assumption.
2/03&05/2014 Econ 141, Spring 2014 48
Conditions listed in book
ππ = π½0 + π½1ππ + π’π, where π = 1, β¦ , π
β’ No information in πΏπ about ππ
πΈ π’π = πΈ π’π ππ = 0
Note, this implies that
πΈ π’πππ = πΈ πππΈ π’π ππ = 0
Which, given πΈ π’π = 0 implies that
cov ππ , π’π = πΈ π’πππ β πΈ π’π πΈ ππ = 0
2/03&05/2014 Econ 141, Spring 2014 49
Conditions listed in book
ππ = π½0 + π½1ππ + π’π, where π = 1, β¦ , π
β’ No information in πΏπ about ππ
πΈ π’π = πΈ π’π ππ = 0
β’ πΏπ, ππ well-behaved random variables
β ππ , ππ , π = 1, β¦ , π, are independently drawn from
identical joint distribution.
β Large outliers are unlikely.
2/03&05/2014 Econ 141, Spring 2014 50
Conditions listed in book
ππ = π½0 + π½1ππ + π’π, where π = 1, β¦ , π
β’ No information in πΏπ about ππ
πΈ π’π = πΈ π’π ππ = 0
β’ πΏπ, ππ well-behaved random variables
β ππ , ππ , π = 1, β¦ , π, are independently drawn from
identical joint distribution.
β Large outliers are unlikely.
Last two assumptions are made such that we can apply LLN
and CLT to derive properties of π½ 0 and π½ 1.
2/03&05/2014 Econ 141, Spring 2014 51
Properties of π½ 0 and π½ 1
Properties of π½ 0 and π½ 1 are derived by manipulating
the first-order necessary conditions from slides 15 and
17.
0 =1
π ππ β π½ 0 β π½ 1ππ
π
π=1
=1
π π’ π
π
π=1
and
0 =1
π ππ β π ππ β π½ 0 β π½ 1ππ
π
π=1
=1
π ππ β π π’ π
π
π=1
2/03&05/2014 Econ 141, Spring 2014 52
Properties of π½ 0 and π½ 1
Properties of π½ 0 and π½ 1 are derived by manipulating
the first-order necessary conditions from slides 15 and
17.
0 =1
π ππ β π½ 0 β π½ 1ππ
π
π=1
=1
π π’ π
π
π=1
and
0 =1
π ππ β π ππ β π½ 0 β π½ 1ππ
π
π=1
=1
π ππ β π π’ π
π
π=1
These are sample approximations of condition
πΈ π’π = πΈ π’πππ = 0
2/03&05/2014 Econ 141, Spring 2014 53
Consistency of π½ 1
0 =1
π ππ β π ππ β π½ 0 β π½ 1ππ
π
π=1
=1
π ππ β π ππ β π β π½ 1 ππ β π
π
π=1
=1
π ππ β π π½0 + π½1ππ + π’π β π½0 β π½1ππ β π’ β π½ 1 ππ β π
π
π=1
=1
π ππ β π π½1 β π½ 1 ππ β π + π’π β π’
π
π=1
= π½1 β π½ 11
π ππ β π 2
π
π=1
+1
π ππ β π π’π β π’
π
π=1
.
2/03&05/2014 Econ 141, Spring 2014 54
Consistency of π½ 1
0 =1
π ππ β π ππ β π½ 0 β π½ 1ππ
π
π=1
=1
π ππ β π ππ β π β π½ 1 ππ β π
π
π=1
=1
π ππ β π π½0 + π½1ππ + π’π β π½0 β π½1ππ β π’ β π½ 1 ππ β π
π
π=1
=1
π ππ β π π½1 β π½ 1 ππ β π + π’π β π’
π
π=1
= π½1 β π½ 11
π ππ β π 2
π
π=1
+1
π ππ β π π’π β π’
π
π=1
.
2/03&05/2014 Econ 141, Spring 2014 55
π πππ πΏπ > π π
πππ πΏπ, ππ = π
Consistency of π½ 1
0 =1
π ππ β π ππ β π½ 0 β π½ 1ππ
π
π=1
=1
π ππ β π ππ β π β π½ 1 ππ β π
π
π=1
=1
π ππ β π π½0 + π½1ππ + π’π β π½0 β π½1ππ β π’ β π½ 1 ππ β π
π
π=1
=1
π ππ β π π½1 β π½ 1 ππ β π + π’π β π’
π
π=1
= π½1 β π½ 11
π ππ β π 2
π
π=1
+1
π ππ β π π’π β π’
π
π=1
π 0 = π½1 β π½ 1 var ππ .
Such that
π½1 β π½ 1π 0, that is π½ 1
π π½1
2/03&05/2014 Econ 141, Spring 2014 56
Consistency of π½ 1
0 =1
π ππ β π ππ β π½ 0 β π½ 1ππ
π
π=1
=1
π ππ β π ππ β π β π½ 1 ππ β π
π
π=1
=1
π ππ β π π½0 + π½1ππ + π’π β π½0 β π½1ππ β π’ β π½ 1 ππ β π
π
π=1
=1
π ππ β π π½1 β π½ 1 ππ β π + π’π β π’
π
π=1
= π½1 β π½ 11
π ππ β π 2
π
π=1
+1
π ππ β π π’π β π’
π
π=1
π 0 = π½1 β π½ 1 var ππ .
Such that
π½1 β π½ 1π 0, that is π½ 1
π π½1
2/03&05/2014 Econ 141, Spring 2014 57
As the sample size π gets arbitrarily
large, i.e. π β, our estimate of the
slope coefficient, π· π, gets arbitrarily
close to the true parameter value
π·πfrom the population regression line
But, in real life, π is finite
Small sample properties of OLS estimators
β’ Unbiasedness
On average OLS estimate equals true
parameter value of interest.
β’ Asymptotic distribution
OLS assumptions imply we can use CLT to
derive asymptotic normal distribution of OLS
estimates, that can be used as approximation
when π is big.
2/03&05/2014 Econ 141, Spring 2014 58
Unbiasedness of π½ 1
0 = π½1 β π½ 11
π ππ β π 2
π
π=1
+1
π ππ β π π’π β π’
π
π=1
.
Such that
π½ 1 = π½1 +
1π
ππ β π π’π β π’ ππ=1
1π
ππ β π 2ππ=1
= π½1 +
1π
ππ β π π’πππ=1
1π
ππ β π 2ππ=1
Taking expectations yields
E π½ 1 = E π½1 + E
1π
ππ β π π’π β π’ ππ=1
1π
ππ β π 2ππ=1
= π½1 + E
1π
ππ β π π’π β π’ ππ=1
1π
ππ β π 2ππ=1
= π½1 + E
1π
ππ β π E π’π β π’ π1, β¦ , ππππ=1
1π
ππ β π 2ππ=1
= π½1 +E
1π
ππ β π E π’π β π’ ππππ=1
1π
ππ β π 2ππ=1
= π½1 + E
1π
ππ β π 0ππ=1
1π
ππ β π 2ππ=1
= π½1
2/03&05/2014 Econ 141, Spring 2014 59
Unbiasedness of π½ 1
0 = π½1 β π½ 11
π ππ β π 2
π
π=1
+1
π ππ β π π’π β π’
π
π=1
.
Such that
π½ 1 = π½1 +
1π
ππ β π π’π β π’ ππ=1
1π
ππ β π 2ππ=1
= π½1 +
1π
ππ β π π’πππ=1
1π
ππ β π 2ππ=1
Taking expectations yields
E π½ 1 = E π½1 + E
1π
ππ β π π’π β π’ ππ=1
1π
ππ β π 2ππ=1
= π½1 + E
1π
ππ β π π’π β π’ ππ=1
1π
ππ β π 2ππ=1
= π½1 + E
1π
ππ β π E π’π β π’ π1, β¦ , ππππ=1
1π
ππ β π 2ππ=1
= π½1 +E
1π
ππ β π E π’π β π’ ππππ=1
1π
ππ β π 2ππ=1
= π½1 + E
1π
ππ β π 0ππ=1
1π
ππ β π 2ππ=1
= π½1
2/03&05/2014 Econ 141, Spring 2014 60
Here is where we
apply second
condition from slide 43
Unbiasedness of π½ 1
0 = π½1 β π½ 11
π ππ β π 2
π
π=1
+1
π ππ β π π’π β π’
π
π=1
.
Such that
π½ 1 = π½1 +
1π
ππ β π π’π β π’ ππ=1
1π
ππ β π 2ππ=1
= π½1 +
1π
ππ β π π’πππ=1
1π
ππ β π 2ππ=1
Taking expectations yields
E π½ 1 = E π½1 + E
1π
ππ β π π’π β π’ ππ=1
1π
ππ β π 2ππ=1
= π½1 + E
1π
ππ β π π’π β π’ ππ=1
1π
ππ β π 2ππ=1
= π½1 + E
1π
ππ β π E π’π β π’ π1, β¦ , ππππ=1
1π
ππ β π 2ππ=1
= π½1 +E
1π
ππ β π E π’π β π’ ππππ=1
1π
ππ β π 2ππ=1
= π½1 + E
1π
ππ β π 0ππ=1
1π
ππ β π 2ππ=1
= π½1
2/03&05/2014 Econ 141, Spring 2014 61
Implied by first
condition from slide 43
Unbiasedness of π½ 1
0 = π½1 β π½ 11
π ππ β π 2
π
π=1
+1
π ππ β π π’π β π’
π
π=1
.
Such that
π½ 1 = π½1 +
1π
ππ β π π’π β π’ ππ=1
1π
ππ β π 2ππ=1
= π½1 +
1π
ππ β π π’πππ=1
1π
ππ β π 2ππ=1
Taking expectations yields
E π½ 1 = E π½1 + E
1π
ππ β π π’π β π’ ππ=1
1π
ππ β π 2ππ=1
= π½1 + E
1π
ππ β π π’π β π’ ππ=1
1π
ππ β π 2ππ=1
= π½1 + E
1π
ππ β π E π’π β π’ π1, β¦ , ππππ=1
1π
ππ β π 2ππ=1
= π½1 +E
1π
ππ β π E π’π β π’ ππππ=1
1π
ππ β π 2ππ=1
= π½1 + E
1π
ππ β π 0ππ=1
1π
ππ β π 2ππ=1
= π½1
2/03&05/2014 Econ 141, Spring 2014 62
Even as the sample size π is not large, on
average our estimate of the slope
coefficient, π· π, will equal the true parameter
value π·πfrom the population regression line.
π π· π = π·π. Thus, π· π unbiased.
Unbiasedness of π½ 1
0 = π½1 β π½ 11
π ππ β π 2
π
π=1
+1
π ππ β π π’π β π’
π
π=1
.
Such that
π½ 1 = π½1 +
1π
ππ β π π’π β π’ ππ=1
1π
ππ β π 2ππ=1
= π½1 +
1π
ππ β π π’πππ=1
1π
ππ β π 2ππ=1
Taking expectations yields
E π½ 1 = E π½1 + E
1π
ππ β π π’π β π’ ππ=1
1π
ππ β π 2ππ=1
= π½1 + E
1π
ππ β π π’π β π’ ππ=1
1π
ππ β π 2ππ=1
= π½1 + E
1π
ππ β π E π’π β π’ π1, β¦ , ππππ=1
1π
ππ β π 2ππ=1
= π½1 +E
1π
ππ β π E π’π β π’ ππππ=1
1π
ππ β π 2ππ=1
= π½1 + E
1π
ππ β π 0ππ=1
1π
ππ β π 2ππ=1
= π½1
2/03&05/2014 Econ 141, Spring 2014 63
Note that the first condition that
E π’π β π’ ππ = 0
is crucial for OLS to be unbiased.
If this condition is not true the average OLS
estimate will deviate from
the true parameter value.
Asymptotic distribution of π½ 1
π½ 1 β π½1 =
1π
ππ β π π’π β π’ ππ=1
1π
ππ β π 2ππ=1
=
1π
ππ β π π’πππ=1
1π
ππ β π 2ππ=1
See slide 48
3 steps to deriving asymptotic distribution
1. Apply CLT to numerator
2. Apply LLN to denominator
3. Combine using Slutskyβs theorem (S&W page 676)
2/03&05/2014 Econ 141, Spring 2014 64
Apply CLT to 1
π ππ β π π’π
ππ=1
β’ Define random variable
π£π = ππ β π π’π
β’ Condition 1 from slide 47: E π£π = 0
β’ Conditions 2&3 from slide 50 imply that
β var π£π exists and is finite.
β CLT applies to sample mean of π£π.
π£ =1
π ππ β π π’π
π
π=1
β’ Note that: π½ 1 β π½1 = π£ π π2
2/03&05/2014 Econ 141, Spring 2014 65
Apply CLT to 1
π ππ β π π’π
ππ=1
Apply the Central Limit Theorem
π =π£ π β E π£π
var π£π π =
π£ π
var π£π π π π 0,1
Such that
π£ ππ π 0, var π£π π
Thus, the numerator of
π½ 1 β π½1 = π£ π π2
has an asymptotic distribution that is normal
with a mean equal to zero. 2/03&05/2014 Econ 141, Spring 2014 66
Apply LLN to π π2
β’ Conditions 2&3 from slide 50 imply that we
can apply the Law of Large Numbers to π π2
and that π π2 converges in probability to the
variance of ππ, i.e. to var ππ .
β’ Thus
π π2
π var ππ
β’ Now that we know asymptotic behavior of
numerator and denominator of π½ 1 β π½1 = π£ π π2
the only thing left is to combine them.
2/03&05/2014 Econ 141, Spring 2014 67
Apply Slutskyβs theorem (S&W page 676)
Slutskyβs theorem implies that we can combine the
asymptotic properties of
π£ ππ π 0, var π£π π
and
π π2
π var ππ
such that
π½ 1 β π½1 = π£ π π2
π π 0,
var π£π π
var ππ
and thus
π½ 1 = π½1 + π£ π π2
π π π½1,
var π£π π
var ππ
2/03&05/2014 Econ 141, Spring 2014 68
π½ 1 has tractable asymptotic distribution
Thus as sample size π get large then estimated
slope coefficient has approximately a normal
distribution, such that
π½ 1~π ππ½ 1, ππ½ 1
where
ππ½ 1= π½1
ππ½ 1
2 =var π£π π
var ππ
2
=1
π
var π£π
var ππ2
2/03&05/2014 Econ 141, Spring 2014 69
π½ 1 has tractable asymptotic distribution
Thus as sample size π get large then estimated
slope coefficient has approximately a normal
distribution, such that
π½ 1~π ππ½ 1, ππ½ 1
where
ππ½ 1= π½1
ππ½ 1
2 =var π£π π
var ππ
2
=1
π
var π£π
var ππ2
2/03&05/2014 Econ 141, Spring 2014 70
Remember:
π π· π = π·π. Thus, π· π unbiased.
π½ 1 has tractable asymptotic distribution
Thus as sample size π get large then estimated
slope coefficient has approximately a normal
distribution, such that
π½ 1~π ππ½ 1, ππ½ 1
where
ππ½ 1= π½1
ππ½ 1
2 =var π£π π
var ππ
2
=1
π
var π£π
var ππ2
2/03&05/2014 Econ 141, Spring 2014 71
Remember:
ππ½ 1
2 π as π β. Thus, π· π consistent.
π½ 1 has tractable asymptotic distribution
Thus as sample size π get large then estimated
slope coefficient has approximately a normal
distribution, such that
π½ 1~π ππ½ 1, ππ½ 1
where
ππ½ 1= π½1
ππ½ 1
2 =var π£π π
var ππ
2
=1
π
var π£π
var ππ2
2/03&05/2014 Econ 141, Spring 2014 72
This term is like a noise to signal ratio.
The numerator is related to variance of the
residual
The denominator is related to the variance of the
explanatory variable
The large the variation in the explanatory variable
relative to the residual the more accurate the
OLS estimate.
Virtues of asymptotic normality
β’ Asymptotic normality of OLS coefficients
allows us to
β Do hypothesis tests
β Calculate confidence intervals
β’ Simple generalizations of same techniques
applied to the population mean.
β’ Chapter 5! Next week.
2/03&05/2014 Econ 141, Spring 2014 73
Summary
β’ Linear regression
model
β’ OLS estimators
β’ π 2 and ππΈπ
β’ OLS conditions
β’ Consistency of π½ 1
β’ Unbiasedness of π½ 1
β’ Asymptotic distribution
of π½ 1
β’ Even-numbered
problems:
4.2, 4.4, 4.6, 4.10,
4.12, 4.14
β’ Study STATA tutorial
β’ Empirical exercises
next week.
2/03&05/2014 Econ 141, Spring 2014 74