Intergenerational mobility Cheti Nicoletti ISER, University of Essex 2009.
-
Upload
gabriella-hart -
Category
Documents
-
view
222 -
download
1
Transcript of Intergenerational mobility Cheti Nicoletti ISER, University of Essex 2009.
Motivation
• Studies on intergenerational (im)mobility try to examine the association between children’s and parents’ socio-economic outcome (usually income, earnings, occupational class or prestige).
• It is believed that low intergenerational mobility is indicative of unequal opportunities between people born in advantaged and disadvantaged families and that policy should improve opportunities for those from disadvantaged backgrounds.
Motivation
• Notice that two societies could have the same level of inequality in earnings within a generation but a completely different level of intergenerational transmission of earnings.
• A society where the relative position of a person in the earnings distribution is exactly inherited from the parents’ one is considered unfair.
Previous studies• A first study of intergenerational transmission can be dated back to Galton
(1886) who regress children’s height on parents’ one. • More recently sociologists have studied intergenerational mobility and have
especially focused on the association between class positions of parents and children (see Erikson and Goldthorpe 1992).
• Economists have begun to be extensively interested in this topic after Becker and Tomes (1979, 1986) developed a model of the transmission of earnings, assets and consumptions from parents to children. Atkinson (1981) and Atkinson, Maynard and Trinder (1983) are the first papers to study intergenerational earnings mobility in the UK (men born in York in 1950 and traced to the late 1970s).
• Review article: Solon, G. (1999) Intergenerational mobility in the labor market, In O. Ashenfelter and D.Card (eds.) Handbook of Labor Economics, Volume 3, Chapter 29, 1761-1800. Amsterdam: Elsevier.
• Look for forthcoming chapter “Intergenerational income mobility and the
role of family background” in Handbook of Economic Inequality Markus Jäntti and Anders Björklund
Intergenerational mobility equation
)var(/),cov(
elasticity tionalInterngera
iii xyx
),0( 2uiiii iiduuxy
)var()var(
),cov(
ncorrelatioationalIntergener
ii
ii
yx
yx
2)(
))((
xx
yyxx
i
iiOLS
fathersfor (earnings)log
sonfor (earnings)log
i
i
x
sy
Data requirement• We need to observe a long run permanent measure (income,
earnings, occupational prestige) for both fathers and their children• Possible data sources:1. Panel data which follow individual across time (for example the
Panel Study of Income dynamic in US, the British Household Panel Survey)
2. Cohort studies which follow people born in a specific cohort from when born or other longitudinal studies (for example the British Cohort Studies 1958 and 1970, National Longitudinal Survey in the US)
3. Cross-sectional data with retrospective questions asked to children about their fathers (for example Oxford Mobility Study, which provides data on occupational class for children and their fathers when they, the sons, were 14 years old.)
Measurement error on father’s earnings
• We observe current earnings xit instead of a permanent measure of earnings xi
xit=xi +εit
where εit iid (0,σε2 ) and independent of xi and yi
• This implies that we underestimate β ( for β>0)
• Notice that a similar error for yit would not bias β, but it would bias the intergenerational correlation
)var(
),cov(
)var()var(
),cov(
)var(
),cov(~
i
ii
iti
ii
it
iit
x
yx
x
yx
x
yx
Solution suggested by Solon (1992) and Zimmerman (1992)
• If xit is observable for t=1,…T, we can consider
)(
),()(
)(
),(Tfor
~
)()(
),(
)(
),()(
)(
),(
)(
),(
)()()(
1 11
i
ii
iti
ii
iti
ii
it
ii
iti
ii
i
ii
itii
T
t
T
t
itT
ti
itiiti
xVar
yxCov
T
VarxVar
yxCov
VarxVar
yxCov
xVar
yxCov
T
VarxVar
yxCov
xVar
yxCovT
VarxVarxVar
Tx
T
x
T
xx
Life cycle bias
• Earnings (even if averaged) can be a bad proxy for permanent earnings if measured when people are too old or too young (Life cycle bias).
• Solution: Control with a polynomial in age or age dummies in the intergenerational equation
iifatherifather
isonisonii
uageage
ageagexy
2,4,3
2,2,1
Life cycle bias
1. Controlling for sons and fathers age in the intergenerational mobility equation can help in reducing the measurement error bias. But this correction is not enough if the earnings growth is heterogeneous across individuals.
2. Imposing and upper and a lower bound for sons and fathers age can be a solution (ex. Blanden et al 2004 and 2007, 30-33; Gershuny 2002, 34-36; Ermsich and Nicoletti 2007, 31-45).
3. Lee and Solon (2005) suggest to estimate an intergenerational mobility equation using sons observed at any age but allowing the elasticity to vary across cohort and son’s age. (See for example Ermsich and Nicoletti 2007)
Comparing Intergenerational across cohorts in Britain
• Comparing measure of intergenerational mobility across children (sons) born in different cohorts is very difficult because of data comparability and data availability issues.
• It is for example difficult to observe earnings for both children and their parents for very long cohort period
• Blanden, Gregg, MacMillan (2007, The Economic Journal) compared NCDS1958 and BC1970 (but the two datasets have a lot comparability issues) and found a negative trend in mobility
• Nicoletti, Ermisch (2007, The B.E. Journal of Economic Analysis & Policy) compared children born from 1950 to 1972 using the BHPS (but they use a two sample two stage procedure which usually provide an overestimation of the intergenerational mobility) and found no trend for the period 1950-72 and a slight negative but not significant trend for the 1960-1972 period
• Breene and Goldthorpe (2001, European Sociological Review) and Goldthrope and Jackson (2007, British Journal of Sociology) studied class mobility and found little change across the two generations when considering measures of exchange mobility, in contrast to the negative trend in mobility found in Blanden et al. (2004, 2007)) using the same two cohorts.
Advantages and disadvantages of panel data
Advantages• Repeated observations are available so that it is
easier to measure the long run economic status. • Panel surveys are usually based on representative
samples of the full population.
Disadvantages • Coresidence selection: Son must be living together
with his father in at least one wave of the panel.
1991 2003
BHPS 1991-2003
Child age
3
8
13
18
23
28
33
Child age
15
20
25
30
35
44
45
1997
Child age
9
14
19
24
29
34
39
Cohort
1988
1983
1978
1973
1968
1963
1958
How can the BHPS help us?
All BHPS respondents are asked to report occupational characteristics of their parents when they were 14
THEREFORE • We know the occupational prestige, say HG, even for
sons and fathers living apart during the panel.• We can estimate the correlation of HG without any
coresidence selection.• We can consider the subsample of sons coresident with
the fathers at least once during the panel and assess the relevance of the coresidence selection.
• We can then compare different methods to correct for the coresidence selection.
Taking account of coresidence selection
Francesconi and Nicoletti (Journal of Applied Econometrics, 2006) find that the intergenerational mobility in occupational prestige is underestimated when using the subsample of sons born between 1966 and 1985.
They try different estimation methods to correct for sample selection and find that only the inverse propensity score is able to attenuate the selection problem
This sample selection evaluation is possible because all BHPS respondents are asked to report occupational characteristics of their parents when they were 14
Coresidence
• If we select only children coresident with their parents in at least one wave of the panel, we can have a problem of sample selection. The subsample is probably not a random sample.
• This can cause a bias in the estimation of β
otherwise 0
coresidentchildren for 1id
iii xy fathersfor (earnings)log
sonfor (earnings)log
i
i
x
sy
g weightinscore propensity usecan we
then ,| iii xZdyIfi
Selection model (coresidence model)
Z=dummies for education, age, regions, ethnicity, religiosity and two house price indexes
iuZdii
'*
otherwise 0
coresidentchildren for 1)0( *
iiiddId
Normaliidif)()|1Pr(
logistic iid if)exp(1
)exp()|1Pr(
)()Pr()|0Pr()|1Pr(
score Propensity
'
'
'
''*
i
i
i
uZZd
uZ
ZZd
ZFZuZdZd
iii
i
i
ii
iiiiii
Selection problem
• Y=log son earnings X= log father earnings• Z=dummies for education, age, regions, ethnicity, religiosity and
two house price indexes• If ε depends of u Selection due to unobservables • Ex. unobserved work abilities which are related to y and to the
probability to leave parental home earlier• If ε depends of Z Selection due to observables • Ex: Y|X depends on Z (education and age) then ε depends on Z
and we have selection due to observables
iuZdii
'*
iii xy
otherwise 0
coresidentchildren for 1id
Selection due to observables:Propensity score weighting estimation
• Y=log son earnings X= log father earnings• Z=dummies for education, age, regions, ethnicity,
religiosity and two house price indexes• If ε and u are independent but ε depends on some of
the Z (for education, age, etc.) Weighted least square estimation of β with weights given by
iuZdii
'*
1'1 )()|1Pr( iiiZFZd
iii xy
otherwise 0
coresidentchildren for 1id
Coresidence selection modelVariable Coeff SE Variable Coeff SE
No qualification -0.684 0.391 Catholic -0.186 0.214
Other qualification 0.209 0.18 Protestant 0.149 0.149
GSCE/O level 0.142 0.131 Other denominations 0.079 0.369
First degree 0.072 0.129
South East -0.59 0.248
Age before leaving home 0.04 0.114 Rest of South East -0.458 0.201
Age square 0.004 0.003 Anglia/Midlands -1.285 0.254
North West -1.324 0.274
house price index LAD -4.074 0.431 Rest of North -1.656 0.27
house price index region 1.503 0.269 Wales -1.449 0.31
Scotland -1.586 0.278
Black -0.294 0.522
Indian 0.182 0.629 Constant 26.389 4.533
Pakistani/Bangladeshi 0.423 0.704 No. Obs 1090
Other Ethnicity 0.239 0.508 Pseudo R-square 0.333
Intergenerational mobility mediated by education
y=α+xβ+εIf ε and u are independent but ε depends on education
• Then to take account that ε is not independent of education (omitted variable) we can consider
y=αN+xβN+edu δ +ω Regression adjustment • βN is consistently estimated (because ω is independent of
u and of education)• β=Cov(x,y)/Var(x)=βN+Cov(x,edu)Var(x)-1δ• βN is the intergenerational association (elasticity) in
earnings net of the effect (or not mediated by) the education.
• (β-βN) is the intergenerational mobility in earnings mediated by the education
Another solution for the coresidence selection problem
using the BHPS
• PROBLEM: We can observe both sons’ and their fathers’ earnings only if they have been living together in at least 1 wave during the panel. This is possible in 12% of the cases in our sample (cohort period 1950-1972).
• SOLUTION: we use a TS2SLS estimator which combines two separate samples from the BHPS
The two-sample two-stage least squares (plim TS2SLS = plim TS2SIV)
The TS2SLS estimator combines two separate samples from the BHPS:
• 1st dataset (Full sample) containing information on sons’ earnings, and fathers’ education and occupational characteristics when sons were 14 years (collected through retrospective questions to sons);
• 2nd dataset (Supplemental sample) containing information on earnings and occupational characteristics of potential fathers (Hope-Goldthorpe index; the Cambridge scale; dummies to distinguish occupations in professional, managerial and technical, skilled non-manual, skilled manual and unskilled; 19 dummies for socio–economic groups).
References 2SIV: Angrist and Krueger (1992), Arellano and Meghir (1992), Ridder and Moffitt (2005), Inoue and Solon (2005)
Two-sample two-stage least squares estimator (TS2SLS) Combining the two samples
1. Estimation of the log earnings equation for fathers using the supplemental sample (imputation regression)
x=Z+v2. Estimation of the main equation using
the full sample and replacing (imputing) x
xPxpZx zpp lim̂
Previous studies on intergenerational mobility using TS2SLS estimator
The choice of instruments in previous studies is quite often dictated by the few variables available:• Bjorklund and Jantti (1997) use education level and occupation in
Sweden,• Fortin and Lefebvre (1998) use 16 occupational groups in Canada• Grawe (2004) uses education levels for Ecuador, Nepal, Pakistan
and Peru.
Our potential IV are instead given by:• Hope-Goldthorpe index; the Cambridge scale; dummies to
distinguish occupations in professional, managerial and technical, skilled non-manual, skilled manual and unskilled; 19 dummies for socio–economic groups, education level and age. (similarly to Lefranc A, Trannoy A. 2005)
Intergenerational earnings mobility(sons 31-45 and fathers 31-55)
First step: We estimate the imputation regressionWe regress x on the following IV • the log Hope-Goldthorpe index, • 4 dummies for managerial duties (self-employed, 1
manager, 2 foreman/supervisor, 3 not foreman/supervisor)
• 3 education level dummies (no qualification or some qualification, 1 further education qualification, 2 first degree or higher)
• age and age square • 2 cohort groups (1930-1938, 1939-1946)
Variable Coeff S.E. Variable Coeff S.E.
Log (Hope-Goldthorpe score) cohort 1930-38 0.51 0.123
Manager, cohort 1930-38 0.737 0.106
Log (Hope-Goldthorpe score) cohort 1939-46 0.452 0.112
Manager, cohort 1930-46 0.313 0.078
Education1, cohort 1930-38 0.126 0.073
Foreman/supervisor cohort 1930-38 0.437 0.11
Education1, cohort 1939-46 0.079 0.058
Foreman/supervisor cohort 1939-46 0.078 0.081
Education2, cohort 1930-38 0.395 0.145
No managerial duties cohort 1930-38 0.459 0.089
Education2, cohort 1939-46 0.256 0.096
No managerial duties cohort 1939-46 0.108 0.071
Cohort 1939-46 0.565 0.611 Age 0.259 0.097
Constant -1.599 2.537 Age2 -0.003 0.001
Number of observations 896
R2 0.259 Adjusted R-squared 0.246
Intergenerational earnings mobility(sons 31-45 and fathers 31-55)
Second step: y= + x + u
y = son’s log earnings; x = father’s log earnings; , and are coefficients; u is i.i.d (0, σ2)We estimate separately for rolling cohort groups1950-55, 1951-56, 1952-57, …, 1967-1972
Elasticities and correlations for single year earnings
-0.2
00
0.0
00
0.2
00
0.4
00
0.6
00
0.8
00
1950 1955 1960 1965cohort
Elasticity CorrelationLower CI Upper CI
Elasticities and correlations for average earnings
-0.2
00
0.0
00
0.2
00
0.4
00
0.6
00
0.8
00
1950 1955 1960 1965cohort
Elasticity CorrelationLower CI Upper CI
Another sample selection problem: Labour market selection
• The intergenerational occupational mobility can be estimated only for people who are employed.
• This means the we can use only the subsample of people who are employed and this can cause a selection bias.
• This selection can be especially relevant when estimating the association between earnings of daughters and fathers because women have usually more work interruptions.
Employment selection model
y=α+xβ+ε d*=Z γ+u d=l(d*>0)where y is the daughter’s occupational prestige (log Hope-
Goldthorpe score) x is her father’s occupational prestige
d=1 for daughters are employed at least in at least one wave and 0 otherwise
Z=occupational prestige father, dummies for education, age, regions, ethnicity, religiosity, marital status and number of children aged between 0-2, 3-4.
Intergenerational mobility and employment selection
If εi and ui are not independent then we have selection due to unobservables (For example unobserved work skills or cognitive skills which affect both the probability of employment and earnings for daughters)
If εi depends on Zi then we have selection due to observables (for example education, age and work experience which can affect both the probability of employment and the daughters’ earnings)
iii xy
iuZdii
'*)0( *
iidId where
Intergenerational mobility and employment selection
• Since εi (yi) depends on some of the variables Zi (for example education, age, etc.)
• We have first of all control for selection due to observables
• This is possible using for example propensity score weighting or regression adjustment.
iii xy iuZdii
'*
Propensity score weighting
• First step: Estimation of the binary model for the employment probability
• Second step: Weighted least square estimation(notice that the weights are estimated, so a proper estimation should
be adopted to estimate the standard errors)
)ˆ()|1r(P̂ ''* iiiZFZduZd iii
12
)|1Pr()(
))((
iiiii
iiiWLS Zdwhere
xx
yyxx
Employment selectionVariable Coeff SE Variable Coeff SE
No qualification -1.274 0.234 No. children 0-2 -0.69 0.149
Other qualification -0.473 0.22 No. children 3-4 -0.5 0.161
GSCE/O level -0.312 0.173 No. children 5-11 -0.25 0.101
First degree 0.004 0.206 No. children 12-15 -0.39 0.143
No. children 16-18 -0.25 0.443
Average age 0.562 0.155 South East 0.275 0.32
Average age square -0.01 0.003 Rest of South East 0.26 0.265
Anglia/Midlands 0.159 0.241
Fathers’s status 0.044 0.203 North West 0.378 0.314
Married 0.673 0.172 Rest of North -0.2 0.248
Wales 0.558 0.395
Black -5.953 2.15 Scotland -0.13 0.28
Indian -1.537 0.461 Constant -5.95 2.15
Pakistani/Bangladeshi -1.252 0.521 No. Obs 1090
Other Ethnicity -0.419 0.513 Pseudo R-square 0.265
Selection due to unobservables
If εi and ui are not independent then we have selection due to unobservables (For example unobserved work skills or cognitive skills which affect both the probability of employment and earnings for daughters)
Usually Heckman procedures are adopted, but these procedure are not very adequate in the case of intergenerational models because εi depends also on Z (omitted variables problems)
We need to control for selection on observables and unobservables too by for example combining Heckman and propensity score methods
iii xy iuZdii
'*
Correlation
Average 10th 25th 50th 75th 90th
Ignoringselection
0.303 0.285 0.447 0.310 0.312 0.246 0.084
(0.000) (0.028) (0.050) (0.048) (0.057) (0.037) (0.032)
Correcting for employment selection
Weighting 0.305 0.301 0.357 0.327 0.333 0.294 0.222
(0.000) (0.032) (0.058) (0.047) (0.050) (0.039) (0.041)
Heckman two-step
0.224 0.301 0.229 0.233 0.204 0.080
(0.029) (0.062) (0.043) (0.046) (0.033) (0.028)
WeightingHeckman
0.230 0.337 0.258 0.254 0.268 0.185
(0.030) (0.064) (0.065) (0.054) (0.046) (0.056)
Intergenerational mobility
Some references for regressions with sample selection
• Buchinski, M. (2001) Quantile regression with sample selection: Estimation women return to education in the U.S., Empirical Economics, 26, 86-113.
• Ibrahim, J.G., Chen, M.-H., Lipsitz, S.R., Herring, A.H. (2005) Missing-data methods for generalized linear models: A comparative review, Journal of the American Statistical Association, 100, 469, 332-346.
• Lipsitz, S.R., Fitzmaurice, G.M., Molenberghs, G., Zhao, L.P. (1997), Quantile regression methods for longitudinal data with drop-outs, Applied Statistics, 46, 463-476.
• Robins, J. M., Rotnitzky, A. (1995), Semiparametric Effciency in Multivariate Regression Models With Missing Data, Journal of the American Statistical Association, 90, 122-129.
• Vella F. (1998), Estimating models with sample selection bias: a survey', The Journal of Human Resources, vol. 3, 127-169.
• Wooldridge, J.M. (2007) Inverse probability weighted M-Estimation for General missing data problems, Journal of Econometrics, 141, 2, 1281-1301.
• Wooldridge, J.M. (2007) Inverse probability weighted M-Estimation for General missing data problems, Journal of Econometrics, 141, 2, 1281-1301.