The Hazards of Unemployment: A Macroeconomic Model of Job ...

The Hazards of Unemployment: A Macroeconomic Model of Job

Search and Résumé Dynamics

Ross Doppelt∗

August 14, 2016

Abstract

I introduce a dynamic general-equilibrium model to investigate the relationship between the duration

of unemployment and the probability of nding a job. Specically, I analyze the hypothesis that a long

unemployment spell sends a negative signal about a worker's quality, thereby aecting her probability

of being hired. In the model, skills are unobservable. I refer to a worker's posterior probability of

being highly skilled, conditional on her labor-market history, as the worker's résumé. Spending time in

unemployment damages a worker's résumé, which alters her job-nding rate: Because high-skill workers

are more likely to form matches, prospective employers infer that unmatched workers are less likely to

be highly skilled. The match surplus incorporates the fact that hiring a worker improves her résumé,

and the theory illustrates how the résumé value of being hired is priced into wages. I calibrate the model

to match data on job-nding rates as a function of duration. The average worker experiences signicant

negative duration dependence, and incomplete information also generates considerable heterogeneity in

job-nding rates. I extend the model to discuss how informational concerns interact with human capital

decay as a source of duration dependence. Finally, I discuss the theory's empirical predictions and

econometric implications.

∗Department of Economics, Penn State University. Contact: [email protected]. This paper previously circulatedunder the shorter title, The Hazards of Unemployment. Please download the most recent version on my website:https://sites.google.com/site/rossdoppelt/. This work is based on my doctoral thesis at New York University; I am grate-ful to my thesis advisor, Tom Sargent, and committee members, Ricardo Lagos and Gianluca Violante, for their feedback andcriticisms. I also beneted from discussions with many colleagues and numerous seminar participants. Any imperfections arepurely my own.

1

mailto:[email protected]

https://sites.google.com/site/rossdoppelt/

1 Introduction

In this paper, I build a theoretical macroeconomic model to investigate the relationship between the duration

of unemployment and the probability of nding a job. Figure A.1, constructed with data from the Current

Population Survey, shows that the probability of nding a job in the coming month is about 30 percentage

points lower for someone who has been looking for a year, relative to a newly unemployed worker.1 Recent

evidence from the empirical microeconomic literature, which I will discuss below, suggests that some of this

correlation is causal: Firms are less interested in hiring applicants who have been unemployed for a long

time. Economists have oered several hypotheses to explain this relationship; my focus will be on stigma

eects, the notion that the length of an unemployment spell sends a signal about worker quality. A job

seeker may fail to be hired because no rms saw her application, or because she was rejected by all of the

rms that did see her application. Prospective employers cannot observe an applicant's past rejections, but

rms will draw inferences from the duration of unemployment. If good workers are more likely to nd jobs,

then there is reason to think that someone who fails to nd a job is less likely to be a good worker. In

turn, an individual worker's job-nding rate will change as she spends time in unemployment, because rms'

demand for her services will change as the length of her unemployment spell reveals information about her

quality. Jobless spells therefore inict a kind of résumé damage, which aects a worker's future employment

and earnings prospects.

I explore this hypothesis by introducing a dynamic general-equilibrium model with heterogeneous agents,

incomplete information, and frictional labor markets. Workers in the model can have one of two skill levels,

either high or low, and this true type is not observed by anyone in the economy. Whereas skill is a permanent

feature of workers, productivity is stochastic and match-specic. The highly skilled are not necessarily more

productive, but they do have a more advantageous productivity distribution. Consequently, people's labor-

market outcomes reveal information about their skills at each stage of their working lives. For the unemployed

to be hired, they not only need to be seen by a rm; they need to realize a productivity draw good enough

to justify forming a match. After being hired, a worker's productivity continues to evolve, with the highly

skilled being more likely to enjoy productivity increases, less likely to suer productivity losses, and less

likely to experience separations.

Of central importance is a worker's probability of being highly skilled, conditional on her history of

unemployment and on-the-job performance. I will refer to this probability as a worker's résumé: Each

period, beliefs about a worker's skill level get updated according to Bayes's rule, so the résumé summarizes

a worker's background and experience, as they pertain to her future productivity. Search markets are

1Data details are in Appendix D.

2

segmented by expected skill level. When rms post vacancies, they solicit applications from workers with

a particular résumé, and the unemployed apply to the openings commensurate with their résumés. In the

market for résumé-r workers, a Pissarides-style matching function determines how many applications land

in the hands of rms. Prospective employers have access to a screening technology, which allows them to see

the applicant's productivity in her rst period on the job. If the productivity draw is good, then the rm

contacts the worker, and they begin production in the following period. If the draw is bad, the rm never

contacts the worker, who remains unemployed with a résumé updated to reect her failure to nd a job.

The job-nding hazard therefore depends on two things: (1) the relative demand for workers with dierent

résumés, and (2) the speed with which a worker's résumé deteriorates during unemployment. Moreover,

the amount of résumé damage associated with joblessness depends on how easy it is to be hired, which

underscores the importance of analyzing this problem in general equilibrium.

1.1 Main Results

The model allows me to make progress on three fronts. I begin by developing a novel theoretical framework

for analyzing informational dynamics in frictional labor markets. Then, I use a calibrated version of the

model to provide a quantitative exploration of the economic forces at work. Finally, I ush out the theory's

empirical predictions and econometric implications.

Informational concerns play an important role in determining match surpluses and, by extension, wages

and job-nding rates. The total surplus from a match equals the expected discounted value of production,

minus the worker's value of search. Both of these values depend on the worker's résumé. The match surplus

incorporates the fact that hiring a worker makes her résumé look better, and not hiring her makes her

résumé look worse. An employee's pay is determined by a linear surplus-splitting condition. Consequently,

wages include a compensating dierential for the résumé value of employment. In a sense, rms pay workers

for their time and output, while workers pay for the résumé improvement they get from being employed.

Using the model, I derive a wage decomposition that quanties the contribution of information to a worker's

take-home compensation.

Understanding the theory of how information is priced into wages is necessary for understanding the

association between résumés and job-nding rates. The model suggests that a worker's probability of being

hired is a hump-shaped function of her résumé. With full information, the most productive workers are in

the highest demand: The market for workers who denitely have high skills is tighter than the market for

workers who denitely have low skills. With incomplete information, however, the tightest markets are for

workers with good, but not pristine, résumés. Workers of uncertain quality are willing to take substantial

3

pay cuts to improve their résumés, whereas workers who are known to have high skills need to be paid high

wages. For rms posting vacancies, recruiting workers with mid-level résumés may result in lower expected

productivity, but higher expected prots. Thus, market tightness is a non-monotonic function of résumé.

After expositing the theoretical framework, I use the calibrated model to analyze the relationship between

job-nding rates and unemployment durations, on both the aggregate and individual levels. Despite the clear

pattern in Figure A.1, the drop in average job-nding rates does not directly imply that an individual's prob-

ability of being hired actually changes over the course of an unemployment spell. An alternative explanation

is that people have heterogeneous but time-invariant job-nding rates: In that case, fast job nders would

be hired after a short amount of time, so the long-term unemployed would comprise the people who had

the lowest job-nding rates all along. Disentangling the relative importance of dynamic selection, due to

unobserved heterogeneity, and true duration dependence, due to time-varying job-nding rates, has been

a contentious issue in the literature.2 The model features both of these mechanisms. Not only is duration

dependence endogenous, so is the distribution of résumés across workers, which determines the amount of

heterogeneity in job-nding rates. For the key parameters governing learning during unemployment and

skill heterogeneity, I calibrate the model to match the moments of the aggregate data that are of primary

interest, namely, average job-nding rates, as a function of duration. Because I do not target the hazard

curves of individual workers, this approach is fairly agnostic about what causes the pattern in Figure A.1.

On the individual level, the results paint a rich picture of true duration dependence. A worker's résumé

is a decreasing function of unemployment duration, but the job-nding rate is a hump-shaped function of

the worker's résumé. Individuals will have heterogeneously shaped hazards, depending on the résumés they

had when they entered unemployment. The baseline calibration suggests that the tightest market is for

workers with probability .80 of being highly skilled. The vast majority of unemployed workers have résumés

below this level, and for them, duration dependence is negative. That is, their job-nding rates decline

as their unemployment spells progress. For a worker entering unemployment with the average résumé, the

job-nding probability drops by nearly half after 52 weeks. As I discuss below, some of the most recent

and persuasive empirical studies nd negative duration dependence; in this respect, the model is broadly

consistent with the microeconomic evidence. But for a minority of workers with the highest résumés, duration

dependence is positive: Their job-nding rates actually go up from one period of search to the next, even

though expectations about their skills are revised downward. This result makes sense in light of how résumé

dynamics inuence wages. As high-résumé workers spend more time searching, they are eectively willing

to take larger pay cuts in order to stanch the stigma from joblessness. Nevertheless, after a suciently long

2Throughout, when discussing the role of composition eects in shaping the aggregate job-nding hazard, I will use theterms dynamic selection and unobserved heterogeneity interchangeably. I will reserve the term duration dependence torefer to actual changes in an individual's job-nding rate over the course of an unemployment spell.

4

unemployment spell, beliefs about a worker's skill will eventually deteriorate to the point that her job-nding

rate declines. In that sense, all workers will eventually experience negative duration dependence.

On the aggregate level, informational dynamics are necessary for the calibrated model's ability to ac-

count for the observed correlation between unemployment durations and job-nding rates. Partly, this is

a consequence of the fact that most workers have downward-sloping hazards. The model allows us to look

separately at the eects of true duration dependence and dynamic selection. Mechanically, the model fea-

tures a lot of dispersion in job-nding rates, and this heterogeneity can generate much of the drop o in

average job-nding rates, even in the absence of true duration dependence. Economically, though, stigma

and heterogeneity are not orthogonal phenomena. Instead, I argue that information eects are responsible

for creating much of the dispersion in job-nding rates. The dierence in job-nding rates between workers

with known skill levels is much smaller than the range of job-nding rates across workers with unknown skill

levels. When workers have heterogeneous skills, but there is full information about those skills, the aggregate

correlation between job-nding rates and durations is much less pronounced.

Human capital decay is the leading alternative explanation for negative duration dependence, and the

model can be extended to incorporate skill loss during unemployment and skill gain during employment.

Relative to the baseline model, this modication can actually attenuate the drop in job-nding rates. When

skills change over time, information about an unemployed worker's current skill level is less valuable. Ulti-

mately, the value of forming a match depends less on the résumé of the worker when she is hired. In turn,

job-nding rates become more uniform across the markets for workers with dierent résumés, so individual

hazards become atter.

Finally, the theory has useful implications for empirical research. Because information is revealed about

a worker's skill level every period, a worker's type will eventually be revealed after a suciently long time in

the marketplace.3 Consequently, older workers have less information revealed about them during an unem-

ployment spell. I document that the correlation between durations and job-nding rates is, in fact, weaker for

more experienced workers. That being said, dierent age groups will have dierent amounts of heterogeneity

in job-nding rates, as well as dierent amounts of true duration dependence. Thus, it's still necessary to have

a careful econometric strategy for disentangling true duration dependence from unobserved heterogeneity.

Fortunately, the theory also provides guidance for the specication and identication of econometric models,

in particular the mixed proportional hazard model. This framework provides a theoretical justication for

tting a mixed proportional hazard model, but only when looking at workers who enter unemployment with

comparable résumés. Moreover, to obtain identication of the proportional hazard model, the theory places

3In the model, the résumé distribution will never degenerate because workers die and are replaced by new-born agents, whoseskills are unknown.

5

restrictions on the information sets of econometricians, relative to agents in the economy.

1.2 Advancement of the Theoretical Literature

Other papers have looked at learning during unemployment as a source of duration dependence from a theo-

retical macroeconomic perspective. The most recent and closely related examples are Jarosch and Pilossoph

[2016] and Fernandez-Blanco and Preugschat [2015]; earlier contributions include Acemo§lu [1995] and Lock-

wood [1991].4 Those models unanimously conclude that learning during unemployment leads to negative

duration dependence, although the calibration exercises by Jarosch and Pilossoph [2016] and Fernandez-

Blanco and Preugschat [2015] suggest that dynamic selection eects are quantitatively more important.

A shared limitation of these theories is that they break the connection between a worker's current job

search and her future career outcomes and this modeling choice is crucial for the way agents assess the

value of information. In each of the above papers, a worker's type is perfectly revealed after a match is

made and production commences. This feature is coupled with one of two assumptions: Either a worker

remains employed with the same job until she dies, or all information about a worker is reset once she

reenters unemployment.5 Such simplications allow authors to make progress on other fronts. However,

these assumptions restrict the value of information: If a worker's planning horizon does not extend beyond

her next job, then her résumé becomes irrelevant for her subsequent employment and wage prospects. In

contrast, my results suggest that the résumé value of employment can be large when workers face the specter

of repeat unemployment spells. Consequently, employees accept lower wages in exchange for the résumé value

of being hired. In that respect, the model relates to the literature on career concerns, initiated by Holmström

[1999]. That line of work typically abstracts from unemployment in order to focus on eort provision.6

Instead, I show how preventing the résumé damage from unemployment can contribute signicantly to the

surplus generated by forming a match.

To the best of my knowledge, Gonzalez and Shi [2010] provide the only other model in which learning

takes place across multiple unemployment spells, but those authors nd that individuals have unambiguously

4Jarosch and Pilossoph [2016] propose a model with two-sided heterogeneity and positive assortative matching betweenworkers and rms. All workers are assumed to meet rms at a constant and exogenous rate, but as a worker's search dura-tion increases, fewer rms are willing to pay the cost of screening her. Fernandez-Blanco and Preugschat [2015] construct acontract-posting model in which rms oer wages contingent on unemployment duration. Acemo§lu [1995] combines informationasymmetry with human capital depreciation by positing that workers have to exert eort, which is unobservable, to keep theirskills from decaying during unemployment. The earliest general-equilibrium treatment of this problem appears in Lockwood[1991], who proposes a random-matching model in which rms can test candidates' abilities; because these tests are imperfect,search duration provides an additional signal of worker quality.

5Mathematically, these are almost identical. Lockwood [1991] and Acemo§lu [1995] make the former assumption; Jaroschand Pilossoph [2016] and Fernandez-Blanco and Preugschat [2015] make the latter.

6Gibbons and Murphy [1992], for instance, say that career concerns arise whenever the (internal or external) labor marketuses a worker's current output to update its belief about the worker's ability and then bases future wages on these updatedbeliefs (p. 469). The present model highlights the fact that the labor market updates its belief about the worker's ability onemployment status, as well as output, and these updated beliefs aect job-nding rates, as well as wages.

6

upward-sloping hazards. Gonzalez and Shi construct a directed-search model in which workers learn about

their ability while searching for a job.7 Qualitatively, those authors tell a story about discouragement

during unemployment: As spells drag on, workers infer that their ability is poor, so they target lower-wage

jobs in order to increase the probability of being hired. However, as I discuss below, the most convincing

microeconomic evidence suggests that true duration dependence is largely negative, which is dicult to

reconcile with Gonzalez and Shi's prediction of upward-sloping individual hazards.8 Nevertheless, on a

theoretical level, their result demonstrates that negative duration dependence is not a trivial outcome when

unemployment duration provides a signal of worker quality. I also nd that some workers will have increasing

job-nding rates because they are willing to accept lower wages; however, this outcome manifests in only a

minority of job seekers.

Relative to existing theories, my model incorporates richer dynamics for individuals' skills, as well as

information about those skills. One novelty of my model is the analysis of how skill change interacts with

informational dynamics. Of the above papers, only Acemo§lu [1995] looks at both skill loss and incomplete

information. In his model, skill change is necessary for the existence of duration dependence; in mine,

skill change can make duration dependence less pronounced. Finally, all of the above models assume that

learning takes place exclusively in unemployment. The framework that I propose allows beliefs to evolve

following hirings, separations, and changes in productivity. These features of the model respect the large

body of evidence suggesting that information about a worker is revealed throughout her career.9 It also

becomes easier to interpret the posterior beliefs about a worker's skill as a résumé when those beliefs reect

information from previous jobs.10

1.3 Relation to the Empirical Literature

This model speaks to the large empirical literature that tries to measure duration dependence using microe-

conomic data. The complete set of observational studies is too large to catalogue here, but the results are, in

the words of Ljungqvist and Sargent [1998], mixed and controversial.11 One reason for this discord is that

true duration dependence cannot be separated from dynamic selection eects in observational data without

imposing identifying assumptions, which are usually driven more by statistical practicality than economic

7In Gonzalez and Shi's model, skill corresponds to the probability that a worker can form a productive match, conditionalon meeting a rm. They assume that, conditional on being employed, all workers produce the same amount of output.

8To be fair, much of this evidence came out after Gonzalez and Shi [2010] was published. Also, because of dynamic selectioneects, Gonzalez and Shi's model can generate a negative correlation between job-nding rates and unemployment durationsin the aggregate, even though true duration dependence is positive.

9Seminal papers along these lines include Altonji and Pierret [2001] and Gibbons and Katz [1991].10Morchio [2016], who cites an earlier version of this paper, adopts the rhetorical device of referring to a worker's expected

skill level as her résumé. However, he focuses on learning from on-the-job productivity and separations throughout a worker'scareer. He abstracts from learning during unemployment, which is my main focus.

11See Machin and Manning [1999] and Van den Berg [2001] for reviews of the labor and econometrics literatures.

7

theory. Nevertheless, there exists good evidence for negative duration dependence.

Recently, authors have used eld experiments to circumvent the identication issues inherent in using

observational data, and the results are compelling. Oberholzer-Gee [2008], Kroft et al. [2013], and Eriksson

and Rooth [2014] sent out fake applications to real help-wanted ads, and the applications with the longest

gap since the last job were the least likely to elicit responses from rms. In addition to conducting a eld

experiment, Oberholzer-Gee [2008] conducted a survey of managers in charge of hiring decisions at Swiss

rms. For managers who prefer not to hire the long-term unemployed, the most important reason is the

belief that a productive worker would have already found a job. Exploiting the variation in labor-market

conditions across localities, Kroft et al. [2013] nd that duration dependence is more strongly negative in

places that have low unemployment.12 Those authors interpret their results as a sign of learning: Failing to

nd a job in a tight market sends a stronger signal about an individual's inability to get hired.

There is also direct evidence to support the search protocol in my model: Unemployment duration is

one hiring criterion that rms mention in help-wanted ads, and workers systematically apply to dierent

openings as their spells become longer. The news media has documented that some rms advertise vacancies

while stating an explicit preference for applicants who are currently or recently employed.13 These stories

conjecture that businesses expect the long-term unemployed to be lower quality workers. However, not all

rms that discriminate on the basis of unemployment duration are exclusionary. Although they draw less

attention from the popular press, there are also help-wanted ads that specically encourage applications from

the unemployed and underemployed.14 By welcoming unemployed applicants, some rms are likely trying

to nd people who can be recruited easily or paid cheaply. This demonstrates that rms target dierent

types of workers, with dierent histories of joblessness. Besides current unemployment status, most job

postings also specify desired levels of education and experience. Analogously, a rm in the theoretical model

solicits applications from people with a specic résumé, which is a composite of all past experiences of both

employment and unemployment. Furthermore, Kudlyak et al. [2014] provide evidence that workers apply for

lower-skill jobs as their unemployment spells grow longer. Exploiting longitudinal data from a job-posting

website, they nd that after several weeks of search, well-educated workers eventually apply to the same jobs

to which less-educated workers apply at the outset. As those authors point out, their results are consistent

with the hypothesis that agents use unemployment duration to learn about worker quality.15

I will proceed as follows. Section 2 describes the model environment. Section 3 denes an equilibrium

12Using U.S. data, Imbens and Lynch [2006] also nd that hazard curves are steeper in tight labor markets, although Biewenand Stees [2010] nd no such correlation in German data. Both of these studies use observational data, so they are subject tothe same qualications mentioned above.

13See, e.g., Rampell [2011] and Banjo [2012].14Searching for the words unemployed or unemployment on a job-search website will turn up many such postings.15Kudlyak et al. [2014] cite Gonzalez and Shi [2010] as one theory consistent with their results; my model is also consistent

with these ndings.

8

and discusses aggregation. In Section 4, I calibrate the model and choose parameters to match the aggregate

data on job-nding rates and search durations. Section 5 contains the results, including decompositions of

hazards and wages to quantify the importance of information eects. Sensitivity analysis is in Section 6.

Empirical implications are discussed in Section 7. Figures are in Appendix A, computations are discussed

in Appendix B, proofs are in Appendix C, and data details are in Appendix D.

2 Model Environment

The following subsections delineate: (1) the agents who inhabit the model, (2) the distinction between a

worker's skill level and her productivity, (3) the information structure of the economy, and (4) the details of

the matching process. Figures 2.1 and 2.2 show how these pieces t together. More specically, Figure 2.1

shows the timing of events for unemployed workers, and Figure 2.2 shows the timing of events for workers

in existing jobs. After describing technology, information, and matching, I present the agents' Bellman

equations and discuss wage determination.

2.1 Preferences and Demographics

There is a unit mass of risk-neutral workers. During periods of unemployment, workers get leisure value λ.

A worker also has a constant probability µ of dying from one period to the next. Each period, a measure µ

of workers die and are replaced by a measure µ of new workers. Agents discount the future at rate ρ; hence,

the eective discount factor is β ≡ 1−µ1+ρ . Firms seek to maximize the present discounted value of prots, and

the measure of rms is determined by free entry.

2.2 Skills and Productivity

There are two types of workers, those with high skills and those with low skills. A worker's skill level is

not the same as her productivity level; rather, high-skill workers take random productivity draws from a

more favorable distribution. A worker's output is given by zx, where z is aggregate productivity, and x is

match-specic. Match-specic productivity x evolves according to a Markov process that depends on the

worker's type. In particular, x has support X ≡ 0, x1, . . . , xnx, and in all states of the world, the process

governing x for high-skill workers stochastically dominates that for low-skill workers. Upon encountering a

rm, a high-skill worker draws a value of match-specic productivity from a distribution with probability

mass function Ω (x | 1); for low-skill workers, this probability mass function is Ω (x | 0). For the duration of

a job, the probability of a high-skill worker transitioning from x to x′ is given by Ω (x, x′ | 1); for low-skill

9

workers, this probability is given by Ω (x, x′ | 0). Conditional on a worker's type, x is independent across

matches: When a worker starts a new job, she takes a fresh draw from Ω (· | 0) or Ω (· | 1), independent

of her productivity on her last job. The distinction between high- and low-skill workers is captured by the

stochastic-dominance assumptions:

∑x≤X

Ω (x | 1) <∑x≤X

Ω (x | 0) . (2.1)

∑x′≤X

Ω (x, x′ | 1) <∑x′≤X

Ω (x, x′ | 0) . (2.2)

I will assume that there is always positive probability of realizing x = 0, which is an absorbing state.16 The

fact that 0 < Ω (0 | 1) < Ω (0 | 0) means that no one can perform all possible jobs, so even highly skilled

workers may fail to be hired. Similarly, the fact that 0 < Ω (x, 0 | 1) < Ω (x, 0 | 0) means that everyone has

some risk of entering unemployment.

2.3 Information and Filtering

In reality, rms cannot perfectly predict how fruitful an employment relationship will be; instead, prospective

employers draw inferences based on a candidate's history. Likewise, in the model, a worker's skill level will

not be directly observable, but rms will engage in a ltering problem to discern her ability using Bayes's

rule. To this end, I will adopt the following denitions.

Denition. For each worker, public information is the complete history of when the worker was unem-

ployed, when the worker was employed, and all realizations of x in jobs where the worker was employed.

Public information is available to workers and all prospective employers. Although information is incom-

plete, it is symmetric: Workers are also learning about their skills through the progression of their careers.17

Notice that public information includes realizations of x in jobs where a worker has been employed not

realizations of x that result in an unemployed worker's application being rejected. Also, if a worker holds a

job with productivity x, and a transition to x′ induces her to enter unemployment, then the realization x′ is

included in public information. That is, a prospective employer can see why an applicant left her last job.

For each newborn worker, there is a prior probability of that worker's being highly skilled. This probability

16To be clear, x dropping to zero does not make a worker permanently unproductive; it just makes her permanently unpro-ductive at her current job.

17It's fair to ask whether an employee's on-the-job productivity is publicly observable. The evidence from empirical microstudies is somewhat mixed: Schönberg [2007] and Kim and Usui [2013] suggest that this assumption is realistic for at least someworkers, though Kahn [2013] suggests that it is not. Nevertheless, the employer-learning literature commonly assumes thatsomeone's idiosyncratic productivity is observable to the entire marketplace, so current and prospective employers are using thesame noisy signals to form inferences about a worker's skill. For example, Altonji and Pierret [2001], Lange [2007], and Kahnand Lange [2014] take this approach.

10

is drawn from an exogenous distribution F (·). I will maintain the assumption that, amongst the workers

born with prior probability r of being highly skilled, a fraction r is, in fact, highly skilled.

Denition. A worker's résumé, denoted r ∈ [0, 1], is the posterior probability of that worker's being highly

skilled, conditional on public information.

A worker's initial résumé is just the prior probability of her being highly skilled. This value corresponds to

someone's qualications upon entering the labor market, based on features such as academic performance.18

Each subsequent period, a worker's résumé gets updated, depending on her labor-market experience. From

one period to the next, there are three things that can happen to a worker:

1. An unemployed worker can remain unemployed.

2. An unemployed worker can be hired by a rm with match-specic productivity x.

3. An employed worker's productivity can transition from x to x′, possibly resulting in separation.

Each of these transitions sends a dierent signal, so there are three belief-updating functions. If an unem-

ployed worker with résumé r fails to nd a job, then her updated résumé is denoted Bu (r). If an unemployed

worker with résumé r is hired with productivity x, then her updated résumé is denoted Bh (r | x). If an

employed worker has résumé r and her productivity transitions from x to x′, then her updated résumé is

denoted Be (r | x, x′).19 Concisely:

Bu (r) ≡ P [highly skilled | unemployed→ unemployed, résumé r] (2.3)

Bh (r | x) ≡ P [highly skilled | unemployed→ x, résumé r] (2.4)

Be (r | x, x′) ≡ P [highly skilled | x→ x′, résumé r] . (2.5)

The updating of beliefs will depend on the endogenous probability of making an employment transition.

Consequently, I will defer presenting the exact expressions for these functions until Section 2.9, after I have

described the matching process. When appropriate, I will denote the inverses of these functions by lowercase

letters; e.g., the function bh (r | x) satises r = Bh(bh (r | x) | x

).20

18If employers make inferences about worker quality on the basis of time-invariant characteristics, such as race or sex, thesefeatures would also be subsumed into the prior.

19Recall that a transition from x to x′ may or may not result in the continuation of a match. Regardless, beliefs will beupdated according to Be (r | x, x′) because the transition from x to x′ is assumed to be observable.

20It will become apparent in Section 2.9 that Bh (· | x) and Be (· | x, x′) are guaranteed to be invertible in their rst arguments.In the numerical exercises, Bu (·) is invertible as well.

11

Figure 2.1: Timing for Unemployed Workers

It will be convenient to dene for r ∈ [0, 1]:

Ω (x | r) ≡ rΩ (x | 1) + (1− r) Ω (x | 0) . (2.6)

Ω (x, x′ | r) ≡ rΩ (x, x′ | 1) + (1− r) Ω (x, x′ | 0) . (2.7)

Notice that each of the above quantities is a conditional probability. In particular, Ω (x | r) is the posterior

predictive distribution of productivity for a newly hired worker with résumé r. Likewise, Ω (x, x′ | r) is the

probability of productivity transitioning from x to x′ for an employee with résumé r.

2.4 Matching

The economy features a continuum of labor markets, indexed by r ∈ [0, 1]. There is one search market

for each résumé value. When a rm posts a vacancy, it solicits applications from workers with a specic

résumé r; in turn, workers with résumé r respond to the rms seeking to hire employees like them. With

some probability, a prospective rm sees a worker's application; below, I will detail the matching technology

for determining this probability. Firms are assumed to have some screening ability, which allows them to

discern whether a particular worker will be a good match. Specically, the prospective employer can observe

the worker's initial value of x. If the employer decides the match is worthwhile, they contact the worker,

12

Figure 2.2: Timing for Employed Workers

and they begin production in the following period. The timing of the matching process, depicted in Figure

2.1, ensures that information between employers and employees will be symmetric: A worker will only be

contacted by a rm if her realization of x makes for a viable match, so a worker who remains unemployed

never nds out if she had a bad x draw, or if her application was simply never seen.

This search protocol has both conceptual and technical justications. Segmenting the search market

along the dimension of expected skill is realistic: Help-wanted ads typically specify that a job is intended for

people with a certain level of experience, set of credentials, or history of unemployment, as I have already

documented. The setup also implies that workers with dierent résumés do not congest one another's search.

In a number of industries, rms have separate recruitment processes for junior and senior positions, so this

assumption provides a reasonable approximation of the hiring process. From a model-making perspective,

I am assuming that search markets are segmented in order to make the solution tractable.21 Without

segmented search markets, a rm posting a vacancy would have to know the full distribution of résumés in

the unemployed population. The state space is reduced dramatically when all workers within a given market

have the same résumé.

In each market r ∈ [0, 1], the number of meetings is determined by a constant-returns-to-scale function

of vacancies and searchers. Denote by θ (r) the market tightness, or the ratio of vacancies to unemployed

workers, in market r. If a worker is searching in market r, then the probability of her application being seen

21Elsewhere, in the directed-search literature, authors focus on the endogenous segmentation of markets. Examples includeGonzalez and Shi [2010] and Menzio and Shi [2010, 2011]. In addition to delivering tractability, the search protocol in thepresent model is clearly a normal-form Nash equilibrium: Workers with résumé r play the strategy of searching in marketr, and rms posting vacancies in market r play a strategy of considering only those applicants who have résumé r. Giventhe recruitment strategies of rms, there's no reason for workers to deviate from their search strategies, and given the searchstrategies of workers, there's no reason for rms to deviate from their recruitment strategies.

13

by a rm is:

p (r) ≡ minζθ (r)

1−ε, 1. (2.8)

The probability of a rm encountering a worker in market r is:

q (r) ≡ minζθ (r)

−ε, 1. (2.9)

2.5 The Firm's Problem

As will become clear in Section 2.7, the surplus of a match will depend not only on the worker's résumé,

but also on the résumé the worker would have if she were not employed. This counterfactual résumé, and

therefore wages, will depend on whether the worker was employed in the previous period. Let s ∈ e, h

denote a worker's tenure status: s = h signies that a worker is newly hired, and s = e signies that a worker

is in a continuing employment relationship.

Let Gs (r, x) be the value associated with owning a rm matched with a worker in state (r, x, s). Let

ws (r, x) be the wage paid to a worker in state (r, x, s); I will detail the wage determination process in Section

2.7. Firm owners have an outside option of zero. An active rm's Bellman equation is:

Gs (r, x) = zx− ws (r, x) + β∑x′

Ω (x, x′ | r) max Ge (r′, x′) , 0 (2.10)

s.t.: r′ = Be (r | x, x′) .

Let V (r) be the value associated with posting a vacancy in market r. For a potential rm, there is a constant

cost κ to maintaining a vacancy in any market in a given period. With probability q (r), the rm encounters

a worker. The potential rm's Bellman equation is:

V (r) = −κ+ β

([1− q (r)]V (r) + q (r)

∑x

Ω (x | r) max Gh (r′, x) , V (r)

)(2.11)

s.t.: r′ = Bh (r | x) .

Free entry requires that:

V (r) = 0, ∀r. (2.12)

Hence, the value of posting a vacancy in any market must satisfy:

κ = βq (r)∑x

Ω (x | r) maxGh(Bh (r | x) , x

), 0. (2.13)

14

2.6 The Worker's Problem

Let Hs (r, x) be the value associated with being an employed worker in state (r, x, s); let U (r) be the value

associated with being an unemployed worker with résumé r. The employed worker's Bellman equation is:

Hs (r, x) = ws (r, x) + β∑x′

Ω (x, x′ | r) max He (r′, x′) , U (r′) (2.14)

s.t.: r′ = Be (r | x, x′) .

The unemployed worker's Bellman equation is:

U (r) = λ+ β

([1− p (r)]U (r′u) + p (r)

∑x

Ω (x | r) max Hh (r′h, x) , U (r′u)

)(2.15)

s.t.: r′h = Bh (r | x) and r′u = Bu (r) .

2.7 Wage Determination

I will assume that wages are determined each period by a linear surplus-splitting rule, with a fraction η of

the total match surplus going to the worker. This value will depend on the counterfactual résumé that the

worker would have if a new match were not formed or if an extant match were not maintained. Consequently,

there are separate surplus-spitting conditions for new hires (s = h) and continuing employees (s = e).

First, consider an unemployed worker searching in market r. If she is not hired, then her résumé is updated

to Bu (r), and she receives value U (Bu (r)). If she is hired with productivity x, then her résumé is updated to

Bh (r | x), and she receives value Hh

(Bh (r | x) , x

). The worker's surplus is therefore Hh

(Bh (r | x) , x

)−

U (Bu (r)). A rm that successfully hires a worker in market r gets value Gh(Bh (r | x) , x

); free entry

ensures that rm owners get nothing if they fail to make a hire. Hence, the surplus-splitting condition for

newly hired workers is:

Hh

(Bh (r | x) , x

)− U (Bu (r)) = η

[Gh(Bh (r | x) , x

)+Hh

(Bh (r | x) , x

)− U (Bu (r))

]. (2.16)

The left-hand side of the above is the worker's surplus, and the term in square brackets on the right-hand

side is the combined surplus of the worker and the rm.

Now, consider a worker in an existing employment relationship who, in the previous period, produced

x− with résumé r−. If productivity transitions to x, then the worker's résumé becomes r = Be (r− | x−, x).

Because the complete history of x is assumed to be observable in any consummated match, the worker's

résumé is unaected by staying on the job for an additional period. Consequently, the worker gets value

15

He (r, x) from continuing employment, and she would get value U (r) if she entered unemployment. The

rm's value from maintaining the match is Ge (r, x), and the rm's value from dissolving the match is zero.

Hence, the surplus-splitting condition for continuing employees is:

He (r, x)− U (r) = η [Ge (r, x) +He (r, x)− U (r)] . (2.17)

Again, the left-hand side of the above is the worker's surplus, and the term in square brackets on the

right-hand side is the combined surplus of the worker and the rm.

More succinctly, we can evaluate (2.16) at résumé bh (r | x) and consolidate it with equation (2.17) to

obtain:

ηGs (r, x) = (1− η) [Hs (r, x)− U (rc)] (2.18)

rc ≡

Bu(bh (r | x)

)if s = h

r if s = e.

In the above, rc represents the counterfactual résumé that the worker would have if unmatched.

When interpreting this wage-determination process, some caveats are in order. In many models, linear

surplus-splitting conditions can be supported in two dierent ways: (1) as the axiomatic solution to a

cooperative game, or (2) as the subgame-perfect equilibrium of a non-cooperative game. The former is

associated with Nash [1950]; the latter, with Rubinstein [1982]. Some authors, such as Pissarides [1994],

directly assert a surplus-splitting rule as a primitive assumption, without invoking any specic game-theoretic

foundations.22 In the present model, any of the standard justications are valid for the surplus-splitting

condition in continuing matches (2.17). For new matches, however, Rubinstein's alternating-oers protocol

does not apply: If a worker were contacted by a rm, but (inexplicably) declined to accept any oer, then

she would continue searching with the knowledge that she was worth hiring. In that case, the worker would

have a private signal of her own skill, relative to her next employer. Non-cooperative bargaining under

asymmetric information is an interesting problem, but it's a non-trivial one that falls outside the scope of

the current endeavor.23

Alternatively, the Nash [1950] solution species a fair bargain (p. 158), dened as an allocation that

satises a collection of axioms. This approach takes as parameters the feasible payos when agents cooperate,

22In the environment developed by Pissarides [1994], the space of payos is non-convex, which creates complications for eitherNash [1950] or Rubinstein [1982] bargaining. See Shimer [2006] for a discussion.

23Firms have no reason to contact workers unless they expect to reach a bargain, so information would continue to be sym-metric in an equilibrium. Nevertheless, solving the non-cooperative bargaining game requires computing the payos associatedwith deviations from equilibrium outcomes, including deviations that lead to information asymmetries.

16

along with utility oors that agents obtain without cooperating. Equation (2.16) species that an unemployed

worker's payo from not forming a match is U (Bu (r)), the value of continuing in unemployment with

public information. This modeling choice sidesteps the complications that would arise with information

asymmetries. As a plausible defense, one can imagine that workers and rms send their applications and

openings to employment agencies. In the event of a compatible match, the agency applies the Nash axioms

to determine a fair bargain and connects the trading parties. An unemployed worker would want to be

contacted with any contract worth more than U (Bu (r)), and the rm would want to be contacted with any

contract worth more than zero. Those conditions will both be satised if wages are determined according to

equation (2.18).

2.8 Policies

Agents decide which matches are worth forming and maintaining. In light of the surplus-splitting rules, a

match is worthwhile for a worker if, and only if, the match is worthwhile for the rm. The set of match-specic

productivities that will result in a worker with résumé r being hired is:

Xh (r) =x ∈ X | Gh (r′h, x) +Hh (r′h, x) ≥ U (r′u) , r′h = Bh (r | x) , r′u = Bu (r)

. (2.19)

In subsequent periods, agents decide whether to continue or allow the employment relationship to dissolve.

The set of match-specic productivities that will perpetuate a match with a worker who has résumé r and

current productivity x is:

Xe (r | x) = x′ ∈ X | Ge (r′e, x′) +He (r′e, x

′) ≥ U (r′e) , r′e = Be (r | x, x′) . (2.20)

2.9 Beliefs

Having dened the meeting rates and the policy functions, it is now possible to compute the belief-updating

rules. Applying Bayes's rule yields:

Bu (r) =1− p (r)

∑x∈Xh(r) Ω (x | 1)

1− p (r)∑x∈Xh(r) Ω (x | r)

r (2.21)

Bh (r | x) =Ω (x | 1)

Ω (x | r)r (2.22)

Be (r | x, x′) =Ω (x, x′ | 1)

Ω (x, x′ | r)r. (2.23)

17

Notice that the amount of information revealed by an unemployment spell is endogenous, because Bu (r)

depends on the equilibrium job-nding rate. All workers searching in market r, regardless of their actual

skill level, have the same probability p (r) of being seen and screened by a rm. A genuinely high-skill worker

with résumé r has a job-nding rate of p (r)∑x∈Xh(r) Ω (x | 1), whereas the job-nding rate for the average

worker with résumé r is p (r)∑x∈Xh(r) Ω (x | r). Provided that Xh (r) consists of all values of x meeting or

exceeding a reservation productivity, high-skill workers will have a greater probability of being hired than

low-skill workers with the same résumé, because Ω (· | 1) stochastically dominates Ω (· | 0). This feature has

two implications. First, Bu (r) < r, so a worker's résumé will, in fact, deteriorate during unemployment.

Second, the information-updating process will occur more rapidly for workers searching in tight markets:

Holding Xh (r) constant, Bu (r) is decreasing in p (r). Although unemployment will always send a bad signal,

the strength of that signal depends on aggregate conditions. Failing to nd a job in a slack market is more

indicative of rms' tepid appetite for hiring, whereas failing to nd a job in a tight market is more indicative

of a worker's ability to make a match.

3 Equilibrium

3.1 Denition

We are now prepared to dene an equilibrium.

Denition. A recursive search equilibrium comprises:

1. Value functions Gs (r, x), V (r), Hs (r, x), and U (r)

2. Policies Xh (r) and Xe (r | x)

3. Contact rates p (r) and q (r)

4. A market-tightness function θ (r)

5. A wage function ws (r, x)

6. Belief functions Bu (r), Bh (r | x), and Be (r | x, x′)

subject to:

1. Bellman equations: (2.10), (2.11), (2.14), and (2.15)

2. Optimal match formation and dissolution: (2.19) and (2.20)

3. Matching technology: (2.8) and (2.9)

4. Free entry (2.12)

5. Surplus splitting (2.18)

6. Bayesian updating: (2.21), (2.22), and (2.23).

18

Several comments are in order. I deliberately constructed the equilibrium so that it has a tractable

solution. In particular, the assumption of segmented search markets ensures that policies, wages, and worker

ows do not depend on the distribution of workers over states.24 However, with the equilibrium objects in

hand, the distributional dynamics become a matter of accounting, which I will lay out in Section 3.2. The

model's structure makes it manageable to solve using the method described in Appendix B. That appendix

reduces the equilibrium conditions to a system of functional equations for the market-tightness function,

the worker's value of unemployment, and the joint value of employment to the worker and the rm. The

equilibrium objects have to be the xed point of a functional operator, so this line of argument suggests an

iterative algorithm for solving the model numerically.

Even though such a computational approach only establishes the existence of an equilibrium, I conjecture

that the equilibrium is unique. In other models of endogenous information revelation, such as Lockwood

[1991] and Acemo§lu [1995], there can be multiple equilibria.25 In those models, as in mine, the probability

of nding a job determines the informational content of a worker's joblessness and the skill composition of

the unemployed workforce. The crucial dierence is, in those models, workers with all dierent expected

skill levels apply to the same vacancies, so prospective employers care about the distribution of information

and skills in the unemployed population. In such environments, self-fullling equilibria are possible: Firms

are reluctant to hire, which makes workers spend more time in unemployment, which makes the pool of

unemployed applicants look worse, which makes rms reluctant to hire. That mechanism for multiplicity

is absent in my model because a rm can choose to seek a worker with a specic expected skill level.

Consequently, rms' hiring decisions alter the distribution of unemployed workers across markets, but the

skill composition of workers within each market remains unchanged. Moreover, there are two markets in this

economy that have full information, corresponding to r = 0 and r = 1. Both of these markets are essentially

discrete-time versions of the environment created by Mortensen and Pissarides [1994], which is known to

have a unique solution. It would be somewhat surprising if equilibrium outcomes were uniquely determined

in markets r ∈ 0, 1, but not in markets r ∈ (0, 1).

3.2 Aggregation

As demonstrated above, we can solve for the model's main ingredients independently of the distribution of

workers across states. However, to compute aggregate statistics, it's necessary to know this distribution,

which is determined endogenously. Let F e (r | x) be the measure of employed workers in continuing employ-

ment relationships with productivity x whose résumé is less than or equal to r. Let Fh (r | x) be the measure

24Shi [2009] refers to this property as block recursivity.25Like me, Gonzalez and Shi [2010], Fernandez-Blanco and Preugschat [2015], and Jarosch and Pilossoph [2016] cannot

provide armative proof of uniqueness, but they cannot nd instances of more than one equilibrium, either.

19

of newly hired workers with productivity x whose résumé is less than or equal to r. Let Fu (r) be the mea-

sure of unemployed workers whose résumé is less than or equal to r. Keep in mind that these functions are

measures, though not proper probability distributions; i.e., Fu (1) +∑x F

h (1 | x) + F e (1 | x) = 1. Recall

that dF (r) is the measure of workers born with résumé r, and a fraction r of these dF (r) workers were

actually born with high skills. Because the prior is assumed to be rational and agents engage in Bayesian

updating, r will be the fraction of résumé-r workers who are indeed highly skilled.

I will dene a functional operator that characterizes the law of motion for these measures. We want an

operator that maps(F e, Fh, Fu

)today into

(F e, Fh, Fu

)′tomorrow, given the components of the equilibrium

dened in Section 3.1. Let Fe, Fh, and Fu be the respective spaces in which these measures must reside.

Dene the mapping Υ : Fe ×Fh ×Fu → Fe ×Fh ×Fu by:

Υ1

(F e, Fh, Fu

)(r′, x′) = (1− µ)

∑x

ˆBe(r′|x,x′)

Ω (x, x′ | r) d[F e (r | x) + Fh (r | x)

](3.1)

Υ2

(F e, Fh, Fu

)(r′, x′) = (1− µ)

ˆBh(r′|x′)

p (r) Ω (x′ | r) dFu (r) (3.2)

Υ3

(F e, Fh, Fu

)(r′) = (1− µ)

∑x,x′

ˆBd(r′|x,x′)

Ω (x, x′ | r) d[F e (r | x) + Fh (r | x)

]

+ (1− µ)

ˆBu(r′)

1− p (r)∑

x∈Xh(r)

Ω (x | r)

dFu (r) + µF (r′) , (3.3)

where I have dened:

Be (r′ | x, x′) ≡ [0, be (r′ | x, x′)] ∩ r | x′ ∈ Xe (r | x) (3.4)

Bh (r′ | x) ≡[0, bh (r′ | x)

]∩r | x′ ∈ Xh (r)

(3.5)

Bd (r′ | x, x′) ≡ [0, be (r′ | x, x′)] ∩ r | x /∈ Xe (r | x) (3.6)

Bu (r′) ≡ r | Bu (r) ≤ r′ . (3.7)

The measures(F e, Fh, Fu

)then evolve according to:

(F e, Fh, Fu

)′= Υ

(F e, Fh, Fu

). (3.8)

To understand the logic of (3.1), notice that dF e (r | x) + dFh (r | x) is the measure of employed workers

with résumé r and productivity x. Of these workers, a fraction (1− µ) Ω (x, x′ | r) will survive to the

following period with productivity x′ and résumé Be (r | x, x′). These workers will continue in their jobs

if x′ ∈ Xe (r | x), and their updated résumés will be weakly less than r′ if r ∈ [0, be (r′ | x, x′)]. Thus,

20

integrating (1− µ) Ω (x, x′ | r) with respect to F e (r | x) + Fh (r | x) over r ∈ Be (r′ | x, x′) and summing

over x yields F e (r′ | x′), the measure of workers in continuing jobs with productivity x′ and résumé less

than or equal to r′. Concisely, (F e)′

= Υ1

(F e, Fh, Fu

). Similar reasoning accounts for the components of

(3.2) and (3.3). Aggregate employment is given by:

e ≡∑x

[F e (1 | x) + Fh (1 | x)

]. (3.9)

Likewise, aggregate output is given by:

y ≡ z∑x

x[F e (1 | x) + Fh (1 | x)

]. (3.10)

Consequently, the evolution of unemployment and output will always depend on the full distribution of

workers across states.

4 Calibration

One time period in the model corresponds to one week. Some the model's parameters are common in

search models, so I will take values used elsewhere in the literature. The rate of time preference is ρ = .0452 ,

corresponding to an annual real interest rate of approximately four percent. The probability of an agent

dying from one period to the next is 140×52 , implying that the average working life lasts forty years. Workers

and rms have equal surplus shares, with η = 12 . Aggregate productivity is normalized as z = 1. The ow

value of leisure is λ = 12 . As will be clear in a moment, the consumption of unemployed workers will equal

half the output of a newly hired worker; this value is in line with other papers in the literature (c.f. Shimer

[2005] and Hall [2005]). Following much of the literature, I will set the elasticity of the matching function to

ε = 12 . The matching function's scaling factor ζ will be chosen to match the steady-state unemployment rate

using a procedure I will describe below. Hall and Milgrom [2008] set the per-period cost of maintaining a

vacancy to 43% of one period's productivity. As will become clear, a newly hired worker produces one unit

of output, so I will set κ = .43.

The details of the productivity processes are as follows. Recall that the support of x is X = 0, x1, . . . , xnx.

I will assume that log (x1) , . . . , log (xnx) constitute a 21-point, equally spaced grid from -1 to 1. There

are two stochastic processes for x, one for each skill type. The parameters governing the evolution of x for

high-skill workers are indexed by i = 1, and the parameters governing the evolution of x for low-skill workers

are indexed by i = 0.

21

For workers in continuing employment relationships, I will assume that log (x) follows a discrete approx-

imation to a random walk with drift, the parameters of which depend on the worker's skill level. There is

a constant, type-specic probability δi of productivity dropping to zero. Conditional on remaining positive,

x has a constant, type-specic probability αi of taking a step up; there is also a constant, type-specic

probability χi of x taking a step down. That is, for 1 < k < nx:

Ω (xk, x′ | i) =

(1− δi)αi if x′ = xk+1

(1− δi) (1− αi − χi) if x′ = xk

(1− δi)χi if x′ = xk−1

δi if x′ = 0.

(4.1)

At the endpoints of the positive support (i.e. x = x1 and x = xnx) productivity can only move in one

direction, so I will specify:

Ω (x1, x′ | i) =

(1− δi)αi if x′ = x2

(1− δi) (1− αi) if x′ = x1

δi if x′ = 0

Ω (xnx , x′ | i) =

(1− δi) (1− χi) if x′ = xnx

(1− δi)χi if x′ = xnx−1

δi if x′ = 0.

(4.2)

The stochastic dominance of Ω (x, x′ | 1) over Ω (x, x′ | 0) is captured by the parameter restrictions α1 >

α0, χ0 > χ1, and δ0 > δ1. In other words, high-skill workers have a higher probability of experiencing

productivity gains and a lower probability of experiencing productivity losses.

A diculty in choosing specic values for αi, χi, δii=0,1 is that these parameters quantify the dierence

between types, and we're interested in a situation where a worker's type is not directly observable. In models

with only one type of worker, or with worker types corresponding to observable categories in the data, authors

typically use parameters that are considered reasonable in light of existing microeconomic estimates. My

calibration strategy will be to choose two sets of parameters within the range of reason, but with one on the

high side, and the other on the low side. In Section 6, I will conduct sensitivity analysis to assess the impact

of various parameter choices. Conditional on x remaining positive (and not being at an endpoint of X ), the

mean and variance of log productivity growth are:

E[log

(x′

x

)| x1 < x < xnx

, 0 < x′, i

]= (αi − χi) dx. (4.3)

V[log

(x′

x

)| x1 < x < xnx , 0 < x′, i

]=

[(αi + χi)− (αi − χi)2

]d2x, (4.4)

22

where dx ≡ log (xnx/x1) / (nx − 1) is the increment size of the log productivity grid. For each productivity

process i, the above two moments will guide my choice of the two parameters αi and χi. In the model, wages

and productivity are not identical, yet the quantitative results will show that productivity is highly correlated

with wages for workers who are in continuing employment relationships. Consequently, my calibration for

αi, χii=0,1 will be motivated by existing estimates for wage and earnings processes. I will set the conditional

mean of log(x′

x

)to .20

52×10 for high-skill workers and .1052×10 for low-skill workers. So, over ten years, match-

specic productivity is expected to grow by about 20% for a high-skill worker and by about 10% for a low-skill

worker. The rationale for these numbers comes from the empirical literature on the return to tenure, which

labor economists typically interpret as the growth rate of match-specic human capital. The calibrated

values fall within the range suggested by existing studies: Altonji and Shakotko [1987] nd a cumulative

ten-year growth rate of 6.6%, whereas Topel [1991] nds a ten-year growth rate in excess of 25%.26 For both

skill types, I will set the variance of log(x′

x

)to .0313

52 . The justication for this choice comes from Meghir

and Pistaferri [2004], who estimate the variance of permanent innovations to log earnings; they obtain an

unconditional variance of .0313 when using annual data pooled across dierent education groups. I will set

δ0 = 152×2.25 and δ1 = 1

52×2.75 . These values imply that the average low-skill worker lasts 2.25 years before

suering a terminal productivity shock, whereas the average high-skill worker lasts 2.75 years.27 As a point

of reference, Shimer [2005] calibrates separation rates so that the average job lasts 2.5 years.

Whereas x follows a random walk with drift over the course of a job, we need to specify how x is initialized

when a match is formed. For newly hired employees, I will assume that there is a type-specic probability

ωi that x = 1; with probability 1 − ωi, x = 0. That is, the initial value of match-specic productivity has

the following distribution:

Ω (x | i) =

ωi if x = 1

1− ωi if x = 0.

(4.5)

Because x = 0 is an absorbing state, we can interpret ωi as the probability of a worker being able to do a job,

conditional on her true type. For Ω (x | 1) to stochastically dominate Ω (x | 0), it's necessary and sucient

that ω1 > ω0. An advantage of this parsimonious specication is that the dierence in job-nding rates

between high- and low-skill workers can be summarized by the dierence between ω1 and ω0; that is, high-

skill workers with résumé r nd jobs with probability ω1p (r), whereas low skill workers with résumé r nd

jobs with probability ω0p (r). The parameters ω0 and ω1 will be important for determining the informational

26For a more recent review of the literature and evidence, see Altonji and Williams [2005].27The model features endogenous separation because x can gradually drift downward to the point that a match is no longer

desirable. Strictly speaking, δi is not a type-specic separation rate, but a lower bound on the separation rate. In the numericalexercises I perform, the majority of separations will come from x dropping to zero immediately, as opposed to declining gradually.

23

content of unemployment spells because the belief-updating function is:

Bu (r) =1− ω1p (r)

1− ωrp (r)r, (4.6)

where I have dened ωr ≡ (1− r)ω0 + rω1. For a given meeting rate p (r), the dierence between ω1 and ω0

governs how quickly a worker's résumé is updated while she is searching for a new job: Bu (r) is decreasing

in ω1 − ω0, and Bu (r) = r when ω1 = ω0.

It remains to to pick numerical values for ω1, ω0, the matching eciency ζ, and the résumé distribution

for newly born agents F (r). To accomplish this, I will select parameters to match the primary moments of

interest, namely, the steady-state unemployment rate and the population job-nding rate as a function of

duration. To economize on free parameters, I will assume that F (r) is Beta(αF , 10− αF ). This implies that

the fraction of high-skill workers in the economy is αF

10 . I will also adopt the normalization that ω0 +ω1 = 1.

This normalization comes with minimal loss of generality: When p (r) is strictly less than one, doubling ζ is

observationally equivalent to cutting both ω0 and ω1 in half. What matters is the dierence ω1 − ω0. I will

conduct a grid search over values of ω1 − ω0 and αF in order to minimize the quadratic deviation between

the population job-nding hazard implied by the model and the empirical job-nding hazard constructed

using CPS data.28 I exclude workers who report being on layo as their reason for unemployment because

Fujita and Moscarini [2015] present evidence that many workers on layo are recalled by the employers from

whom they were initially separated. Only 13% of unemployed workers in my sample were on layo; excluding

them slightly diminishes the drop in job-nding rates at low durations. The candidate values of ω1 − ω0

consist of an equally-spaced grid of 51 points between zero and 12 ; the candidate values of αF consist of an

equally-spaced grid of 10 points between 1.5 and 8.5. For each pair (ω1 − ω0, αF ), I nd a value for ζ that

results in a 6% steady-state unemployment rate.29 This procedure points to parameter values of ω0 = .40,

ω1 = .60, αF = 4.0, and ζ = .2612.

Figure A.2 shows the duration-specic job-nding probabilities implied by the model, along with the

duration-specic job-nding probabilities from the CPS data. Overall, the t is good. It's worth noting

that this procedure does not place any ex ante restrictions on the shapes of individuals' hazard curves.

Recall from Section 1.2 that most existing theories imply declining hazards, but Gonzalez and Shi [2010] nd

28More precisely, the criterion being minimized takes the form (m−m0)′W (m−m0). Here, m is a 52× 1 vector, the dth

element of which is the fraction of workers reporting an unemployment duration of d weeks in their rst CPS survey and whoreport being employed in their second CPS survey. When computing these fractions, workers are weighted using the CPS nalweights. The vector m0 is the probability of a worker with an unemployment duration of d weeks in the model nding workin the subsequent four weeks. The weighting matrix W is diagonal, the dth element of which is proportional to the numberof unemployed workers in the CPS who report an unemployment duration of d weeks. Details on the data are available inAppendix D.

29Numerically, it appears that the steady-state unemployment rate is monotonic in ζ, so nding a value that satised thiscondition was accomplished using a bisection method.

24

increasing hazards. Negative duration dependence is not hardwired into the present model. Likewise, this

approach allows either true duration dependence or dynamic selection eects to drive the negative correlation

between search durations and job-nding rates. The results in the following section demonstrate that all

of these forces are at work. The large majority of job-seekers have declining hazards, but a minority has

increasing hazards. True duration dependence and dynamic selection eects are both important, but much

of heterogeneity in job-nding rates is a consequence of informational concerns.

5 Results

The market tightness function θ (r) summarizes the demand for workers, given their expected skill level,

and determines job-nding rates for the unemployed. Figure A.3 shows market tightness as a function

of a worker's résumé. Strikingly, market tightness is not monotonically related to r. In the absence of

informational concerns, one would expect the highest-skill workers to be in the highest demand. Indeed, the

markets r = 0 and r = 1 have full information, and θ (1) is 17% greater than θ (0). However, the dierence

in market tightness between the two markets with full information is dwarfed by the the range of tightnesses

for markets with incomplete information. Market tightness is highest for workers with résumé .80, and

this market is 5.4 times tighter than the market for workers who are highly skilled with certainty. Because

Bu (r) < r, a worker's résumé unambiguously declines during unemployment, but this does not imply that

a worker's job-nding probability declines as well. If a worker begins her search in a market r close to one,

then she will actually move into tighter markets as she spends time in unemployment. Eventually, though,

her résumé will deteriorate until she is to the left of the peak in Figure A.3; subsequently, her job-nding

rate drops with each additional period of joblessness.

Figure A.4 shows the résumé distribution for all workers, employed workers, unemployed workers, and

new entrants into the labor force. The employed tend to have better résumés than the unemployed, reecting

the fact that high-skill workers are more likely to form and maintain matches. Figure A.4 also shows the

eects of learning: Relative to the résumé distribution of new entrants, the population distribution of résumés

has a lot of mass close to r = 0 and r = 1. Experienced workers have less uncertainty about their types,

because their career outcomes send signals to the market about their underlying abilities.

In Section 5.1, I will show that the non-monotonicity of the market-tightness function is related to the

value of information: Workers of uncertain quality are willing to take pay cuts to improve their résumés,

whereas workers who are known to have high skills need to be paid high wages. Recruiting workers with

mid-level résumés may result in lower expected productivity, but higher expected prots. Then, in Section

5.2, I will explore the model's quantitative predictions for the job-nding hazard in more detail.

25

5.1 Wages and the Value of Information

To understand the economic forces behind job creation, it is essential to understand how the value of

information is priced into wages. Holding down a job and staying out of unemployment improve a worker's

résumé. Therefore, the equilibrium wage should reect the amount that a rm pays for a worker's services,

minus the amount that the worker pays for the résumé value of being employed. The following decomposition

makes this point clear.

Proposition 1. In a recursive search equilibrium, wages are given by the following function of the worker's

résumé and productivity:

ws (r, x) =

Static Nash︷︸︸︷ηzx+ (1− η)λ+

Search Wedge︷︸︸︷ηκθ (rc) −

Information Wedge︷︸︸︷(1− η)βE [U (r′e)− U (r′u) | r, x], (5.1)

where the expectation is taken over x′, and where I have dened:

rc ≡

Bu(bh (r | x)

)if s = h

r if s = e

r′e ≡ Be (r | x, x′)

r′u ≡ Bu (rc) .

Proof. See Appendix C.

We see that wages depend on three terms. The rst is ηzx + (1− η)λ, a convex combination of the

worker's output while employed and her ow value of leisure while unemployed. This expression is equal to

the wage that would arise in a static model of Nash bargaining. The second term is ηκθ (rc), which I will call

the search wedge. This expression represents the worker's option value of search; when markets are tight,

the worker can leverage the relative ease with which she can nd a new job. Recall from Section 2.7 that rc

is the counterfactual résumé a worker would have if she were not employed. In a textbook model of search

and bargaining, such as Pissarides [2000], wages are equal to the static Nash outcome, plus a search wedge.

In this environment, though, the search wedge depends on θ (rc), the tightness of the market in which the

worker would be searching if she were not employed in her current match.

The nal term in equation (5.1) is − (1− η)βE [U (r′e)− U (r′u) | r, x]. I will call this term the information

wedge, because it represents the résumé value of one additional period of employment to the worker. To

make this interpretation clear, consider a worker with résumé r who has the opportunity to work today, but

will enter unemployment tomorrow. If she is employed today, then her résumé is updated to r′e tomorrow,

26

and her value of searching will be U (r′e). If she is unemployed today, then she will have counterfactual

résumé rc, and tomorrow she will be be unemployed with résumé r′u = Bu (rc). In that case, her value of

searching will be U (r′u). By reaching an agreement for one period of employment, the worker improves her

future search prospects by U (r′e)− U (r′u). The information wedge equals the expected discounted value of

U (r′e) − U (r′u), scaled by the rm's surplus share (1− η). Hiring someone makes that person look good

in the eyes of future employers, so a rm extracts some of this value from the worker in the form of lower

wages. In other words, wages include a compensating dierential for the résumé value of holding a job and

staying out of unemployment.

Figure A.5 shows the wages of newly hired workers, and Figure A.6 shows the decomposition of these

wages. Even though all workers have the same output in their rst period on the job, wages can vary

substantially, depending on a worker's résumé. Many people will receive negative wages, with the lowest

pay going to workers who are probably highly skilled, yet who have some uncertainty surrounding their

true types. The reason for this result is that these workers have the information wedges with the largest

magnitudes. For workers whose types are known, i.e. those with résumés r ∈ 0, 1, a job is just a source of

consumption; there's no information value from employment. Notice that the information wedges for these

workers is zero: U (r′e) = U (r′u) for people with r ∈ 0, 1. Unsurprisingly, high-skill workers (r = 1) get

paid more than low-skill workers (r = 0) because they have better outside options, reected by larger search

wedges. However, the dierence in wages between the two types with full information is negligible compared

to the range of wages received by newly hired workers with r ∈ (0, 1). Figure A.7 shows U (r), the value of

unemployment, and Figure A.8 shows rc = Bu(bh (r | x)

), the counterfactual résumé a newly hired worker

would have if she were not employed, as a function of the worker's current résumé. The workers with the

largest information wedges are those for whom being hired sends the greatest signal (r− rc is large), and for

whom the signal is of the greatest value (the slope of U (r) is large).

Figures A.9 and A.10 show the wages and wedges of workers in continuing employment relationships.30

Wages are only slightly increasing in the worker's résumé, but earnings scale up almost linearly with produc-

tivity. If it were possible to regress w on x and r, one might conclude productivity is the main determinant

of wages, and informational concerns play only a minor role. Decomposing wages according to equation (5.1)

suggests otherwise. Figure A.10 demonstrates that the search and information wedges are reasonably large,

but for continuing employees, these terms have comparable magnitudes and opposing signs. In other words,

30Figure A.10 show only a single line for the information wedge, even though equation (5.1) suggests that the informationwedge is a function of current-period productivity x. In fact, under the chosen specication for Ω (x, x′ | i), the informationwedge is identical for all workers whose productivity is in the interior of X : Everyone has the same probability of a one-incrementproductivity increase and a one-increment productivity decrease, so the distribution over r′e does not depend on x. There arerelatively few workers with the maximal productivity level xnx , and their information wedges are quantitatively very similar.In equilibrium, there are no workers with the minimal positive productivity x1 because there is no match surplus.

27

there are two strong eects that come close to canceling out: The workers who get the most résumé value

from employment are also those who would have the best job-nding prospects if they were to leave their

current employers. The information wedge is smaller for workers in continuing employment relationships,

relative to new matches. All employees enjoy the résumé value of holding down a job, but new employees

also avoid the résumé damage from unemployment.

The behavior of wages sheds some light on the shape of the market-tightness function, which is the key

to determining the job-nding hazard. For workers in continuing matches, the search wedge is proportional

to market tightness, and it is nearly the mirror image of the information wedge. The tightest markets are

the therefore the ones that provide the greatest résumé value to workers, not the ones that result in the

highest expected output. This fact explains why high-résumé workers have upward-sloping hazard curves:

As these people spend more time in unemployment, they are eectively willing to take a larger pay cut in

order to halt the résumé damage from joblessness. Workers with good, but not pristine, résumés are in the

highest demand because these matches are likely to be fruitful, and rms can hire labor at a relative bargain.

In a full-information environment, the only thing an employer can oer a worker is some fraction of the

goods she is producing. When there is uncertainty about workers' skills, rms can use the value of résumé

improvement to compensate workers without giving up output.

Some of the model's predictions for wages are quantitatively drastic, but the qualitative predictions are

clear and logical. The most extreme outcome is that many workers literally pay for the opportunity to work

in their rst week on the job, and in subsequent periods, wages come close to tracking productivity. On a

technical level, this lopsided stream of earnings over the life of a job is a symptom of period-by-period surplus

splitting with linear utility, which implies that workers are not bothered by sudden swings in consumption.31

Nevertheless, real-world wage-tenure proles are often increasing and concave. Theorists have proposed a

number of mechanisms to explain this fact, such as workers' ability to acquire skills or rms' attempts to

retain workers in the face of outside oers. The present model provides an alternative explanation, based on

the informational rents that rms extract from new employees. Workers want to show future employers that

they were worth hiring in their current job. The résumé value of being hired, for which workers are willing

to take a pay cut, is greater than the résumé value of renewing an existing match. Entry-level internships

t comfortably into this theory: They often have short durations and low pay, yet workers accept them to

bolster their résumés. Finally, for the purposes of explaining market tightness, what matters is the present

value of expected prots over the full lifetime of the match, not the timing of prots within the match. When

wages are determined period-by-period, the résumé-improvement value that rms extract from workers is

31See Burdett and Coles [2003] and Stevens [2004] for illustrations of how the curvature of the utility function aects wage-tenure proles.

28

concentrated in the rst period of the match. Even if payments were smoothed out, workers with uncertain

skill levels would be willing to accept a contract with a lower expected present value of wages; the markets

for such workers would still be tight relative to the markets for workers with known skill levels.

5.2 Unemployment Durations and Job-Finding Rates

Individual Hazards

Let's investigate the shapes of individuals' true job-nding hazards. Consider a worker who enters unem-

ployment with résumé r0. After t periods of search, she will have résumé (Bu)t(r0), where (Bu)

t(·) denotes

the function Bu (·) composed with itself t times. Consequently, her probability of being seen and screened

by a rm, as a function of unemployment duration, is p(

(Bu)t(r0)

). Thus, conditional on her true skill

level, her job-nding hazard is ωip(

(Bu)t(r0)

). This representation resembles a proportional hazard model:

The probability of nding a job equals a function of duration p(

(Bu)t(r0)

), times a scaling factor ωi that

depends on an individual worker's underlying type.32 Two things determine how the worker's job-nding

rate evolves over the course of an unemployment spell: (1) how quickly her résumé deteriorates and (2) how

her probability of being seen by a rm changes as a function of her résumé.

To show how information is revealed during unemployment, Figure A.11 displays (Bu)t(r0) for several

initial initial résumés r0. Everyone's résumé declines in unemployment, but the drop is slower for people

with r0 close to zero or one. This pattern comes from two sources. First, workers with résumés close to 12

have the most prior uncertainty about their skill levels, so an additional observation carries more weight.33

Second, the markets for upper-middle-résumé workers are the tightest, and failing to be hired in a tight

market does more damage to a worker's résumé, as seen from equation (2.21). To show how information

aects contact rates, the left panel of Figure A.12 shows p (r), the probability of being seen by a rm as a

function of résumé; the right panel shows ωrp (r), the average job-nding probability of workers searching in

market r. Both functions inherit the hump shape of the market-tightness function θ (r), shown in Figure A.3.

Figure A.13 displays p(

(Bu)t(r0)

)for various initial résumés r0. An individual's job-nding probability

is proportional (by a factor of ωi) to p(

(Bu)t(r0)

), so this gure illustrates the nature of true duration

dependence.

Evidently, dierent workers can have hazard curves with totally dierent shapes. Microeconomic studies

often try to establish whether true duration dependence is positive or negative, but in the present model, the

worker's initial résumé inuences both the level and the slope of the hazard. Of newly unemployed workers,

32In Section 7, I'll discuss the theory's implications for the econometric implementation of proportional hazard models.33Conditional on having résumé r, a worker's skill level follows a Bernoulli distribution with parameter r; the variance of this

distribution is (1− r) r, which is maximized at r = 12.

29

13% will exhibit positive duration dependence, in the sense that their job-nding probability will increase if

they spend a second period in unemployment. Nevertheless, because the overwhelming majority of workers

exhibit negative duration dependence, the model is broadly consistent with the experimental studies cited

in the Introduction. Amongst the workers with downward-sloping hazards, those with higher résumés will

experience a faster drop in their job-nding probabilities. The average worker entering unemployment has

a résumé of .37, and such a worker will see her job-nding probability drop by 29% after 13 weeks, by 40%

after 26 weeks, and by 46% after 52 weeks.

Aggregate Hazards

A long-standing problem in the literature is how to account for the negative correlation between job-nding

rates and unemployment durations that appears in aggregate data: Quantitatively, how much of this cor-

relation is due to true duration dependence, and how much to unobserved heterogeneity? In the calibrated

economy, both mechanisms are important. As one measure of true duration dependence, Jarosch and Pi-

lossoph [2016] propose looking at the average change in job-nding rates experienced by workers who have

been unemployed for t periods.34 Figure A.14 shows that this metric of true duration dependence exhibits

a 20% drop o over the course of a year of search. By contrast, Jarosch and Pilossoph [2016] nd that true

duration dependence in their model accounts for virtually no change in the job-nding rate.

We can perform other experiments to isolate the eects heterogeneity and true duration dependence.

Figure A.15 contrasts three curves. The rst is the average job-nding rate in the unemployed population

as a function of duration. The second is the population job-nding rate that we would observe if workers

experienced duration dependence, but the skill composition of workers remained xed.35 The drop in the

job-nding rate coming from true duration dependence is appreciable: Holding the composition of workers

xed, the average job-nding rate drops by 20% after 13 weeks, by 34% after 26 weeks, and by 43% after 52

weeks. The third curve in Figure A.15 is the population job-nding rate that we would observe if workers

had heterogeneous but time-invariant job-nding rates; the distribution over job-nding rates is taken to

be the distribution implied by the model for workers entering unemployment.36 Despite the importance of

34That is, for each duration t, compute:

100×ˆ

p (r)

p((bu)t (r)

)dFut (r) ,

where bu (·) is the inverse of Bu (·), (bu)t (·) is bu (·) composed with itself t times, and Fut (·) is the résumé distribution amongst

workers who have been unemployed for t consecutive periods. Hence, a worker who has a job-nding rate of ωip (r) in her tth

period of search had a job-nding rate of ωip((bu)t (r)

)at the outset of her unemployment spell.

35This is the average job-nding rate as a function of duration, where the average is taken over skills and initial résumés,with respect to the distribution of workers entering unemployment:

Hazard Due to True Duration Dependence =

ˆ[rω1 + (1− r)ω0] p

((Bu)t (r)

)dFu

0 (r) ,

where Fu0 (·) is the distribution of résumés amongst workers entering unemployment.

36To be precise, let Fu0 (·) denote the distribution over résumés amongst workers entering unemployment. A measure rdFu

0 (r)

30

true duration dependence, the amount of heterogeneity in the model is also capable of generating a strong

negative correlation between unemployment duration and the probability of nding a job.

More importantly, the heterogeneity amongst job seekers is not purely mechanical; dierences in job-

nding rates across workers are endogenously determined economic outcomes. As evidenced by Figure A.12,

the dierence in job-nding rates between workers with known skills (i.e. r ∈ 0, 1) is small relative to the

dierence in job-nding rates between workers with unknown skills (i.e. r ∈ (0, 1)). Incomplete information

provides an economic mechanism for the relationship between search durations and job-nding rates, and

this mechanism operates through the channels of both heterogeneity and true duration dependence. This

point is further illustrated in the next section, which examines an economy with full information.

5.3 A Full-Information Benchmark

To appreciate the role of incomplete information, it's instructive to compare the economy of Section 2 to

a benchmark environment where workers' skill levels are known, but the expected distribution over labor

productivity is unchanged. Suppose that a worker born with résumé r draws her productivity from the

distributions Ω (x | r) and Ω (x, x′ | r) for her entire lifetime, with certainty. Instead of representing a worker's

expected skill, the résumé in this context represents the worker's actual, observable skill level. As before,

there is a continuum of segmented search markets, indexed by r ∈ [0, 1], and a rm posting a vacancy chooses

to solicit applications from workers with a particular résumé r. Agents' Bellman equations are the same as

before, except that the belief-updating functions are replaced by identity functions. Such an environment

is like having a continuum of Mortensen-Pissarides economies, each with its own idiosyncratic productivity

process, indexed by r.

This full-information benchmark diers from the model of Section 2 in two important respects. First,

job-nding rates are not subject to duration dependence. A worker born with résumé r will have résumé r

forever; consequently, a worker who begins searching in market r will continue searching in market r for the

duration of her spell. Second, there is no résumé value from employment. In equation (5.1), the information

wedge disappears because r′e = r′u = r. Employers can no longer compensate their workers with the résumé

value of holding a job, so the rm's cost of labor goes up. As a result, market tightness declines. Figure A.16

compares the job-nding rate ωrp (r) in the full- and incomplete-information economies. By construction,

these functions align at r = 0 and r = 1, but for all r ∈ (0, 1), markets are tighter when information is

of newly unemployed workers has job-nding rate ω1p (r), and a measure (1− r) dFu0 (r) has job-nding rate ω0p (r). If everyone

had a constant probability of nding a job, then the fraction of workers with job-nding rate ωip (r) remaining in unemploymentafter t periods is [1− ωip (r)]t. The average hazard curve, holding all individual job-nding rates constant, is therefore:

Hazard Due to Heterogeneity =

´ [[1− ω1p (r)]t ω1p (r) r + [1− ω0p (r)]t ω0p (r) (1− r)

]dFu

0 (r)´ [[1− ω1p (r)]t r + [1− ω0p (r)]t (1− r)

]dFu

0 (r).

31

incomplete. Furthermore, when types are known, market tightness is monotonically related to skill level;

in contrast, the market-tightness function has a pronounced hump shape when types are unknown. Besides

causing genuine duration dependence, information frictions also widen the range of job-nding rates amongst

the unemployed. Figure A.17 compares the average aggregate job-nding rates, as a function of duration,

between the baseline model and the full-information benchmark. The full-information economy falls far short

of replicating the negative relationship observed in the data. In part, this is because individual workers have

constant hazards, but this result also comes from the compression of job-nding rates across workers with

dierent résumés. Although the results in Section 5.2 suggest that dynamic selection eects are important,

heterogeneity should not be interpreted as an alternative to informational dynamics when accounting for

the aggregate relationship between job-nding rates and search durations. Instead, the model suggests that

informational dynamics contribute to heterogeneity in job-nding rates.

5.4 Skill Decay and Alternative Sources of Duration Dependence

Thus far, informational dynamics have been the only source of duration dependence that I have considered,

but other explanations are worth exploring. Amongst the competing hypotheses, skill decay during unem-

ployment is the leading alternative. The basic idea is that low-skill workers are in lower demand, so workers

will have declining job-nding rates as they lose human capital during unemployment. Naïvely, one might

think that résumé damage and skill decay can both generate negative duration dependence, so a model with

both forces would have a steeper hazard curve than a model with only one. In the environment I have

constructed, however, skill loss during unemployment can actually attenuate duration dependence. If skills

evolve over time, then a signal about a worker's current type becomes less valuable. Consequently, the value

of creating a match may deteriorate less over the course of an unemployment spell, even though someone's

résumé may deteriorate more.

The model allows us to explore these forces in more detail. The framework of Section 2 can be extended

to allow workers' skills to change, depending on their employment status. Suppose that, conditional on

being unemployed in period t, high-skill workers have probability τ of becoming low-skill workers in period

t+ 1. Likewise, conditional on being employed in period t, low-skill workers have probability γ of becoming

high-skill workers in period t+ 1. The worker's actual skill level remains unobservable. Mathematically, the

32

Table 1: Parametrizing Skill ChangeParametrization Prob. of Skill Gain On the Job (γ) Prob. of Skill Loss O the Job (τ)

Baseline 0 0SC1 0 1

26

SC2 12×52

126

SC3 126

126

only things that change are the belief-updating functions, which become:

Be (r | x, x′) = γ + (1− γ)Ω (x, x′ | 1)

Ω (x, x′ | r)r (5.2)

Bh (r | x) = (1− τ)Ω (x | 1)

Ω (x | r)r (5.3)

Bu (r) = (1− τ)1− p (r)

∑x∈Xh(r) Ω (x | 1)

1− p (r)∑x∈Xh(r) Ω (x | r)

r. (5.4)

Table 1 shows the parametrizations for γ and τ that I will consider. All other parameters are held constant.

As seen in Figure A.1, the average job-nding rate, as a function of duration, appears to level o after about

six months. With this in mind, I set τ = 126 , implying that it takes an average time of six months for someone

to experience a drop in skills. Parametrization SC1 species that skills can be lost during unemployment,

but not cannot be regained on the job. Parametrizations SC2 and SC3 allow for skill upgrading during

employment; on average, an improvement takes two years under SC2 and six months under SC3. As a

point of reference, Ljungqvist and Sargent [1998] specify that the average rate of human-capital depreciation

during unemployment is twice as large as the rate of human-capital accumulation during employment.37

For each parametrization, Figure A.18 shows the average job-nding rate as a function of résumé. The

probability of being hired is less sensitive to changes in a worker's résumé. Also, for most markets, job-nding

rates are lower in the economies with skill change. The drops in most workers' hazard curves are more acute

in the economy that has résumé damage as the only source of duration dependence. On the aggregate level,

the average job-nding rate declines more rapidly as a function of duration under the baseline calibration,

relative to SC1-SC3.

The results suggest that, when types are permanent and unobservable, dierences in workers' job-nding

rates have more to do with dierences in how workers value information, as opposed to dierences in expected

productivity. In particular, under the baseline specication, rms are most eager to hire the workers who

are most willing to give up wages in order to improve their résumés. Adding stochastic skill change to the

model diminishes the marginal value of résumé improvement. Recall that the information wedge in workers'

37In Ljungqvist and Sargent's model, human capital is synonymous with productivity. In my model, skill is a distributionover productivities, so the parameters of their model cannot be compared directly to mine.

33

wages is proportional to the expected value of U (r′e)−U (r′u), which captures how the résumé improvement

from being employed today increases the value of search tomorrow. The slope of U (r) therefore provides

an indication of how much a worker values the marginal résumé improvement she gets from holding a job.

Figure A.19 shows U (r), the value of being unemployed with résumé r, under each parametrization. Notice

that U (r) is steeper under the baseline calibration than under SC1-SC3. If workers can lose their skills in

unemployment, there's less benet to being a high-skill worker; if workers can gain skills on the job, there's

less harm in being a low-skill worker. In either case, the possibility of skill change causes U (r) to atten

out, implying that information about a worker's quality has less eect on her search prospects.

There are other possible explanations for duration dependence, besides informational dynamics and skill

decay. It's possible that workers become discouraged and expend less eort looking for jobs. One could

extend the environment of Section 2 to allow for variable search intensity, modeled as in Chapter 5 of

Pissarides [2000]. In general, though, the optimal search intensity selected by a worker will be constant

within an unemployment spell unless some other feature of the environment is time-varying. Consequently,

this notion of search intensity can amplify, but not cause, duration dependence. Yet another possibility is

that employment transitions are determined by stock-ow matching. Under this conception of the matching

process, a newly unemployed worker surveys the stock of all existing vacancies to see if there is a suitable

match. If such a job is available, the worker is hired; if not, the worker surveys the ow of new vacancies

as they are posted by rms. In that case, duration dependence occurs by assumption, as a result of the

matching technology. Also, this explanation only applies to the drop in job-nding rates between the rst

and second periods of search. A promising approach may be to merge stock-ow matching with one of the

mechanisms discussed above. Doing so, however, is not a straightforward extension, so I defer the problem

to future research.

6 Sensitivity Analysis

To assess the sensitivity of the results, I will experiment with some alternative parameter values. In the

model, information about workers is revealed both on and o the job. Although the model's main applica-

tion is analyzing learning during unemployment, the quantitative results depend in part on the on-the-job

productivity dierences between employed workers, captured by Ω (x, x′ | 0) and Ω (x, x′ | 1). Although the

dierence between Ω (x, x′ | 0) and Ω (x, x′ | 1) plays a non-trivial role, the results are most sensitive to ω0

and ω1, which determine the fraction of jobs each type of worker is capable of doing and, by extension, the

probability of being hired conditional on being seen by a rm. To illustrate this point, I solve the model with

Ω (x, x′ | 0) = Ω (x, x′ | 1). I set the conditional mean of log(x′

x

), given by equation (4.3), to .15

52×10 , and I set

34

δ0 = δ1 = 152×2.50 . All other parameters remain the same as before. Under this alternative parametrization,

high- and low-skill workers are equally productive on a job, conditional on being able to do that job at all.

The market-tightness function (not shown) remains hump-shaped, though the maximal value of θ (r) is lower

and occurs when r = .69. Consequently, a larger fraction of newly unemployed workers, 32%, experience

positive duration dependence. A worker entering unemployment with the average résumé will see her job-

nding probability drop by 15% after 13 weeks, by 23% after 26 weeks, and by 29% after 52 weeks. This

decline is not as pronounced as the one described in Section 5.2, but it is nevertheless substantial, especially

given that the two skill levels are assumed to be equally productive once a match has formed.

The main parameters that control the informational content of unemployment are ω1 and ω0, which

are the respective probabilities of high- and low-skill workers being able to do a given job. If ω1 and ω0

were the same, then the probability of making a match, conditional on being seen by a rm, would not

be correlated with on-the-job productivity; in that case, no learning about worker quality would take place

during unemployment. Conversely, if ω1 − ω0 increases, the job-nding probability for high-skill workers

goes up relative to low-skill workers with the same résumé, so an additional period of unemployment makes

it seem even more likely that a worker is unskilled. To demonstrate the importance of these parameters, I

re-solve the model with dierent values of ω1 − ω0, keepingω1+ω0

2 xed. Figure A.20 plots the average job-

nding rate ωrp (r), as a function of a worker's résumé. As ω1−ω0 goes up, the dierence in job-nding rates

between high- and low-skill workers (i.e. the dierence between ω1p (1) and ω0p (0)) becomes larger. More

importantly, though, the hump in ωrp (r) gets more pronounced. For each of these parameter congurations,

Figure A.21 shows the average job-nding rate as a function of duration. The aggregate relationship between

the duration of search and the probability of being hired becomes stronger as ω1 − ω0 increases. The values

(ω0, ω1) = (.40, .60) used in the baseline calibration are the ones that come closest to replicating the data.

7 Implications for Empirical Research

7.1 The Precision of Beliefs over Time

Because résumés are updated every period, the variance of posterior beliefs about a worker's skill level will

decline stochastically. To see this, let ri,t denote the résumé of worker i at time t. Beliefs about a worker's

skill level follow a martingale: Et [ri,t+1] = ri,t. The variance of posterior beliefs is given by (1− ri,t) ri,t,

which is a supermartingale:

Et [(1− ri,t+1) ri,t+1] ≤ (1− Et [ri,t+1])Et [ri,t+1] = (1− ri,t) ri,t,

35

where the inequality comes from Jensen's inequality. Consequently, workers with a lot of labor-market

experience will have little uncertainty about their types. If duration dependence is solely a consequence of

learning, then a worker can only experience true duration dependence if there is uncertainty about her skill.

Thus, the model implies that workers who have been in the market the longest are subject to the least amount

of duration dependence in unemployment. This suggests comparing the relationship between unemployment

durations and job-nding rates for workers with dierent amounts of labor-market experience.

To summarize these correlations, consider the following semiparametric regression, applied to workers

who are surveyed by the CPS in consecutive months and who report being unemployed in the rst month:

job foundi = h (durationi) + x′iβ + εi. (7.1)

In the above, durationi is the number of weeks that worker i claims to have been looking for a job in the

rst survey month; h (·) is a smooth function; job foundi is an indicator variable that is equal to one if the

worker reports being employed in the second survey month; xi is a vector of covariates; and εi is a residual.

Data details are in Appendix D. I divide the sample by potential labor-market experience, dened as age

minus years of education, and I t equation (7.1) separately for each quartile of the potential experience

distribution.38

Figure A.22 shows the estimated values of 100× h (d) /h (0) for each quartile of the potential-experience

distribution. Indeed, it appears that the drop in average job-nding rates, as a function of duration, is

strongest for those in the bottom quartile of the potential experience distribution and weakest for those

in the top quartile.39 The decline is essentially the same for the second and third quartiles, but taken

together, those in the middle two quartiles show a steeper decline than the most-experienced workers, and a

more moderate decline than the least-experienced workers. Figure A.23 shows the results when the sample

excludes workers who are seeking part-time employment. In that case, it looks clearer that the average

job-nding rates fall less with duration for the most experienced workers. As a point of comparison, Figure

A.24 shows the how the average job-nding rate changes with duration for dierent experience groups,

using articial data generated by the theoretical model.40 Quantitatively, it's unsurprising that the model-

generated lifecycle data do not align perfectly with the lifecycle data in the CPS; that's because the model

38More specically, I t a local-polynomial regression, with additional linear covariates, following Robinson [1988]. How-ever, when performing the local-polynomial regressions, I employ the estimator described by Breidt and Opsomer [2000] forincorporating survey weights (in this case, the CPS nal weights).

39As an aside, younger workers tend to have higher job-nding rates than older workers. This would just represent avertical shift of each of the population hazards depicted in Figure A.22, which are normalized by the job-nding rates of newlyunemployed workers.

40In the theoretical model, age is geometrically distributed, so the quartiles in Figure A.24 correspond to the quartilesof the potential experience distribution from the CPS. Also, the model-generated job-nding rates shown in Figure A.24 arecomputed directly using p (r), rather than from estimating the semiparametric regression in equation (7.1).

36

has a very primitive treatment of age, with workers facing a constant probability of dying. Qualitatively,

though, the model-generated data do show that the correlation between duration and job-nding rates

declines somewhat with experience, although the magnitude diers from what we observe in the CPS. More

importantly, the prediction that highly experienced workers have at hazards is a general consequence of

Bayesian learning, not a byproduct of the way the theoretical model is calibrated. Of course, the regression

equation (7.1) can only capture correlations, and ideally, one would want to distinguish between true duration

dependence and dynamic selection eects. Nevertheless, these results are at least suggestive of informational

dynamics as a source of duration dependence.

7.2 Specication and Identication of Proportional Hazards Models

The theoretical results place restrictions on the proper use of the mixed proportional hazard model, which

is the most popular tool in the econometric literature. That specication asserts that worker i's probability

of nding a job after t periods of search is vihX (xi)hT (t), where vi is a mean-one idiosyncratic disturbance

and xi is a vector of person-specic predictors. In applications, the main object of interest is hT (t), which is

called the baseline hazard. In light of the economic theory, a proportional hazard model raises two concerns:

specication and identication. For a typical cross-sectional dataset, the proportional hazard model is

misspecied. More subtlety, if a prospective employer and an econometrician have the same information

about workers, then the proportional hazard model is correctly specied but not identied. To obtain

identication of a correctly specied model, it's necessary for an econometrician to have an indicator of

workers' skills that falls outside the information set of rms.

First, consider a random sample of unemployment spells from the model economy. Recall from Section

5.2 that an individual's job-nding hazard is ωip(

(Bu)t(r0)

), where r0 denotes the worker's résumé at the

outset of the spell.41 There is heterogeneity amongst job seekers in both ωi and r0, so idiosyncratic eects

cannot be multiplicatively separated from the time-varying portion of the hazard. Graphically, it's clear

that a proportional hazard model will not be correctly specied for a sample of workers with dierent initial

résumés: Figure A.13 shows the shapes of some individuals' true hazard curves, and they are obviously not

in constant proportion to one another. In fact, this may be one of the reasons for the diversity of results

obtained using proportional hazard models.

Now, suppose that an econometrician could observe unemployment spells for a collection of workers

who entered unemployment with identical résumés. In that case, the job-nding rate can be written as

vihX (xi)hT (t), where vi ≡ ωi/ωr0 , hX (xi) ≡ 1, and hT (t) ≡ ωr0p(

(Bu)t(r0)

). So, for a xed r0, the

41If there were more than two skill levels, the hazard would still assume this form, but r would be vector-valued. The followingarguments will still apply.

37

true data-generating process is consistent with a mixed proportional hazard model. Unfortunately, the

proportional hazard model is not identied with a dataset that has no variation in r0. In the case where the

data contain only one unemployment spell per worker, Elbers and Ridder [1982] prove that identiability

of the proportional hazard model hinges on hX (xi) being a non-constant function and xi having non-zero

variance.42 Even if an econometrician could observe aspects of a worker's publicly observed history to include

in xi, they would be relevant only insofar as they determine r0. In other words, Elbers and Ridder [1982]

show that identiability of the proportional hazard model requires observable dierences in workers with the

same baseline hazard, but the theory of informational dynamics suggests that the only workers who share

the same baseline hazard are observationally identical, in the sense of having the same résumé.

Identication requires the econometrician to have access to some indicator of workers' skills that is

not included in the publicly observable history that is available to prospective employers. For example, a

longitudinal dataset may allow an econometrician to see a worker's wage on her fth job, which would not be

visible during her third unemployment spell; nevertheless, these wages will be correlated with the worker's

skills. Given covariates xi that are outside rms' information set, one could specify hX (xi) ≡ E [ωi | xi, r0];

it would also be necessary to assume that the expectational error in the econometrician's beliefs, given by

vi ≡ ωi/hX (xi), is independent of xi, conditional on r0. In that case, the proportional hazards model

would be both identied and correctly specied, in the sense that it comports with the theoretical general-

equilibrium framework presented here.

8 Conclusion

Summing up, this model generates several novel theoretical results. First, there is a non-monotonic rela-

tionship between expected skill and job-nding rates. The total match surplus depends not only on the

worker's productivity when employed, but also on the résumé damage the worker would have experienced

when unemployed. Consequently, the tightest markets are generally not those for the most productive work-

ers, but for those who most value the résumé improvement from being hired. Second, the model illustrates

how information is priced into wages, which is essential to understanding the patterns in job-nding rates.

The model provides a decomposition of wages that includes a compensating dierential for the résumé value

of employment. Third, human capital decay, which is an alternative explanation for negative duration de-

pendence, can actually attenuate the drop in job-nding rates. When skills change over time, information

about a worker's current skill level is less valuable, thus blunting the above mechanism.

42Observing multiple spells per worker oers little help in this context. Honoré [1993] proves that variation in xi is unnecessaryfor identication when we can observe multiple spells per person, but this result assumes a person has the same vi and hT (t)across spells. Even if a person's skill level remains constant between spells, her résumé almost certainly will not. Consequently,one person's multiple spells are not informative of a single baseline hazard function.

38

Besides the theoretical results, the quantitative results from the calibrated model suggest that informa-

tional concerns can play a large role in shaping job-nding rates. The model can replicate the aggregate

relationship between search durations and job-nding rates reasonably well. For individual workers, hazards

can change substantially over an unemployment spell. Although heterogeneity in job-nding rates can gen-

erate a strong aggregate correlation between search durations and job-nding rates, incomplete information

is responsible for much of the heterogeneity in job-nding rates, in addition to causing true duration depen-

dence for individuals. The overwhelming majority of job-seekers experience negative duration dependence,

which is qualitatively consistent with the micro evidence.

In addition to the implications for empirical research discussed in Section 7, the model also suggests some

directions for future theoretical research. There have been relatively few macroeconomic models of learning

from the duration of unemployment. And, as I have discussed, most of those models do not consider career

concerns, or how the résumé value of employment inuences future unemployment experiences. Whereas

I have embellished the structure of workers' skills and information, other authors have taken a closer look

at the characteristics and behaviors of rms. In particular, Jarosch and Pilossoph [2016] argue that rm

heterogeneity is important, because the rms that do not bother interviewing the long-term unemployed

would not have been compatible with those workers anyway. An environment that incorporates the mech-

anisms in each of our models would provide a fruitful avenue for future research. Another extension would

be an examination of business cycles. Although I have focused on the steady state, the environment I have

developed can accommodate aggregate shocks. This aspect makes the model a fairly general framework for

analyzing information dynamics in frictional labor markets.

References

Daron Acemo§lu. Public policy in a model of long-term unemployment. Economica, 62(246):16178, 1995.

Joseph G. Altonji and Charles R. Pierret. Employer learning and statistical discrimination. The Quarterly

Journal of Economics, 116(1):313350, 2001.

Joseph G. Altonji and Robert A. Shakotko. Do wages rise with job seniority? The Review of Economic

Studies, 54(3):437459, 1987.

Joseph G. Altonji and Nicolas Williams. Do wages rise with job seniority? A reassessment. Industrial and

Labor Relations Review, 58(3):370397, 2005.

Shelly Banjo. Measures aim to end bias against long-term jobless. The Wall Street Journal, February 24, 2012.

URL http://online.wsj.com/news/articles/SB10001424052970204778604577241693309654990.

39

http://online.wsj.com/news/articles/SB10001424052970204778604577241693309654990

Martin Biewen and Susanne Stees. Unemployment persistence: Is there evidence for stigma eects? Eco-

nomics Letters, 106(3):188190, 2010.

F. Jay Breidt and Jean D. Opsomer. Local polynomial regression estimators in survey sampling. Annals of

Statistics, 28(4):10261053, 2000.

Ken Burdett and Melvyn Coles. Equilibrium wage-tenure contracts. Econometrica, 71(5):13771404, 2003.

Chris Elbers and Geert Ridder. True and spurious duration dependence: The identiability of the propor-

tional hazard model. The Review of Economic Studies, 49(3):403409, 1982.

Stefan Eriksson and Dan-Olof Rooth. Do employers use unemployment as a sorting criterion when hiring?

Evidence from a eld experiment. American Economic Review, 104(3):101439, 2014.

Javier Fernandez-Blanco and Edgar Preugschat. On the eects of ranking by unemployment duration. 2015.

Shigeru Fujita and Giuseppe Moscarini. Recall and unemployment. 2015.

Robert Gibbons and Lawrence F. Katz. Layos and lemons. Journal of Labor Economics, 9(4):35180,

October 1991.

Robert Gibbons and Kevin J. Murphy. Optimal incentive contracts in the presence of career concerns:

Theory and evidence. The Journal of Political Economy, 100(3):468505, 1992.

Francisco M. Gonzalez and Shouyong Shi. An equilibrium theory of learning, search, and wages. Economet-

rica, 78(2):509537, 2010.

Robert E. Hall. Employment uctuations with equilibrium wage stickiness. American Economic Review, 95

(1):5065, 2005.

Robert E. Hall and Paul R. Milgrom. The limited inuence of unemployment on the wage bargain. American

Economic Review, 98(4):165374, 2008.

Bengt Holmström. Managerial incentive problems: A dynamic perspective. The Review of Economic Studies,

66(1):169182, 1999.

Bo E. Honoré. Identication results for duration models with multiple spells. The Review of Economic

Studies, 60(1):241246, 1993.

Guido W. Imbens and Lisa M. Lynch. Re-employment probabilities over the business cycle. Portuguese

Economic Journal, 5(2):111134, 2006.

40

Gregor Jarosch and Laura Pilossoph. Statistical discrimination and duration dependence in the job nding

rate. 2016.

Lisa B. Kahn. Asymmetric information between employers. American Economic Journal: Applied Eco-

nomics, 5(4):165205, 2013.

Lisa B. Kahn and Fabian Lange. Employer learning, productivity, and the earnings distribution: Evidence

from performance measures. The Review of Economic Studies, 81(4):15751613, 2014.

Seik Kim and Emiko Usui. Employer learning, job mobility, and wage dynamics. 2013.

Kory Kroft, Fabian Lange, and Matthew J. Notowidigdo. Duration dependence and labor market conditions:

Evidence from a eld experiment. The Quarterly Journal of Economics, 128(3):11231167, 2013.

Marianna Kudlyak, Damba Lkhagvasuren, and Roman Sysuyev. Systematic job search: New evidence from

individual job application data. 2014.

Fabian Lange. The speed of employer learning. Journal of Labor Economics, 25(1):135, 2007.

Lars Ljungqvist and Thomas J. Sargent. The European unemployment dilemma. Journal of Political

Economy, 106(3):514550, 1998.

Ben Lockwood. Information externalities in the labour market and the duration of unemployment. The

Review of Economic Studies, 58(4):733753, 1991.

Stephen Machin and Alan Manning. The causes and consequences of longterm unemployment in Europe.

Handbook of Labor Economics, 3:30853139, 1999.

Costas Meghir and Luigi Pistaferri. Income variance dynamics and heterogeneity. Econometrica, 72(1):132,

2004.

Guido Menzio and Shouyong Shi. Block recursive equilibria for stochastic models of search on the job.

Journal of Economic Theory, 145(4):14531494, 2010.

Guido Menzio and Shouyong Shi. Ecient search on the job and the business cycle. Journal of Political

Economy, 119(3), 2011.

Iacopo Morchio. Work histories and lifetime unemployment. 2016.

Dale T. Mortensen and Christopher A. Pissarides. Job creation and job destruction in the theory of unem-

ployment. The Review of Economic Studies, 61(3):397415, 1994.

41

John F. Nash, Jr. The bargaining problem. Econometrica, 18(2):155162, 1950.

Felix Oberholzer-Gee. Nonemployment stigma as rational herding: A eld experiment. Journal of Economic

Behavior & Organization, 65(1):3040, 2008.

Christopher A. Pissarides. Search unemployment with on-the-job search. The Review of Economic Studies,

61(3):457475, 1994.

Christopher A. Pissarides. Equilibrium Unemployment Theory. The MIT Press, 2000.

Catherine Rampell. The help-wanted sign comes with a frustrating asterisk. The New

York Times, July 26, 2011. URL http://www.nytimes.com/2011/07/26/business/

help-wanted-ads-exclude-the-long-term-jobless.html.

Peter M. Robinson. Root-n-consistent semiparametric regression. Econometrica, 56(4):931954, 1988.

Ariel Rubinstein. Perfect equilibrium in a bargaining model. Econometrica, 50(1):97109, 1982.

Uta Schönberg. Testing for asymmetric employer learning. Journal of Labor Economics, 25(4):651691,

2007.

Shouyong Shi. Directed search for equilibrium wage-tenure contracts. Econometrica, 77(2):561584, 2009.

Robert Shimer. The cyclical behavior of equilibrium unemployment and vacancies. American Economic

Review, 95(1):2549, 2005.

Robert Shimer. On-the-job search and strategic bargaining. European Economic Review, 50(4):811830,

2006.

Robert Shimer. The probability of nding a job. American Economic Review, 98(2):26873, 2008.

Margaret Stevens. Wage-tenure contracts in a frictional labour market: Firms' strategies for recruitment

and retention. The Review of Economic Studies, 71(2):535551, 2004.

Robert Topel. Specic capital, mobility, and wages: Wages rise with job seniority. Journal of Political

Economy, 99(1):14576, 1991.

Gerard J. Van den Berg. Duration models: Specication, identication and multiple durations. Handbook

of Econometrics, 5:33813460, 2001.

42

http://www.nytimes.com/2011/07/26/business/help-wanted-ads-exclude-the-long-term-jobless.html

http://www.nytimes.com/2011/07/26/business/help-wanted-ads-exclude-the-long-term-jobless.html

A Figures

Figure A.1: Job-Finding Rates as a Function of Duration

0 10 20 30 40 500.2

0.25

0.3

0.35

0.4

0.45

0.5

0.55

0.6

0.65

0.7

Duration (Weeks)

P[ J

ob F

ound

Nex

t Mon

th ]

Each x in the scatter plot represents the fraction of unemployed workers of a given duration who found a jobby the time they were surveyed in the subsequent sample month. The number of workers who found a joband the number of workers with a given unemployment duration are weighted with CPS nal weights. Thesize of each x is proportional to the weighted number of workers who report that unemployment duration.The solid line is a local-polynomial kernel regression estimate. Data details are in Appendix D.

43

Figure A.2: Fitting the Population Hazard

0 10 20 30 40 500.2

0.25

0.3

0.35

0.4

0.45

0.5

0.55

0.6

0.65

0.7

Duration

One

−M

onth

Job

−F

indi

ng P

rob.

The scatter plot is the same as the one in Figure A.1, except that the above excludes workers on layo. Thesolid line is the relationship implied by the model.

Figure A.3: Market Tightness

0 0.2 0.4 0.6 0.8 11

2

3

4

5

6

7

8

r

θ(r)

44

Figure A.4: The Résumé Distribution

Résumé0 0.2 0.4 0.6 0.8 1

Mea

sure

0

2

4

6

8

10

12

EntrantsPopulationEmployedUnemployed

Figure A.5: Wages of New Hires

0 0.2 0.4 0.6 0.8 1−3

−2.5

−2

−1.5

−1

−0.5

0

0.5

1

1.5

r

wh(r

|1)

45

Figure A.6: Wedges in the Wage Equation for New Hires

0 0.2 0.4 0.6 0.8 1−6

−5

−4

−3

−2

−1

0

1

2

r

Wed

ge

Search WedgeInformation Wedge

Figure A.7: Value of Unemployment: U (r)

0 0.2 0.4 0.6 0.8 11

1.02

1.04

1.06

1.08

1.1

1.12

r

( 1

− β

) U

(r)

46

Figure A.8: Signal from Hiring: Counterfactual Résumés rc

0 0.2 0.4 0.6 0.8 10

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

r

rc

Bu(bh(r))

45o

Figure A.9: Wages of Continuing Employees

00.5

1

01230.8

1

1.2

1.4

1.6

1.8

2

rx

we(r

,x)

47

Figure A.10: Wedges in the Wage Equation for Continuing Employees

0 0.2 0.4 0.6 0.8 1−1.5

−1

−0.5

0

0.5

1

1.5

2

r

Wed

ge

Search WedgeInformation Wedge

Figure A.11: Résumé Updating During Unemployment

0 20 40 60 00.5

10

0.2

0.4

0.6

0.8

1

Initial RésuméUnemp. Duration (Wks)

Upd

ated

Rés

umé

48

Figure A.12: Meeting and Matching Probabilities as a Function of Résumé

0 0.5 10.25

0.3

0.35

0.4

0.45

0.5

0.55

0.6

0.65

0.7

0.75

r

p(r)

Meeting Rate

0 0.5 10.1

0.15

0.2

0.25

0.3

0.35

0.4

0.45

0.5

r

ωrp(

r)

Matching Rate

Figure A.13: Contact Rates as a Function of Unemployment Duration

020

4060 0

0.510.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

Initial RésuméUnemp. Duration (Wks)

Pro

babi

lity:

p(r

)

49

Figure A.14: Average Drop in Individual Hazards, by Duration

0 10 20 30 40 50

Unemp. Duration (Wks)

75

80

85

90

95

100R

elat

ive

Job-

Fin

ding

Rat

e

Figure A.15: The Role of Heterogeneity in the Population Hazard

10 20 30 40 50 600.1

0.15

0.2

0.25

Unemp. Duration (Wks)

Job−

Fin

ding

Rat

e (W

kly)

Average HazardHazard due to HeterogeneityHazard due to Stigma

50

Figure A.16: Job-Finding Rates in the Full-Information Economy

0 0.2 0.4 0.6 0.8 10.1

0.15

0.2

0.25

0.3

0.35

0.4

0.45

0.5

r

ωr p

(r)

Incomplete Info. EconomyFull Info. Economy

Figure A.17: Aggregate Average Job-Finding Rates in the Full-Information Economy

0 10 20 30 40 50 600.1

0.15

0.2

0.25

Duration (Weeks)

Job−

Fin

ding

Pro

b. (

Wkl

y.)

Incomplete Info. EconomyFull Info. Economy

51

Figure A.18: Skill Change and the Job-Finding Rate ωrp (r)

0 0.2 0.4 0.6 0.8 10.1

0.15

0.2

0.25

0.3

0.35

0.4

0.45

0.5

Résumé

Avg

. Job

−F

indi

ng P

rob.

ωr p

(r)

BaselineSC1SC2SC3

Figure A.19: Skill Change and the Value of Unemployment U (r)

0 0.2 0.4 0.6 0.8 11

1.02

1.04

1.06

1.08

1.1

1.12

Résumé

( 1

− β

) U

(r)

BaselineSC1SC2SC3

52

Figure A.20: Job-Finding Rates by Résumé, Alternative ω1 − ω0

00.5

1

00.1

0.20.3

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

Résuméω1 − ω

0

ωr p

(r)

Figure A.21: Average Aggregate Job-Finding Rates by Duration, Alternative ω1 − ω0

020

4060 0

0.10.2

0.30.05

0.1

0.15

0.2

0.25

0.3

0.35

0.4

ω1 − ω

0Duration

Pro

babi

lity

53

Figure A.22: Job-Finding Rates by Duration and Potential Experience

Duration (Weeks)0 10 20 30 40 50

Nor

mal

ized

Job

-Fin

ding

Rat

e

40

50

60

70

80

90

100

1st Quartile of Potential Experience2nd Quartile of Potential Experience3rd Quartile of Potential Experience4th Quartile of Potential Experience

Figure A.23: Job-Finding Rates by Duration and Potential Experience, Excluding Part Time

Duration (Weeks)0 10 20 30 40 50

Nor

mal

ized

Job

-Fin

ding

Rat

e

50

60

70

80

90

100


54

Figure A.24: Job-Finding Rates by Duration and Potential Experience, Model-Generated Data

Unemp. Duration (Weeks)0 5 10 15 20 25 30 35 40 45 50

Nor

mal

ized

Job

-Fin

ding

Pro

b.

65

70

75

80

85

90

95

100


B Computational Outline

I will begin by combining expressions to arrive at a concise system of functional equations that must be

satised in equilibrium. These equilibrium functions must be the xed point of an operator, which I will

dene below, so the computational strategy will be to iterate on this operator. I then provide some details

about how this iterative procedure is executed in practice.

Dene:

Cs (r, x) ≡ Gs (r, x) +Hs (r, x) . (B.1)

It will be convenient to evaluate the surplus-splitting condition (2.18) at (Be (r | x, x′) , x′, e) and(Bh (r | x) , x′, h

).

Doing so, and using the denition of Cs (r, x), yields:

Ge (Be (r | x, x′) , x′) = (1− η) [Ce (Be (r | x, x′) , x′)− U (Be (r | x, x′))] (B.2)

Gh(Bh (r | x) , x′

)= (1− η)

[Ch(Bh (r | x) , x′

)− U (Bu (r))

]. (B.3)

55

Combining the above with (2.10) and (2.14) gives us:

Cs (r, x) = zx+ β∑x′

Ω (x, x′ | r) max Ce (Be (r | x, x′) , x′) , U (Be (r | x, x′)) . (B.4)

Notice that the right-hand side does not depend on s, so neither does the left-hand side. Hence, I will simply

write C (r, x). We can write the value of unemployment (2.15) as:

U (r) = λ+ β

(U (Bu (r)) + ηζθ (r)

1−ε∑x

Ω (x | r) maxC(Bh (r | x) , x′

)− U (Bu (r)) , 0

). (B.5)

Using the surplus-splitting condition for new hires allows us to write the free-entry condition as:

θ (r) = β1− ηκ

ζθ (r)1−ε∑

x


)− U (Bu (r)) , 0

. (B.6)

The above allows us to reduce further the value of unemployment to:

U (r) = λ+ηκ

1− ηθ (r) + βU (Bu (r)) . (B.7)

Given C (r, x), U (r), and θ (r), the policy functions can be written as:

Xe (r | x) = x′ | C (r′e, x′) ≥ U (r′e) , r

′e = Be (r | x, x′) (B.8)

Xh (r) =x | C (r′e, x) ≥ U (r′u) , r′h = Bh (r | x) , r′u = Bu (r)

. (B.9)

Similarly, once we know θ (r), we know the belief functions. Thus, solving for an equilibrium amounts to

nding a solution to the following system of functional equations:

C (x, r) = zx+ β∑x′

Ω (x, x′ | r) max C (Be (r | x, x′) , x′) , U (Be (r | x, x′)) (B.10)

U (r) = λ+ηκ

1− ηθ (r) + βU (Bu (r)) (B.11)

θ (r) = β1− ηκ

ζθ (r)1−ε∑

x


)− U (Bu (r)) , 0

. (B.12)

Dene T (C,U, θ) as the operator mapping (C,U, θ) to the right-hand side of (B.10)-(B.12). Given initial

guesses for (C,U, θ), I iterate on the T operator until I attain practical convergence.

Practically, I perform these iterations as follows. I will solve for the functions of interest on a grid of nr

56

values 0 = r1 ≤ · · · ≤ rnr = 1. Dene:

r ≡[r1 · · · rnr

]′(B.13)

x ≡[x0 x1 · · · xnx

]′, (B.14)

where x0 is understood to be zero. Dene the nr × (nx + 1) matrix C and the vectors U and Θ by:

C ≡

C (r1, x0) · · · C (r1, xnx

)

.... . .

...

C (rnr, x0) · · · C (rnr

, xnx)

(B.15)

U ≡[U (r1) · · · U (rnr )

]′(B.16)

Θ ≡[θ (r1) · · · θ (rnr

)

]′. (B.17)

Dene the nr × (nx + 1)2matrix:

Be ≡

Be (r1 | x0, x0) · · · Be (r1 | x0, xnx

) · · · Be (r1 | xnx, x0) · · · Be (r1 | xnx

, xnx)

.... . .

......

. . ....

Be (rnr| x0, x0) · · · Be (rnr

| x0, xnx) · · · Be (rnr

| xnx, x0) · · · Be (rnr

| xnx, xnx

)

,(B.18)

where elements of Be are imputed to be zero for infeasible (x, x′) transitions. Let Ce be the column-wise

interpolation of 11×(nx+1) ⊗C along the grid given by Be. That is:

Ce =

[Ce

0 Ce1 · · · Ce

nx

], (B.19)

where each Cei is nr × (nx + 1), and the (j, k + 1) element of Ce

i is C (Be (rj | xi, xk) , xk). Dene:

Ξei ≡(1(nx+1)×1 ⊗ Ω′i

)(Inx+1 ⊗ 1(nx+1)×1

)

=

Ω (x0, · | i)′ 0(nx+1)×1 · · · 0(nx+1)×1

0(nx+1)×1 Ω (x1, · | i)′. . .

...

.... . .

. . . 0(nx+1)×1

0(nx+1)×1 · · · 0(nx+1)×1 Ω (xnx, · | i)′

, (B.20)

where Ω (xh, · | i) represents the (h+ 1)st

row of Ωi, and represents element-by-element multiplication.

57

Dene Ue as the column-wise interpolation of U⊗ 11×(nx+1)2 along the grid given by Be. That is:

Ue =

[Ue

0 Ue1 · · · Ue

nx

], (B.21)

where the (j, k + 1) element of Uei is U (Be (rj | xi, xk)). Observe that column (j + 1) of max Ce,UeΞei is

maxCej ,U

ej

Ω (xj , · | i)′, which gives the expected value (over x′) of max C (Be (r | xj , x′) , x′) , U (Be (r | xj , x′))

under the distribution Ωi, conditional on starting with x = xj . The updated value of C is given by:

TC (C,U,Θ) = z1nr×1x′ + β (max Ce,UeΞe1)

(r11×(nx+1)

)+β (max Ce,UeΞe0)

(1nr×(nx+1) − r11×(nx+1)

). (B.22)

Dene the nr × 1 vector:

Bu ≡[Bu (r1) · · · Bu (rnr )

]′. (B.23)

Dene Uu as the interpolation of U along the grid given by Bu. The updated value of U is given by:

TU (C,U,Θ) = λ1nr×1 +ηκ

1− ηΘ + βUu. (B.24)

Dene the nr × 1 vector:

Bh ≡[Bh (r1) · · · Bh (rnr

)

]′. (B.25)

The above incorporates the specication assumed in the calibration exercise; namely, all newly hired workers

have the same productivity x = 1. Dene Ch as the column-wise interpolation of C ( : , I [x′ = 1] ) along

the grid given by Bh. Dene:

P ≡ minζΘ1−ε, 1

(B.26)

~ωr ≡ rω1 + (1nr×1 − r)ω0, (B.27)

where the min · operator and the power (1− ε) are understood to apply element-by-element. The updated

value of Θ is given by:

TΘ (C,U,Θ) = β

(1− ηκ

)P ~ωr

[max

Ch −Uu, 0

]. (B.28)

58

C Proof of Proposition 1

Evaluating equation (B.11) at rc and subtracting it from Hs (r, x), as dened in equation (2.14), yields:

Hs (r, x)− U (rc) = ws (r, x) + β∑x′

Ω (x, x′ | r) max He (Be (r | x, x′) , x′)− U (Be (r | x, x′)) , 0

−λ+∑x′

Ω (x, x′ | r) [U (Be (r | x, x′))− U (Bu (rc))]− βκη

1− ηθ (rc)

= ws (r, x) +η

1− ηβ∑x′

Ω (x, x′ | r) max Ge (Be (r | x, x′) , x′) , 0

−λ+ β∑x′

Ω (x, x′ | r) [U (Be (r | x, x′))− U (Bu (rc))]−κη

1− ηθ (rc) , (C.1)

where the second line uses the surplus-splitting condition (2.18) evaluated at (Be (r | x, x′) , x′, e) instead of

(r, x, s). Multiplying the above by 1− η, we obtain:

(1− η) [Hs (r, x)− U (rc)] = (1− η)ws (r, x)− (1− η)λ− κηθ (rc)

+ (1− η)β∑x′

Ω (x, x′ | r) [U (Be (r | x, x′))− U (Bu (rc))]

+ηβ∑x′

Ω (x, x′ | r) max Ge (Be (r | x, x′) , x′) , 0 . (C.2)

Multiplying equation (2.10) by η, we obtain:

ηGs (r, x) = ηzx− ηws (r, x) + ηβ∑x′

Ω (x, x′ | r) max Ge (Be (r | x, x′) , x′) , 0 . (C.3)

The surplus-splitting condition (2.18) allows us to equate the above two expressions for ηGs (r, x) and

(1− η) [Hs (r, x)− U (rc)]. Rearranging terms completes the proof.

D Data

I use the basic monthly CPS data from 1994 to 2013, available from the NBER's website, maintained by

Jean Roth.43 Although earlier data are available, the CPS underwent a redesign in 1994, and it's known

that this change altered the measured distribution of unemployment durations. The CPS is a rotating panel,

where the same household is interviewed for 4 months, not interviewed for 8 months, and then interviewed

for 4 months. Questions include labor force status and, if unemployed, the duration of search. The goal is

to identify unemployed people who have been unemployed for a given duration, and we want to know what

fraction of these people found jobs by the following interview date. However, to exploit the longitudinal

43See: http://www.nber.org/data/cps_basic.html.

59

structure of the CPS, it's necessary to match people across months. To do this, I use a method similar

to the one used by Shimer [2008]. I merge the monthly les on the basis of household identier (hrhhid),

sample identier (hrsample), serial sux (hrsersuf), household number (hrhhnum), and person line number

(pulineno). When I nd observations that match across consecutive months, I drop observations in which

the person's race, sex, or indication of Hispanic origin changes. I also drop observations in which age

changes by two or more years between months. The unemployment duration data are heavily imputed,

except for workers in their rst or fth months in sample. To minimize the eects of imputation, I conne

my sample to workers who were unemployed in their rst or fth interview month. Ultimately, I obtain

194,715 observations where a worker was unemployed in one month, and I can observe the same worker in

the following month.

In Section 7.1, I explore the data by tting a semiparametric regression, as specied by equation (7.1).

The non-linear component h (·) is assumed to be locally cubic. The covariates in xi include race, sex, Hispanic

origin, reason for unemployment, an indicator for being a high-school dropout, an indicator for being a high-

school graduate, an indicator for whether the worker was seeking part-time employment, the state-level

unemployment rate, and a quadratic function of age. The sample excludes people on temporary layo

because many of them are recalled by their previous employers (Fujita and Moscarini [2015]). I also conne

the sample to workers ages 18 to 64. With these restrictions, there are 155,638 observations in the sample;

of these, 13% indicate that they are seeking part-time employment. Potential labor-market experience is

dened as age minus years of education. Because the CPS asks people about their level of schooling, not

their years of schooling, this requires imputing a certain number of years for each level of schooling. I impute

that those who did not reach 9th grade received 13 years of education; those who reached a maximum of

9th grade (10th grade, 11th grade) received 14 years (15 years, 16 years) of education; those who reached

12th grade but did not graduate from high school received 17 years of education; those who attended some

college but did not graduate received 19 years of education; and those whose highest degree is a high-school

diploma (associate's degree, bachelor's degree, master's degree, professional degree, doctorate) received 18

years (20 years, 22 years, 23 years, 24 years, 26 years) of education. The sample is divided into quartiles

of the potential experience distribution; the 25th, 50th, and 75th percentiles of the potential experience

distribution are 6 years, 16 years, and 28 years.

60

The Hazards of Unemployment: A Macroeconomic Model of Job ...

Documents

Transcript of The Hazards of Unemployment: A Macroeconomic Model of Job ...