Chapter 10. Valuation, Intertemporal Choice, and …...CHAPTER 10 Valuation, Intertemporal Choice,...

20
CHAPTER 10 Valuation, Intertemporal Choice, and Self-Control Joseph W. Kable OUTLINE Introduction 173 Valuation in Intertemporal Choice 173 Behavioral and Theoretical Evidence 173 Functional Imaging Evidence 177 Evidence from Single Unit Neurophysiology 179 Self-Control 182 Individual Differences in Discounting 182 Stationarity 184 Persistence and Delay of Gratification 187 Conclusion 189 References 189 INTRODUCTION Most decisions have future consequences, effects that are delayed in time relative to when the choice is made. Deciding to make a large purchase, or deciding to make a small adjustment to your retirement contri- bution, can affect your consumption patterns for months or years into the future. Choices about diet, exercise and smoking all have consequential long- term effects on health. A one-time decision to trust, or not to trust, a social partner can subsequently rever- berate through interactions with that person for a long time. The ability to evaluate future consequences is there- fore a fundamental aspect of decision making. Like risk (covered in Chapter 9) or social concerns (covered in Chapter 11), the timing of outcomes poses unique problems that decision makers need to solve. Accordingly, much work in neuroeconomics has inves- tigated how decision-making mechanisms are affected by delayed or future outcomes. This chapter reviews the theoretical, behavioral and neurobiological findings within this domain. VALUATION IN INTERTEMPORAL CHOICE Behavioral and Theoretical Evidence Decisions that involve tradeoffs between outcomes that occur at different points in time are called inter- temporal choices. A consistent finding regarding inter- temporal choices, in all species that have been tested, is that delayed outcomes are discounted relative to immediate ones (Frederick et al., 2002; Green and Myerson, 2004; Soman et al., 2005). That is, outcomes are weighted less the more remotely in time they occur; the subjective value of a reward is smaller when it is delayed than when the same reward is available immediately. This process can be characterized for an individual decision maker by measuring a discount function, which shows how the subjective value of an outcome changes as a function of the delay until it is received (Figure 10.1). Typically, discount functions are measured by giving an individual a series of binary choices between less attractive outcomes that are available sooner and more attractive outcomes that 173 Neuroeconomics. DOI: http://dx.doi.org/10.1016/B978-0-12-416008-8.00010-3 © 2014 Elsevier Inc. All rights reserved.

Transcript of Chapter 10. Valuation, Intertemporal Choice, and …...CHAPTER 10 Valuation, Intertemporal Choice,...

Page 1: Chapter 10. Valuation, Intertemporal Choice, and …...CHAPTER 10 Valuation, Intertemporal Choice, and Self-Control Joseph W. Kable OUTLINE Introduction 173 Valuation in Intertemporal

C H A P T E R

10

Valuation, Intertemporal Choice, andSelf-ControlJoseph W. Kable

O U T L I N E

Introduction 173

Valuation in Intertemporal Choice 173Behavioral and Theoretical Evidence 173Functional Imaging Evidence 177Evidence from Single Unit Neurophysiology 179

Self-Control 182Individual Differences in Discounting 182Stationarity 184Persistence and Delay of Gratification 187

Conclusion 189

References 189

INTRODUCTION

Most decisions have future consequences, effectsthat are delayed in time relative to when the choice ismade. Deciding to make a large purchase, or decidingto make a small adjustment to your retirement contri-bution, can affect your consumption patterns formonths or years into the future. Choices about diet,exercise and smoking all have consequential long-term effects on health. A one-time decision to trust, ornot to trust, a social partner can subsequently rever-berate through interactions with that person for along time.

The ability to evaluate future consequences is there-fore a fundamental aspect of decision making. Likerisk (covered in Chapter 9) or social concerns (coveredin Chapter 11), the timing of outcomes poses uniqueproblems that decision makers need to solve.Accordingly, much work in neuroeconomics has inves-tigated how decision-making mechanisms are affectedby delayed or future outcomes. This chapter reviewsthe theoretical, behavioral and neurobiological findingswithin this domain.

VALUATION IN INTERTEMPORALCHOICE

Behavioral and Theoretical Evidence

Decisions that involve tradeoffs between outcomesthat occur at different points in time are called inter-temporal choices. A consistent finding regarding inter-temporal choices, in all species that have been tested,is that delayed outcomes are discounted relative toimmediate ones (Frederick et al., 2002; Green andMyerson, 2004; Soman et al., 2005). That is, outcomesare weighted less the more remotely in time theyoccur; the subjective value of a reward is smaller whenit is delayed than when the same reward is availableimmediately. This process can be characterized for anindividual decision maker by measuring a discountfunction, which shows how the subjective value of anoutcome changes as a function of the delay until it isreceived (Figure 10.1). Typically, discount functionsare measured by giving an individual a series ofbinary choices between less attractive outcomes thatare available sooner and more attractive outcomes that

173Neuroeconomics. DOI: http://dx.doi.org/10.1016/B978-0-12-416008-8.00010-3 © 2014 Elsevier Inc. All rights reserved.

Page 2: Chapter 10. Valuation, Intertemporal Choice, and …...CHAPTER 10 Valuation, Intertemporal Choice, and Self-Control Joseph W. Kable OUTLINE Introduction 173 Valuation in Intertemporal

are available later. Individual differences in the extentof discounting are often referred to as differences inpatience, with steeper discounting being equated withgreater impatience.

Much theoretical work has focused on how decisionmakers should discount delayed rewards, and muchbehavioral and theoretical work has characterized howchoosers actually do discount delayed rewards.Samuelson (1937) first suggested that decision makersshould employ exponential discounting, decrementingan outcome’s value by a fixed percentage for each timestep that it is delayed into the future (Figure 10.1):

DUðx; tÞ5 δtUðxÞ ð10:1ÞHere DU(x,t) is the discounted utility of outcome x

to be received at time t, U(x) is the utility of outcome xreceived immediately, and δ is a discount factor thatranges from zero to one, with smaller values of δresulting in greater discounting. Thus, the subjectivevalue of a delayed reward is the value that rewardwould have when received immediately decrementedby a fixed percentage for every time step the reward isdelayed into the future. Exponential discounting there-fore treats each time period equivalently: adding aday’s delay to an immediate reward or to a rewarddue one year from now affects both by the same fixedpercentage. Subsequently, Strotz (1956) showed thatonly exponential discounters maintain a consistentconsumption plan as they move through time. Allother forms of discounting lead to plans that changesimply because time has passed. In other words, non-exponential discounting leads to the kinds of inconsis-tent choices that violate the central tenet of modelsthat assume choice behavior maximizes a consistentutility function, a point discussed in Chapter 1.

These observations were further formalized byFishburn and Rubinstein (1982). They showed that adecision maker choosing between discrete outcomesthat occur at specific times makes choices consistentwith an exponential discount function (Eq. 10.1) if and

only if those choices obey five axioms (see Box 10.1).Two axioms to note specifically are ordering and statio-narity. Ordering is a form of the transitivity requirementseen in the general axiom of revealed preference andthe von Neumann�Morgenstern axioms of expectedutility (see Chapter 1). For example, ordering requiresthat if you would choose $10 today over $11 in a week,and you would choose $11 in a week over $12 in twoweeks, that you would also choose $10 today over $12in two weeks. Stationarity states that choosers are con-sistent as they move through time � that is, choosershave the same preference between two outcomes nomatter where in time they are relative to those out-comes. For example, stationarity requires that if todayyou would choose $11 in two weeks over $10 in oneweek, that you would also choose $11 in one week over$10 today, since in a week’s time, the former choice willturn into the latter one. (Note that this is the discretechoice analog of Strotz’s consistent planning condition.)

There is a wealth of evidence, however, that calls intoquestion whether humans and other animals are in factexponential discounters (Frederick et al., 2002; Green andMyerson, 2004; Soman et al., 2005). One important exam-ple is a line of research developed in the quantitativeanalysis of animal behavior. Mazur (1987) investigatedpigeons choosing between rewards that arrived after dif-ferent delays. He concluded that their empiricallyobserved choices could be best explained if the subjec-tive value of the delayed reward followed a hyperbolicfunction of the form (Figure 10.1):

DUðx; tÞ5 UðxÞ11 kt

ð10:2Þ

Here DU(x,t) is again the discounted utility of out-come x to be received at time t, U(x) is again the utility ofoutcome x received immediately, and k is a discount ratethat is typically greater than zero, with larger values of kresulting in greater discounting. Thus, the subjectivevalue of a delayed reward is the value that rewardwould have when received immediately, divided by afactor that depends on the length of the delay. This equa-tion was a natural extension of the observation that ani-mals pay attention to rates of return from different choiceoptions, which had been demonstrated in several experi-ments on optimal foraging including those investigatingthe phenomenon of the matching law (Gibbon et al., 1988;Herrnstein, 1961; Stephens, 2002). The hyperbolic equa-tion maintains that animals are concerned with ratios ofreward magnitudes and delays, but allows the weight ofthese variables to vary away from a strict rate of returncalculation. Though Mazur’s work used pigeons, previ-ous and subsequent experiments found support forhyperbolic discounting in a variety of other non-humananimals (Kim et al., 2008; Louie and Glimcher, 2010;Mazur, 1987; Richards et al., 1997).

Delay

Sub

ject

ive

valu

e

Exponential: δtU(x)

Hyperbolic: U(x)/(1+kt )

FIGURE 10.1 Comparative illustration of the hyperbolic andexponential discount functions.

174 10. VALUATION, INTERTEMPORAL CHOICE, AND SELF-CONTROL

NEUROECONOMICS

Page 3: Chapter 10. Valuation, Intertemporal Choice, and …...CHAPTER 10 Valuation, Intertemporal Choice, and Self-Control Joseph W. Kable OUTLINE Introduction 173 Valuation in Intertemporal

Ainslie was one of the first to make similar observa-tions in humans (Ainslie and Haendel, 1983). He showedthat people’s choices between monetary rewards as afunction of delay-to-reward were better described byhyperbolic as opposed to exponential discounting. Healso proposed that hyperbolic discounting provided aframework for understanding impulsive behavior

(Ainslie, 1975). He hypothesized, for example, that anaddict’s repeated cycles of deciding to quit, only to laterrevert to drug use, might be understood as temporalinconsistencies (violations of stationarity) induced byhyperbolic discounting (Figure 10.2).

There have been many subsequent studies of discount-ing in humans, and the vast majority of these have found

BOX 10.1

THE AX IOMS OF D I SCOUNTED UT I L I TY

Fishburn and Rubinstein (1982) provide an axiomati-

zation of exponential discounting, a kind of traditional

model testing widely used in economics and developed

in Chapter 1. They start with the following axioms

(quoted directly from Fishburn and Rubinstein (1982)):

Axiom 0: Outcome�Time Space

These axioms deal with choices between outcome�time pairs, each pair being an outcome x at a time t,

where time can be treated in a discrete or continuous

manner.

“X is a nondegenerate real interval; T is either a set of

successive non-negative integers or an interval of non-

negative numbers, and 0, 1AT.”

Axiom 1: Ordering

Choices are consistent with a utility ordering.

“g is a weak order on X3T”

Axiom 2: Monotonicity

If one outcome is preferred to another when both are

immediate, it is also preferred when both are delayed by

the same amount.

“If x. y then (x, t)g(y, t)”

Axiom 3: Impatience

Positive outcomes are preferred sooner, negative out-

comes are preferred later.

“If s, t then x. 0-(x, s)g(x, t), x5 0-(x, s)B(x, t), andx, 0-(x, t)g(x, s)”

Axiom 4: Continuity

Preferences do not contain any discontinuities.

“{(x, t):(x, t)g(y, s)} and {(x, t): (y, s)g(x, t)} are closed inthe product topology on X3T”

Axiom 5: Stationarity

The indifference between two time�outcome pairs

depends only on the difference between the times, and

not on the time of the first outcome.

“If (x, t)B(y, t1 τ) then (x, s)B(y, s1 τ)”

They then show that these axioms imply exponential

discounting:

Result: Exponential Discounting

If the above axioms hold, choices can be represented

as following an exponential discount function.

“THEOREM 2. If A0�A5 hold, then given any 0,α, 1,there is a continuous, increasing real valued functionf on X such that:

(i) for all (x, t), (y, s)AX3T, (x, t)g(y, s) iff αtf(x)$αsf(y);(ii) f(0) must be 0 if 0AX, and xf(x) must be positive for

all xAX\{0};(iii) if T is an interval then f is unique (given α) up to

multiplication by positive constants on {xAX: x. 0}and on {xAX: x, 0}”

Thus, any chooser who obeys all of these axioms can

be represented as maximizing an exponentially dis-

counted utility function, and, conversely, any chooser

that maximizes an exponentially discounted utility func-

tion will obey all of these axioms. As discussed in the

text, most of the theoretical and behavioral work exam-

ining departures from exponential discounting has

focused on potential violations of the fifth axiom, statio-

narity. Hyperbolic and quasi-hyperbolic discounting, for

example, both fail to satisfy the stationarity axiom.

Despite its centrality to questions about intertemporal

choice, however, there have been surprisingly few direct

tests of the stationarity axiom.

175VALUATION IN INTERTEMPORAL CHOICE

NEUROECONOMICS

Page 4: Chapter 10. Valuation, Intertemporal Choice, and …...CHAPTER 10 Valuation, Intertemporal Choice, and Self-Control Joseph W. Kable OUTLINE Introduction 173 Valuation in Intertemporal

that people’s choices depart from exponential discounting(Frederick et al., 2002; Green and Myerson, 2004; Somanet al., 2005). Typically, though not always, these studieshad individuals make choices between immediate anddelayed rewards, estimated a discount function on thebasis of these choices, and compared the fit of the hyper-bolic and exponential equations to this function. Therehave been many fewer direct tests of the axioms of expo-nential discounting (see the section on Stationarity below),though several early papers reported violations of the sta-tionarity axiom (Ainslie and Haendel, 1983; Green et al.,1994a; Kirby and Herrnstein, 1995).

Given these findings, departures from exponentialdiscounting have been a topic of intense research inbehavioral economics. In this literature, departuresfrom exponential discounting have typically beenmodeled using what is often called a quasi-hyperbolicequation(Laibson, 1997) (Figure 10.3):

for t5 0; DUðx; tÞ5UðxÞfor all other t. 0; DUðx; tÞ5βδtUðxÞ ð10:3Þ

Here DU(x,t) is again the discounted utility of out-come x to be received at time t, U(x) is again the utilityof outcome x received immediately, and β and δ areboth discount factors that range from zero to one, withsmaller values of either resulting in greater discount-ing. Thus, the subjective value of a delayed reward isthe value that reward would have when receivedimmediately, decremented by two factors: one factor(δ) is identical to the exponential model, decrementingthe delayed reward by a fixed percentage for everytime step the reward is delayed into the future; the sec-ond factor (β) adds to this a bias towards immediaterewards, by decrementing all delayed rewards by thesame constant percentage. This formulation is more

tractable for economic modeling applications than thehyperbolic form, and the β parameter provides an eas-ily interpretable estimate of how far someone departsfrom the normative exponential equation (since atβ5 1 the quasi-hyperbolic reduces to the exponentialequation). The quasi-hyperbolic model also lends itselfto an interpretation in terms of dual process models ofhyperbolic discounting (McClure et al., 2004, 2007),with the β and δ parameters paralleling the hot/cold(Metcalfe and Mischel, 1999), affective/deliberative(Loewenstein and O’Donoghue, 2004), and doer/plan-ner dichotomies (Thaler and Shefrin, 1981) of existingtheories. Smaller β parameters accentuate the effect ofimmediacy, in the same way that the hot/affective/doer systems are predicted to bias people to chooseimmediate rewards. In fact, further work in behavioraleconomics has explicitly modeled hyperbolic-like pre-ferences as arising from a competition between twoprocesses, where the current and future “self” are trea-ted as competing agents using the tools of game theory(Fudenberg and Levine, 2004).

Time

Sub

ject

ive

valu

e

Time

Sub

ject

ive

valu

e

t1 t2 t1 t2

Exponential Hyperbolic

FIGURE 10.2 Hyperbolic discounting predicts preference reversals as a chooser moves through time. The schematic illustrates how thesubjective value of a larger, later (blue) and smaller, sooner (red) reward change as time passes, under exponential (left) and hyperbolic (right)discounting. Under hyperbolic discounting, the subjective value of the larger, later reward is greater initially (at t1), but this reverses as thearrival time of the smaller reward grows near (at t2). This reversal does not occur under exponential discounting. Note that the convention inFigure 10.2 is reversed from that of Figure 10.1; here the rewards are fixed in time and the decision maker moves.

Delay

Sub

ject

ive

valu

e

Quasi-hyperbolic: βδtU(x)Exponential: δtU(x)

FIGURE 10.3 Comparative illustration of the quasi-hyperbolicand exponential discount functions.

176 10. VALUATION, INTERTEMPORAL CHOICE, AND SELF-CONTROL

NEUROECONOMICS

Page 5: Chapter 10. Valuation, Intertemporal Choice, and …...CHAPTER 10 Valuation, Intertemporal Choice, and Self-Control Joseph W. Kable OUTLINE Introduction 173 Valuation in Intertemporal

Functional Imaging Evidence

The hyperbolic and quasi-hyperbolic formulations ofthe discount function parallel different possible neuralmechanisms that might underlie hyperbolic-like prefer-ences. One possible neural algorithm, inspired by thequasi-hyperbolic model, explains discounting throughthe interaction of two systems, one which exclusively orpredominantly values immediate rewards, and anotherthat values both immediate and delayed rewards(McClure et al., 2004, 2007). The former system, akin tothe β in quasi-hyperbolic discounting, drives behaviorthat is more impatient. The latter system, akin to the δin quasi-hyperbolic discounting, drives behavior that ismore patient. An alternative neural algorithm, closer inspirit to the hyperbolic model, posits a unitary systemthat evaluates all rewards as a hyperbolic-like functionof time (Kable and Glimcher, 2007, 2010). An earlydebate in neuroeconomics centered on which of thesealternatives better describes the neural algorithm forintertemporal valuation.

An initial set of functional imaging experimentsargued for the dual systems hypothesis (McClure et al.,2004, 2007). Those experiments found that one set ofregions � including ventromedial prefrontal cortex,posterior cingulate cortex, and ventral striatum � wasmore active for choices that involved an immediatereward compared to choices that only involved delayedrewards. A second set of regions � including lateralprefrontal cortex and posterior parietal cortex � wasmore active for all choices compared to rest and for dif-ficult choices compared to easy ones. In line with thequasi-hyperbolic model, the first set of regions wasreferred to as β regions and the second set as δ regions.These studies also found that when subjects chose theimmediate reward there was more activity in β regionsthan in δ regions, and when subjects chose the delayedreward there was more activity in δ regions that in βregions. These findings were interpreted as evidencefor a neural algorithm that mirrors dual-processaccounts of hyperbolic-like discounting.

A subsequent set of experiments, however, arguedagainst this interpretation (Kable and Glimcher, 2007,2010). For most people, immediate rewards are morevaluable than equivalently sized delayed rewards.Regions that are more active for immediate rewards,therefore, may be responding to the larger subjectivevalue of immediate rewards, rather than their immedi-acy per se. To isolate brain regions where BOLD activitytracks with subjective value, Kable and Glimcher (2007)had individuals with stable, well-characterized discountfunctions choose between immediate and delayed mone-tary rewards. The immediate reward was fixed on alltrials, while the delayed reward varied, so that the sub-jective value of only one reward was changing from trial

to trial. Participants’ discount functions provided an esti-mate of the subjective value of the delayed reward tothat specific individual, and how this changed from trialto trial. BOLD activity in ventromedial prefrontal cortex,posterior cingulate cortex, and ventral striatum was cor-related with this estimate, consistent with activity inthese regions tracking subjective value, rather thanimmediacy per se (Figure 10.4). BOLD activity in theseregions was also more strongly correlated with subjec-tive value than with the objective reward parameters(delay, monetary amount), a binary variable indicatingthe subject’s choice, or a value estimate that did not takeinto account individual differences in discounting.Furthermore, the discount rate that maximized the corre-lation between BOLD activity and subjective valuematched, on average, the subject’s behavioral discountrate, indicating that BOLD activity in each of these brainregions did not reflect a more impatient valuation thanthe subject’s behavior.

This conclusion was bolstered in a follow-up study(Kable and Glimcher, 2010), which compared choicesbetween an immediate and a delayed reward to choicesbetween two delayed rewards. In that study, BOLDactivity in medial prefrontal cortex, posterior cingulatecortex and striatum could be completely accounted forby a response to subjective value (Figure 10.4). Activityin these regions when an immediate reward was avail-able was no greater than that which would be expectedon the basis of these rewards being more valuable,demonstrating that these regions show no immediacybias beyond that which is present in people’s choices.

Many subsequent studies have supported the gen-eral conclusion that these regions � medial prefron-tal cortex, posterior cingulate cortex, and ventralstriatum � are not activated exclusively or dispro-portionately for immediate rewards. Several havedirectly replicated the finding that BOLD activity inthese regions scales with the subjective value ofimmediate and delayed rewards, as estimated fromsubject’s choices (Peters and Buchel, 2009, 2010).Others, using slightly different analyses, have con-firmed that activity in these regions is sensitive toboth the magnitude and delay of rewards (Ballardand Knutson, 2009; Pine et al., 2009; Wittmann et al.,2007), as would be required for any region thatencodes subjective value.

As discussed in next section in more detail, one openquestion is whether BOLD activity that scales with sub-jective value when averaged across a large anatomicalregion could result from the aggregation of the activityof smaller neural units that do not themselves track sub-jective value. For example, one class of computationalmodels produces hyperbolic discounting in aggregate byaveraging over individual units that exponentially dis-count at a range of different rates (Kurth-Nelson and

177VALUATION IN INTERTEMPORAL CHOICE

NEUROECONOMICS

Page 6: Chapter 10. Valuation, Intertemporal Choice, and …...CHAPTER 10 Valuation, Intertemporal Choice, and Self-Control Joseph W. Kable OUTLINE Introduction 173 Valuation in Intertemporal

Redish, 2010; Tanaka et al., 2004); see Chapter 17 formore detail). In this context, Tanaka and colleagues(Tanaka et al., 2004; reported the intriguing result thatdelay sensitivity varied across the striatum, with BOLDactivity in ventromedial areas of the striatum reflecting agreater degree of discounting and BOLD activity in dor-solateral areas of the striatum reflecting a lesser degreeof discounting. Such a finding is consistent with BOLDactivity averaged across the entire structure reflectingthe degree of discounting observed in behavior, but itwill be important to further investigate this possibility infuture research, especially since the kind of intertempor-al choices studied by Tanaka and colleagues (Tanakaet al., 2004) � delays on the order of seconds in the

context of a reinforcement learning task � differed dra-matically from those used in the other studies discussedin this section.

The view that several regions encode the subjectivevalue of immediate and delayed rewards is also con-sistent with neuroeconomic findings in other choicedomains. In the past five years a growing number ofstudies have shown that activity in two of these areasin particular � ventromedial prefrontal cortex andventral striatum � scales with the subjective value ofthe available options during choice (Kable andGlimcher, 2009; Levy and Glimcher, 2012; Rangel andHare, 2010). This finding is consistent across a varietyof choice domains, including risky and ambiguous

(A)

x = –3 y = 5–0.04 –0.02 0 0.02 0.04

0

5

10

15

Neural k – Behavioral k

Cou

nt

Brain more impulsive

Behavior more impulsive

0

5

10

15

Mea

n A

ctiv

ity, I

mm

edia

te P

rese

nt

– M

ean

Act

ivity

, Im

med

iate

Abs

ent

Ven

tral

S

tria

tum

Med

ial

Pre

fron

tal

Pos

terio

r C

ingu

late

x = 3 y = 6

Range of effects predicted if a region encodes subjective value

(B)

(C) (D)

Striatum Medial prefrontal Posterior cingulate

FIGURE 10.4 BOLD activity in medial prefrontal cortex, posterior cingulate cortex and ventral striatum tracks the subjective value ofimmediate and delayed rewards. (A) Regions where BOLD activity was correlated with the subjective value (as estimated from behavior) dur-ing intertemporal choices. (B) The discount rate that maximized the subjective value correlation in these regions (neural k) was not more orless impulsive than the behavioral discount rate on average. (C) Replication of panel (A) under conditions where an immediate reward couldbe present or absent. (D) Under these conditions, BOLD activity in medial prefrontal cortex, posterior cingulate cortex and ventral striatumwas only greater when an immediate reward was present to the extent predicted if these regions encode subjective value. (A, B) Adapted fromKable and Glimcher (2007), and (C, D) adapted from Kable and Glimcher (2010).

178 10. VALUATION, INTERTEMPORAL CHOICE, AND SELF-CONTROL

NEUROECONOMICS

Page 7: Chapter 10. Valuation, Intertemporal Choice, and …...CHAPTER 10 Valuation, Intertemporal Choice, and Self-Control Joseph W. Kable OUTLINE Introduction 173 Valuation in Intertemporal

gambles (Levy et al., 2010; Tom et al., 2007) (see alsoChapter 9), food (Hare et al., 2009, 2011a; Plassmannet al., 2007) (see also Chapter 8), consumer goods (Chibet al., 2009), and social exchange (Harbaugh et al., 2007;Hare et al., 2010) (see also Chapter 11). Further, thesame areas encode subjective value signals consistentlybetween different domains (e.g., food and money), asrequired by any region encoding a “common cur-rency” for making choices (Levy and Glimcher, 2011,2012) (see also Chapter 13). In light of this evidence, itis not surprising that these same regions would encodethe subjective value of immediate and delayed rewardsduring intertemporal choices.

But what about the role of the putative δ regions �dorsolateral prefrontal and posterior parietal cortex?The somewhat conflicting evidence regarding the roleof these regions is discussed in more detail in the fol-lowing sections. There is no doubt that these regionsare involved in the choice process, consistent withtheir reliable activation for harder compared to easierchoices. However, the finding that these regions showgreater activity when people choose delayed rewardsthan when they choose immediate rewards, which wasthe original basis for arguing that these regions pro-mote patient behavior, was not replicated in Kable andGlimcher (2007). In fact, in that study, increased activ-ity in medial prefrontal cortex was the strongest predic-tor of choosing the delayed reward.

Even if lateral prefrontal regions act to promotepatient choices, though, it does not necessarily followthat these regions act in opposition to impatient pro-cesses in other regions. An alternative view is that theaction of lateral prefrontal regions modulates valuerepresentations elsewhere in the brain (Kable, 2010).Support for this view comes from a recent study of diet-ing decisions (Hare et al., 2009). That study foundincreased inferior prefrontal activity when subjectsavoided foods that they thought tasted good but wereunhealthy, and increased functional connectivitybetween this region and ventromedial prefrontal cortexduring such choices. Activity in ventromedial prefron-tal cortex, in turn, reflected both taste and health con-cerns. That is, this region tracked the overall subjectivevalue of the food item, as estimated from the subject’sratings. These findings support the view that lateralprefrontal regions may interact with, rather than strictlyoppose in push�pull fashion, ventromedial prefrontalregions to bias behavior towards long-term outcomes.

Evidence from Single Unit Neurophysiology

The balance of functional imaging evidence sup-ports the view that hyperbolic discounting arises outof a single, integrated neural algorithm that evaluates

both immediate and delayed rewards. The BOLD sig-nal measured with fMRI, however, only reflectspopulation-level neural activity within a volume ofbrain tissue. BOLD activity that scales with the subjec-tive value of rewards could in principle arise from apopulation of neurons in which no individual neuronprecisely tracks subjective value. In the limit, twopopulations of neurons co-localized in the same voxel,one that reflected an impatient β signal and anotherthat reflected a patient δ signal, could generate theBOLD activity observed in ventromedial prefrontalcortex, ventral striatum, and posterior cingulate cortex.Similarly, if individual neurons only tracked the mag-nitude or delay of an outcome, the activity of the pop-ulation as a whole could scale with subjective value.

In fact, as introduced in the previous section, thereare several computational models that explain howaggregate hyperbolic discounting could arise from apopulation of neural units, none of which individuallyperform hyperbolic discounting (Kurth-Nelson andRedish, 2010; Tanaka et al., 2004) (see Chapter 17 formore detail). If the activity of each unit reflects anexponentially discounted value estimate, and there is adistribution of exponential discount rates across units,then the sum of activity across the units will reflecthyperbolic discounting. This follows from the fact thata mixture of exponential functions can approximate ahyperbolic function (Azfar, 1999; Sozou, 1998).

It is therefore of great interest to know what individ-ual neurons encode in brain regions where the BOLDsignal scales with subjective value during intertemporalchoice. Such data would provide important informationabout links between the representation of subjectivevalue in neuronal spiking and the BOLD signal. Currentwork in rodents may soon provide these data for muchof the basal ganglia (see Chapter 17). Unfortunately, theinvestigations that have most directly compared neuro-nal sensitivities to intertemporal choice behavior to date(Louie and Glimcher, 2010) have been performed inother brain regions, and very few investigations at allhave recorded from single neurons in areas where theBOLD signal reflects subjective value (with the exceptionof Cai et al., 2011, discussed below). The available datado not suggest that any region contains distinct catego-ries of “patient” and “impatient” neurons � rather, neu-ronal sensitivity to delay within a region appears to benormally (or at least uniformly) distributed. However,firm conclusions on this question await more detailedinvestigations, especially in neural regions where theBOLD signal tracks subjective value.

The studies performed to date do demonstrate thatthe subjective value of immediate and delayed rewardsaffects neuronal firing rates in both putative β and puta-tive δ regions, and also characterize the representationalspace in which subjective value is encoded in these

179VALUATION IN INTERTEMPORAL CHOICE

NEUROECONOMICS

Page 8: Chapter 10. Valuation, Intertemporal Choice, and …...CHAPTER 10 Valuation, Intertemporal Choice, and Self-Control Joseph W. Kable OUTLINE Introduction 173 Valuation in Intertemporal

regions. Specifically, these data discriminate betweentwo possible representational spaces in which subjectivevalues might be encoded. One possibility is that individ-ual neurons might encode the subjective value of an out-come independent of the specific motor action requiredto obtain that outcome, that is, a representation in whatis sometimes called “goods-based” or “stimulus value”space. Alternatively, individual neurons might encodethe subjective value of an outcome that results from spe-cific motor actions, that is, a representation in what issometimes called “action value” space (see Chapter 13for more detailed discussion of this issue).

These studies all examined monkey subjects makingchoices between different-sized juice rewards availableafter different delays. These subjects would indicatetheir choice by looking at one of two symbols, and theywere given initial experience so that they learned thesize and delay of the reward associated with each sym-bol. Cai and colleagues (2011) recorded from striatalneurons while monkeys performed such a task. Basedon the monkey’s choices, they could fit a discount func-tion and estimate the discounted value of each option.In ventral striatum, they found that single neuron activ-ity tended to correlate best with the sum of thediscounted values of the two options. For example, neu-rons that responded more strongly as the discountedvalue available from the left option (i.e., obtained bylooking to the symbol on the left) increased alsoresponded more strongly as the discounted value avail-able from the right option (i.e., obtained by looking to

the symbol on the right) increased (Figure 10.5). Theactivity of ventral striatal neurons therefore did notdepend on the specific motor response necessary toobtain one reward versus the other. Thus ventral stria-tal neurons seemed to encode discounted value in“goods-based” or “stimulus-value” space.

The same group has also studied neuronal activity indorsolateral prefrontal cortex and dorsal striatum in thesame task (Cai et al., 2011; Kim et al., 2008). The patternof responses in these two regions differed from that inventral striatum. In dorsal striatum, single neuronactivity tended to correlate best with the difference indiscounted values between the two options. For exam-ple, neurons that responded more strongly as the dis-counted value available from the left option increasedalso responded more strongly as the discounted valueavailable from the right option decreased (Figure 10.5). Aweaker trend towards the same kind of responses waspresent in dorsolateral prefrontal neurons (note thisresult was significant when the analysis was restrictedto those neurons that were identified as selective for thedirection of the movement). The signal the dorsolateralprefrontal cortex and dorsal striatum therefore seem toencode is discounted value in “action-value” space,since it depends on the specific motor response neces-sary to obtain one reward versus the other.

The response in dorsolateral prefrontal cortex seemsmore compatible with the idea that this region isinvolved in action selection than the idea that thisregion promotes patient behavior. In addition, Kim

FIGURE 10.5 Single neuron activity in dorsolateral prefrontal cortex and striatum during intertemporal choice. Each scatter plot showsthe standardized regression coefficients (SRC) associated with the discounted value of the left target (DVL) and the discounted value of theright target (DVR) for single neurons in dorsolateral prefrontal cortex dlPFC, caudate nucleus/dorsal striatum, and ventral striatum. Circlescorrespond to neurons for which the effect of at least one of the variables was significant (P, 0.05), whereas squares correspond to neuronsfor which neither of the variables was significant. Gray and black circles indicate neurons for which both variables were significant at P, 0.10and P, 0.05, respectively. Across neurons, the relationship between sensitivities is significantly positive in ventral striatum (r5 0.39,P, 0.001), indicating that neuronal activity in this region best follows the sum of the discounted values of the two options. The relationship issignificantly negative in caudate (r520.22, P5 0.03) and exhibits a negative trend in dorsolateral prefrontal cortex (r520.09, P5 0.11), indi-cating that neuronal activity in these regions best follows the difference between the discounted value of the two actions. Adapted from Kimet al. (2008) and Cai et al. (2011)

180 10. VALUATION, INTERTEMPORAL CHOICE, AND SELF-CONTROL

NEUROECONOMICS

Page 9: Chapter 10. Valuation, Intertemporal Choice, and …...CHAPTER 10 Valuation, Intertemporal Choice, and Self-Control Joseph W. Kable OUTLINE Introduction 173 Valuation in Intertemporal

and colleagues (2008) do not report that dorsolateralprefrontal neurons respond preferentially to delayedrewards or to a greater extent on trials where thedelayed reward is chosen. A similar conclusion hasbeen reached about single neurons in posterior parietalcortex, another putative δ region. Louie and Glimcher(Louie and Glimcher, 2010) recorded from single neu-rons in area LIP in the parietal lobe. Single neurons inLIP show increased firing rates before saccades to aparticular region of space, called the response field, orto the onset of potential saccadic targets in theresponse field. Previous results establish that the firingof LIP neurons in response to potential saccadic targetsencodes the subjective value of the outcome obtainedby an eye movement into its response field, scaled bythe value available from other possible eye movements(Dorris and Glimcher, 2004; Louie and Glimcher, 2010;Platt and Glimcher, 1999). Louie and Glimcher (2010)demonstrated that neuronal response to potential tar-gets in LIP reflects the discounted value of the rewardobtained by saccades to those targets, when thisreward is only obtained after an intervening delay.

Louie and Glimcher’s (2010) measurements alsodemonstrate how the activity of the neuronal popula-tion in LIP relates to behavior (Figure 10.6). Assumingthat the population of LIP neurons encodes discountedvalue, they were able to estimate a discount functionbased purely on the neural data. They also estimated adiscount rate from each subject’s behavioral data. Eventhough the behavioral discount rate varied acrossmonkeys, the discount rate estimated from the neuro-nal population in each subject matched the behavioraldiscount rate from that same subject. This matchbetween neuronal and behavioral discount ratesdemonstrates that LIP neurons do not preferentiallyvalue delayed rewards as would be expected of a puta-tive δ region. Furthermore, there was no evidence thatdelay sensitivity in LIP neurons followed a bimodal, asopposed to normal, distribution, so this population-level match in LIP was not achieved by averagingtogether two distinct sub-populations.

In summary, the major difference between ventralstriatal neurons and neurons in dorsal caudate, dorso-lateral prefrontal cortex, and LIP seems to be in theirsensitivity to the specific motor action required toobtain a reward. This difference suggests that the tran-sition from medial prefrontal/medial parietal/ventralstriatal regions to lateral prefrontal/lateral parietal/dorsal striatal regions has more to do with a progres-sion from valuation to choice, or from valuation in“goods-space” to valuation in “action-space,” than indifferentially contributing to patient versus impatientbehavior. At this time, there seems little compellingevidence for the existence of distinct β and δ systemsas originally hypothesized.

The characterization of neuronal activity linked todiscounted value in dorsolateral prefrontal and poste-rior parietal cortex raises the question of why theseregions do not always exhibit BOLD activity linked tosubjective value in fMRI experiments (McClure et al.,2004, 2007; Kable and Glimcher, 2007, 2010; Peters andBuchel, 2009). One possibility is that the effects of sub-jective value on single neurons in these regions cancelsout in the average neuronal activity across the popula-tion. Consistent with this notion, multivariate analysesthat examine the information available across a popu-lation of fMRI voxels have found more widespreadencoding of value than corresponding studies thathave examined only mean BOLD activity (Vickeryet al., 2011). Another possibility is that BOLD activityin these regions tracks not the overall subjective valueof the choice options but rather the absolute differencein value between the choice options. Several recentstudies in other choice domains have identified BOLDactivity in premotor and posterior parietal regions thatis correlated with the absolute difference in subjectivevalues, and argued that such a signal is consistent

2

0

1

0.5

Dis

coun

ted

valu

e

0

1

0.5

00 4 8 12 0 4 8 12

Pop

ulat

ion

activ

ity(n

orm

)

0 500 1000 0 500 1000

2

0

Time (ms)

Delay (s)

2

0

2

0

v s v sMonkey W Monkey D

FIGURE 10. 6 Neural discount functions from LIP match behav-ioral discount functions. The top shows the averaged neuronal activ-ity on free choice trials where the monkey looked into the neuron’sresponse field, color-coded by delay and aligned to the visual targetonset (v) and initiation of a saccade (s). A zero delay is shown inpink, a delay of 12 seconds is shown in light blue, and intermediatedelays are shown in intermediate colors. Firing rates are shown sepa-rately for two monkeys, Monkey W (n5 23 neurons) and Monkey D(n5 48 neurons). The bottom panel shows the neural and behavioraldiscount functions for both animals. Filled circles show the normal-ized LIP activity as a function of delay, relative to activity at zerodelay. The neural discount function is shown in black, and repre-sents the best-fit hyperbolic curve to the neural activity as a functionof delay (95% confidence intervals based on bootstrap are illustratedin gray). The behavioral discount functions are shown in red.Adapted from Louie and Glimcher (2010).

181VALUATION IN INTERTEMPORAL CHOICE

NEUROECONOMICS

Page 10: Chapter 10. Valuation, Intertemporal Choice, and …...CHAPTER 10 Valuation, Intertemporal Choice, and Self-Control Joseph W. Kable OUTLINE Introduction 173 Valuation in Intertemporal

with a region that encodes action value comparisons atthe single neuron level (Hare et al., 2011b; Wunderlichet al., 2009).

SELF-CONTROL

Intertemporal choices are intricately tied up with popu-lar notions of self-control. The dieter resisting the urge toindulge or the athlete who trains long hours in hopes offuture victory are both prioritizing long-term goals overshort-term comforts. It is natural, therefore, that work inneuroeconomics on intertemporal choice has begun totouch on questions in the broader domain of self-control.

A potential pitfall in these efforts is that the term“self-control” already has a rich, varied and complexlay meaning. A danger, then, is that scientific work onself-control is falling prey to the so-called “jingle fal-lacy” (Thorndike, 1904) � using the same term to referto more than one process. Indeed, the number of papersthat reference self-control in psychology, economics,and neuroscience is vast and growing, yet investigatorsacross these different fields seem to employ the term“self-control” in quite different ways (Kable, 2011). It istherefore important to try and be careful and preciseabout its usage here.

In particular, it is important to note that it may notbe merited to start with the assumption that there is asingle cognitive or neural process that corresponds to“self-control.” A more agnostic starting point wouldrecognize that there is a family of behaviors that typi-cally get labeled as “self-controlled” and that thesebehaviors share at least some superficial resemblanceto each other. A scientific program in this domainwould then seek to develop experimental paradigmsthat captured specific aspects of self-controlled behav-ior, and then would attempt to identify and character-ize the cognitive and neural processes that are criticalfor behavior in these paradigms. Such work mightthen identify a single process, or alternatively a multi-tude of processes, that are important in self-controlledbehaviors. Taking this approach, in this section we willdiscuss work on three specific topics relevant to thebroad domain of self-control: individual differences indiscount rates; intertemporal preference reversals andviolations of stationarity; and persistence in the pursuitof delayed gratification.

Individual Differences in Discounting

One topic within the domain of self-control involvesindividual differences in the steepness of discounting.Behaviors that are labeled as self-controlled ofteninvolve choosing delayed rewards over immediateones. What makes some people more likely to choose

delayed rewards over immediate ones? Are there inter-ventions that reduce an individual’s discount rate?

Note that the appropriate level of discounting for aperson to adopt in any given environment can bedebated in most cases (Baron, 2000; Harvey, 1994, 1986),so there are few situations where it is clear-cut that indi-viduals discount “too much.” A notable exception is thesituation that is most widely studied in the laboratory �choices between immediate and delayed monetaryrewards. If an individual has access to credit and invest-ment opportunities, they should discount monetaryrewards at the market interest rate (Fisher, 1930), yet themedian discount rate for monetary rewards observedbehaviorally is many times the market rate (Fredericket al., 2002; Green and Myerson, 2004; Soman et al., 2005).This same argument, however, does not necessarilyapply to other kinds of intertemporal choices.Nevertheless, many behaviors that are labeled as self-controlled involve choosing delayed rewards overimmediate ones, and many behaviors that policymakersseek to change (like the consumption of illicit drugs)involve choosing immediate rewards over delayed ones.

There are several behavioral correlates of discountrates. Discounting of delayed rewards decreases withincreasing cognitive ability (Burks et al., 2009; Shamoshet al., 2008) and decreases from childhood into middleage (Green et al., 1994b). Consistent with shallowdiscounting being an important contributor to self-controlled behaviors, steeper discount rates are associ-ated with tobacco use (Baker et al., 2003; Bickel andMarsch, 2001), alcohol use (Boettiger et al., 2007;Boettiger et al., 2009; Mazas et al., 2000; Mitchell et al.,2007), use of other drugs (Kirby and Petry, 2004; Kirbyet al., 1999; Monterosso et al., 2007; Petry et al., 1998),pathological gambling (Petry, 2001), and obesity(Weller et al., 2008).

At the neural level, one hypothesis is that the activ-ity and integrity of the lateral frontal cortex contributesto reduced discounting. This hypothesis can accountfor the behavioral correlates of discount rates outlinedin the previous paragraph. Cognitive ability is associ-ated with neural activity in the lateral frontal cortexand the integrity of the lateral frontal lobes (Gray andThompson, 2004). During development, the lateralfrontal cortex is one of the latest regions to reachmature levels of myelination (Gogtay et al., 2004).Additionally, neural changes in prefrontal cortex areprominent in substance abusers (Volkow and Fowler,2000; Volkow and Li, 2004; Volkow et al., 2003).

Several lines of evidence also directly support thehypothesis that lateral frontal cortex contributes toreduced discounting. Several studies have found high-er BOLD activity in lateral frontal regions when indivi-duals choose delayed rewards rather than immediaterewards (McClure et al., 2004, 2007; Weber and

182 10. VALUATION, INTERTEMPORAL CHOICE, AND SELF-CONTROL

NEUROECONOMICS

Page 11: Chapter 10. Valuation, Intertemporal Choice, and …...CHAPTER 10 Valuation, Intertemporal Choice, and Self-Control Joseph W. Kable OUTLINE Introduction 173 Valuation in Intertemporal

Huettel, 2008), or in shallower discounters comparedto steeper discounters (Ballard and Knutson, 2009;Boettiger et al., 2007; Monterosso et al., 2007; Shamoshet al., 2008). Shallower discounting is also associatedwith increases in gray matter volume in lateral frontalregions (Bjork et al., 2009). When transcranial magneticstimulation (see Chapter 6 for more details on thismethodology) is used to disrupt activity in left lateralfrontal cortex, people are more likely choose immedi-ate over delayed rewards (Figner et al., 2010).

There are questions about the consistency and repli-cability of these results, however. Different studiesimplicate diverse regions of lateral prefrontal cortex,including superior frontal (Boettiger et al., 2007; Weberand Huettel, 2008), middle frontal (Figner et al., 2010;Shamosh et al., 2008), anterior and posterior inferiorfrontal (Hare et al., 2009), and lateral orbital regions(Boettiger et al., 2007; Monterosso et al., 2007).Functional imaging results are also conflicting, withsome studies finding greater BOLD activity in lateralfrontal regions in impatient subjects (Boettiger et al.,2007), and others finding no lateral frontal differencesbetween immediate and delayed choices (Kable andGlimcher, 2007, 2010; Wittmann et al., 2007). In fact, theone study that has examined discounting in patientswith damage to lateral prefrontal cortex did not findany effect on discount rates (Fellows and Farah, 2005).

Holding aside concerns about empirical consistency,another open question regards what the precise func-tional role of lateral frontal cortex in intertemporalchoices might be (Kable, 2010). Why might the activityand integrity of lateral frontal cortex contribute toreduced discounting? One possibility is that lateralfrontal cortex is directly involved in action selection,and more specifically that it inhibits the selection ofimmediate rewards (Figner et al., 2010). Another possi-bility is that lateral frontal cortex is involved in direct-ing attention towards certain goals or choice attributes,and that this, on balance, leads to a more positiveevaluation of delayed rewards (Hare et al., 2009). Theformer proposal posits that lateral prefrontal cortexacts on the output of an evaluation process, while thelatter suggests that lateral prefrontal cortex modulatesthe inputs to an evaluation process.

In resolving these questions, it will help to considerfindings from the broader cognitive neuroscience litera-ture on prefrontal cortex. Direct inhibition of motorresponses has been studied using paradigms like stop-signal reaction time, a task that measures how quicklyand effectively someone can cancel a movement theyhad previously been instructed to make. Performanceon such tasks has been linked to the right inferior pre-frontal cortex (Aron et al., 2004). More dorsal regions(inferior frontal sulcus, middle frontal gyrus) have beenlinked to other processes such as the maintenance and

flexible manipulation of information in working mem-ory (Nee et al., 2007; Wager and Smith, 2003). The latterregion has been identified in more studies of intertem-poral decision making. One possible explanation forinconsistency across studies is that lateral prefrontalcortex plays no specific role in promoting delayedchoices, but is only involved to a greater degree in moredifficult choices, and in some studies choices of thedelayed reward are confounded with difficulty. Anintriguing alternative, however, is that the functionaleffect of lateral prefrontal activity in principle dependson what information is currently being maintained orattended. Behavioral evidence shows that directingattention towards the magnitude of reward promotesdelayed choices, while directing attention to the delaypromotes immediate choices (Weber et al., 2007). If lat-eral prefrontal cortex is primarily involved in imple-menting this kind of attentional modulation, then itmay not be surprising that results are sometimes incon-sistent, because of differences across studies or subjectsin what information is attended.

Although there has been more emphasis on testingwhether lateral prefrontal cortex contributes to patientchoices, there is growing evidence that medial prefron-tal regions may be at least as critical a neural locus fordetermining individual differences in discounting(Figure 10.7). A corollary of the finding that medial pre-frontal cortex tracks the subjective value of delayedrewards is that this region is less active for delayedrewards in individuals who are more impatient (Kableand Glimcher, 2007). Interestingly, activity in medialprefrontal cortex when individuals are simply asked tothink about the future is also correlated with discountrates. When asked to make judgments about themselvesin the future, more impatient individuals exhibit agreater decrease in medial prefrontal regions comparedto judgments about the present (Ersner-Hershfield et al.,2009; Mitchell et al., 2010). Furthermore, interventionsthat promote patient behavior enhance activity inmedial prefrontal regions. Peters and Buchel (2010)found that cuing individuals with specific future eventsreduced discount rates, an effect that was correlatedwith changes in both activity and functional connectiv-ity in anterior cingulate cortex. Chua and colleagues(2011) found that higher activity in medial prefrontalregions when smokers viewed anti-smoking messagespredicted quitting success, while Falk and colleagues(2010, 2011) found that higher activity in ventromedialprefrontal cortex predicted the success of both pro-sunscreen and anti-smoking messages. Finally, the onehuman lesion study that observed increases in dis-counting, the effect that should be observed when aregion contributing to patient behavior is damaged,involved lesions of medial orbitofrontal cortex (Sellittoet al., 2010). These studies suggest that the medial

183SELF-CONTROL

NEUROECONOMICS

Page 12: Chapter 10. Valuation, Intertemporal Choice, and …...CHAPTER 10 Valuation, Intertemporal Choice, and Self-Control Joseph W. Kable OUTLINE Introduction 173 Valuation in Intertemporal

regions of prefrontal cortex may be as or even moreimportant than the lateral regions in promoting patientbehavior. This possibility merits closer attention infuture work, as does the growing evidence for func-tional interactions between lateral and medial prefron-tal cortex being critical in promoting patient behavior(Baumgartner et al., 2011; Hare et al., 2009) (see alsoChapters 8 and 11).

Stationarity

Behaviors that are labeled self-controlled usuallyinvolve not just choosing delayed rewards, but also per-sisting in this choice until the delayed reward arrives. Aperson seeking to lose weight must not only choose tostart a diet but also stick to it in the face of opportunitiesto cheat. A drug user seeking to quit must not only starttreatment but also stay with it over time. Yet, dieters,drug users, and others often “fall off the wagon.”People often make the initial commitment to a delayedgoal, only to fail to persist in this choice, abandoningthis commitment before achieving their goal.

Remember that a central argument for the rational-ity of exponential discounting is that it avoids oneform of intertemporal preference reversal, as expressedin the stationarity axiom (Fishburn and Rubinstein,1982). The stationarity axiom states that if a decisionmaker prefers one delayed outcome to another, thispreference should not change simply because time haspassed and both outcomes are now sooner in time. Animportant feature of both hyperbolic and quasi-hyperbolic discounting models is that they lead tochoices that violate this stationarity requirement.

Ainslie proposed that this feature of hyperbolic dis-counting could explain impulsive behavior (Ainslie, 1975;Ainslie and Haendel, 1983). If people are hyperbolic dis-counters, there will be some situations where they initiallyprefer the larger-later of two delayed rewards, but as timepasses and both rewards grows nearer, this preferencereverses to the smaller-sooner choice (Figure 10.2). Thiscould explain, for example, how a smoker could go to bedresolving to choose the long-term health benefits of quit-ting over the short-term pleasures of one more cigarette,only to arrive at the next day, face a cigarette in front ofthem and do exactly the opposite.

It is important to realize, however, that the non-stationarity of hyperbolic and quasi-hyperbolic dis-counting arises because of the shape of these discountfunctions. Both very patient and very impatient hyper-bolic choosers will make preference reversals. Thesteepness of discounting only determines for whatpairs of rewards such reversals are predicted to occur.

However, even though it is often taken as given inbehavioral economics that choosers violate stationarity,the empirical evidence regarding stationarity violationsis surprisingly equivocal. There are many studiesshowing that hyperbolic models fit choice data betterthan exponential models do; however, it is critical tonote that such fit comparisons are not direct tests ofstationarity. To test whether decision makers obey sta-tionarity, one must compare choices of the followingform (where y is preferred to x at no delay and s is far-ther away in time than t):

1. Would you prefer x at time t or y at time s? Forexample, would you prefer $10 today or $11 inone week?

(A) (B) (C)

FIGURE 10.7 Activity in medial prefrontal cortex is associated with less impatient decisions. (A) There is greater BOLD activity in medialprefrontal (MPFC) and medial parietal cortex for judgments about yourself now compared to judgments about yourself in the future.Impatient people showed a bigger drop in medial prefrontal activity when judgments concerned the future as opposed to the present (adaptedfrom Mitchell et al. (2010), see also Ersner-Hershfield et al. (2009)). (B) Providing event tags for future dates increases activity in ventromedialprefrontal cortex (vmPFC) and medial parietal areas (RSC, retrosplenial cortex; PCC, posterior cingulate cortex). This condition is also associ-ated with reduced impatience. (Adapted from Peters and Buchel (2010)). (C) Activity in medial prefrontal cortex (MPFC) and medial parietal cor-tex (precuneus) when viewing persuasive messages is correlated with the degree of future behavior change. This result is from messagespromoting sunscreen use, and similar results have been found in the same medial prefrontal cortex region for messages promoting smokingcessation (adapted from Falk et al. (2010)).

184 10. VALUATION, INTERTEMPORAL CHOICE, AND SELF-CONTROL

NEUROECONOMICS

Page 13: Chapter 10. Valuation, Intertemporal Choice, and …...CHAPTER 10 Valuation, Intertemporal Choice, and Self-Control Joseph W. Kable OUTLINE Introduction 173 Valuation in Intertemporal

2. Would you prefer x at time t1 τ or y at time s1 τ?For example, would you prefer $10 in one week or$11 in two weeks?

A decision maker choosing the later option in ques-tion 2 ($11 in two weeks in the example) and thesooner option in question 1 ($10 today in the example)would violate stationarity in the manner predicted byhyperbolic discounting.

Such a test of stationarity can be performed in twoways. An experimenter could ask both questions onthe same occasion. Alternatively, question 2 could beasked on one day and question 1 could be asked laterafter the amount of time τ has passed. In the lattercase, for example, an experiment might ask you onone day whether you prefer $10 in one week or $11 intwo weeks, and then ask you one week later whetheryou prefer $10 today or $11 in one week. For practicalreasons, most tests of stationarity have been carriedout in the first manner. It is important to note, how-ever, that only in the second case is the experimentasking about the exact same tradeoff in consumptionfrom two different points in time. Indeed, some mod-els that incorporate uncertainty into exponential dis-counting (Azfar, 1999; Sozou, 1998) predict (apparent)violations of stationarity in the first case (when bothquestions are asked on the same day) but do not pre-dict violations in the second case (when the sametradeoff is asked about on two different days). Theydo this by postulating, in essence, that subjects havegreater doubts about whether or not they will receivelong-delayed rewards from the experimenter thanthey have about immediate rewards; a kind of mixingof risk and delay.

Studies testing stationarity in the first manner, byasking both questions on the same occasion, haveyielded mixed results. Some find evidence for statio-narity violations (Ainslie and Haendel, 1983; Greenet al., 1994a; Kirby and Herrnstein, 1995) and others donot (Ahlbrecht and Weber, 1997; Baron, 2000; Holcomband Nelson, 1992; Kable and Glimcher, 2010; Read,2001; Read and Roelofsma, 2003). Interestingly, all ofthe studies reporting evidence of stationarity violationshave used a specific experimental procedure, in whichparticipants were first verified to prefer a smaller-sooner option ((x,t) in question 1), and subsequentquestions added fixed delays (τ) to both options untilthe participant’s preferences reversed. This procedureis explicitly searching for the violations of stationaritypredicted by hyperbolic discounting. Since only pairsof options for which the person prefers the sooneroption when both delays are shorter are examined, thisprocedure can only discover inconsistencies in the pre-dicted direction. (An inconsistency in the unpredicteddirection would involve selecting the sooner option

when both delays are longer � (y,s) in question 1 �and the later option when both delays are shorter � (x,t1 t) in question 2.) An arguably fairer test wouldpresent choices in a random order, and compare pairsof questions that differ by the addition of a fixed delay.Studies that have used this approach have often failedto find evidence of stationarity violations (Ahlbrechtand Weber, 1997; Baron, 2000; Holcomb and Nelson,1992; Kable and Glimcher, 2010; Read, 2001; Read andRoelofsma, 2003). Kable and Glimcher (2010) per-formed a fairly extensive test of this type, whichinvolved 200�400 pairs of questions in 25 participants.They found no evidence that stationarity violationswere more frequent than inconsistencies in the unpre-dicted direction (Figure 10.8).

Studies that have tested stationarity in the secondmanner, by compared decisions about the same tradeoffat two different points in time, have similarly yieldedmixed results (Figure 10.8). One study found evidencefor violations in the predicted direction, representing ashift towards the smaller-sooner reward as time passed(Ainslie and Haendel, 1983). A series of two studiesfound no inconsistencies on average (Read et al., 2012).A third study found a significant shift in the oppositeof the predicted direction, toward the larger-laterreward, as time passed (Sayman and Onculer, 2009).

The experimental evidence then for the central irra-tionality predicted by traditional hyperbolic and quasi-hyperbolic models � that decision makers will shiftfrom favoring larger-later rewards to smaller-soonerrewards as both draw closer in time � is surprisinglyweak. Several models have been proposed to try andreconcile these findings with the evidence againstexponential discounting in choices between immediateand delayed rewards (Glimcher et al., 2007; Greenet al., 2005; Kable and Glimcher, 2010; Read, 2001;Read and Roelofsma, 2003; Scholten and Read, 2006).These models all share the notion that choosers takeinto account differences in delays and magnitudes,instead of or in addition to absolute delays and magni-tudes. Neural data weighs against models that onlytake into account differences in delay and magnitude,since stimuli with different absolute delays but equiva-lent delay differences do not lead to equivalent neuralsignals (Kable and Glimcher, 2010). Models that canaccount for both the behavioral and neural findingsare just beginning to be explored, and a priority forfuture research will be to develop systematic explana-tions for departures from not just exponential but alsohyperbolic discounting. Thus, studies looking forpotential violations of stationarity have so far yieldedfurther empirical puzzles, which have meant that thisparadigm has not yet provided the insights one mighthave expected regarding processes important for self-controlled behavior.

185SELF-CONTROL

NEUROECONOMICS

Page 14: Chapter 10. Valuation, Intertemporal Choice, and …...CHAPTER 10 Valuation, Intertemporal Choice, and Self-Control Joseph W. Kable OUTLINE Introduction 173 Valuation in Intertemporal

0

50

100

Per

cent

of c

hoic

es

Same60 DAY More ImpatientNOW More Impatient

Shift60 days

60 DAY conditionNOW condition

(A)

(B)

8

7

6

5

4

3

2

1

00 0.2

Population switch proportion (SWP)

0.4 0.6 0.8 1

Sayman & Onculer Experiment 1 and 2 Ainslie & Haendel

FIGURE 10.8 Direct tests do not find strong evidence that stationarity is violated. (A) Kable and Glimcher (2010) asked subjects pairs ofquestions that provided a direct test of stationarity, since the difference in delay was constant across the pairs, but the delay to sooner rewardvaried (specifically, it was either immediate, i.e., “now”, or in 60 days). Questions were presented in random order. For vast majority of ques-tions, people chose consistently across the pair (i.e., they chose both of the sooner or both of the later rewards). When they were inconsistent,they were actually slightly more likely to reverse in the direction not predicted by hyperbolic discounting (i.e., choosing the smaller rewardwhen both are delayed and the larger when one is immediate). Similar results obtained when restricting the analysis to only those choiceswhere hyperbolic discounting predicts reversals or to only the very first choices the subjects faced. Adapted from Kable and Glimcher (2010).(B) Posterior probability that people will switch in three longitudinal tests of stationarity. Larger switch proportions signify a greater likeli-hood of switching in the direction predicted by hyperbolic discounting (from later rewards when both options are farther in the future tosooner rewards when both options are nearer in time), and smaller proportions signify a greater likelihood of switching in the unpredicteddirection. A switch proportion of 0.5 means people are equally likely to switch in either direction (predicted or unpredicted). Read and collea-gues (2012) argue that the heterogeneity across studies likely reflects social effects, given that subjects in Sayman and Onculer (2009) andAinslie and Haendel (1983) could confer between the time of the first and second choice. Across the three studies there is not strong evidencefor stationarity violations as predicted by hyperbolic discounting. Adapted from Read et al. (2012).

186 10. VALUATION, INTERTEMPORAL CHOICE, AND SELF-CONTROL

NEUROECONOMICS

Page 15: Chapter 10. Valuation, Intertemporal Choice, and …...CHAPTER 10 Valuation, Intertemporal Choice, and Self-Control Joseph W. Kable OUTLINE Introduction 173 Valuation in Intertemporal

Persistence and Delay of Gratification

Other studies have examined persistence in situationswhere a decision maker is free to give up anytime whilewaiting through a delay. Perhaps the most well-knownexample is the delay-of-gratification paradigm (popu-larly known as the “marshmallow test”) developed byWalter Mischel and colleagues (Mischel et al., 1989). Inthese experiments, young children are told they canhave a preferred reward (for example two marshmal-lows) if they wait until the experimenter returns, or atany time they can choose instead to forgo the preferredreward in favor of a less preferred reward (one marsh-mallow) that does not require waiting. Typically, almostall children choose to begin waiting for the largerreward, but most quit before the experimenter returns,opting to take the smaller reward rather than to continuewaiting. This paradigm has received much attentionbecause it seems to capture the essential difficulty of per-sistence, and because how long children wait in this par-adigm predicts several beneficial life outcomes, such asacademic success (Mischel et al., 1989).

Behavior in the delay of gratification paradigmsuperficially resembles violations of stationarity, inthat a decision maker initially opts for a delayedreward only to abandon this choice later in favor of animmediately available alternative. However, it isimportant to note that behavior in the delay of gratifi-cation paradigm cannot be explained by hyperbolicdiscounting. First, the smaller reward in the delay ofgratification paradigm is always immediately available.Thus, the choice to quit waiting and take the smallerreward cannot be explained by an increase in the

subjective value of the smaller reward over time,unlike in the scenarios described by Ainslie(Figure 10.9). The delay of gratification paradigm isalso different in that, from the perspective of the sub-ject, the arrival time of the larger reward is completelyunknown. This uncertainty is a standard part of theparadigm, with the participant only informed in vagueterms about how long they will have to wait (e.g., theexperimenter will be gone “for a while”).

Several psychological models have been proposedto explain the failure to persist in the pursuit ofdelayed rewards as observed in the delay of gratifica-tion paradigm. Most of these models posit competingprocesses whose strengths vary over time. For exam-ple, a control process acts to promote delaying behav-ior, but this process weakens with prolonged use or incertain physiological states (Muraven and Baumeister,2000) and/or this process is less effective in the pres-ence of certain cues (Metcalfe and Mischel, 1999).Consistent with such models, adults who waited lon-ger in the delay of gratification paradigm as childrendo show evidence of stronger cognitive control pro-cesses, including increased inferior frontal activity in ago�no-go paradigm (Casey et al., 2011).

An alternative view, initially suggested by Rachlin(2000) and further developed by McGuire and Kable(2013), is that failure to persist in delaying gratificationis a potentially rational response in the face of temporaluncertainty about the delayed reward’s arrival. If thearrival time of the delayed reward is uncertain, adecision-maker’s current belief about the remainingdelay should depend on their initial expectations aboutthe delay (Figure 10.10). Under some initial

Time

Sub

ject

ive

valu

e

Time

Sub

ject

ive

valu

e

arrival time of large reward

arrival time of large reward

arrival time of small reward

small rewardavailable

Future small reward (Ainslie) Immediate small reward (Mischel)

FIGURE 10.9 Hyperbolic discounting cannot explain reversals in the delay of gratification paradigm. The preference reversals emphasizedby Ainslie (left) involve choices between two future rewards. As the arrival time of both rewards draws nearer, hyperbolic discounting causespreferences to shift towards the small reward. In Mischel’s delay of gratification paradigm (right), the small reward is always available imme-diately, so reversals cannot be explained by its value changing as time passes. If the chooser believed the large reward was coming at a fixedpoint in the future, preference for the larger reward would only grow as time passes. See Figure 10.10 for analysis of the situation where thechooser is uncertain about the larger reward’s arrival time.

187SELF-CONTROL

NEUROECONOMICS

Page 16: Chapter 10. Valuation, Intertemporal Choice, and …...CHAPTER 10 Valuation, Intertemporal Choice, and Self-Control Joseph W. Kable OUTLINE Introduction 173 Valuation in Intertemporal

expectations, a decision-maker’s prediction about thelength of the remaining delay decreases as time passes.In this case, after starting to wait, one should never giveup, since the (predicted) discounted value of thedelayed reward is increasing as time passes. Underother initial expectations, though, a decision-maker’sprediction about the length of the remaining delay canincrease as time passes. In this situation, it can be ratio-nal to start waiting and then give up after a certainamount of time, since the (predicted) discounted valueof the delayed reward decreases as time passes.

A couple of examples help to illustrate different kindsof initial expectations and how these expectations affectthe predicted remaining delay. In one case, imagine wait-ing for a lecture that was initially scheduled for one hourto end. A reasonable initial expectation in this case canbe characterized by a Gaussian distribution (seeFigure 10.10). In other words, you might think it is mostlikely that the lecture will last an hour, but there is somepossibility that it will be longer or shorter, and times areless likely the farther away they are from an hour. Insituations where initial delay expectations are Gaussian,the predicted remaining delay decreases as time passes.In this example, your prediction about the time left in thelecture would decrease the longer the lecture has lasted.

As another case, imagine calling a company’s cus-tomer service line and being placed on hold. A

reasonable initial expectation in this case can becharacterized by a heavy-tailed distribution (seeFigure 10.10). In other words, you might think it ispossible and most likely that the wait time will be veryshort, but it is also possible that the wait time will bevery long. Heavy-tailed expectations could arise ifbeing released from hold is a random process but youare unsure about the rate of that process � for exam-ple, if there are some companies where people arereleased from hold quickly and some where people arereleased from hold slowly and you are not sure whichkind of company you have called. In situations whereinitial delay expectations are heavy-tailed, the pre-dicted remaining delay increases as time passes. In thisexample, your prediction about the remaining waittime would increase the longer you have been on hold� the longer you have been on hold, the more likely itis that you have called a company where wait timesare very long.

McGuire and Kable (2013) present several lines ofevidence that heavy-tailed expectations and lengthen-ing predicted delays could explain people’s limitedpersistence in delaying gratification. First, several the-orists have argued that people should begin withheavy-tailed expectations in situations where the haveno good information about the length of an event(Caves, 2000; Gott, 1994), which is arguably the case insituations structure liked the delay of gratification par-adigm. Furthermore, McGuire and Kable (2013) foundthat, consistent with heavy-tailed expectations, peo-ple’s predictions about the remaining delay increasedwith time already waited for both the delay of gratifi-cation paradigm and several other scenarios typicallycharacterized as involving self-control. Finally, a sim-ple temporal prediction model with reasonable para-meters was able to fully account for the empiricallyobserved distribution of wait times in the delay ofgratification task.

This view suggests a critical determinant of persis-tence may be a decision-maker’s initial temporal expec-tations. McGuire and Kable (2012b) tested whetherthese expectations and persistence behavior can beshaped by prior experience. In a simple waiting task,people experienced either sets of delays where themedian remaining delay decreased as time passed (likethe Gaussian distribution in Figure 10.10) or sets ofdelays where the median remaining delay increased astime passed (like the heavy-tailed distribution inFigure 10.10). As predicted, the former condition pro-moted persistence while the latter condition under-mined persistence. In a follow-up study, they foundthat BOLD activity in medial prefrontal cortex evolvedover a delay in a manner that paralleled the changingvalue of the delayed reward in that environment,increasing over time only in an environment where the

0

0.5

0

0.5 Current timeExpected arrival

0

0.5

0

0 5 100

0.5

0 5 100

0.5

Pro

babi

lity

dens

ity

Time (min) Time (min)

Gaussian Heavy-tailed

FIGURE 10.10 Illustration of how the expected remaining delaychanges as time passes under different initial beliefs about the uncer-tain arrival time of an outcome. The left and the right column illus-trate Gaussian and heavy-tailed beliefs about the distribution ofpossible delays. The solid line represents the current time (shown at0, 2, and 4 min). The dashed line represents the outcome’s expectedarrival time, defined as the median of the area to the right of the cur-rent time. For Gaussian beliefs, the expected remaining delay startsat 3 min and grows shorter with time. For heavy-tailed beliefs, theexpected remaining delay starts at 3 min and rises with time. FromMcGuire and Kable (2012b).

188 10. VALUATION, INTERTEMPORAL CHOICE, AND SELF-CONTROL

NEUROECONOMICS

Page 17: Chapter 10. Valuation, Intertemporal Choice, and …...CHAPTER 10 Valuation, Intertemporal Choice, and Self-Control Joseph W. Kable OUTLINE Introduction 173 Valuation in Intertemporal

predicted remaining delay decreased with time waited(McGuire and Kable, 2012a).

These results suggest that in many situations it maynot be necessary to posit a time-varying control processto adequately explain failures to persist in delaying grati-fication. Different theories also make competing pre-dictions about a chooser’s preferences regardingprecommitment. Precommitment is when a decisionmaker eliminates the possibility of choosing an option atsome point in the future, often at some cost. The paradig-matic literary example of precommitment is Odysseustying himself to the mast of his ship so that he cannotapproach the Sirens. An everyday example is a Christmasclub, in which people put a set amount of their salaryeach month into an account that can only be used to pur-chase Christmas presents in December. Such a personprecommits to saving money they have now and onlyspending it in December, at the cost of the interest theycould accrue if they deposited the same amount into amore flexible savings account that would permit them towithdraw the money before December.

A decision maker facing temporal uncertainty, in anenvironment where the predicted remaining delay canincrease, should not want to find a way to precommitto the delayed reward, closing off all possibility ofabandoning this choice later in favor of an immedi-ately available alternative. In contrast, a decisionmaker who knows that the strength of their controlprocesses varies over time, or who knows that theyhave hyperbolic-like time preferences, may in manysituations prefer to precommit to the delayed reward ifpossible, even when such precommitment is costly.

Of course, the central importance of precommitmentas a diagnostic marker has been recognized for sometime. Gul and Pesendorfer (Gul and Pesendorfer, 2001)make a preference for precommitment the centerpiece oftheir axiomatization of temptation and self-control pre-ferences. Despite its importance, however, there havebeen very few published laboratory studies of precom-mitment (though see Ariely and Wertenbroch, 2002;Houser et al., 2010). Data from the field both confirm thata preference for precommitment exists, but also thatsuch a preference is far from universal (DellaVigna,2009). It remains to be determined whether this signifiesthat most people are unsophisticated in their ability topredict their own future behavior (O’Donoghue andRabin, 1999)� that is that they should precommit but donot � or whether instead people are reasonably keepingtheir options open in the face of uncertainty.

As this section makes clear, neuroeconomic researchin the broad domain of self-control is just beginning.Aside from paradigms already discussed, future workshould also explore the importance of other factors inpromoting patient or impatient behavior. For example,the sense of cognitive effort (Kurzban, 2010) or learned

cues (Bernheim and Rangel, 2004) can promote impul-sive behavior, while the presence of social norms(Cialdini and Goldstein, 2004) can promote self-controlled behavior. These topics have just begun to beexplored in neuroeconomics (Botvinick, 2007; McGuireand Botvinick, 2010; Montague and Lohrenz, 2007).

CONCLUSION

Intertemporal choice has been an area of intense inter-est over the last decade of neuroeconomic research.Much work in this area has focused on the basic questionof how the neural mechanisms of valuation and choiceincorporate discounting of delayed rewards. There isnow a wealth of functional imaging and single-unitrecording evidence that brain regions implicated in valu-ation across domains respond to both immediate anddelayed rewards, and that the neural response in theseregions reflects the degree of discounting present in theindividual’s behavior. Given the intensity of interest inthe area, work on intertemporal choice provides a testbed for whether neuroeconomics will yield insights notjust for neuroscience but also for psychology and eco-nomics. In this respect, it is striking that the neurosciencefindings regarding intertemporal choice question whathas often been the default assumption in psychologyand economics, that explaining hyperbolic discountingrequires some kind of dual-process model, and point theway towards alternative explanations and modelingapproaches. As work intensifies in the broader domainof self-control, a domain in which there has been muchintense debate about the basic identity and nature of therelevant psychological processes, the neuroeconomicapproach may similarly yield important insights.

References

Ahlbrecht, M., Weber, M., 1997. An empirical study of intertemporaldecision making under risk. Manag. Sci. 43 (6), 813�826.

Ainslie, G., 1975. Specious reward: a behavioral theory of impulsive-ness and impulse control. Psychol. Bull. 82 (4), 463�496.

Ainslie, G., Haendel, V., 1983. The motives of the will. In: Gottheil,E., Druley, K.A., Skoloda, T.E., Waxman, H.M. (Eds.), EtiologicAspects of Alcohol and Drug Abuse. Charles C. Thomas,Springfield, IL, pp. 119�140.

Ariely, D., Wertenbroch, K., 2002. Procrastination, deadlines, andperformance: self-control by precommitment. Psychol. Sci. 13,219�224.

Aron, A.R., Robbins, T.W., Poldrack, R.A., 2004. Inhibition and theright inferior frontal cortex. Trends Cogn. Sci. 8 (4), 170�177.

Azfar, O., 1999. Rationalizing hyperbolic discounting. J. Econ. Behav.Organ. 38 (2), 245�252.

Baker, F., Johnson, M.W., Bickel, W.K., 2003. Delay discounting incurrent and never-before cigarette smokers: similarities and dif-ferences across commodity, sign, and magnitude. J. Abnorm.Psychol. 112 (3), 382�392.

189REFERENCES

NEUROECONOMICS

Page 18: Chapter 10. Valuation, Intertemporal Choice, and …...CHAPTER 10 Valuation, Intertemporal Choice, and Self-Control Joseph W. Kable OUTLINE Introduction 173 Valuation in Intertemporal

Ballard, K., Knutson, B., 2009. Dissociable neural representations offuture reward magnitude and delay during temporal discounting.Neuroimage. 45 (1), 143�150.

Baron, J., 2000. Can we use human judgments to determine the dis-count rate? Risk Anal. 20 (6), 861�868.

Baumgartner, T., Knoch, D., Hotz, P., Eisenegger, C., Fehr, E., 2011.Dorsolateral and ventromedial prefrontal cortex orchestrate nor-mative choice. Nat. Neurosci. 14 (11), 1468�1474.

Bernheim, B.D., Rangel, A., 2004. Addiction and cue-triggered deci-sion processes. Am. Econ. Rev. 94 (5), 1558�1590.

Bickel, W.K., Marsch, L.A., 2001. Toward a behavioral economicunderstanding of drug dependence: delay discounting processes.Addiction. 96 (1), 73�86.

Bjork, J.M., Momenan, R., Hommer, D.W., 2009. Delay discountingcorrelates with proportional lateral frontal cortex volumes. Biol.Psychiatry. 65 (8), 710�713.

Boettiger, C.A., Kelley, E.A., Mitchell, J.M., D’Esposito, M., Fields,H.L., 2009. Now or later? An fMRI study of the effects of endoge-nous opioid blockade on a decision-making network. Pharmacol.Biochem. Behav. 93(3), 291–299.

Boettiger, C.A., Mitchell, J.M., Tavares, V.C., et al., 2007. Immediatereward bias in humans: fronto-parietal networks and a rolefor the catechol-O-methyltransferase 158(Val/Val) genotype.J. Neurosci. 27 (52), 14383�14391.

Botvinick, M.M., 2007. Conflict monitoring and decision making: rec-onciling two perspectives on anterior cingulate function. Cogn.Affect. Behav. Neurosci. 7 (4), 356�366.

Burks, S.V., Carpenter, J.P., Goette, L., Rustichini, A., 2009. Cognitiveskills affect economic preferences, strategic behavior, and jobattachment. Proc. Natl. Acad. Sci. U S A. 106 (19), 7745�7750.

Cai, X., Kim, S., Lee, D., 2011. Heterogeneous coding of temporallydiscounted values in the dorsal and ventral striatum during inter-temporal choice. Neuron. 69 (1), 170�182.

Casey, B.J., Somerville, L.H., Gotlib, I.H., et al., 2011. Behavioral andneural correlates of delay of gratification 40 years later. Proc.Natl. Acad. Sci. U.S.A. 108 (36), 14998�15003.

Caves, C., 2000. Predicting future duration from present age: a criti-cal assessment. Contemp. Phys. 41, 143�153.

Chib, V.S., Rangel, A., Shimojo, S., O’Doherty, J.P., 2009. Evidencefor a common representation of decision values for dissimilargoods in human ventromedial prefrontal cortex. J. Neurosci. 29(39), 12315�12320.

Chua, H.F., Ho, S.S., Jasinska, A.J., et al., 2011. Self-related neuralresponse to tailored smoking-cessation messages predicts quit-ting. Nat. Neurosci. 14 (4), 426�427.

Cialdini, R.B., Goldstein, N.J., 2004. Social influence: compliance andconformity. Annu. Rev. Psychol. 55 (1), 591�621.

DellaVigna, S., 2009. Psychology and economics: evidence from thefield. J. Econ. Lit. 47 (2), 315�372.

Dorris, M.C., Glimcher, P.W., 2004. Activity in posterior parietal cor-tex is correlated with the relative subjective desirability of action.Neuron. 44 (2), 365�378.

Ersner-Hershfield, H., Wimmer, G.E., Knutson, B., 2009. Saving forthe future self: neural measures of future self-continuity predicttemporal discounting. Soc. Cogn. Affect. Neurosci. 4 (1), 85�92.

Falk, E.B., Berkman, E.T., Mann, T., Harrison, B., Lieberman, M.D.,2010. Predicting persuasion-induced behavior change from thebrain. J. Neurosci. 30 (25), 8421�8424.

Falk, E.B., Berkman, E.T., Whalen, D., Lieberman, M.D., 2011. Neuralactivity during health messaging predicts reductions in smokingabove and beyond self-report. Health Psychol. 30 (2), 177�185.

Fellows, L.K., Farah, M.J., 2005. Dissociable elements of human fore-sight: a role for the ventromedial frontal lobes in framing thefuture, but not in discounting future rewards. Neuropsychologia.43 (8), 1214�1221.

Figner, B., Knoch, D., Johnson, E.J., et al., 2010. Lateral prefrontal cor-tex and self-control in intertemporal choice. Nat. Neurosci. 13, 4.

Fishburn, P.C., Rubinstein, A., 1982. Time preference. Int. Econ. Rev.23 (3), 677�694.

Fisher, I., 1930. The Theory of Interest. The Macmillan Company,New York.

Frederick, S., Loewenstein, G., O’Donoghue, T., 2002. Time discount-ing and time preference: a critical review. J. Econ. Lit. 40, 351�401.

Fudenberg, D., Levine, D.K., 2004. The Long-Run Versus the Short-Run Self. Working paper, Harvard University Department ofEconomics.

Gibbon, J., Church, R.M., Fairhurst, S., Kacelnik, A., 1988. Scalarexpectancy theory and choice between delayed rewards. Psychol.Rev. 95 (1), 102�114.

Glimcher, P.W., Kable, J.W., Louie, K., 2007. Neuroeconomic studiesof impulsivity: now or just as soon as possible? Am. Econ. Rev.97 (2), 142�147.

Gogtay, N., Giedd, J.N., Lusk, L., et al., 2004. Dynamic mapping ofhuman cortical development during childhood through earlyadulthood. Proc. Natl. Acad. Sci. U.S.A. 101 (21), 8174�8179.

Gott, J.I., 1994. Future prospects discussed. Nature. 308, 108.Gray, J.R., Thompson, P.M., 2004. Neurobiology of intelligence: sci-

ence and ethics. Nat. Rev. Neurosci. 5 (6), 471�482.Green, L., Fristoe, N., Myerson, J., 1994a. Temporal discounting and

preference reversals in choice between delayed outcomes.Psychon. Bull. Rev. 1 (3), 383�389.

Green, L., Fry, A.F., Myerson, J., 1994b. Discounting of delayedrewards: a life-span comparison. Psychol. Sci. 5 (1), 33�36.

Green, L., Myerson, J., 2004. A discounting framework for choicewith delayed and probabilistic rewards. Psychol. Bull. 130 (5),769�792.

Green, L., Myerson, J., Macaux, E.W., 2005. Temporal discountingwhen the choice is between two delayed rewards. J. Exp. Psychol.Learn. Mem. Cogn. 31 (5), 1121�1133.

Gul, F., Pesendorfer, W., 2001. Temptation and self-control.Econometrica. 69 (6), 1403�1435.

Harbaugh, W.T., Mayr, U., Burghart, D.R., 2007. Neural responses totaxation and voluntary giving reveal motives forcharitable donations. Science. 316 (5831), 1622�1625.

Hare, T.A., Camerer, C.F., Knoepfle, D.T., Rangel, A., 2010. Valuecomputations in ventral medial prefrontal cortex duringcharitable decision making incorporate input from regionsinvolved in social cognition. J. Neurosci. 30 (2), 583�590.

Hare, T.A., Camerer, C.F., Rangel, A., 2009. Self-control in decision-making involves modulation of the vmPFC valuation system.Science. 324 (5927), 646�648.

Hare, T.A., Malmaud, J., Rangel, A., 2011a. Focusing attention on thehealth aspects of foods changes value signals in vmPFC andimproves dietary choice. J. Neurosci. 31 (30), 11077�11087.

Hare, T.A., Schultz, W., Camerer, C.F., O’Doherty, J.P., Rangel, A.,2011b. Transformation of stimulus value signals into motor com-mands during simple choice. Proc. Natl. Acad. Sci. U.S.A. 108(44), 18120�18125.

Harvey, C.M., 1986. Value functions for infinite-period planning.Manage. Sci. 32 (9), 1123�1139.

Harvey, C.M., 1994. The reasonableness of non-constant discounting.J. Public Econ. 53 (1), 31�51.

Herrnstein, R.J., 1961. Relative and absolute strength of responses asa function of frequency of reinforcement. J. Exp. Anal. Behav. 4,267�272.

Holcomb, J.H., Nelson, P.S., 1992. Another experimental look at indi-vidual time preference. Rationality Soc. 4 (2), 199�220.

Houser D., Schunk D., Winter J., Xiao E. Temptation and commit-ment in the laboratory. Inst. Empir. Res. Econ., Univ. ZurichWork. Pap. Ser.. 2010;488:1�34.

190 10. VALUATION, INTERTEMPORAL CHOICE, AND SELF-CONTROL

NEUROECONOMICS

Page 19: Chapter 10. Valuation, Intertemporal Choice, and …...CHAPTER 10 Valuation, Intertemporal Choice, and Self-Control Joseph W. Kable OUTLINE Introduction 173 Valuation in Intertemporal

Kable, J.W., 2010. Just a little (lateral prefrontal) patience. Nat.Neurosci. 13 (5), 523�524.

Kable, J.W., 2011. Social influences on “self”-control. Decision Makingfor a Social World. Web conference, www.cognitionandculture.net.

Kable, J.W., Glimcher, P.W., 2007. The neural correlates of subjectivevalue during intertemporal choice. Nat. Neurosci. 10 (12),1625�1633.

Kable, J.W., Glimcher, P.W., 2009. The neurobiology of decision: con-sensus and controversy. Neuron. 63 (6), 733�745.

Kable, J.W., Glimcher, P.W., 2010. An “as soon as possible” effect inhuman inter-temporal decision making: behavioral evidence andneural mechanisms. J. Neurophysiol. 103, 2513�2531.

Kim, S., Hwang, J., Lee, D., 2008. Prefrontal coding of temporally dis-counted values during intertemporal choice. Neuron. 59 (1),161�172.

Kirby, K.N., Herrnstein, R.J., 1995. Preference reversals due to myo-pic discounting of delayed reward. Psychol. Sci. 6 (2), 83�89.

Kirby, K.N., Petry, N.M., 2004. Heroin and cocaine abusers havehigher discount rates for delayed rewards than alcoholics or non-drug-using controls. Addiction. 99 (4), 461�471.

Kirby, K.N., Petry, N.M., Bickel, W.K., 1999. Heroin addicts havehigher discount rates for delayed rewards than non-drug-usingcontrols. J. Exp. Psychol. Gen. 128 (1), 78�87.

Kurth-Nelson, Z., Redish, A.D., 2010. A reinforcement learningmodel of precommitment in decision making. Front. Behav.Neurosci. 4, 184.

Kurzban, R., 2010. Why Everyone (else) is a Hypocrite: Evolutionand the Modular Mind. Princeton University Press, Princeton.

Laibson, D., 1997. Golden eggs and hyperbolic discounting. Q. J.Econ. 443�477.

Levy, D.J., Glimcher, P.W., 2012. The root of all value: a neural com-mon currency for choice. Curr. Opin. Neurobiol.

Levy, D.J., Glimcher, P.W., 2011. Comparing apples and oranges:using reward-specific and reward-general subjective value repre-sentation in the brain. J. Neurosci. 31 (41), 14693�14707.

Levy, I., Snell, J., Nelson, A.J., Rustichini, A., Glimcher, P.W., 2010.Neural representation of subjective value under risk and ambigu-ity. J. Neurophysiol. 103 (2), 1036�1047.

Loewenstein G., O’Donoghue T., 2004. Animal Spirits: Affective andDeliberative Processes in Economic Behavior. Working paper,Carnegie Mellon University Department of Social and DecisionSciences.

Louie, K., Glimcher, P.W., 2010. Separating value from choice: delaydiscounting activity in the lateral intraparietal area. J. Neurosci.30 (16), 5498�5507.

Mazas, C.A., Finn, P.R., Steinmetz, J.E., 2000. Decision-making biases,antisocial personality, and early-onset alcoholism. Alcohol. Clin.Exp. Res. 24 (7), 1036�1040.

Mazur, J.E., 1987. An adjusting procedure for studying delayed rein-forcement. In: Commons, M.L., Mazur, J.E., Nevin, J.A., Rachlin,H. (Eds.), Quantitative Analysis of Behavior: The Effects of Delayand Intervening Events on Reinforcement Value, vol. 5. LawrenceErlbaum Associates, Publishers, Hillsdale, Nj, pp. 55�73.

McClure, S.M., Ericson, K.M., Laibson, D.I., Loewenstein, G., Cohen,J.D., 2007. Time discounting for primary rewards. J. Neurosci. 27(21), 5796�5804.

McClure, S.M., Laibson, D.I., Loewenstein, G., Cohen, J.D., 2004.Separate neural systems value immediate and delayed monetaryrewards. Science. 306 (5695), 503�507.

McGuire, J.T., Botvinick, M.M., 2010. Prefrontal cortex, cognitive con-trol, and the registration of decision costs. Proc. Natl. Acad. Sci.U.S.A. 107 (17), 7922�7926.

McGuire J.T., Kable J.W., 2012a. Ventromedial prefrontal cortex anddecisions to sustain delay of gratification. Paper presented at:Society for Neuroeconomics, Evanston, IL.

McGuire, J.T., Kable, J.W., 2012b. Decision makers calibrate behav-ioral persistence on the basis of time-interval experience.Cognition. 124 (2), 216�226.

McGuire J.T., Kable J.W., 2013. Rational temporal predictions canunderlie apparent failures to delay gratification. Psychol. Rev.120, 395–410.

Metcalfe, J., Mischel, W., 1999. A hot/cool-system analysis of delayof gratification: dynamics of willpower. Psychol. Rev. 106 (1),3�19.

Mischel, W., Shoda, Y., Rodriguez, M.I., 1989. Delay of gratificationin children. Science. 244 (4907), 933�938.

Mitchell, J.M., Tavares, V.C., Fields, H.L., D’Esposito, M., Boettiger, C.A., 2007. Endogenous opioid blockade and impulsive respondingin alcoholics and healthy controls. Neuropsychopharmacology. 32(2), 439�449.

Mitchell, J.P., Schirmer, J., Ames, D.L., Gilbert, D.T., 2010. Medialprefrontal cortex predicts intertemporal choice. J. Cogn. Neurosci.23 (4), 857�866.

Montague, P.R., Lohrenz, T., 2007. To detect and correct: norm viola-tions and their enforcement. Neuron. 56 (1), 14�18.

Monterosso, J.R., Ainslie, G., Xu, J., Cordova, X., Domier, C.P.,London, E.D., 2007. Frontoparietal cortical activity ofmethamphetamine-dependent and comparison subjectsperforming a delay discounting task. Hum. Brain Mapp. 28 (5),383�393.

Muraven, M., Baumeister, R.F., 2000. Self-regulation and depletion oflimited resources: does self-control resemble a muscle? Psychol.Bull. 126 (2), 247�259.

Nee, D.E., Wager, T.D., Jonides, J., 2007. Interference resolution:insights from a meta-analysis of neuroimaging tasks. Cogn.Affect. Behav. Neurosci. 7 (1), 1�17.

O’Donoghue, T., Rabin, M., 1999. Doing it now or later. Am. Econ.Rev. 89 (1), 103�124.

Peters, J., Buchel, C., 2009. Overlapping and distinct neural systemscode for subjective value during intertemporal and risky decisionmaking. J. Neurosci. 29 (50), 15727�15734.

Peters, J., Buchel, C., 2010. Episodic future thinking reduces rewarddelay discounting through an enhancement of prefron-tal�mediotemporal interactions. Neuron. 66 (1), 138�148.

Petry, N.M., 2001. Pathological gamblers, with and without substanceuse disorders, discount delayed rewards at high rates. J. Abnorm.Psychol. 110 (3), 482�487.

Petry, N.M., Bickel, W.K., Arnett, M., 1998. Shortened time horizonsand insensitivity to future consequences in heroin addicts.Addiction. 93 (5), 729�738.

Pine, A., Seymour, B., Roiser, J.P., et al., 2009. Encoding of marginalutility across time in the human brain. J. Neurosci. 29 (30),9575�9581.

Plassmann, H., O’Doherty, J., Rangel, A., 2007. Orbitofrontal cortexencodes willingness to pay in everyday economic transactions.J. Neurosci. 27 (37), 9984�9988.

Platt, M.L., Glimcher, P.W., 1999. Neural correlates of decision vari-ables in parietal cortex. Nature. 400 (6741), 233�238.

Rachlin, H., 2000. The Science of Self-Control. Harvard UniversityPress, Cambridge, Mass.

Rangel, A., Hare, T., 2010. Neural computations associated withgoal-directed choice. Curr. Opin. Neurobiol. 20 (2), 262�270.

Read, D., 2001. Is time-discounting hyperbolic or subadditive? J. RiskUncertain. 23 (1), 5�32.

Read, D., Frederick, S., Airoldi, M., 2012. Four days later inCincinnati: longitudinal tests of hyperbolic discounting. ActaPsychol. (Amst.). 140 (2), 177�185.

Read, D., Roelofsma, P.H.M.P., 2003. Subadditive versus hyperbolicdiscounting: a comparison of choice and matching. Organ. Behav.Hum. Decis. Process. 91 (2), 140�153.

191REFERENCES

NEUROECONOMICS

Page 20: Chapter 10. Valuation, Intertemporal Choice, and …...CHAPTER 10 Valuation, Intertemporal Choice, and Self-Control Joseph W. Kable OUTLINE Introduction 173 Valuation in Intertemporal

Richards, J.B., Mitchell, S.H., de Wit, H., Seiden, L.S., 1997.Determination of discount functions in rats with an adjusting-amount procedure. J. Exp. Anal. Behav. 67 (3), 353�366.

Samuelson, P., 1937. A note on the measurement of utility. Rev.Econ.Stud. 4, 155�161.

Sayman, S., Onculer, A., 2009. An investigation of time inconsistency.Manage. Sci. 55 (3), 470�482.

Scholten, M., Read, D., 2006. Discounting by intervals: a generalizedmodel of intertemporal choice. Manage. Sci. 52 (9), 1424�1436.

Sellitto, M., Ciaramelli, E., di Pellegrino, G., 2010. Myopic discount-ing of future rewards after medial orbitofrontal damage inhumans. J. Neurosci. 30 (49), 16429�16436.

Shamosh, N.A., DeYoung, C.G., Green, A.E., et al., 2008. Individual dif-ferences in delay discounting: relation to intelligence, working mem-ory, and anterior prefrontal cortex. Psychol. Sci. 19 (9), 904�911.

Soman, D., Ainslie, G., Frederick, S., et al., 2005. The psychology ofintertemporal discounting: why are distant events valued differ-ently from proximal ones? Mark. Lett. 16 (3�4), 347�360.

Sozou, P.D., 1998. On hyperbolic discounting and uncertain hazardrates. Proc. R. Soc. Lond. B Biol. Sci. 265, 2015�2020.

Stephens, D.W., 2002. Discrimination, discounting and impulsivity: arole for an informational constraint. Philos. Trans. R. Soc. Lond. BBiol. Sci. 357 (1427), 1527�1537.

Strotz, R.H., 1956. Myopia and inconsistency in dynamic utility maxi-mization. Rev. Econ. Stud. 23 (3), 165�180.

Tanaka, S.C., Doya, K., Okada, G., Ueda, K., Okamoto, Y.,Yamawaki, S., 2004. Prediction of immediate and future rewardsdifferentially recruits cortico-basal ganglia loops. Nat. Neurosci. 7(8), 887�893.

Thaler, R.H., Shefrin, H.M., 1981. An economic theory of self-control.J. Pol. Econ. 89 (2), 392�406.

Thorndike, E.L., 1904. An Introduction to the Theory of Mental andSocial Measurements. Science Press, Oxford.

Tom, S.M., Fox, C.R., Trepel, C., Poldrack, R.A., 2007. The neuralbasis of loss aversion in decision-making under risk. Science. 315(5811), 515�518.

Vickery, T.J., Chun, M.M., Lee, D., 2011. Ubiquity and specificity ofreinforcement signals throughout the human brain. Neuron. 72(1), 166�177.

Volkow, N.D., Fowler, J.S., 2000. Addiction, a disease of compulsionand drive: involvement of the orbitofrontal cortex. Cereb. Cortex.10 (3), 318�325.

Volkow, N.D., Fowler, J.S., Wang, G.J., 2003. The addicted humanbrain: insights from imaging studies. J. Clin. Invest. 111 (10),1444�1451.

Volkow, N.D., Li, T.K., 2004. Drug addiction: the neurobiologyof behaviour gone awry. Nat. Rev. Neurosci. 5 (12),963�970.

Wager, T.D., Smith, E.E., 2003. Neuroimaging studies of workingmemory: a meta-analysis. Cogn. Affect. Behav. Neurosci. 3 (4),255�274.

Weber, B.J., Huettel, S.A., 2008. The neural substrates of probabilisticand intertemporal decision making. Brain Res. 1234, 104�115.

Weber, E.U., Johnson, E.J., Milch, K.F., Chang, H., Brodscholl, J.C.,Goldstein, D.G., 2007. Asymmetric discounting in intertemporalchoice: a query-theory account. Psychol. Sci. 18 (6), 516�523.

Weller, R.E., Cook 3rd, E.W., Avsar, K.B., Cox, J.E., 2008. Obesewomen show greater delay discounting than healthy-weightwomen. Appetite. 51 (3), 563�569.

Wittmann, M., Leland, D.S., Paulus, M.P., 2007. Time and decisionmaking: differential contribution of the posterior insular cortexand the striatum during a delay discounting task. Exp. Brain Res.179 (4), 643�653.

Wunderlich, K., Rangel, A., O’Doherty, J.P., 2009. Neural computa-tions underlying action-based decision making in the humanbrain. Proc. Natl. Acad. Sci. 106 (40), 17199�17204.

192 10. VALUATION, INTERTEMPORAL CHOICE, AND SELF-CONTROL

NEUROECONOMICS