A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic...

118
A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer and Information Science July 7, 2009 ACM EC

Transcript of A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic...

Page 1: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

A tutorial on: Dynamic Mechanism

DesignRuggiero Cavallo

University of PennsylvaniaDepartment of Computer and

Information Science

July 7, 2009ACM EC

Page 2: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

The setting

• Sequence of decisions to be made impacting utility experienced by a group of agents

• Social planner wants to make optimal choices

agents hold private valuation information

knowledge of private information required to determine optimal decision at every point in time

new private information potentially arrives after each decision

2

Page 3: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Example: a resource – say a government-owned super-computer – is to be allocated repeatedly for 1 week intervals.

3

Problem: Agents’ goals differ from center’s goals, but agent cooperation is essential.

Page 4: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

The solution framework

• Dynamic mechanism design: specification of payment schemes such that optimal outcomes are achieved in equilibrium.

An extension/generalization of “static” mechanism design.

4

Page 5: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Does static MD really fall short?

• Yes. Rare is the decision scenario that is completely independent of future decisions.

• E.g., allocating a resource. What future opportunities will there be for procuring the resource? What opportunities for reselling the resource?

5

Page 6: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Tutorial plan

1. Rudimentary review of static mechanism design.

2. Dynamic MD basics: modeling of “types” in dynamic settings, dynamic equilibrium notions.

3. Key solutions so far.

4. Extensions.

6

Page 7: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Static mechanism design

Just social-welfare maximizing, here

7

Page 8: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

• Many of the marquee static MD results have (more complicated) analogs in the dynamic setting.

• So start with quick review of static case...

8

Page 9: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

9

Page 10: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

10

private informationdecision,payments

Mechanism Design

Page 11: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

11

Mechanism design

• Specify decision rule (outcome selection), plus a monetary charge/payment imposed on each agent.

• Outcome/payments enforced by a center.

• Criteria for success:

social-welfare maximizing (a.k.a., efficient)

individual rational (no agent worse off)

budget properties

Page 12: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Solution concepts• Strategyproof: reporting true type is always a

utility-maximizing strategy, regardless of what other agents do.

• Ex post incentive compatible: reporting true type is utility-maximizing, whatever the types of other agents, assuming they’re truthful. [same as strategyproof in private values setting]

• Bayes-Nash incentive compatible: reporting true type is utility-maximizing, in expectation given distribution over others’ types, assuming other agents are truthful.

12

Page 13: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Efficient static mechanisms: the Groves class

• Choose outcome that is social welfare maximizing according to agent reports.

• Pay each agent the (combined) reported value of all other agents for the chosen outcome... minus some quantity independent of the agent’s report.

[Vickrey, 1961; Clarke, 1971; Groves 1973]

13

Page 14: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

14

$10

$10

$10

Page 15: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Groves (and nothing else) works• The set of Groves mechanisms exactly

corresponds to those that are efficient in dominant strategies.*

[Green & Laffont, 77], strengthened by [Holmstrom, 79]

Our freedom is limited to defining the agent-independent “charge” term

*For sufficiently rich domains (“for all practical purposes”).

15

Page 16: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Efficient mechanism design boiled down

1. Align incentives – make each agent’s payoff equal to social welfare.

2. Recover funds – have each agent make a payment independent of his behavior.

16

Page 17: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Efficient mechanism design boiled down

1. Align incentives – make each agent’s payoff equal to social welfare.

2. Recover funds – have each agent make a payment independent of his behavior.

16

A little more subtle in dynamic setting...

Page 18: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

The VCG mechanism• A Groves mechanism.

• Defines “charge” term for each agent i equal to the value other agents could have obtained if i’s interests were ignored.

Each agent’s utility equals contribution to social welfare.

Ex post individual rational (if agents have non-negative values for all outcomes)

No-deficit... in fact often yields high revenue.17

Page 19: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

The expected externality (AGV) mechanism[Arrow, 79; d’Aspremont & Gerard-Varet, 79]

• Each agent’s payment is expected social welfare others will get given his report, minus some uninfluencable quantity.

• Efficient in Bayes-Nash equilibrium.

• Ex ante individual rational.

• Strongly budget-balanced.

18

Page 20: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Redistribution mechanisms

• AGV maintains all value with agents, but is only weakly efficient and IR, unlike VCG.

• Idea: try to return revenue under VCG back to agents – thus improving social welfare – without weakening equilibrium or running deficit.

• First reference: [Bailey, 97] for certain allocation settings.

19

Page 21: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Redistribution mechanism[Cavallo, 06]Idea: leverage domain information to obtain “revenue-guarantees”.

For each agent i, compute minimum revenue that i could cause to result, given reports of other agents (Gi).

Run VCG.

Give each i payment of Gi/n.

Applicable to any setting (e.g., combinatorial allocation). In single-item allocation, coincides with [Bailey, 97] mechanism.

20

Page 22: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

21

0

10

20

30

40

50

60

70

80

90

100

3 4 5 6 7 8 9 10

% value retained by agents

number of agents

dynamic-VCGdynamic-RM

Page 23: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Lots of interesting recent work for case of multi-unit auctions

• [Guo & Conitzer, 07; Moulin, 2007] – worst-case optimality.

• [Guo & Conitzer, 08] – optimal-in-expectation mechanism.

• [Hartline & Roughgarden, 2008] – money burning when payments not possible.

• [de Clippel, Naroditskiy, Greenwald, here].

22

Page 24: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Much not discussed here, e.g.,

• Interdependent (“common”) values settings

• Inefficient mechanism design, concerned with, e.g., revenue maximization, maximizing the minimum utility, etc.

23

Page 25: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Basics of the dynamic setting

24

Page 26: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Aspects of the problem• At each time period each agent holds some private

information (“local state”).

• At each time period, the center selects an action to execute, which generates value (of varying degree) for agents and yields new local states.

• The (predicted) effects of taking any given action depend on state.

• Agents perceive utility of value x obtained k steps in future to be γk x, for some 0 < γ ≤ 1.

Key variable: local state.

25

Page 27: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Local state

• Encapsulates all information required to determine:

a conditional distribution over value the agent would obtain for every possible action

a conditional distribution over future local states for every possible action

26

Page 28: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Assumption

• Given the action executed by the center, value obtained and subsequent local state for each agent are independent of other agents’ local states.

Dynamic version of private values.

27

Page 29: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Markov decision processes (MDPs)• State space

• Action space

• Reward function

When a given action is taken in a given state, what value results?

• Non-deterministic transition function

When a given action is taken in a given state, what new state results?

28

Page 30: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

29

10

1/43/4

1/2

20

10020

01/2 1/2

600

01/2

20

• Two possible actions (red and blue).• Two time periods.

Page 31: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

So what’s a dynamic type?

• It’s an MDP.

Transition dynamics between local states.

Value function for state-action pairs.

Indicator of “current” state.

30

Page 32: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

• In static setting, type is “complete” and reportable. In dynamic setting, type is gradually revealed to the agent by nature over time.

• It’s not the multiple time steps alone, it’s the uncertainty.

• If types are MDPs with no stochastic state transitions, we’re in a static MD setting – just decide policy at time 0.

31

Page 33: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

A simple (and important) special case: MABs

• Multi-armed bandit problems: a special case of the general sequential decision-making framework.

• Captures, e.g., single-item repeated allocation scenarios.

32

Page 34: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

A simple (and important) special case: MABs

• Each agent’s dynamics can be represented by a Markov chain: no multiplicity of actions.

• A single action associated with each agent. When an agent’s action is chosen, his state changes; otherwise, it doesn’t.

33

Page 35: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Captures, e.g., repeated allocation of a resource

34

0

3/4 1/4

1/2 1/230

30300

0

1/4 3/4

1/2 1/220

20200

Page 36: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Captures, e.g., repeated allocation of a resource

35

0

3/4 1/4

1/2 1/230

30300

0

1/4 3/4

1/2 1/220

20200

Allocate to agent 1,who finds no value.

Page 37: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

36

DMD setup• There is a set of actions.

• Each agent has a type represented by an MDP.

• In each period agents report types and the center takes an action.

• A dynamic mechanism specifies two things:

a decision policy: a function that maps a joint type to an action.

a transfer function: that maps a joint type to a payment for each agent.

Page 38: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Basics of the dynamic setting: equilibrium

concepts

37

Page 39: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Within-period ex post Nash equilibrium

If all other agents play the equilibrium strategy in the future, no agent can benefit from deviating – regardless of what the joint state is and regardless of what came before.

38

Page 40: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Within-period ex post incentive compatibility

If all other agents report types truthfully in the future, no agent can benefit from misreporting type – regardless of what the joint type is and regardless of what came before.

39

No incentive to deviate even if agents know everything one can know – without being able to see the future.

Page 41: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

This is the gold standard

• In a dynamic setting, agents needs to make predictions about the future in determining how to maximize utility – and this requires positing some behavior for other agents.

• Weaker than dominant strategy.

• But if others’ future types were irrelevant to the agent’s utility, incentives couldn’t possibly be aligned.

40

Page 42: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Bayes-Nash equilibrium

Given distribution over other agents’ types, no agent can expect to gain from deviating if others don’t.

Within-period ex post also involves expectation, but expectation is over uncertain type transitions, not current types.

41

Page 43: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Mechanism desiderata• Efficiency: social-welfare maximizing decisions

achieved in equilibrium.

• Individual rationality: no agent expects to lose from participating.

Within-period ex post: at every time-step, for every joint type.

Ex ante: from beginning of the mechanism, for whatever the joint type is then.

• Budget-balance / no-deficit.

42

Page 44: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

By the way...

• A dynamic analog of the revelation principle holds [Myerson, 1986].

• So we can think only about direct revelation mechanisms, without loss of generality.

43

Page 45: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Some solutions so far

44

Page 46: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

A basic efficient dynamic mechanism

•Dynamic team mechanism [Athey & Segal, 07]

Follows efficient policy given agent reports.

In each period, pays each agent the expected immediate value obtained by other agents given reported types (“Groves payment”).

45

Page 47: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Dynamic team mechanismexample

200

1/43/4

1/40

020

0 1/2 1/2600

03/4

0

20 20

0

100

Agent 2

Agent 1

20

200

0 9/10

1/10

A B C

J K L

D E F G H I

M N O P Q

*

*

46

Page 48: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Dynamic team mechanismexample

optimal policy (γ close to 1): * → blueAJ → red or blueBJ → red or blueCK → redCL → blue

200

1/43/4

1/40

020

0 1/2 1/2600

03/4

0

20 20

0

100

Agent 2

Agent 1

20

200

0 9/10

1/10

A B C

J K L

D E F G H I

M N O P Q

*

*

47

Page 49: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Dynamic team mechanismexample

• T1(*) = 0, T2(*) = 0

200

1/43/4

1/40

020

0 1/2 1/2600

03/4

0

20 20

0

100

Agent 2

Agent 1

20

200

0 9/10

1/10

A B C

J K L

D E F G H I

M N O P Q

*

*

48

Page 50: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Dynamic team mechanismexample

• T1(*) = 0, T2(*) = 0• T1(CL) = 100, T2(CL) = 0

200

1/43/4

1/40

020

0 1/2 1/2600

03/4

0

20 20

0

100

Agent 2

Agent 1

20

200

0 9/10

1/10

A B C

J K L

D E F G H I

M N O P Q

*

*

49

Page 51: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Dynamic team mechanism

Theorem: The dynamic team mechanism is truthful and efficient in within-period ex post Nash equilibrium.

[Athey & Segal, 07]

50

Page 52: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Dynamic-Groves mechanism class

• Follows efficient policy given agent reports; defines payments such that:

Each agent’s expected sum of payments when he follows strategy σ equals the expected value other agents obtain when he follows σ, minus some quantity independent of σ.

51

Page 53: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Dynamic-Groves mechanism class

52

Theorem: Every dynamic-Groves mechanism is truthful and efficient in within-period ex post Nash equilibrium.[Cavallo, Parkes, & Singh, 07]

Proof: Each agent obtains social utility (aligns incentives) minus some constant (doesn’t distort).

Page 54: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Dynamic-Groves: all efficient mechanisms

Theorem: For unrestricted types, the dynamic-Groves class exactly corresponds to the history-independent dynamic mechanisms that are truthful and efficient in within-period ex post Nash equilibrium. [Cavallo, 08]

For within-period ex post efficient (and history-independent) dynamic mechanism design, dynamic-Groves is the only game in town.

53

Page 55: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Dynamic-Groves: all efficient mechanisms

Theorem: For unrestricted types, the dynamic-Groves class exactly corresponds to the history-independent dynamic mechanisms that are truthful and efficient in within-period ex post Nash equilibrium. [Cavallo, 08]

Generalizes [Green & Laffont, 77] (Groves class unique for static settings).

Proof idea: If non-Groves, there is always some type for which incentives are sufficiently distorted from efficiency.

54

Page 56: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Budget & participation

• Given characterization theorem, if we demand efficiency in strongest sense, we know what the possibilities are.

• Now pick mechanisms in class with desirable budget/participation properties.

basic “team mechanism” won’t fly – extreme budget imbalance

need to recover payments...

55

Page 57: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Recovering payments: ex ante charge (EAC)

Charge agents some quantity computed “ex ante” of anything they report.

56

Page 58: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Recovering payments: ex ante charge (EAC)• At every time-step:

Choose efficient decision given reported types.

Make Groves payments.

Charge each agent a quantity based only on the reported types of other agents in the first time-step: (1-γ) times total value other agents would obtain, in expectation from beginning of mechanism, if policy optimal for them was chosen.

[Cavallo, Parkes, & Singh, 06]

Chapter 5: Dynamic mechanism design 92

The dynamic-basic-Groves mechanism pays each agent the (reported) value ob-tained by other agents, and it is thus a dynamic-Groves mechanism where the con-stant C charge functions are null. But we can see that any mechanism that modifiesdynamic-basic-Groves by imposing a charge on each agent at each time period that isindependent of anything that agent has ever reported will also be a dynamic-Grovesmechanism. We can specify the charge for each agent in a way such that, in expecta-tion from the beginning of the mechanism, a deficit will not result and at the sametime agent payo!s will be non-negative. I will refer to this as the dynamic-EAC (exante charge) mechanism:

Definition 5.12 (dynamic-EAC). The dynamic-EAC mechanism executes de-cision policy !! and, !i " I and "t " ", transfers:

Ti("t) = r"i("

t"i, !

!("t)) # (1 # #)V"i("0"i) (5.39)

Theorem 5.4. The dynamic-EAC mechanism is truthful and e!cient in within-period ex post Nash equilibrium, ex ante individual rational, and ex ante no-deficit.

Proof. First observe that for any i " I, "0 " ", and any two strategies $#i and

$##i , (1 # #)V"i("0

"i, !!"i, $

#i) = (1 # #)V"i("0

"i, !!"i, $

##i ), since V"i("0

"i) depends onlyon the states of other agents at the beginning of the mechanism, which i cannotpossibly impact. Thus dynamic-EAC is a dynamic-Groves mechanism with Ci("t) =(1##)V"i("0

"i), !"t. Therefore, by Theorem 5.3, dynamic-EAC is truthful and e#cientin within-period ex post Nash equilibrium.

The mechanism is also ex ante IR. Consider the truthful reporting strategy profile.In expectation from the beginning of the mechanism the reward obtained intrinsicallyby any agent plus the payment he receives is greater than the charge he must pay:

E

!

K"

k=0

#k#

ri("ki , !

!("k)) + r"i("k"i, !

!("k)) # (1 # #)V"i("0"i)

$%

%

%"0, !!

&

(5.40)

= Vi("0) + V"i("

0) # V"i("0"i) (5.41)

= V ("0) # V"i("0"i) (5.42)

$ 0, (5.43)

where the final inequality holds by optimality of !!.Finally, note that in expectation from the beginning of the mechanism, in the

truthful equilibrium the payments made by the mechanism to any agent i equalV"i("0) # V"i("0

"i). V"i("0"i) $ V"i("0) by optimality (for agents other than i) of !!

"i,and thus the mechanism is ex ante no-deficit

57

Page 59: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Dynamic-EAC mechanismexample

• γ = 0.9• V1(*-2) = 50 + γ10 = 59• V2(*-1) = γ90 = 81

• T1(*) = -8.1, T2(*) = -5.9• T1(CL) = 100 - 8.1, T2(CL) = -5.9• ...

200

1/40

0 30

03/4

0

20 20

0

100

Agent 2

Agent 1

20 0 9/10

1/10

A B C

J K L

E F G I

M P Q

*

*

10

58

0

N

Page 60: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Recovering payments: ex ante charge (EAC)

Theorem: The dynamic-EAC mechanism is truthful and efficient in within-period ex post Nash equilibrium, ex ante individual rational, and ex ante no-deficit.

[Cavallo, Parkes, & Singh, 06]

59

Page 61: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Weak IR and budget-balance properties

• With dynamic-EAC scheme agents will “sign up” at beginning of mechanism, but may wish to back out...

• Same for center.

• Can we strengthen?

60

Page 62: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Dynamic-VCG[Bergemann & Valimaki, 08]

• At each time-step, pay each agent i the expected value other agents would obtain if i were ignored after one step, minus the value they’d obtain if i were always ignored.

Each agent has to pay the amount he inhibits other agents from obtaining value (now and in the future) by his current report.

61

Page 63: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Dynamic-VCG[Bergemann & Valimaki, 08]

• At each time-step, pay each agent i the expected value other agents would obtain if i were ignored after one step, minus the value they’d obtain if i were always ignored.

Chapter 5: Dynamic mechanism design 93

There are a couple additional interesting things to note about this mechanism.First, the properties (incentives and budget) do not depend at all on when the chargeterms imposed on the agents are executed, as long as they are scaled appropriatelyaccording to the discount factor. For instance, the mechanism could just as well havebeen defined to have each agent i pay V!i(!0

!i) at time 0 with no further charges inthe periods to follow.

Also, if the initial state !0 is common knowledge (and only the realized statetransitions are private), each i can be charged (1!")V!i(!0) rather than (1!")V!i(!0

!i)each period; Theorem 5.4 continues to hold, and in fact the expected revenue from thebeginning of the mechanism equals 0, since the expected aggregate value of paymentsmade to any agent are 0.

5.5 The dynamic-VCG mechanism

The results of the previous section are positive, but leave room for improvement.In many scenarios the distinction between ex ante individual rationality or no-deficitand within-period ex post individual rationality or guaranteed no-deficit will be sig-nificant. The mechanism I present in this chapter strengthens the dynamic-EACmechanism in just these ways. Bergemann & Valimaki’s [2006] dynamic-VCG mech-anism, we will see, is e!cient, IC, and IR in within-period ex post Nash equilibrium,and is also no-deficit. I will demonstrate e!ciency and IC, again, by showing thatdynamic-VCG is a dynamic-Groves mechanism and then referring to Theorem 5.3.The nature of the proof will at the same time demonstrate the IR and no-deficitproperties of the mechanism.

Finally, the revenue a mechanism generates—or, how much of the value from asequence of decisions is acquired by the center rather than kept by the agents—is alsoan important evaluation metric. Of course in many business settings a mechanismdesigner would seek to implement a mechanism in which revenue is high, extracting asmuch value as possible; I will show that dynamic-VCG is optimal here (if e!ciency isrequired). In the next chapter I follow the approach of Chapter 3 and try to minimizerather than maximize revenue.

Definition 5.13 (dynamic-VCG). [Bergemann and Valimaki, 2006] Thedynamic-VCG mechanism executes policy #" and, "i # I and !t # ", transfers:

Ti(!t) = r!i(!

t!i, #

"(!t)) + "E[V!i($(!t!i, #

"(!t)))] ! V!i(!t!i) (5.44)

Recall that V!i(!t!i) denotes V!i(!t

i, !t!i, #

"!i, %i), and thus E[V!i($(!t

!i, #"(!t)))]

is the expected value that agents other than i would obtain from a policy that isoptimized for them forward from the joint type that results when the socially optimal

62

Page 64: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Dynamic-VCG mechanismexample

• γ = 0.9

• T1(*) = γ90 - γ90• T2(*) = γ30 - (50 + γ10)

• T1(CL) = 100 - 100• T2(CL) = 0 - 30• ...

200

1/40

0 30

03/4

0

20 0

0

100

Agent 2

Agent 1

20 0 9/10

1/10

A B C

J K L

E F G I

M P Q

*

*

10

63

0

N

Page 65: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Dynamic-VCG[Bergemann & Valimaki, 08]

• No payment to any agent in any period is positive.

• Expected future payoff to every agent i, from any joint state, at any time t, is:

64(NB: assumes no negative values)

Chapter 5: Dynamic mechanism design 93

There are a couple additional interesting things to note about this mechanism.First, the properties (incentives and budget) do not depend at all on when the chargeterms imposed on the agents are executed, as long as they are scaled appropriatelyaccording to the discount factor. For instance, the mechanism could just as well havebeen defined to have each agent i pay V!i(!0

!i) at time 0 with no further charges inthe periods to follow.

Also, if the initial state !0 is common knowledge (and only the realized statetransitions are private), each i can be charged (1!")V!i(!0) rather than (1!")V!i(!0

!i)each period; Theorem 5.4 continues to hold, and in fact the expected revenue from thebeginning of the mechanism equals 0, since the expected aggregate value of paymentsmade to any agent are 0.

5.5 The dynamic-VCG mechanism

The results of the previous section are positive, but leave room for improvement.In many scenarios the distinction between ex ante individual rationality or no-deficitand within-period ex post individual rationality or guaranteed no-deficit will be sig-nificant. The mechanism I present in this chapter strengthens the dynamic-EACmechanism in just these ways. Bergemann & Valimaki’s [2006] dynamic-VCG mech-anism, we will see, is e!cient, IC, and IR in within-period ex post Nash equilibrium,and is also no-deficit. I will demonstrate e!ciency and IC, again, by showing thatdynamic-VCG is a dynamic-Groves mechanism and then referring to Theorem 5.3.The nature of the proof will at the same time demonstrate the IR and no-deficitproperties of the mechanism.

Finally, the revenue a mechanism generates—or, how much of the value from asequence of decisions is acquired by the center rather than kept by the agents—is alsoan important evaluation metric. Of course in many business settings a mechanismdesigner would seek to implement a mechanism in which revenue is high, extracting asmuch value as possible; I will show that dynamic-VCG is optimal here (if e!ciency isrequired). In the next chapter I follow the approach of Chapter 3 and try to minimizerather than maximize revenue.

Definition 5.13 (dynamic-VCG). [Bergemann and Valimaki, 2006] Thedynamic-VCG mechanism executes policy #" and, "i # I and !t # ", transfers:

Ti(!t) = r!i(!

t!i, #

"(!t)) + "E[V!i($(!t!i, #

"(!t)))] ! V!i(!t!i) (5.44)

Recall that V!i(!t!i) denotes V!i(!t

i, !t!i, #

"!i, %i), and thus E[V!i($(!t

!i, #"(!t)))]

is the expected value that agents other than i would obtain from a policy that isoptimized for them forward from the joint type that results when the socially optimalr!i(!t

!i,""(!t)) + # [V!i($(!t

!i,""(!t)))] ! V!i(!t

!i)

V (!t)" V!i(!t!i) # 0

1

r!i(!t!i,"

"(!t)) + # [V!i($(!t!i,"

"(!t)))] ! V!i(!t!i)

V (!t)" V!i(!t!i) # 0

1

Page 66: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Dynamic-VCG[Bergemann & Valimaki, 08]

Theorem: The dynamic-VCG mechanism is truthful and efficient in within-period ex post Nash equilibrium, within-period ex post individual rational, and ex post no-deficit.

65

Page 67: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Dynamic-VCG: good social-welfare?

66

0

10

20

30

40

50

60

70

80

90

100

3 4 5 6 7 8 9 10

% value retained by agents

number of agents

dynamic-VCG

In a single-item allocation setting, with values normally distributed.

Page 68: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Dynamic-VCG: good social-welfare?

Theorem: Among all history-independent mechanisms that are efficient in within-period ex post Nash equilibrium and within-period ex post individual rational, dynamic-VCG yields the most expected revenue, for every joint type.

[Cavallo, 08]

67

Page 69: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Dynamic-VCG: good social-welfare?

• Since dynamic-VCG can be so bad for the agents, what do we do?

• Think back to the static setting... better budget balance was achieved by redistribution mechanisms; strong budget-balance by moving to Bayes-Nash equilibrium.

68

Page 70: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

A dynamic redistribution mechanism?• Redistribution much more complicated in

the dynamic setting. Now redistribution payment computed in later time periods can potentially be influenced via an agent’s reports in earlier periods... in subtle ways.

Focus on worlds representable as multi-armed bandits.

69

Page 71: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

70

Dynamic-VCG for MABs reduces to:

• Determine optimal agent i to activate.

i pays (1-γ) times the expected value other agents would get if i were always ignored.

Other agents pay nothing.

Page 72: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

71

Dynamic-VCG for MABs:• Winner pays (1-γ) times the

expected value other agents would get if he were always ignored.

• Other agents pay nothing.

0

3/4 1/4

1/2 1/230

30300

01/2 1/2

20

200

Page 73: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

72

Dynamic-VCG for MABs:• Winner pays (1-γ) times the

expected value other agents would get if he were always ignored.

• Other agents pay nothing.

0

3/4 1/4

1/2 1/230

30300

01/2 1/2

20

200

T1 = -(1-γ) (10 + γ10)

Page 74: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

73

Dynamic-VCG for MABs:• Winner pays (1-γ) times the

expected value other agents would get if he were always ignored.

• Other agents pay nothing.

0

3/4 1/4

1/2 1/230

30300

01/2 1/2

20

200

T1 = -(1-γ) (10 + γ10)

Page 75: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

74

Dynamic-VCG for MABs:• Winner pays (1-γ) times the

expected value other agents would get if he were always ignored.

• Other agents pay nothing.

0

3/4 1/4

1/2 1/230

30300

01/2 1/2

20

200

T1 = -(1-γ) (10 + γ10)

T2 = -(1-γ) 7.5

Page 76: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

75

Dynamic-VCG for MABs:• Winner pays (1-γ) times the

expected value other agents would get if he were always ignored.

• Other agents pay nothing.

0

3/4 1/4

1/2 1/230

30300

01/2 1/2

20

200

T1 = -(1-γ) (10 + γ10)

T2 = -(1-γ) 7.5

Page 77: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

76

Dynamic-VCG for MABs:• Winner pays (1-γ) times the

expected value other agents would get if he were always ignored.

• Other agents pay nothing.

0

3/4 1/4

1/2 1/230

30300

01/2 1/2

20

200

T1 = -(1-γ) (10 + γ10)

T2 = -(1-γ) 7.5

T2 = -(1-γ) 7.5

Page 78: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

77

Dynamic-RM for MABs[Cavallo, 08]

• Modify dynamic-VCG by adding the following payments to the agents each period:

For agent i receiving item: (1-γ)/n times the expected total discounted revenue that would result if i were ignored going forward.

For every other agent j: 1/n times the expected immediate revenue that would have resulted this period if j were ignored.

Page 79: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Dynamic-RM for MABs[Cavallo, 08]

Lemma: Whatever strategy an agent follows, his expected redistribution payments over time equal: a 1/n share of the expected total (over time) revenue that would result if the agent were not present.

(This is the hard part to prove. Once we have, it follows that dynamic-RM is a dynamic-Groves mechanism, and thus efficient.)

78

Page 80: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Dynamic-RM for MABs[Cavallo, 08]

Theorem: Dynamic-RM is efficient in within-period ex post Nash equilibrium, within-period ex post IR, and never runs a deficit.

And yields significantly more value for the agents than dynamic-VCG.

Examples with three or more agents are tough to illustrate, so let’s just look at aggregate results:

79

Page 81: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Value retained: normal distribution

80

0

10

20

30

40

50

60

70

80

90

100

3 4 5 6 7 8 9 10

% value retained by agents

number of agents

dynamic-RMdynamic-VCG

Page 82: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Value retained: uniform distribution

81

0

10

20

30

40

50

60

70

80

90

100

3 4 5 6 7 8 9 10

% value retained by agents

number of agents

dynamic-RMdynamic-VCG

Page 83: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

82

efficiency IR budget-balance

team mechanism w.p. ex post w.p. ex post huge deficit

dynamic-EAC w.p. ex post ex ante ex ante no-deficit

dynamic-VCG w.p. ex post w.p. ex post ex post no-deficit

dynamic-RM(only for MABs)

w.p. ex post w.p. ex post ex post no-deficit,much closer to perfect BB

balanced-mechanism

Bayes-Nash ex ante perfect

Page 84: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

(Balanced team mechanism presented by Susan Athey)

83

Page 85: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Extensions

84

Page 86: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Dynamically changing populations of agents

• What’s new: agents may – either temporarily or permanently – become “inaccessible”, i.e., unable to communicate with the center or make/receive payments.

• Generalizes arrival/departure dynamics.

85

Page 87: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

For instance:

• Imagine selling theater tickets to tourists who plan to see multiple shows over a period of days.

New tourists always arriving, others leaving (dynamic population).

A tourist may see a show, realize she likes the theater more/less (dynamic types).

86

Page 88: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Related area:online mechanism design• Dynamic population (arrivals and departures),

but static types – all private information an agent will ever obtain can be reported in arrival period.

[Friedman & Parkes, 03]

[Parkes & Singh, 03]

[Lavi & Nisan, 04]

[Porter, 04]

87

Page 89: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Online-VCG mechanism[Parkes & Singh, 03]

• Collects a single payment from each agent in her “arrival period”.

Within-period ex post efficient.

Ex post individual rational

Ex post no-deficit.

88

Page 90: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Online-VCG mechanism[Parkes & Singh, 03]

• Collects a single payment from each agent in her “arrival period”.

Within-period ex post efficient.

Ex post individual rational

Ex post no-deficit.

88

But only for static types.

Page 91: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Dynamic populations, dynamic types[Cavallo, Parkes, & Singh, 07]

• Unifies dynamic mechanism design and online mechanism design.

• The new challenges:

Optimal policy must consider accessibility/inaccessibility dynamics

Agents may not be available for payment while still exerting influence on welfare of other agents.

89

Page 92: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

! 1 ! 1

A B C ! 2

t = 1 t = 2 t = 3

8 2

(a) Agent 1’s Type.

! 1 ! 1

E G ! 2

D ! 1 ! 1

F H ! 2

t = 1 t = 2 t = 3

0.2

4

0.8 20

(b) Agent 2’s Type.

Figure 1: Two-agent, 3 time-step problem with a single item to allocate. Initial jointstate is AD. Decisions ({allocate to 1, allocate to 2, don’t allocate}) are implicit in thestate transitions. Agent 1’s type has deterministic transitions, while agent 2’s type hasnon-determinism only in the first period.

or not an agent transitions to an inaccessible state can also depend on both theaction taken and the agent’s current type.

This model of periodic inaccessibility applies, for instance, to environments in whichan agent might periodically lose contact with the center due to faulty or limited com-munication, or because of bounded attention where an agent needs to periodicallyattend to other decisions or apply full attention to utilizing a resource just assignedby the center. We will see that it also generalizes earlier arrival-departure modelsof online mechanism design, in which an agent is inaccessible and then accessibleand then inaccessible again.

E!ciency is constrained by the new communication constraints, which preclude asocial planner from knowing the private type of an agent while an agent is inacces-sible. Moreover, inaccessible agents have no opportunity to misreport their type tothe center, and therefore incentive-compatibility requirements will also be modifiedso that they need hold only while an agent is accessible.

To gain some intuition, we can consider a simple mechanism in which the decisionpolicy is defined in a way that simply ignores inaccessible agents, and selects theaction in each period that is e!cient as though the population consists of onlyaccessible agents. In addition, suppose the mechanism makes a payment to eachaccessible agents in every period that is equal to the (reported) value obtained bythe other accessible agents in the period. The following example shows that such asimple mechanism is not incentive-compatible for this environment.

Example 2. Consider again the problem in Figure 1, but now extended with thefollowing periodic-inaccessibility dynamics: with very high probability agent 1 is

16

90

Page 93: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

• Imagine agent 1 accessible at t = 1, and agent 2 inaccessible at t = 1 but very likely to become accessible at t = 2.

! 1 ! 1

A B C ! 2

t = 1 t = 2 t = 3

8 2

(a) Agent 1’s Type.

! 1 ! 1

E G ! 2

D ! 1 ! 1

F H ! 2

t = 1 t = 2 t = 3

0.2

4

0.8 20

(b) Agent 2’s Type.

Figure 1: Two-agent, 3 time-step problem with a single item to allocate. Initial jointstate is AD. Decisions ({allocate to 1, allocate to 2, don’t allocate}) are implicit in thestate transitions. Agent 1’s type has deterministic transitions, while agent 2’s type hasnon-determinism only in the first period.

or not an agent transitions to an inaccessible state can also depend on both theaction taken and the agent’s current type.

This model of periodic inaccessibility applies, for instance, to environments in whichan agent might periodically lose contact with the center due to faulty or limited com-munication, or because of bounded attention where an agent needs to periodicallyattend to other decisions or apply full attention to utilizing a resource just assignedby the center. We will see that it also generalizes earlier arrival-departure modelsof online mechanism design, in which an agent is inaccessible and then accessibleand then inaccessible again.

E!ciency is constrained by the new communication constraints, which preclude asocial planner from knowing the private type of an agent while an agent is inacces-sible. Moreover, inaccessible agents have no opportunity to misreport their type tothe center, and therefore incentive-compatibility requirements will also be modifiedso that they need hold only while an agent is accessible.

To gain some intuition, we can consider a simple mechanism in which the decisionpolicy is defined in a way that simply ignores inaccessible agents, and selects theaction in each period that is e!cient as though the population consists of onlyaccessible agents. In addition, suppose the mechanism makes a payment to eachaccessible agents in every period that is equal to the (reported) value obtained bythe other accessible agents in the period. The following example shows that such asimple mechanism is not incentive-compatible for this environment.

Example 2. Consider again the problem in Figure 1, but now extended with thefollowing periodic-inaccessibility dynamics: with very high probability agent 1 is

16

90

Page 94: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

• Imagine agent 1 accessible at t = 1, and agent 2 inaccessible at t = 1 but very likely to become accessible at t = 2.

• In “naive” dynamic-VCG mechanism, agent 1 better off “hiding” to improve social-welfare.

! 1 ! 1

A B C ! 2

t = 1 t = 2 t = 3

8 2

(a) Agent 1’s Type.

! 1 ! 1

E G ! 2

D ! 1 ! 1

F H ! 2

t = 1 t = 2 t = 3

0.2

4

0.8 20

(b) Agent 2’s Type.

Figure 1: Two-agent, 3 time-step problem with a single item to allocate. Initial jointstate is AD. Decisions ({allocate to 1, allocate to 2, don’t allocate}) are implicit in thestate transitions. Agent 1’s type has deterministic transitions, while agent 2’s type hasnon-determinism only in the first period.

or not an agent transitions to an inaccessible state can also depend on both theaction taken and the agent’s current type.

This model of periodic inaccessibility applies, for instance, to environments in whichan agent might periodically lose contact with the center due to faulty or limited com-munication, or because of bounded attention where an agent needs to periodicallyattend to other decisions or apply full attention to utilizing a resource just assignedby the center. We will see that it also generalizes earlier arrival-departure modelsof online mechanism design, in which an agent is inaccessible and then accessibleand then inaccessible again.

E!ciency is constrained by the new communication constraints, which preclude asocial planner from knowing the private type of an agent while an agent is inacces-sible. Moreover, inaccessible agents have no opportunity to misreport their type tothe center, and therefore incentive-compatibility requirements will also be modifiedso that they need hold only while an agent is accessible.

To gain some intuition, we can consider a simple mechanism in which the decisionpolicy is defined in a way that simply ignores inaccessible agents, and selects theaction in each period that is e!cient as though the population consists of onlyaccessible agents. In addition, suppose the mechanism makes a payment to eachaccessible agents in every period that is equal to the (reported) value obtained bythe other accessible agents in the period. The following example shows that such asimple mechanism is not incentive-compatible for this environment.

Example 2. Consider again the problem in Figure 1, but now extended with thefollowing periodic-inaccessibility dynamics: with very high probability agent 1 is

16

90

Page 95: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

• Imagine agent 1 accessible at t = 1, and agent 2 inaccessible at t = 1 but very likely to become accessible at t = 2.

• In “naive” dynamic-VCG mechanism, agent 1 better off “hiding” to improve social-welfare.

• In non-naive mechanism that makes dynamic-VCG payments only to accessible agents, agent 2 can benefit by hiding.

! 1 ! 1

A B C ! 2

t = 1 t = 2 t = 3

8 2

(a) Agent 1’s Type.

! 1 ! 1

E G ! 2

D ! 1 ! 1

F H ! 2

t = 1 t = 2 t = 3

0.2

4

0.8 20

(b) Agent 2’s Type.

Figure 1: Two-agent, 3 time-step problem with a single item to allocate. Initial jointstate is AD. Decisions ({allocate to 1, allocate to 2, don’t allocate}) are implicit in thestate transitions. Agent 1’s type has deterministic transitions, while agent 2’s type hasnon-determinism only in the first period.

or not an agent transitions to an inaccessible state can also depend on both theaction taken and the agent’s current type.

This model of periodic inaccessibility applies, for instance, to environments in whichan agent might periodically lose contact with the center due to faulty or limited com-munication, or because of bounded attention where an agent needs to periodicallyattend to other decisions or apply full attention to utilizing a resource just assignedby the center. We will see that it also generalizes earlier arrival-departure modelsof online mechanism design, in which an agent is inaccessible and then accessibleand then inaccessible again.

E!ciency is constrained by the new communication constraints, which preclude asocial planner from knowing the private type of an agent while an agent is inacces-sible. Moreover, inaccessible agents have no opportunity to misreport their type tothe center, and therefore incentive-compatibility requirements will also be modifiedso that they need hold only while an agent is accessible.

To gain some intuition, we can consider a simple mechanism in which the decisionpolicy is defined in a way that simply ignores inaccessible agents, and selects theaction in each period that is e!cient as though the population consists of onlyaccessible agents. In addition, suppose the mechanism makes a payment to eachaccessible agents in every period that is equal to the (reported) value obtained bythe other accessible agents in the period. The following example shows that such asimple mechanism is not incentive-compatible for this environment.

Example 2. Consider again the problem in Figure 1, but now extended with thefollowing periodic-inaccessibility dynamics: with very high probability agent 1 is

16

90

Page 96: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

A fix

• For any inaccessible agent, keep log of payments dynamic-VCG would impose on agent; when the agent becomes accessible, execute “lump sum” payment, appropriately scaled for discounting.

• Requires that all agents eventually “come back”.

91

Page 97: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Imagine both agents accessible in all periods. Should agent 2 feign inaccessibility until t = 2?

! 1 ! 1

A B C ! 2

t = 1 t = 2 t = 3

8 2

(a) Agent 1’s Type.

! 1 ! 1

E G ! 2

D ! 1 ! 1

F H ! 2

t = 1 t = 2 t = 3

0.2

4

0.8 20

(b) Agent 2’s Type.

Figure 1: Two-agent, 3 time-step problem with a single item to allocate. Initial jointstate is AD. Decisions ({allocate to 1, allocate to 2, don’t allocate}) are implicit in thestate transitions. Agent 1’s type has deterministic transitions, while agent 2’s type hasnon-determinism only in the first period.

or not an agent transitions to an inaccessible state can also depend on both theaction taken and the agent’s current type.

This model of periodic inaccessibility applies, for instance, to environments in whichan agent might periodically lose contact with the center due to faulty or limited com-munication, or because of bounded attention where an agent needs to periodicallyattend to other decisions or apply full attention to utilizing a resource just assignedby the center. We will see that it also generalizes earlier arrival-departure modelsof online mechanism design, in which an agent is inaccessible and then accessibleand then inaccessible again.

E!ciency is constrained by the new communication constraints, which preclude asocial planner from knowing the private type of an agent while an agent is inacces-sible. Moreover, inaccessible agents have no opportunity to misreport their type tothe center, and therefore incentive-compatibility requirements will also be modifiedso that they need hold only while an agent is accessible.

To gain some intuition, we can consider a simple mechanism in which the decisionpolicy is defined in a way that simply ignores inaccessible agents, and selects theaction in each period that is e!cient as though the population consists of onlyaccessible agents. In addition, suppose the mechanism makes a payment to eachaccessible agents in every period that is equal to the (reported) value obtained bythe other accessible agents in the period. The following example shows that such asimple mechanism is not incentive-compatible for this environment.

Example 2. Consider again the problem in Figure 1, but now extended with thefollowing periodic-inaccessibility dynamics: with very high probability agent 1 is

16

92

Page 98: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Imagine both agents accessible in all periods. Should agent 2 feign inaccessibility until t = 2?

! 1 ! 1

A B C ! 2

t = 1 t = 2 t = 3

8 2

(a) Agent 1’s Type.

! 1 ! 1

E G ! 2

D ! 1 ! 1

F H ! 2

t = 1 t = 2 t = 3

0.2

4

0.8 20

(b) Agent 2’s Type.

Figure 1: Two-agent, 3 time-step problem with a single item to allocate. Initial jointstate is AD. Decisions ({allocate to 1, allocate to 2, don’t allocate}) are implicit in thestate transitions. Agent 1’s type has deterministic transitions, while agent 2’s type hasnon-determinism only in the first period.

or not an agent transitions to an inaccessible state can also depend on both theaction taken and the agent’s current type.

This model of periodic inaccessibility applies, for instance, to environments in whichan agent might periodically lose contact with the center due to faulty or limited com-munication, or because of bounded attention where an agent needs to periodicallyattend to other decisions or apply full attention to utilizing a resource just assignedby the center. We will see that it also generalizes earlier arrival-departure modelsof online mechanism design, in which an agent is inaccessible and then accessibleand then inaccessible again.

E!ciency is constrained by the new communication constraints, which preclude asocial planner from knowing the private type of an agent while an agent is inacces-sible. Moreover, inaccessible agents have no opportunity to misreport their type tothe center, and therefore incentive-compatibility requirements will also be modifiedso that they need hold only while an agent is accessible.

To gain some intuition, we can consider a simple mechanism in which the decisionpolicy is defined in a way that simply ignores inaccessible agents, and selects theaction in each period that is e!cient as though the population consists of onlyaccessible agents. In addition, suppose the mechanism makes a payment to eachaccessible agents in every period that is equal to the (reported) value obtained bythe other accessible agents in the period. The following example shows that such asimple mechanism is not incentive-compatible for this environment.

Example 2. Consider again the problem in Figure 1, but now extended with thefollowing periodic-inaccessibility dynamics: with very high probability agent 1 is

16

T2 = -6 - 2 = -8, same whether he hides at t = 1 or not.

92

Page 99: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Imagine both agents accessible in all periods. Should agent 2 feign inaccessibility until t = 2?

! 1 ! 1

A B C ! 2

t = 1 t = 2 t = 3

8 2

(a) Agent 1’s Type.

! 1 ! 1

E G ! 2

D ! 1 ! 1

F H ! 2

t = 1 t = 2 t = 3

0.2

4

0.8 20

(b) Agent 2’s Type.

Figure 1: Two-agent, 3 time-step problem with a single item to allocate. Initial jointstate is AD. Decisions ({allocate to 1, allocate to 2, don’t allocate}) are implicit in thestate transitions. Agent 1’s type has deterministic transitions, while agent 2’s type hasnon-determinism only in the first period.

or not an agent transitions to an inaccessible state can also depend on both theaction taken and the agent’s current type.

This model of periodic inaccessibility applies, for instance, to environments in whichan agent might periodically lose contact with the center due to faulty or limited com-munication, or because of bounded attention where an agent needs to periodicallyattend to other decisions or apply full attention to utilizing a resource just assignedby the center. We will see that it also generalizes earlier arrival-departure modelsof online mechanism design, in which an agent is inaccessible and then accessibleand then inaccessible again.

E!ciency is constrained by the new communication constraints, which preclude asocial planner from knowing the private type of an agent while an agent is inacces-sible. Moreover, inaccessible agents have no opportunity to misreport their type tothe center, and therefore incentive-compatibility requirements will also be modifiedso that they need hold only while an agent is accessible.

To gain some intuition, we can consider a simple mechanism in which the decisionpolicy is defined in a way that simply ignores inaccessible agents, and selects theaction in each period that is e!cient as though the population consists of onlyaccessible agents. In addition, suppose the mechanism makes a payment to eachaccessible agents in every period that is equal to the (reported) value obtained bythe other accessible agents in the period. The following example shows that such asimple mechanism is not incentive-compatible for this environment.

Example 2. Consider again the problem in Figure 1, but now extended with thefollowing periodic-inaccessibility dynamics: with very high probability agent 1 is

16

T2 = -6 - 2 = -8, same whether he hides at t = 1 or not.

difference in optimal value for agent 1, with and without

agent 2 present at t=1

difference in optimal value for agent 1, with and without

agent 2 present at t=292

Page 100: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

What if agents don’t always come back?

• In general, the scheme won’t work.

• For an arrival/departure model, within-period ex post efficiency is recovered if agent arrivals are independent conditioned on actions chosen.

93

Page 101: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Randomly arriving agents, revenue maximization[Gershkov and Moldovanu, 09]

• Goal is not efficiency, but rather revenue maximization.

• Agents arrive randomly over time.

• Set of resources to be allocated before a deadline.

94

Page 102: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Computation

95

Page 103: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

The big bad secret

• Computing optimal policies is, in general, very hard... but often necessary.

• What can we do?

Approximations, yielding approximate equilibria (even this is hard)?

Identify tractable special cases.

Thankfully, MABs are such a case.

96

Page 104: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Computing optimal policies in MABs [Gittins & Jones, 74]

• At each period, compute Gittins index for each agent’s Markov chain.

• “Activate” (e.g., allocate resource to) agent with highest index.

Complexity: Gittins indices are independent, so linear in number of agents.

97

Page 105: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Beyond simple repeated allocation

Coordination of value information acquisition preceding one-time allocation of a single item (“metadeliberation auctions”).

98

Page 106: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Metadeliberation auction[Cavallo & Parkes, 08]

• A resource is to be allocated. Agents have initial valuations for the resource. Valuations can potentially be increased by costly “deliberation” (e.g., researching new ways of using the resource).

• How to coordinate deliberation/allocation to maximize social welfare?

99

Page 107: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Metadeliberation auction[Cavallo & Parkes, 08]

• Given optimal policy, dynamic-VCG mechanism can be applied to deal with incentives.

• Computing optimal deliberation/allocation policy is tractable (reduction to multi-armed bandits problem).

• Note: even in this one-time allocation scenario, a realistic analysis of the problem reveals the need for dynamic solution.

100

Page 108: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

ComputationBeyond bandits:

heuristics for special cases

101

Page 109: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Self-correcting dynamic multi-unit auctions[Constantin & Parkes, here]

• When computing optimal policy is infeasible...

• Propose heuristic method that is strategyproof, yet achieves social-welfare ~90% of optimal.

• See talk tomorrow for details.

102

Page 110: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Auctions with online supply[Babaioff, Blumrosen, & Roth, workshop here]

• Dynamically arriving items – unknown total quantity.

• Approximate mechanisms.

• Nonetheless truthful – possible due to the restricted setting.

103

Page 111: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Open problems, future directions

104

Page 112: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Interdependent values

• Interestingly, sequential nature of problem kind of helps here: ex post payments become natural.

Version of the team mechanism is still within-period ex post efficient.

But no apparent way to extend dynamic-VCG...

Can we achieve no-deficit, IR, and efficienct in interdependent settings?

105

Page 113: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

Computation

• The general case looks hopeless.

• Continue to identify tractable special cases?

• Adopt more realistic equilibrium notions?

106

Page 114: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

References• [Vickrey, 61] William Vickrey. Counterspeculations, auctions, and competitive

sealed tenders. Journal of Finance, 16:8–37, 1961.

• [Clarke, 71] Edward Clarke. Multipart pricing of public goods. Public Choice, 8:19–33, 1971.

• [Groves, 73] Theodore Groves. Incentives in teams. Econometrica, 41:617–631, 1973.

• [Green & Laffont, 77] Jerry Green and Jean-Jacques Laffont. Characterization of satisfactory mechanisms for the revelation of preferences for public goods. Econometrica, 45:427–438, 1977.

• [Holmstrom, 79] Bengt Holmstrom. Groves’ scheme on restricted domains. Econometrica, 47(5):1137–1144, 1979.

107

Page 115: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

• [Arrow, 79] Kenneth J. Arrow. The property rights doctrine and demand revelation under incomplete information. In M. Boskin, editor, Economics and Human Welfare. Academic Press, 1979.

• [d’Aspremont & Gerard-Varet, 79] C. D’Aspermont and L.A. Gerard-Varet. Incentives and incomplete information. Journal of Public Economics, 11:25–45, 1979.

• [Bailey, 97] Martin J. Bailey. The demand revealing process: To distribute the surplus. Public Choice, 91:107–126, 1997.

• [Cavallo, 06] Ruggiero Cavallo. Optimal decision-making with minimal waste: Strategyproof redistribution of VCG payments. In Proceedings of the 5th International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS’06), pages 882–889, 2006.

• [Guo & Conitzer, 07] Mingyu Guo and Vincent Conitzer. Worst-case optimal redistribution of VCG payments. In Proceedings of the 8th ACM Conference on Electronic Commerce (EC-07), San Diego, CA, USA, pages 30–39, 2007.

• [Moulin, 2007] Herv´e Moulin. Efficient, strategy-proof and almost budget-balanced assignment. unpublished, 2007.

108

Page 116: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

• [Guo & Conitzer, 08] Mingyu Guo and Vincent Conitzer. Optimal-in-expectation redistribution mechanisms. In Proceedings of the Seventh International Conference on Autonomous Agents and Multiagent Systems (AAMAS-08), 2008.

• [Hartline & Roughgarden, 2008] Jason D. Hartline and Tim Roughgarden. Optimal mechanism design and money burning. In Proceedings of the 40th annual ACM symposium on Theory of Computing (STOC’08), 2008.

• [de Clippel, Naroditskiy, Greenwald, here]. Geoffroy de Clippel, Victor Naroditskiy, and Amy Greenwald. Destroy to Save. In Proceedings of the 10th ACM Conference on Electronic Commerce (EC-09), 2009.

• [Myerson, 1986] Roger Myerson. Multistage games with communication. Econometrica, 54(2):323–358, 1986.

• [Athey & Segal, 07] Susan Athey and Ilya Segal. An efficient dynamic mechanism. Working paper, http://www.stanford.edu/~isegal/agv.pdf, 2007.

• [Cavallo, Parkes, & Singh, 06] Ruggiero Cavallo, David C. Parkes, and Satinder Singh. Optimal coordinated planning amongst self-interested agents with private state. In Proceedings of the Twenty-second Annual Conference on Uncertainty in Artificial Intelligence (UAI’06), 2006.

109

Page 117: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

• [Cavallo, 08] Ruggiero Cavallo. Efficiency and redistribution in dynamic mechanism design. In Proceedings of the 9th ACM Conference on Electronic Commerce (EC-08) (to appear), 2008.

• [Bergemann & Valimaki, 08] Dirk Bergemann and Juuso Valimaki. Efficient dynamic auctions. Cowles Foundation Discussion Paper 1584, http://cowles.econ.yale.edu/P/cd/d15b/d1584.pdf, 2006.

• [Cavallo, Parkes, & Singh, 07] Ruggiero Cavallo, David C. Parkes, and Satinder Singh. Online mechanisms for persistent, periodically inaccessible self-interested agents. In DIMACS Workshop on the Boundary between Economic Theory and Computer Science, 2007.

• [Friedman & Parkes, 03] E. Friedman and D. C. Parkes. Pricing WiFi at Starbucks– issues in online mechanism design. In Proc. Fourth ACM Conference on Electronic Commerce (EC’03), pages 240–241, 2003.

• [Parkes & Singh, 03] David C. Parkes and Satinder Singh. An MDP-based approach to Online Mechanism Design. In Proceedings of the 17th Annual Conf. on Neural Information Processing Systems (NIPS’03), 2003.

• [Lavi & Nisan, 04] Ron Lavi and Noam Nisan. Competitive analysis of incentive compatible on-line auctions. Theoretical Computer Science, 310:159–180, 2004. Earlier version in ACMEC 2000.

110

Page 118: A tutorial on: Dynamic Mechanism Designcavallo/cavallo-dmdTut-ec09.pdf · A tutorial on: Dynamic Mechanism Design Ruggiero Cavallo University of Pennsylvania Department of Computer

• [Porter, 04] Ryan Porter. Mechanism design for online real-time scheduling. In Proceedings of the ACM Conference on Electronic Commerce (EC’04), pages 61–70, 2004.

• [Gershkov and Moldovanu, 09] Alex Gershkov and Benny Moldovanu. Dynamic Revenue Maximization with Heterogeneous Objects: A Mechanism Design Approach. Forthcoming in American Economic Journal: Microeconomics.

• [Gittins & Jones, 74] J. C. Gittins and D. M. Jones. A dynamic allocation index for the sequential design of experiments. In In Progress in Statistics, pages 241–266. J. Gani et al., 1974.

• [Cavallo & Parkes, 08] Ruggiero Cavallo and David C. Parkes. Efficient metade-liberation auctions. In Proceedings of the 26th Annual Conference on Artificial Intelligence (AAAI-08), 2008.

• [Constantin & Parkes, here] Florin Constantin and David C. Parkes. Self-Correcting Sampling-Based Dynamic Multi-Unit Auctions. In the 10th ACM Electronic Commerce Conference (EC'09), 2009

• [Babaioff, Blumrosen, & Roth, workshop here] Moshe Babaioff, Liad Blumrosen, and Aaron Roth. Auctions with Online Supply. In Fifth Workshop on Ad Auctions, 2009.

111