Simultaneous Move Games - UC Santa...

30
Simultaneous Move Games Simultaneous Move Games Decision making without knowledge of the strategy choice of opponents

Transcript of Simultaneous Move Games - UC Santa...

Page 1: Simultaneous Move Games - UC Santa Barbarafaculty.econ.ucsb.edu/~garratt/Econ171/Lect04_Slides.pdf · Simultaneous Moves • Arise when players have to make their strategy choices

Simultaneous Move GamesSimultaneous Move Games

Decision making without knowledge of the strategy choice of opponents

Page 2: Simultaneous Move Games - UC Santa Barbarafaculty.econ.ucsb.edu/~garratt/Econ171/Lect04_Slides.pdf · Simultaneous Moves • Arise when players have to make their strategy choices

Simultaneous MovesSimultaneous Moves• Arise when players have to make their strategy choices 

simultaneously without knowing the strategies that have beensimultaneously, without knowing the strategies that have been chosen by the other player(s).– Student studies for a test; the teacher writes questions.

f d d l d d h h d l d k– Two firms independently decide whether or not to develop and market a new product.

• While there is no information about what other players will actually choose, we assume that the strategic choices available to each player are known by all players.

• Players must think not only about their own best strategic• Players must think not only about their own best strategic choice but also the best strategic choice of the other player(s).

• We will consider both discrete and continuous strategy spaces.

Page 3: Simultaneous Move Games - UC Santa Barbarafaculty.econ.ucsb.edu/~garratt/Econ171/Lect04_Slides.pdf · Simultaneous Moves • Arise when players have to make their strategy choices

Normal or Strategic Formg• A simultaneous move game is depicted in “Normal” or “Strategic” form using a game table that relates the g g gstrategic choices of the players to their payoffs.

• The convention is that the row player’s payoff is listed fi t d th l l ’ ff i li t d dfirst and the column player’s payoff is listed second.

Column Player

R S C1 S C2RowPlayer Strategy R1

Strategy C1a, b

Strategy C2c, d

Strategy R2 e, f g, h

• For example, if Row player chooses R2 and Column player chooses C1, the Row player’s payoff is e and the Column player’s payoff is f.

Page 4: Simultaneous Move Games - UC Santa Barbarafaculty.econ.ucsb.edu/~garratt/Econ171/Lect04_Slides.pdf · Simultaneous Moves • Arise when players have to make their strategy choices

Special Zero‐Sum Formp• For zero (or constant sum games), knowing the payoffs sum to zero (or some otherthe payoffs sum to zero (or some other constant) allows us to write a simultaneous move game in normal form more simply:g p y

Warden

P i Cli b W llGuard Wall

1Inspect Cells

1

• Payoffs are shown only for the Prisoner; the

Prisoner Climb Wall ‐1  1

Dig Tunnel 1 ‐1

• Payoffs are shown only for the Prisoner; the Warden’s payoffs are the negative of the prisoner’s payoffprisoner s payoff

Page 5: Simultaneous Move Games - UC Santa Barbarafaculty.econ.ucsb.edu/~garratt/Econ171/Lect04_Slides.pdf · Simultaneous Moves • Arise when players have to make their strategy choices

The Role of Beliefs• When players move simultaneously, what does it mean to say that in equilibrium strategies are a mutual best 

?response?– You cannot see what the other is doing and condition your  behavior on their move.

• In simultaneous move games, rational players consider all of the strategies their opponents may take and they form beliefs (subjective probabilities) about the ( j p )likelihood of each strategy their opponent(s) could take.

• After forming these beliefs rational players maximizeAfter forming these beliefs, rational players maximize their expected payoff by choosing the strategy that is a best response to their beliefs about the play of their opponent(s) The same is true of the opponent(s)opponent(s).  The same is true of the opponent(s).

Page 6: Simultaneous Move Games - UC Santa Barbarafaculty.econ.ucsb.edu/~garratt/Econ171/Lect04_Slides.pdf · Simultaneous Moves • Arise when players have to make their strategy choices

Coordination Game ExampleCoordination Game Example

How would you play this game? y p y g

Page 7: Simultaneous Move Games - UC Santa Barbarafaculty.econ.ucsb.edu/~garratt/Econ171/Lect04_Slides.pdf · Simultaneous Moves • Arise when players have to make their strategy choices

Example of the Role of Beliefsp• Consider the pure coordination game.

Column                    Player

Row X Y

• Suppose Row player assigns probability p> 5 to column

Player X 0, 0 1, 1

Y 1, 1 0, 0

• Suppose Row player assigns probability p>.5 to column player playing Y. Then Row’s best response to this belief is to play X: 

Row’s expected payoff from playing X is 0(1‐p)+1(p)=p, while Row’s expected payoff from playing Y is 1(1‐p) + 0(p)=1‐p. Since we assumed p>.5, the expected payoff to Row from playing X, p, 

h h ffis greater than the expected payoff to Y, 1‐p.

Page 8: Simultaneous Move Games - UC Santa Barbarafaculty.econ.ucsb.edu/~garratt/Econ171/Lect04_Slides.pdf · Simultaneous Moves • Arise when players have to make their strategy choices

How Might Such Beliefs be Formed?g• Players’ subjective beliefs about the play of an opponent in a simultaneous move game may beopponent in a simultaneous move game may be formed in one of several ways:– Introspection:  given my knowledge of the opponent’s payoffs what would I do if I were the other player?payoffs what would I do if I were the other player?

– History (repeated games only): what strategy has the same opponent played in the past.Imitation/learning from others: what strategies have– Imitation/learning from others: what strategies have players (other than my current opponent) chosen in this type of strategic setting? 

– Pre‐play communication– Pre‐play communication.– Other type of signaling.

• We focus for now on the first, introspective method.

Page 9: Simultaneous Move Games - UC Santa Barbarafaculty.econ.ucsb.edu/~garratt/Econ171/Lect04_Slides.pdf · Simultaneous Moves • Arise when players have to make their strategy choices

Pure vs. Mixed Strategiesg• A player pursues a pure strategy if she always chooses the same strategic action out of all thechooses the same strategic action out of all the strategic action choices available to her in every round.

Al f t l th t t h– e.g. Always refuse to clean the apartment you share with your roommate.

• A player pursues a mixed strategy if she p y p gyrandomizes in some manner among the strategic action choices available to her in every round.

e g Sometimes pitch a curveball sometimes a– e.g. Sometimes pitch a curveball, sometimes a slider (“mix it up,” “keep them guessing”).

• We focus for now on pure strategies only.We focus for now on pure strategies only.

Page 10: Simultaneous Move Games - UC Santa Barbarafaculty.econ.ucsb.edu/~garratt/Econ171/Lect04_Slides.pdf · Simultaneous Moves • Arise when players have to make their strategy choices

Example: Battle of the Networksp f• Suppose there are just two television networks. Both are 

battling for shares of viewers (0‐100%). Higher shares are preferred ( higher advertising revenues)preferred (= higher advertising revenues).

• Network 1 has an advantage in sitcoms. If it runs a sitcom, it always gets a higher share than if it runs a game show.

• Network 2 has an advantage in game shows. If it runs a game show it always gets a higher share than if it runs a sitcom.

kNetwork 2

Network 1 SitcomSitcom

55% 45%Game Show52% 48%Network 1 Sitcom 55%, 45% 52%, 48%

Game Show 50%, 50% 45%, 55%

Page 11: Simultaneous Move Games - UC Santa Barbarafaculty.econ.ucsb.edu/~garratt/Econ171/Lect04_Slides.pdf · Simultaneous Moves • Arise when players have to make their strategy choices

Nash Equilibrium• We cannot use rollback in a simultaneous move game, so 

how do we find a solution?• We determine the “best response” of each player to a 

particular choice of strategy by the other player.• We do this for both players. Note that in thinking of an 

opponent’s best response, we are using introspection to form beliefs about what the (rational) opponent will do.

• If each player’s strategy choice is a best response to the strategy choice of the other player, then we have found a solution or equilibrium to the game.

• This solution concept is know as a Nash equilibrium, after p qJohn Nash who first proposed it.

• A game may have 0, 1 or more Nash equilibria.

Page 12: Simultaneous Move Games - UC Santa Barbarafaculty.econ.ucsb.edu/~garratt/Econ171/Lect04_Slides.pdf · Simultaneous Moves • Arise when players have to make their strategy choices

Best Response Analysis• Best response analysis (a.k.a. cell‐by‐cell inspection) is the most reliable method for finding Nash equilibria.First find Network 1's best response to Network 2'spossible strategies:– If Network 2 runs a sitcom, Network 1’s best response is to prun a sitcom. Circle Network 1’s payoff in this case, 55%

– If Network 2 runs a game show, Network 1’s best response is to run a sitcom Circle Network 1’s payoff in this case 52%to run a sitcom. Circle Network 1 s payoff in this case, 52%

Network 2

Network 1 SitcomSitcom

55%, 45%Game Show52%, 48%

Game Show 50%, 50% 45%, 55%

Page 13: Simultaneous Move Games - UC Santa Barbarafaculty.econ.ucsb.edu/~garratt/Econ171/Lect04_Slides.pdf · Simultaneous Moves • Arise when players have to make their strategy choices

Best Response Analysis, Continued• Next, we find Network 2’s best response.

– If Network 1 runs a sitcom, Network 2’s best response is to run a game show. Circle Network 2’s payoff in this case, 48%p y ,

– If Network 1 runs a game show, Network 2’s best response is to run a game show. Circle Network 2’s payoff in this case, 55%

• The unique Nash equilibrium is for Network 1 to run a sitcom and Network 2The unique Nash equilibrium is for Network 1 to run a sitcom and Network 2 to run a game show. 

• This is found by the cell with the two circled payoffs.   This is the method of best response analysis for locating Nash equilibriabest response analysis for locating Nash equilibria.

Network 2

Network 1 SitcomSitcom

55%, 45%Game Show52%, 48%

Game Show 50%, 50% 45%, 55%

Page 14: Simultaneous Move Games - UC Santa Barbarafaculty.econ.ucsb.edu/~garratt/Econ171/Lect04_Slides.pdf · Simultaneous Moves • Arise when players have to make their strategy choices

Dominant Strategies• A player has a dominant strategy if it outperforms(has higher payoff than) all other strategies regardless( g p y ) g gof the strategies chosen by the opposing player(s).• For example, in the battle of the networks game, p , g ,Network 1 has a dominant strategy of always choosing to run a sitcom. Network 2 has a dominant strategy of always choosing to run a game showstrategy of always choosing to run a game show.

• Why?• Successive elimination of non‐dominant or

•Successive elimination of “dominated” strategies can help us to find a N.E. equum.

Page 15: Simultaneous Move Games - UC Santa Barbarafaculty.econ.ucsb.edu/~garratt/Econ171/Lect04_Slides.pdf · Simultaneous Moves • Arise when players have to make their strategy choices

Successive Elimination of Dominated StrategiesStrategies

• Another way to find Nash equilibria• Draw lines through (successively eliminate) each player’s dominated 

strategy(s).• If successive elimination of dominated strategies results in a unique

outcome, that outcome is the Nash equilibrium of the game.• We call such games dominance solvable• We call such games dominance solvable.• But, not all games have unique equilibria/are dominance solvable, so this 

method will not work as generally as best response analysis.

kNetwork 2

Network 1 SitcomSitcom

55% 45%Game Show52% 48%Network 1 Sitcom 55%, 45% 52%, 48%

Game Show 50%, 50% 45%, 55%

Page 16: Simultaneous Move Games - UC Santa Barbarafaculty.econ.ucsb.edu/~garratt/Econ171/Lect04_Slides.pdf · Simultaneous Moves • Arise when players have to make their strategy choices

Minimax Method• For zero, or constant‐sum games only, so not so general

• Each player reasons that what’s good for me is bad for my opponent.

• Suppose payoffs are written for row only (Network game is constant‐sum).

• Row looks only at the lowest payoff in each row and chooses the row with the highest of these lowest payoffs (maximizes the minimum)g p y ( )– Network 1 chooses sitcom because 52% > 48%.

• Column looks only at the highest payoffs in each column and chooses the row with the lowest of these highest payoffs (minimizes the maximum).g p y ( )– Network 2 chooses game show because 52%<55%.

Network 2

Network 1 SitcomSitcom55%

GameShow52%

Row Min52%

Game Show 50% 45% 45%Game Show 50% 45% 45%

Column Max 55% 52%

Page 17: Simultaneous Move Games - UC Santa Barbarafaculty.econ.ucsb.edu/~garratt/Econ171/Lect04_Slides.pdf · Simultaneous Moves • Arise when players have to make their strategy choices

Adding More Strategiesg g• Suppose we add a third choice of a “talent show” to Battle of 

the Networks.

Network 2

Network 1

SitcomSitcom

55%, 45%Game Show52%, 48%

Talent Show 51%, 49%

Game Show 50% 50% 45% 55% 46% 54%Network 1 Game Show 50%, 50% 45%, 55% 46%, 54%

Talent Show 52%, 48% 49%, 51% 48%, 52%

• What is the Nash equilibrium in this case? First ask: are there any dominated strategies? If so, eliminate them from consideration.

Page 18: Simultaneous Move Games - UC Santa Barbarafaculty.econ.ucsb.edu/~garratt/Econ171/Lect04_Slides.pdf · Simultaneous Moves • Arise when players have to make their strategy choices

Eliminating the Dominated Strategies Reduces theS t f St t i th t M C i N h E ilib iSet of Strategies that May Comprise Nash Equilibria.

Network 2

Network 1

SitcomSitcom

55%, 45%Game Show52%, 48%

Talent Show 51%, 49%

Game Show 50% 50% 45% 55% 46% 54%Network 1 Game Show 50%, 50% 45%, 55% 46%, 54%

Talent Show 52%, 48% 49%, 51% 48%, 52%

• A talent show  is  a dominated strategy for Network 1.  • A sitcom is a dominated strategy for Network 2• A sitcom is a dominated strategy for Network 2.

• A game show is a dominated strategy for Network 1.  

Page 19: Simultaneous Move Games - UC Santa Barbarafaculty.econ.ucsb.edu/~garratt/Econ171/Lect04_Slides.pdf · Simultaneous Moves • Arise when players have to make their strategy choices

Continuing the search for dominated strategies th i i h iamong the remaining choices…

Network 2Nash Equilibrium

Network 1

SitcomSitcom

55%, 45%Game Show52%, 48%

Talent Show 51%, 49%

Game Show 50% 50% 45% 55% 46% 54%Network 1 Game Show 50%, 50% 45%, 55% 46%, 54%

Talent Show 52%, 48% 49%, 51% 48%, 52%

• Game show is now a dominated strategy for Network 2

Page 20: Simultaneous Move Games - UC Santa Barbarafaculty.econ.ucsb.edu/~garratt/Econ171/Lect04_Slides.pdf · Simultaneous Moves • Arise when players have to make their strategy choices

Best Response Analysis Also WorksBest Response Analysis Also Works

Network 2Nash Equilibrium

Network 1

SitcomSitcom

55%, 45%Game Show52%, 48%

Talent Show 51%, 49%

Game Show 50% 50% 45% 55% 46% 54%Network 1 Game Show 50%, 50% 45%, 55% 46%, 54%

Talent Show 52%, 48% 49%, 51% 48%, 52%

Page 21: Simultaneous Move Games - UC Santa Barbarafaculty.econ.ucsb.edu/~garratt/Econ171/Lect04_Slides.pdf · Simultaneous Moves • Arise when players have to make their strategy choices

Adding a Third Playerg y• Consider again the case of two strategies, sitcom and game 

show and suppose there is a third player Network 3show and suppose there is a third player, Network 3.

• The normal‐form representation of this three‐player game is:

kNetwork 3

Sitcom

Network 2

Game Show

Network 2

Network 1

SitcomSitcom

34%,25%,41%Game Show32%,32%,36%

Network 1

SitcomSitcom

34%,29%,37%Game Show38%,32%,30%

Game Show 32% 30% 38% 33% 31% 36% Game Show 35% 38% 27% 36% 39% 25%Game Show 32%,30%,38% 33%,31%,36% Game Show 35%,38%,27% 36%,39%,25%

• Network 3’s payoff is now the third percentage given.

Wh t i th N h ilib i f thi ?• What is the Nash equilibrium of this game?

Page 22: Simultaneous Move Games - UC Santa Barbarafaculty.econ.ucsb.edu/~garratt/Econ171/Lect04_Slides.pdf · Simultaneous Moves • Arise when players have to make their strategy choices

Use Best Response AnalysisUse Best Response Analysis

kNetwork 3

Sitcom

Network 2

Game Show

Network 2

Network 1

SitcomSitcom

34%,25%,41%Game Show32%,32%,36%

Network 1

SitcomSitcom

34%,29%,37%Game Show38%,32%,30%

Game Show 32% 30% 38% 33% 31% 36% Game Show 35% 38% 27% 36% 39% 25%Game Show 32%,30%,38% 33%,31%,36% Game Show 35%,38%,27% 36%,39%,25%

• Pure strategy Nash equilibrium to this game is for Network 1 to run a game show Network 2 to run a game show andto run a game show, Network 2 to run a game show and Network 3 to run a sitcom.

Page 23: Simultaneous Move Games - UC Santa Barbarafaculty.econ.ucsb.edu/~garratt/Econ171/Lect04_Slides.pdf · Simultaneous Moves • Arise when players have to make their strategy choices

Non‐Constant‐Sum Games• The Network Game is an example of a constant sum game.

• The payoffs to both players always add up to the constant sum of 100%• The payoffs to both players always add up to the constant sum of 100%.

• We could make that game zero sum by redefining payoffs relative to a 50%‐50% share for each network.

• Nash eq ilibria also e ist in non constant s m or variable sum games• Nash equilibria also exist in non‐constant sum or variable sum games, where players may have some common interest.

• For example, prisoner’s dilemma type games. Payoffs are “profits”, so more is bettermore is better:

Burger King

Value Meals No Value MealsMcDonald’s Value Meals

Value Meals2, 2

No Value Meals4, 1

No Value Meals 1,4 3, 3

Page 24: Simultaneous Move Games - UC Santa Barbarafaculty.econ.ucsb.edu/~garratt/Econ171/Lect04_Slides.pdf · Simultaneous Moves • Arise when players have to make their strategy choices

Asymmetry In PayoffsAsymmetry In Payoffs• Payoffs need not be symmetric in equilibrium.• Consider this version of Prisoner’s Dilemma:• Consider this version of Prisoner s Dilemma:

Burger King

McDonald’s Value MealsValue Meals

3, 1No Value Meals

5, 0

N V l M l 1 6 4 5

• Both players still have a dominant strategy of 

No Value Meals 1,6 4, 5

Value Meals: Game is Still a PD, as the mutually efficient (most jointly profitable) outcome is not a Nash equilibriumNash equilibrium

Page 25: Simultaneous Move Games - UC Santa Barbarafaculty.econ.ucsb.edu/~garratt/Econ171/Lect04_Slides.pdf · Simultaneous Moves • Arise when players have to make their strategy choices

No Equilibrium in Pure StrategiesNo Equilibrium in Pure Strategies

• Not every simultaneous move game has an y gequilibrium in pure strategies.

• Consider again the Prisoner‐Warden game.Warden

Guard Wall Inspect Cells

Prisoner Climb Wall ‐1 1 1 ‐1

• Cell by cell inspection yields NO pure strategy NE

Prisoner Climb Wall ‐1, 1 1, ‐1

Dig Tunnel 1, ‐1 ‐1, 1

• Cell‐by‐cell inspection yields NO pure strategy NE.• There is, however, an equilibrium in mixed strategies.g

Page 26: Simultaneous Move Games - UC Santa Barbarafaculty.econ.ucsb.edu/~garratt/Econ171/Lect04_Slides.pdf · Simultaneous Moves • Arise when players have to make their strategy choices

RationalizabilityRationalizability• An alternative solution concept that is a generalization of 

Nash equilibrium.• Identify strategies that are never a best response for a player 

given any beliefs about the play of his opponent.  Strictly dominated strategies are never a best response, but there 

b t t i th t t t i tl d i t d b t hi hcan be strategies that are not strictly dominated but which are never a best response.

• The set of strategies that survive elimination on the grounds f b i b t th t f “ ti li blof never being a best response are the set of “rationalizable

equilibria” they can be rationalized via certain beliefs.• Every Nash equilibrium is a rationalizable equilibriuim, but 

t ti li bl ilib i i N h ilib inot every rationalizable equilibrium is a Nash equilibrium.• In this sense, rationalizability is a more general solution 

concept. 

Page 27: Simultaneous Move Games - UC Santa Barbarafaculty.econ.ucsb.edu/~garratt/Econ171/Lect04_Slides.pdf · Simultaneous Moves • Arise when players have to make their strategy choices

Rationalizability ExampleRationalizability ExampleColumn

X Y Z

Row A 3,2 0,3 2,0

B 1 3 2 0 1 2B 1,3 2,0 1,2

C 2,1 4,3 0,2

h h l h d d• Note that neither player has any dominated strategies.• Nash equilibrium is: CY• For Row, Strategy B is never a best response.• For Column, Strategy Z is never a best response., gy p• Eliminating these, we see that X is never a best response for Column.• It further follows that A is not a best response for Row, leaving only CY.• More generally, sequential elimination of never best responses need not lead t N h ilib ito Nash equilibrium

Page 28: Simultaneous Move Games - UC Santa Barbarafaculty.econ.ucsb.edu/~garratt/Econ171/Lect04_Slides.pdf · Simultaneous Moves • Arise when players have to make their strategy choices

Elimination of Never Best Responses d l lNeed Not Select a Solution

• Consider this example:Column

X Y Z

Row A 1 5 2 4 5 1Row A 1,5 2,4 5,1

B 4,2 3,3 4,2   

C 5,1 2,4 1,5

• For Row, A is rationalizable if Row thinks Column will play Z; B is rationalizable if Row thinks Column will play Y; C is rationalizable if Row thinks Column will play X.F C l X i i li bl if C l hi k R ill l A Y i• For Column, X is rationalizable if Column thinks Row will play A; Y is rationalizable if Column thinks Row will play B, Z is rationalizable if Column thinks Row will play C.  

• Thus all strategies are rationalizable in this exampleThus, all strategies are rationalizable in this example.• However, the Nash equilibrium, using  best response analysis is BY

Page 29: Simultaneous Move Games - UC Santa Barbarafaculty.econ.ucsb.edu/~garratt/Econ171/Lect04_Slides.pdf · Simultaneous Moves • Arise when players have to make their strategy choices

How to Negotiate a Pay Raise with Game Theory

anuzis

http://www.youtube.com/watch?v=ikE1pn034WA

Page 30: Simultaneous Move Games - UC Santa Barbarafaculty.econ.ucsb.edu/~garratt/Econ171/Lect04_Slides.pdf · Simultaneous Moves • Arise when players have to make their strategy choices

http://www.youtube.com/watch?v=rExm2FbY-BE

Can you solve this three player game?