Simultaneous Move Games - UC Santa...

Simultaneous Move GamesSimultaneous Move Games

Decision making without knowledge of the strategy choice of opponents

Simultaneous MovesSimultaneous Moves• Arise when players have to make their strategy choices

simultaneously without knowing the strategies that have beensimultaneously, without knowing the strategies that have been chosen by the other player(s).– Student studies for a test; the teacher writes questions.

f d d l d d h h d l d k– Two firms independently decide whether or not to develop and market a new product.

• While there is no information about what other players will actually choose, we assume that the strategic choices available to each player are known by all players.

• Players must think not only about their own best strategic• Players must think not only about their own best strategic choice but also the best strategic choice of the other player(s).

• We will consider both discrete and continuous strategy spaces.

Normal or Strategic Formg• A simultaneous move game is depicted in “Normal” or “Strategic” form using a game table that relates the g g gstrategic choices of the players to their payoffs.

• The convention is that the row player’s payoff is listed fi t d th l l ’ ff i li t d dfirst and the column player’s payoff is listed second.

Column Player

R S C1 S C2RowPlayer Strategy R1

Strategy C1a, b

Strategy C2c, d

Strategy R2 e, f g, h

• For example, if Row player chooses R2 and Column player chooses C1, the Row player’s payoff is e and the Column player’s payoff is f.

Special Zero‐Sum Formp• For zero (or constant sum games), knowing the payoffs sum to zero (or some otherthe payoffs sum to zero (or some other constant) allows us to write a simultaneous move game in normal form more simply:g p y

Warden

P i Cli b W llGuard Wall

1Inspect Cells

1

• Payoffs are shown only for the Prisoner; the

Prisoner Climb Wall ‐1 1

Dig Tunnel 1 ‐1

• Payoffs are shown only for the Prisoner; the Warden’s payoffs are the negative of the prisoner’s payoffprisoner s payoff

The Role of Beliefs• When players move simultaneously, what does it mean to say that in equilibrium strategies are a mutual best

?response?– You cannot see what the other is doing and condition your behavior on their move.

• In simultaneous move games, rational players consider all of the strategies their opponents may take and they form beliefs (subjective probabilities) about the ( j p )likelihood of each strategy their opponent(s) could take.

• After forming these beliefs rational players maximizeAfter forming these beliefs, rational players maximize their expected payoff by choosing the strategy that is a best response to their beliefs about the play of their opponent(s) The same is true of the opponent(s)opponent(s). The same is true of the opponent(s).

Coordination Game ExampleCoordination Game Example

How would you play this game? y p y g

Example of the Role of Beliefsp• Consider the pure coordination game.

Column Player

Row X Y

• Suppose Row player assigns probability p> 5 to column

Player X 0, 0 1, 1

Y 1, 1 0, 0

• Suppose Row player assigns probability p>.5 to column player playing Y. Then Row’s best response to this belief is to play X:

Row’s expected payoff from playing X is 0(1‐p)+1(p)=p, while Row’s expected payoff from playing Y is 1(1‐p) + 0(p)=1‐p. Since we assumed p>.5, the expected payoff to Row from playing X, p,

h h ffis greater than the expected payoff to Y, 1‐p.

How Might Such Beliefs be Formed?g• Players’ subjective beliefs about the play of an opponent in a simultaneous move game may beopponent in a simultaneous move game may be formed in one of several ways:– Introspection: given my knowledge of the opponent’s payoffs what would I do if I were the other player?payoffs what would I do if I were the other player?

– History (repeated games only): what strategy has the same opponent played in the past.Imitation/learning from others: what strategies have– Imitation/learning from others: what strategies have players (other than my current opponent) chosen in this type of strategic setting?

– Pre‐play communication– Pre‐play communication.– Other type of signaling.

• We focus for now on the first, introspective method.

Pure vs. Mixed Strategiesg• A player pursues a pure strategy if she always chooses the same strategic action out of all thechooses the same strategic action out of all the strategic action choices available to her in every round.

Al f t l th t t h– e.g. Always refuse to clean the apartment you share with your roommate.

• A player pursues a mixed strategy if she p y p gyrandomizes in some manner among the strategic action choices available to her in every round.

e g Sometimes pitch a curveball sometimes a– e.g. Sometimes pitch a curveball, sometimes a slider (“mix it up,” “keep them guessing”).

• We focus for now on pure strategies only.We focus for now on pure strategies only.

Example: Battle of the Networksp f• Suppose there are just two television networks. Both are

battling for shares of viewers (0‐100%). Higher shares are preferred ( higher advertising revenues)preferred (= higher advertising revenues).

• Network 1 has an advantage in sitcoms. If it runs a sitcom, it always gets a higher share than if it runs a game show.

• Network 2 has an advantage in game shows. If it runs a game show it always gets a higher share than if it runs a sitcom.

kNetwork 2

Network 1 SitcomSitcom

55% 45%Game Show52% 48%Network 1 Sitcom 55%, 45% 52%, 48%

Game Show 50%, 50% 45%, 55%

Nash Equilibrium• We cannot use rollback in a simultaneous move game, so

how do we find a solution?• We determine the “best response” of each player to a

particular choice of strategy by the other player.• We do this for both players. Note that in thinking of an

opponent’s best response, we are using introspection to form beliefs about what the (rational) opponent will do.

• If each player’s strategy choice is a best response to the strategy choice of the other player, then we have found a solution or equilibrium to the game.

• This solution concept is know as a Nash equilibrium, after p qJohn Nash who first proposed it.

• A game may have 0, 1 or more Nash equilibria.

Best Response Analysis• Best response analysis (a.k.a. cell‐by‐cell inspection) is the most reliable method for finding Nash equilibria.First find Network 1's best response to Network 2'spossible strategies:– If Network 2 runs a sitcom, Network 1’s best response is to prun a sitcom. Circle Network 1’s payoff in this case, 55%

– If Network 2 runs a game show, Network 1’s best response is to run a sitcom Circle Network 1’s payoff in this case 52%to run a sitcom. Circle Network 1 s payoff in this case, 52%

Network 2


55%, 45%Game Show52%, 48%

Game Show 50%, 50% 45%, 55%

Best Response Analysis, Continued• Next, we find Network 2’s best response.

– If Network 1 runs a sitcom, Network 2’s best response is to run a game show. Circle Network 2’s payoff in this case, 48%p y ,

– If Network 1 runs a game show, Network 2’s best response is to run a game show. Circle Network 2’s payoff in this case, 55%

• The unique Nash equilibrium is for Network 1 to run a sitcom and Network 2The unique Nash equilibrium is for Network 1 to run a sitcom and Network 2 to run a game show.

• This is found by the cell with the two circled payoffs. This is the method of best response analysis for locating Nash equilibriabest response analysis for locating Nash equilibria.

Network 2


55%, 45%Game Show52%, 48%

Game Show 50%, 50% 45%, 55%

Dominant Strategies• A player has a dominant strategy if it outperforms(has higher payoff than) all other strategies regardless( g p y ) g gof the strategies chosen by the opposing player(s).• For example, in the battle of the networks game, p , g ,Network 1 has a dominant strategy of always choosing to run a sitcom. Network 2 has a dominant strategy of always choosing to run a game showstrategy of always choosing to run a game show.

• Why?• Successive elimination of non‐dominant or

•Successive elimination of “dominated” strategies can help us to find a N.E. equum.

Successive Elimination of Dominated StrategiesStrategies

• Another way to find Nash equilibria• Draw lines through (successively eliminate) each player’s dominated

strategy(s).• If successive elimination of dominated strategies results in a unique

outcome, that outcome is the Nash equilibrium of the game.• We call such games dominance solvable• We call such games dominance solvable.• But, not all games have unique equilibria/are dominance solvable, so this

method will not work as generally as best response analysis.

kNetwork 2


55% 45%Game Show52% 48%Network 1 Sitcom 55%, 45% 52%, 48%

Game Show 50%, 50% 45%, 55%

Minimax Method• For zero, or constant‐sum games only, so not so general

• Each player reasons that what’s good for me is bad for my opponent.

• Suppose payoffs are written for row only (Network game is constant‐sum).

• Row looks only at the lowest payoff in each row and chooses the row with the highest of these lowest payoffs (maximizes the minimum)g p y ( )– Network 1 chooses sitcom because 52% > 48%.

• Column looks only at the highest payoffs in each column and chooses the row with the lowest of these highest payoffs (minimizes the maximum).g p y ( )– Network 2 chooses game show because 52%<55%.

Network 2

Network 1 SitcomSitcom55%

GameShow52%

Row Min52%

Game Show 50% 45% 45%Game Show 50% 45% 45%

Column Max 55% 52%

Adding More Strategiesg g• Suppose we add a third choice of a “talent show” to Battle of

the Networks.

Network 2

Network 1

SitcomSitcom

55%, 45%Game Show52%, 48%

Talent Show 51%, 49%

Game Show 50% 50% 45% 55% 46% 54%Network 1 Game Show 50%, 50% 45%, 55% 46%, 54%

Talent Show 52%, 48% 49%, 51% 48%, 52%

• What is the Nash equilibrium in this case? First ask: are there any dominated strategies? If so, eliminate them from consideration.

Eliminating the Dominated Strategies Reduces theS t f St t i th t M C i N h E ilib iSet of Strategies that May Comprise Nash Equilibria.

Network 2

Network 1

SitcomSitcom

55%, 45%Game Show52%, 48%



Talent Show 52%, 48% 49%, 51% 48%, 52%

• A talent show is a dominated strategy for Network 1. • A sitcom is a dominated strategy for Network 2• A sitcom is a dominated strategy for Network 2.

• A game show is a dominated strategy for Network 1.

Continuing the search for dominated strategies th i i h iamong the remaining choices…

Network 2Nash Equilibrium

Network 1

SitcomSitcom

55%, 45%Game Show52%, 48%



Talent Show 52%, 48% 49%, 51% 48%, 52%

• Game show is now a dominated strategy for Network 2

Best Response Analysis Also WorksBest Response Analysis Also Works

Network 2Nash Equilibrium

Network 1

SitcomSitcom

55%, 45%Game Show52%, 48%



Talent Show 52%, 48% 49%, 51% 48%, 52%

Adding a Third Playerg y• Consider again the case of two strategies, sitcom and game

show and suppose there is a third player Network 3show and suppose there is a third player, Network 3.

• The normal‐form representation of this three‐player game is:

kNetwork 3

Sitcom

Network 2

Game Show

Network 2

Network 1

SitcomSitcom

34%,25%,41%Game Show32%,32%,36%

Network 1

SitcomSitcom

34%,29%,37%Game Show38%,32%,30%

Game Show 32% 30% 38% 33% 31% 36% Game Show 35% 38% 27% 36% 39% 25%Game Show 32%,30%,38% 33%,31%,36% Game Show 35%,38%,27% 36%,39%,25%

• Network 3’s payoff is now the third percentage given.

Wh t i th N h ilib i f thi ?• What is the Nash equilibrium of this game?

Use Best Response AnalysisUse Best Response Analysis

kNetwork 3

Sitcom

Network 2

Game Show

Network 2

Network 1

SitcomSitcom

34%,25%,41%Game Show32%,32%,36%

Network 1

SitcomSitcom

34%,29%,37%Game Show38%,32%,30%

Game Show 32% 30% 38% 33% 31% 36% Game Show 35% 38% 27% 36% 39% 25%Game Show 32%,30%,38% 33%,31%,36% Game Show 35%,38%,27% 36%,39%,25%

• Pure strategy Nash equilibrium to this game is for Network 1 to run a game show Network 2 to run a game show andto run a game show, Network 2 to run a game show and Network 3 to run a sitcom.

Non‐Constant‐Sum Games• The Network Game is an example of a constant sum game.

• The payoffs to both players always add up to the constant sum of 100%• The payoffs to both players always add up to the constant sum of 100%.

• We could make that game zero sum by redefining payoffs relative to a 50%‐50% share for each network.

• Nash eq ilibria also e ist in non constant s m or variable sum games• Nash equilibria also exist in non‐constant sum or variable sum games, where players may have some common interest.

• For example, prisoner’s dilemma type games. Payoffs are “profits”, so more is bettermore is better:

Burger King

Value Meals No Value MealsMcDonald’s Value Meals

Value Meals2, 2

No Value Meals4, 1

No Value Meals 1,4 3, 3

Asymmetry In PayoffsAsymmetry In Payoffs• Payoffs need not be symmetric in equilibrium.• Consider this version of Prisoner’s Dilemma:• Consider this version of Prisoner s Dilemma:

Burger King

McDonald’s Value MealsValue Meals

3, 1No Value Meals

5, 0

N V l M l 1 6 4 5

• Both players still have a dominant strategy of

No Value Meals 1,6 4, 5

Value Meals: Game is Still a PD, as the mutually efficient (most jointly profitable) outcome is not a Nash equilibriumNash equilibrium

No Equilibrium in Pure StrategiesNo Equilibrium in Pure Strategies

• Not every simultaneous move game has an y gequilibrium in pure strategies.

• Consider again the Prisoner‐Warden game.Warden

Guard Wall Inspect Cells

Prisoner Climb Wall ‐1 1 1 ‐1

• Cell by cell inspection yields NO pure strategy NE

Prisoner Climb Wall ‐1, 1 1, ‐1

Dig Tunnel 1, ‐1 ‐1, 1

• Cell‐by‐cell inspection yields NO pure strategy NE.• There is, however, an equilibrium in mixed strategies.g

RationalizabilityRationalizability• An alternative solution concept that is a generalization of

Nash equilibrium.• Identify strategies that are never a best response for a player

given any beliefs about the play of his opponent. Strictly dominated strategies are never a best response, but there

b t t i th t t t i tl d i t d b t hi hcan be strategies that are not strictly dominated but which are never a best response.

• The set of strategies that survive elimination on the grounds f b i b t th t f “ ti li blof never being a best response are the set of “rationalizable

equilibria” they can be rationalized via certain beliefs.• Every Nash equilibrium is a rationalizable equilibriuim, but

t ti li bl ilib i i N h ilib inot every rationalizable equilibrium is a Nash equilibrium.• In this sense, rationalizability is a more general solution

concept.

Rationalizability ExampleRationalizability ExampleColumn

X Y Z

Row A 3,2 0,3 2,0

B 1 3 2 0 1 2B 1,3 2,0 1,2

C 2,1 4,3 0,2

h h l h d d• Note that neither player has any dominated strategies.• Nash equilibrium is: CY• For Row, Strategy B is never a best response.• For Column, Strategy Z is never a best response., gy p• Eliminating these, we see that X is never a best response for Column.• It further follows that A is not a best response for Row, leaving only CY.• More generally, sequential elimination of never best responses need not lead t N h ilib ito Nash equilibrium

Elimination of Never Best Responses d l lNeed Not Select a Solution

• Consider this example:Column

X Y Z

Row A 1 5 2 4 5 1Row A 1,5 2,4 5,1

B 4,2 3,3 4,2

C 5,1 2,4 1,5

• For Row, A is rationalizable if Row thinks Column will play Z; B is rationalizable if Row thinks Column will play Y; C is rationalizable if Row thinks Column will play X.F C l X i i li bl if C l hi k R ill l A Y i• For Column, X is rationalizable if Column thinks Row will play A; Y is rationalizable if Column thinks Row will play B, Z is rationalizable if Column thinks Row will play C.

• Thus all strategies are rationalizable in this exampleThus, all strategies are rationalizable in this example.• However, the Nash equilibrium, using best response analysis is BY

How to Negotiate a Pay Raise with Game Theory

anuzis

http://www.youtube.com/watch?v=ikE1pn034WA

http://www.youtube.com/watch?v=ikE1pn034WA

http://www.youtube.com/watch?v=rExm2FbY-BE

Can you solve this three player game?

Simultaneous Move Games - UC Santa...

Documents

Transcript of Simultaneous Move Games - UC Santa...