Literature Game Theory - University of Liverpool...

20
Game Theory 1 Game Theory Wiebe van der Hoek Computer Science University of Liverpool United Kingdom wvdh 2 Literature Fun and Games Ken Binmore A Course in Game Theory Martin Osborne en Ariel Rubinstein wvdh 3 What is it all about? game = interaction traffic supermarket employee, employer, board, union student / teacher judge and lawyers George en Osama marriage and career wvdh 4 What is it all about? terminology of chess and bridge logic and systematics of interaction analysis takes you from irrational issues strategic interaction is difficult because reasoning is circular If J and M play a game, J’s strategy will typically depend on his prediction of M’s strategy, which, on its turn depends on M’s expectation of J’s.... wvdh 5 Surprise and Paradox does it make sense to vote for a candidate you fancy least? for a general, to toss a coin? in poker, place a maximal bid with the worst cards? to throw some goods away before starting to negotiate about them to sell your house to the second best bidder? YES!! YES!! YES!! YES!! YES!! wvdh 6 Strategic voting Boris, Horace and Maurice determine who can be a member of the Dead Poet Society proposal: allow Alice amendment: allow Bob, rather than Alice first vote over amendment, then over proposal

Transcript of Literature Game Theory - University of Liverpool...

Page 1: Literature Game Theory - University of Liverpool ...cgi.csc.liv.ac.uk/~wiebe/Teaching/Masters/gametheory.pdf · Game Theory 1 Game Theory ... wvdh 2 Literature Fun and Games Ken Binmore

Game Theory 1

Game Theory

Wiebe van der Hoek

Computer Science

University of Liverpool

United Kingdom

wvdh 2

Literature

� Fun and Games

� Ken Binmore

� A Course in Game Theory

� Martin Osborne en Ariel Rubinstein

wvdh 3

What is it all about?

� game = interaction� traffic

� supermarket

� employee, employer, board, union

� student / teacher

� judge and lawyers

� George en Osama

� marriage and career

wvdh 4

What is it all about?

� terminology of chess and bridge

� logic and systematics of interaction

� analysis takes you from irrational issues

� strategic interaction is difficult

� because reasoning is circular

If J and M play a game, J’s strategy will typically depend on his prediction of M’s strategy, which, on its

turn depends on M’s expectation of J’s....

wvdh 5

Surprise and Paradox

� does it make sense� to vote for a candidate you fancy least?

� for a general, to toss a coin?

� in poker, place a maximal bid with the worst cards?

� to throw some goods away before starting to negotiate about them

� to sell your house to the second best bidder? YES

!!

YES!!

YES!!

YES!!

YES!!

wvdh 6

Strategic voting

� Boris, Horace and Maurice determine who can be a member of the Dead Poet Society

� proposal: allow Alice

� amendment: allow Bob, rather than Alice

� first vote over amendment, then over proposal

Page 2: Literature Game Theory - University of Liverpool ...cgi.csc.liv.ac.uk/~wiebe/Teaching/Masters/gametheory.pdf · Game Theory 1 Game Theory ... wvdh 2 Literature Fun and Games Ken Binmore

Game Theory 2

wvdh 7

Strategic voting

Bob

Nobody

Alice

Bob

Alice

Nobody

Nobody

Alice

Bob

Borice Horace Maurice

wvdh 8

Strategic voting

� first between A, B� winner Alice

� then between A, N� winner Alice

� strategic voting H:� first vote for Bob!

� result … B, N

� M anticipates: vote for A

Bob

Nobody

AliceBob

Alice

Nobody

Nobody

Alice

Bob

Borice

Horace

Maurice

wvdh 9

History

� Von Neumann and Morgenstern

� The Theory of Games and EconomicBehaviour (1944)

� ideas from economics and mathematics

� initially very optimistic, then draw-back

� revival since 1970’s

� Nash, Aumann, Shapley, Selten, Harsanyi

wvdh 10

Game Theory

� theory of decision makers� are rational:� aware of alternatives

� form expectations

� have preferences

� optimize after deliberation

� set A of actions;

� set C consequences;

� g: A → C� consequence function

� preference relation ≥on A

� or: utility function

� u: C → R

wvdh 11

Abstracts from `emotions’

� suppose you’re offered £ 1.000

� you make a deal with the first person you encounter: (1.000-x,x) x = 1, 2 ...� if he accepts: (you,person) get (1.000-x,x)

� else (0,0)

� only money counts, and that is known

� both are rational: prefer y+1 over y

� what will you offer?ONE

£!!!!!!!

wvdh 12

Game Theory

� theory of decision makers� are rational

� reason strategically

� players anticipate on knowledge and expectations about behaviour of other decision makers

Page 3: Literature Game Theory - University of Liverpool ...cgi.csc.liv.ac.uk/~wiebe/Teaching/Masters/gametheory.pdf · Game Theory 1 Game Theory ... wvdh 2 Literature Fun and Games Ken Binmore

Game Theory 3

wvdh 13

Strategic Games

� Definition G = ⟨ N‚(Ai)‚(≥i)⟩� finite set N (players)

� set Ai (actions) for every player i

� preference relation ≥i for every player i� ui is utility function: A→ R with

� a ≥i b ⇔ ui(a) ≥ ui(b)� also called payoff-function

� (although not the same)

wvdh 14

Representation strategic games

� N = {1,2}

� A1 = {T,B}

� A2= {L,R}

� u1(T) =w1, etcz1‚z2y1‚y2

x1‚x2w1‚w2T

B

L R

wvdh 15

Representation strategic games

� Interpretation� one-shot

� simultaneous

� independent

� utilities are known

� not the choice of others

z1‚z2y1‚y2

x1‚x2w1‚w2T

B

L R

wvdh 16

Example: BoS

� N = {1,2}

� A1 = {B,S}

� A2= {B,S}

� u1, u2 see figure� B: Bach

� S: Strawinsky

� Battle of the Sexes

1,20,0

0,02‚1B

S

B S

wvdh 17

Profiles

� A1, A2 ,... , An are the action sets

� (a1, a2 ,... , an) ∈ A1x A2 x ... x An is a profile

� notation: (x), or a*

� x-i ∈ A1x A2 x ... x Ai-1 x Ai+1 x ... x An � (x-i,xi) = (x)

� focus on i, given the profile of the others

wvdh 18

Profiles: example

� A1, A2 ,... , A7 are bids (∈ R)

� (a1, a2 ,... , a7) is a concrete bid

� notation: (x)i =(25,22,20,12,0,27,22)=a*

� x-6 ∈(25,22,20,12,0,22)� (x-6,x6) = ((25,22,20,12,0,22),27)

� (x-6,x’6) = ((25,22,20,12,0,22),26) would have been better for player 6, given the profiles of the others

Page 4: Literature Game Theory - University of Liverpool ...cgi.csc.liv.ac.uk/~wiebe/Teaching/Masters/gametheory.pdf · Game Theory 1 Game Theory ... wvdh 2 Literature Fun and Games Ken Binmore

Game Theory 4

wvdh 19

Nash equilibrium

� John Nash

� equilibrium (“solution”)

� every player is rational

� ever player plays optimally

� no use to devert individually

� not an algorithmic approach

wvdh 20

Nash equilibrium (definitie)

� Given G = ⟨ N‚(Ai)‚(≥i)⟩� a* ∈ A = A1x A2 x ... x An is Nash equilibrium iff

� ∀i∈N ∀ai∈Ai (a*i-1,a*i) ≥i (a*i-1,ai) � `no player i can improve in a*, if the other players still play a*-i’

wvdh 21

Nash equilibrium (alternative)

� define for every a-i ∈A-i the best response for i, Bi(a-i)

� Bi(a-i)={ai∈Ai | ∀a’i ∈Ai (a-i,ai) ≥i (a-i,a’i)}

� a* is N.eq iff ∀i∈N a*i∈ Bi(a*-i)

wvdh 22

Example: BoS (N.eq)

� N = {1,2}

� A1 = {B,S}

� A2= {B,S}

� u1, u2 see figure� B: Bach

� S: Strawinsky

� two equilibria:

� (bach,bach) and

� (strawinsky, strawinsky)

1,20,0

0,02‚1B

S

B S

wvdh 23

Example: coordination game

� Mozart or Mahler?

� same preferences

� two equilibria:

� (Mozart,Mozart) and

� (Mahler,Mahler)

� N.eq right concept?

1,10,0

0,02,2Mo

Ma

Mo Ma

wvdh 24

Pareto Efficiency

� (Mozart,Mozart) and

� (Mahler,Mahler)

� N.eq right concept?

� (2,2) is (strongly) Pareto efficient:

� ¬∃x¬∃y (x,y) > (2,2)

1,10,0

0,02,2Mo

Ma

Mo Ma

Page 5: Literature Game Theory - University of Liverpool ...cgi.csc.liv.ac.uk/~wiebe/Teaching/Masters/gametheory.pdf · Game Theory 1 Game Theory ... wvdh 2 Literature Fun and Games Ken Binmore

Game Theory 5

wvdh 25

Pareto Optimality (definition)

� Given G = ⟨ N‚(Ai)‚(≥i)⟩� a* = (a1, a2 ,... , an) ∈ A1x A2 x ... x Anis Pareto optimal iff

� ∀i∈N ∀b* ∈ A1x A2 x ... x An (a*) ≥i (b*)

wvdh 26

Example: prisoner’s dilemma

� C: cooperate with the other, keep silent

� D: justify against the other -1,-13,-2

-2,30,0C

D

C D

wvdh 27

Example: prisoner’s dilemma

� C: cooperate with the other, keep silent

� D: justify against the other

� Altough cooperate would be better, every player has a preference for defeat

-1,-13,-2

-2,30,0C

D

C D

wvdh 28

Example: hawk-dove

� preference:� hawkish if other is dovish

� dovish if oher is hawkish

� N.eq: (Dove,Hawk)

� and (Hawk,Dove)

0,04,1

1,43,3D

H

D H

wvdh 29

Example: Matching Pennies

� Head and Tail

� if different, 1 pays a Pound to 2 if the same, 2 pays a Pound to 1

� no equilibrium!

� game is strictly competatief

1,-1-1,1

-1,11,-1H

T

H T

wvdh 30

SCSG

� Strictly Compatitive Strategic Game

� if G = ⟨{1,2}‚(Ai)‚(≥i)⟩, � and ∀a,b ∈A: a ≥1 b ⇔ b ≥2 a� also called zero-sum game:

� with u1 and u2 we have

� u1 + u2 = 0

Page 6: Literature Game Theory - University of Liverpool ...cgi.csc.liv.ac.uk/~wiebe/Teaching/Masters/gametheory.pdf · Game Theory 1 Game Theory ... wvdh 2 Literature Fun and Games Ken Binmore

Game Theory 6

wvdh 31

maxminimizer

� let G = ⟨{1,2}‚(Ai)‚(≥i)⟩ an SCSG� action x* ∈ A1 is maxminimizer for 1:

� ∀x∈A1 min u1(x*,y) ≥1 min u1(x,y)

� action y* ∈ A2 is maxminimizer for 2:

� ∀y∈A2 min u2(x,y*) ≥2 min u2(x,y)x∈A1

y∈A2 y∈A2

x∈A1

wvdh 32

maxminimizer

� action x* ∈ A1 is maxminimizer for 1:

� ∀x∈A1 min u1(x*,y) ≥1 min u1(x,y)

� solves for 1 maxxminy u1(x,y)

� solves for 2 maxyminx u2(x,y)

� ‘maximises the minimum that i can guarantee’

� x* is a security strategy for 1

y∈A2 y∈A2

wvdh 33

Equilibria and maxminimizers

� (x*,y*) is N.eq for G, iff:

� x* is a maxminimizer for 1;

� y* is a maxminimizer for 2

� maxxminyu1(x,y) =

� minymaxxu1(x,y) =u1(x*,y*)

wvdh 34

maxminimizers

� solves for 1

� maxxminy u1(x,y) =� max{

� min{u1(x,y)|y∈A2} � |x ∈A1} =

4,-45,-56,-63,-34,-4x6

3,-32,-24,-45,-53,-3x5

5,-57,-75,-58,-86,-6x4

3,-33,-34,-42,-25,-5x3

4,-46,-64,-45,-53,-3x2

1,-11,-13,-32,-22,-2x1

y5y4y3y2y1...

wvdh 35

maxminimizers

� solves for 1

� maxxminy u1(x,y) =� max{

� min{u1(x,y)|y∈A2} � |x ∈A1} =

4,-45,-56,-63,-34,-4x6

3,-32,-24,-45,-53,-3x5

5,-57,-75,-58,-86,-6x4

3,-33,-34,-42,-25,-5x3

4,-46,-64,-45,-53,-3x2

1,-11,-13,-32,-22,-2x1

y5y4y3y2y1...

x1: miny u(x1,y) = 1

wvdh 36

maxminimizers

� solves for 1

� maxxminy u1(x,y) =� max{

� min{u1(x,y)|y∈A2} � |x ∈A1} =

4,-45,-56,-63,-34,-4x6

3,-32,-24,-45,-53,-3x5

5,-57,-75,-58,-86,-6x4

3,-33,-34,-42,-25,-5x3

4,-46,-64,-45,-53,-3x2

1,-11,-13,-32,-22,-2x1

y5y4y3y2y1...

x1: miny u(x1,y) = 1

x2: miny u(x2,y) = 3

xn: miny u(xn,y) = 3

..: .......... .. = .. max = 5 for x* = x4

Page 7: Literature Game Theory - University of Liverpool ...cgi.csc.liv.ac.uk/~wiebe/Teaching/Masters/gametheory.pdf · Game Theory 1 Game Theory ... wvdh 2 Literature Fun and Games Ken Binmore

Game Theory 7

wvdh 37

maxminimizers

� solves for 1

� maxxminy u1(x,y) =� max{

� min{u1(x,y)|y∈A2} � |x ∈A1} = 5

� solves for 2

� maxxminy u2(x,y) =� max{

� min{u1(x,y)|x∈A1} � |y ∈A2} =

4,-45,-56,-63,-34,-4x6

3,-32,-24,-45,-53,-3x5

5,-57,-75,-58,-86,-6x4

3,-33,-34,-42,-25,-5x3

4,-46,-64,-45,-53,-3x2

1,-11,-13,-32,-22,-2x1

y5y4y3y2y1...

wvdh 38

maxminimizers

� solves for 1

� maxxminy u1(x,y) =� max{

� min{u1(x,y)|y∈A2} � |x ∈A1} = 5

� solves for 2

� maxxminy u2(x,y) =� max{

� min{u1(x,y)|x∈A1} � |y ∈A2} = -5!

� Equilibrium (5,-5)

4,-45,-56,-63,-34,-4x6

3,-32,-24,-45,-53,-3x5

5,-57,-75,-58,-86,-6x4

3,-33,-34,-42,-25,-5x3

4,-46,-64,-45,-53,-3x2

1,-11,-13,-32,-22,-2x1

y5y4y3y2y1...

wvdh 39

security level vs equilibria

� consider cooperative game G

� (2,2) looks like `the optimal’ solution

� security strategy of1 is r, gives 1!

� Nash equilibria?

1,11,1

0,02,2l

r

L R

wvdh 40

security level vs equilibria

� consider game G

� (2,2) looks like `the optimal’ solution

� security strategy of 1 is r, gives 1!

� Nash equilibria?

1,11,1

0,02,2l

r

L R

wvdh 41

bimatrix games

� m x n matrix

� 1 has strategies s1and s2, 2 has t1, t2and t3

� payoff π1(si,tj) = ij� π2(si,tj) = (i-2)(j-2)

6,04,02,0

3,-12,01,1s1

s2

t1 t2 t3

wvdh 42

bimatrix games

� m x n matrix

� 1 has strategies s1and s2, 2 has t1, t2and t3

� Nash equilibrium (σ,τ):� ∀s,t π1(σ,τ) ≥ π1(s,τ) � ∀s,t π2(σ,τ) ≥ π2(σ,t)

6,04,02,0

3,-12,01,1s1

s2

t1 t2 t3

Page 8: Literature Game Theory - University of Liverpool ...cgi.csc.liv.ac.uk/~wiebe/Teaching/Masters/gametheory.pdf · Game Theory 1 Game Theory ... wvdh 2 Literature Fun and Games Ken Binmore

Game Theory 8

wvdh 43

domination

� strategy sd of 1 dominates sistrongly:

� ∀t π1(sd,t) > π1(si,t)� and weakly if:

� ∀t π1(sd,t) ≥ π1(si,t)� ∃t π1(sd,t) > π1(si,t)

� t1 dominates t2weakly

6,04,02,0

3,-12,01,1s1

s2

t1 t2 t3

wvdh 44

Iterated elimination

� s2 of 1 strongly dominates s1

� No further (weak) domination: all is left are Nash Equilibria

� this is not generally so

6,04,02,0

3,-12,01,1s1

t1 t2 t3

wvdh 45

Order of elimination

3,33,3BF

3,33,3BE

1,10,2AF

1,12,0AE

DC3,33,3BF

3,33,3BE

1,10,2AF

1,12,0AE

DC

3,33,3BF

3,33,3BE

1,10,2AF

1,12,0AE

DC

lost equilibrium!

wvdh 46

elimination: conclusions

� strict strategies: no problem

� with weakly dominated strategies:

� some equilibria can get lost

� order of elimination is important

wvdh 47

Example: BoS

� N = {1,2}

� A1 = {B,S}

� A2= {B,S}

� u1, u2 see figure� B: Bach

� S: Strawinsky

� Battle of the Sexes

1,20,0

0,02‚1B

S

B S

no dominant strategies

still (two) Nash equilibira

wvdh 48

Ex: coordination game

� Mozart of Mahler?

� Same preference

1,10,0

0,02,2Mo

Ma

Mo Ma

No dominant strategy

still two Nash equilibria

Page 9: Literature Game Theory - University of Liverpool ...cgi.csc.liv.ac.uk/~wiebe/Teaching/Masters/gametheory.pdf · Game Theory 1 Game Theory ... wvdh 2 Literature Fun and Games Ken Binmore

Game Theory 9

wvdh 49

Ex: prisoner’s dilemma

� C: cooperate, and be silent

� D: justify against the other

� D dominates C

� D dominates C

� gives Nash equilibrium (-1,-1)

-1,-13,-2

-2,30,0C

D

C D

wvdh 50

Ex: Matching Pennies

� Head and Tail

� if different, 1 pays a Pound to 2; if the same, 2 pays a Pound to 1 1,-1-1,1

-1,11,-1H

T

H T

no dominant strategy

No pure Nash equilibrium

wvdh 51

mixed strategies

� don’t always bid in the same way with poker

� being inpredictable can be an advantage

� sometimes a strategy is not dominated by another pure strategy, but by a mixed one

wvdh 52

security level and strategy

� maximin for 1:

� 2, via s2� note: (s2,t2) is a saddlepoint

� then 2 is also security level of player 1

0,40,39,0s3

3,22,04,6s2

7,31,20,1s1

t3t2t1

wvdh 53

security level and strategy

� maximin for 1: 2 via s3� (s3,t2) is not a saddlepoint

� security level of 1 is indeed not 2, but 22/3

� How?4,02,33,7s3

3,00,22,1s2

0,96,41,0s1

t3t2t1

wvdh 54

mixed strategies

� remove strictly dominated strategy s2

� 2 has no pure dominating strategy, but, .....

4,02,33,7s3

3,00,22,1s2

0,96,41,0s1

t3t2t1

Page 10: Literature Game Theory - University of Liverpool ...cgi.csc.liv.ac.uk/~wiebe/Teaching/Masters/gametheory.pdf · Game Theory 1 Game Theory ... wvdh 2 Literature Fun and Games Ken Binmore

Game Theory 10

wvdh 55

mixed strategies� 2 has no pure dominating strategy, but, .....

� q = (1/2,0,1/2) dominates t2strongly!

� π2(s1,q) = � (1/2)�0 + (1/2)�9 =4.5 > 4

� π2(s3,q) = � (1/2)�7 + (1/2)�0 =3.5 > 3

4,02,33,7s3

3,00,22,1s2

0,96,41,0s1

t3t2t1

wvdh 56

mixed strategies� 2 has no pure dominating strategy, but, .....

� q = (1/2,0,1/2) dominates t2strongly!

� π2(s1,q) = � (1/2)�0 + (1/2)�9 =4.5 > 4

� π2(s3,q) = � (1/2)�7 + (1/2)�0 =3.5 > 3

4,02,33,7s3

3,00,22,1s2

0,96,41,0s1

t3t2t1

wvdh 57

mixed strategies

� after iterated elimination

� what is security level of 1?

� suppose 1 plays mixed strategy (1-r,0,r)

4,02,33,7s3

3,00,22,1s2

0,96,41,0s1

t3t2t1s

wvdh 58

mixed strategies

� suppose 1 playes mixed strategy (1-r,0,r)

� let Ek(r) be payoff of 1 if 2 plays tk:

� E1(r)= 1(1-r) + 3r = 1 + 2r

� E2(r)= 6(1-r) + 2r = 6 – 4r

� E3(r)= 0(1-r) + 4r = 4r

4,02,33,7s3

3,00,22,1s2

0,96,41,0s1

t3t2t1s

wvdh 59

mixed strategies

� E1(r)= 1(1-r) + 3r = 1 + 2r

� E2(r)= 6(1-r) + 2r = 6 – 4r

� E3(r)= 0(1-r) + 4r = 4r

� m(r) = min{E1,E2, E3}

0

1

2

3

4

5

6

1r →

1/2 5/63/4

max for r = 5/6payoff is E1(r) = 22/3

wvdh 60

Ex: Matching Pennies

� Head and Tail

� if different, 1 pays a Pound to 2; if the same, 2 pays a Pound to 1 1,-1-1,1

-1,11,-1H

T

H T

no dominant strategy

No Nash equilibrium

Page 11: Literature Game Theory - University of Liverpool ...cgi.csc.liv.ac.uk/~wiebe/Teaching/Masters/gametheory.pdf · Game Theory 1 Game Theory ... wvdh 2 Literature Fun and Games Ken Binmore

Game Theory 11

wvdh 61

Ex: Matching Pennies

� Suppose r plays (0.5,0.5), and

� c plays (0.5,0.5)

1,-1-1,1

-1,11,-1H

T

H T

wvdh 62

Ex: Matching Pennies

� Suppose r plays (0.5,0.5), and

� c plays (0.5,0.5) � π1 would then be:

� 0.5 ⋅ 0.5 ⋅ 1 + 0.5 ⋅ 0.5 ⋅ -1 +� 0.5 ⋅ 0.5 ⋅ 1 + 0.5 ⋅ 0.5 ⋅ -1 = 0

� this is Nash: � would r play (q,1-q) then π1((q,1-q),(0.5,0.5)) =

� (q ⋅ 0.5 ⋅ 1) + (q ⋅ 0.5 ⋅ -1) +� ((1-q) ⋅ 0.5 ⋅ -1) + ((1-q) ⋅ 0.5 ⋅ 1) = 0 is not better

1,-1-1,1

-1,11,-1H

T

H T

wvdh 63

Ex: Matching Pennies

� Suppose r plays (0.5,0.5), and

� c plays (0.5,0.5) � π1 would then be:

� 0.5 ⋅ 0.5 ⋅ 1 + 0.5 ⋅ 0.5 ⋅ -1 +� 0.5 ⋅ 0.5 ⋅ 1 + 0.5 ⋅ 0.5 ⋅ -1 = 0

� no other Nash: � would r play (q,1-q) with q > 0.5 then c plays T; π1=

� (q ⋅ 0 ⋅ 1) + (q ⋅ 1 ⋅ -1) +� ((1-q) ⋅ 0 ⋅ -1) + ((1-q) ⋅ 1 ⋅ 1) = 1-(2 ⋅ q) < 0

1,-1-1,1

-1,11,-1H

T

H T

Game Theory: Part II: Extensive Games

Wiebe van der Hoek

Computer Science

University of Liverpool

United Kingdom

wvdh 65

Rules of the Game

root

terminal

When?

wvdh 66

Rules of the Game

When?

r

l

l

r

L

RR

R

MLLWhat?

I

I

II II

II

Who?w w

w

w

w

l

lHow much?

Page 12: Literature Game Theory - University of Liverpool ...cgi.csc.liv.ac.uk/~wiebe/Teaching/Masters/gametheory.pdf · Game Theory 1 Game Theory ... wvdh 2 Literature Fun and Games Ken Binmore

Game Theory 12

wvdh 67

Example: tictactoe

ox

I

ox

o

II

ox

o

x

I

ox

o

x o

oxo

II

oxo xo

II

oxo xo

x o

II

oxo x

I

oxo xo

x

I

oxo xo

x o

xI

I

II

o o

x

ox

o x

o

x

o

xo

o

x

o o

x

o

x

I

II

II

o

I

I

wo

xo xo

x o

x o

d

o

II I

wvdh 68

Strategies

� A pure strategie for player p specifies for every decision note of p what he will do there

� If all players choose such a strategy, the outcome of the game is determined

wvdh 69

Strategies

� A pure strategie for player p specifies for every decision note of p what he will do there

� Strategies for I:r

l

l

r

L

RR

R

MLL

I

I

II II

IIw w

w

w

w

l

lllllllll lrlrlrlr rlrlrlrl rrrrrrrr

b

a

c d

e

wvdh 70

Strategies

� A pure strategie for player p specifies for every decision note of p what he will do there

� Strategies for II:

LLL, LLR, LML, LMR, LRL, LRR

RLL, RLR, RML, RMR, RRL, RRR

r

l

l

r

L

RR

R

MLL

I

I

II II

IIw w

w

w

w

l

l

b

a

c d

e

wvdh 71

Strategy profile

r

l

l

r

L

RR

R

MLL

I

I

II II

IIw w

w

w

w

l

l

b

a

c d

e

Example:

[lr,RMR]

wvdh 72

extensive to strategic

1

1

2A B

C D

E F

a b

c

d

dd

dd

cb

caAE

AF

BE

BF

DC

dd

cb

caAE

AF

B

DC

reduced strategic form

Page 13: Literature Game Theory - University of Liverpool ...cgi.csc.liv.ac.uk/~wiebe/Teaching/Masters/gametheory.pdf · Game Theory 1 Game Theory ... wvdh 2 Literature Fun and Games Ken Binmore

Game Theory 13

wvdh 73

Equilibria: example

� Nash equilibria?

� via strategic form:1

2A B

L R

0,0 2,1

1,2

1,21,2

2,10,0A

B

L R

wvdh 74

Strategy profile

r

l

l

r

L

RR

R

MLL

I

I

II II

IIw w

w

w

w

l

l

b

a

c d

e

LL

w

w

w

w

RR RRRLMRMLLRLLRLMRMLLR

wwwwwwwwwwwrr

wwwwwlwwwwlrl

llllllwwwwwlr

llllllwwwwwll

RL G

strategic form of G extensive form of G

a strategy profile: [rr,RLL]

wvdh 75

Backward InductionI

w

II

w l

II

wwII

I

wl

G

w

I

w

II

wII

I

wl

G

l

I

w

II

wwII

w

G

l

I

w

II

ww

G

l

I

w

G

l

G I

w

wvdh 76

backward induction

I

w

II

w l

II

wwII

I

wl

G

wvdh 77

I can win

LLL

w

w

w

w

LRR RRRRRLRMRRMLRLRRLLLRLLMRLMLLLR

wwwwwwwwwwwrr

wwwwwlwwwwlrl

llllllwwwwwlr

llllllwwwwwll

again: rr is winning strategy,since that row only contains a w

wvdh 78

extensive games: definitions

� extensive games: G = ⟨ N‚H‚P,(≥i)⟩� N: set of players

� H histories: ∅, (ak)k=1..K (may be infinite)� closed under prefixes

� terminals Z: no successor or infinite

� P: H\Z → N player who is to move

� ≥i: preference relation on Z

Page 14: Literature Game Theory - University of Liverpool ...cgi.csc.liv.ac.uk/~wiebe/Teaching/Masters/gametheory.pdf · Game Theory 1 Game Theory ... wvdh 2 Literature Fun and Games Ken Binmore

Game Theory 14

wvdh 79

extensive games: definitions

� N: set of players

� H histories: ∅, (ak)k=1..K (may be infinite)� closed under prefixes

� terminals Z: no successor or infinite

� h∈H, a action ⇒ (h,a) ∈H

� H is finite ⇒ G is finite

� H only contains finite h ⇒ G has finite horizon

wvdh 80

Subgame perfect solutions

� extensive games: Γ = ⟨ N‚H‚P,(≥i)⟩� N: set of player

� H histories: ∅, (ak)k=1..K (may be infinite)� P: H\Z → N player to play

� ≥i: preference relation on Z

� subgames: Γ(h) = ⟨ N‚H|h‚P|h,(≥i|h)⟩� all continuations of h

wvdh 81

Subgames

� history h

� subgame Γ(h)1

1

2A B

C D

E F

a b

c

d

wvdh 82

subgame perfect N.-eq

� let Γ = ⟨N‚H‚P,(≥i)⟩ extensive� s* is N.-eq if ∀i∀si O(s-i*,si*) ≥i O(s-i*,si)

� s* is subgame perfect N.-eq if� ∀i∀h∈H\Z (P(h)=i ⇒� Oh(s-i*|h,si *|h) ≥i|hO(s-i*|h,si))� for all strategies si for i in Γ(h)

� s*|h is N.-eq for all Γ(h)

wvdh 83

Equilibria: Example

� Nash equilibria?1

2A B

L R

0,0 2,1

1,2

wvdh 84

equilibria (ctd)

� so: (A,R) and (B,L)

� interpretation (B,L):

� given that 2 plays Lafter A, 1 better choose B

� intuitive?

� what is optimal for 1?

1

2A B

L R

0,0 2,1

1,2

1

2A B

L R

0,0 2,1

1,2

Page 15: Literature Game Theory - University of Liverpool ...cgi.csc.liv.ac.uk/~wiebe/Teaching/Masters/gametheory.pdf · Game Theory 1 Game Theory ... wvdh 2 Literature Fun and Games Ken Binmore

Game Theory 15

wvdh 85

equilibria (ctd)

� so: (A,R) and (B,L)

� interpretation (B,L):

� given that 2 plays L after A, 1 better choose B

� AR is the only subgame perfect equilibrium

1

2A B

L R

0,0 2,1

1,2

1

2A B

L R

0,0 2,1

1,2

wvdh 86

equilibria (ctd)

� so: (A,R) and (B,L)

� interpretation (B,L):

� given that 2 plays L after A, 1 better choose B

� AR is the only subgame perfect equilibrium

� not BL!

1

2A B

L R

0,0 2,1

1,2

1

2A B

L R

0,0 2,1

1,2

wvdh 87

shop-chain game

� chain k and n competitors

� every competitor can either enter challenge k (i), or not (o)

� if so, k chooses between cooperate (c) and fight (f)

wvdh 88

shop-chain game

� chain k and n competitors

� every competitor can either enter challenge k (i), or not (o)

� if so, k chooses between cooperate (c) and fight (f)

n

C

i o

k

F 5,1

2,20,0

k

F C

i o

2

k

F C

i o

2

10110

1511112

112

F C

i o

k

3

7120

12121

9122

F C

i o

3

k

5100

101017

10

2

F C

i o

k

3

5010

100117

012

F C

i o

k

3

2020

7021

4022

F C

i o

3

k

0000

50012

00

2

F C

i o

k

3

4220

92216

222

F C

i o

k

3

2200

7201

4202

F C

i o

3

k

7110

12211

9212

F C

i o

3

k

k

F C

i o

2

k

F C

1

i o

k

F C

i o

2

k

F C

i o

2

10110

1511112

112

F C

i o

k

3

7120

12121

9122

F C

i o

3

k

5100

101017

10

2

F C

i o

k

3

5010

100117

012

F C

i o

k

3

2020

7021

4022

F C

i o

3

k

0000

50012

00

2

F C

i o

k

3

4220

92216

222

F C

i o

k

3

2200

7201

4202

F C

i o

3

k

7110

12211

9212

F C

i o

3

k

k

F C

i o

2

k

F C

1

i o

Page 16: Literature Game Theory - University of Liverpool ...cgi.csc.liv.ac.uk/~wiebe/Teaching/Masters/gametheory.pdf · Game Theory 1 Game Theory ... wvdh 2 Literature Fun and Games Ken Binmore

Game Theory 16

k

F C

i o

2

k

F C

i o

2

10110

1511112

112

F C

i o

k

3

7120

12121

9122

F C

i o

3

k

5100

101017

10

2

F C

i o

k

3

5010

100117

012

F C

i o

k

3

2020

7021

4022

F C

i o

3

k

0000

50012

00

2

F C

i o

k

3

4220

92216

222

F C

i o

k

3

2200

7201

4202

F C

i o

3

k

7110

12211

9212

F C

i o

3

k

k

F C

i o

2

k

F C

1

i o

k

F C

i o

2

k

F C

i o

2

10110

1511112

112

F C

i o

k

3

7120

12121

9122

F C

i o

3

k

5100

101017

10

2

F C

i o

k

3

5010

100117

012

F C

i o

k

3

2020

7021

4022

F C

i o

3

k

0000

50012

00

2

F C

i o

k

3

4220

92216

222

F C

i o

k

3

2200

7201

4202

F C

i o

3

k

7110

12211

9212

F C

i o

3

k

k

F C

i o

2

k

F C

1

i o

k

F C

i o

2

k

F C

i o

2

10110

1511112

112

F C

i o

k

3

7120

12121

9122

F C

i o

3

k

5100

101017

10

2

F C

i o

k

3

5010

100117

012

F C

i o

k

3

2020

7021

4022

F C

i o

3

k

0000

50012

00

2

F C

i o

k

3

4220

92216

222

F C

i o

k

3

2200

7201

4202

F C

i o

3

k

7110

12211

9212

F C

i o

3

k

k

F C

i o

2

k

F C

1

i o

k

F C

i o

2

k

F C

i o

2

10110

1511112

112

F C

i o

k

3

7120

12121

9122

F C

i o

3

k

5100

101017

10

2

F C

i o

k

3

5010

100117

012

F C

i o

k

3

2020

7021

4022

F C

i o

3

k

0000

50012

00

2

F C

i o

k

3

4220

92216

222

F C

i o

k

3

2200

7201

4202

F C

i o

3

k

7110

12211

9212

F C

i o

3

k

k

F C

i o

2

k

F C

1

i o

k

F C

i o

2

k

F C

i o

2

10110

1511112

112

F C

i o

k

3

7120

12121

9122

F C

i o

3

k

5100

101017

10

2

F C

i o

k

3

5010

100117

012

F C

i o

k

3

2020

7021

4022

F C

i o

3

k

0000

50012

00

2

F C

i o

k

3

4220

92216

222

F C

i o

k

3

2200

7201

4202

F C

i o

3

k

7110

12211

9212

F C

i o

3

k

k

F C

i o

2

k

F C

1

i o

wvdh 96

shop-chain game

� subgame perfect equilibrium:

� all shops play i, chain k playc c

� not realistic, if many more shops to fight

� solution: shops should be uncertain about the motives of k

Page 17: Literature Game Theory - University of Liverpool ...cgi.csc.liv.ac.uk/~wiebe/Teaching/Masters/gametheory.pdf · Game Theory 1 Game Theory ... wvdh 2 Literature Fun and Games Ken Binmore

Game Theory 17

wvdh 97

Backward Induction

1 12

1,1 2,2 3,3

r r r

d d d

0,0

wvdh 98

Backward Induction

1 12

1,1 2,2 3,3

r r r

d d d

0,0

wvdh 99

Backward Induction

1 12

1,1 2,2 3,3

r r r

d d d

0,0

wvdh 100

Backward Induction

1 12

1,1 2,2 3,3

r r r

d d d

0,0

wvdh 101

Centipede

� 1 and 2 divide n marbles; they choose in turn, if somebody picks two, the game is over

11 12 22

2,0 1,2 3,1 2,3 4,2 3,4

3,3e e e e e e

t t t t t t

wvdh 102

Centipede

� Intuitively correct?

11 12 22

2,0 1,2 3,1 2,3 4,2 3,4

3,3e e e e e e

t t t t t t

Page 18: Literature Game Theory - University of Liverpool ...cgi.csc.liv.ac.uk/~wiebe/Teaching/Masters/gametheory.pdf · Game Theory 1 Game Theory ... wvdh 2 Literature Fun and Games Ken Binmore

Game Theory 18

wvdh 103

Strategic voting

� Boris, Horace and Maurice determine who can be a member of the Dead Poet Society

� proposal: allow Alice

� counterprop: allow Bob, rather than Alice

� first vote over counterprop, then over proposal

wvdh 104

Strategic voting

� first betwee A, B� winner Alice

� then between A, N� winner Alice

� strategic voting H:� first vote Bob!

� solution… B, N

Bob

Nobody

AliceBob

Alice

Nobody

Nobody

Alice

Bob

Borice

Horace

Maurice

wvdh 105

Strategic voting

� first between A, B� winner Alice

� then between A, N� winner Alice

� strategic voting H:� first vote Bob!

� solution…

� B, N

� M anticipates: vote forA

Bob

Nobody

AliceBob

Alice

Nobody

Nobody

Alice

Bob

Borice

Horace

Maurice

wvdh 106

Strategic voting

� first between A, B� winner Alice

� then between A, N� winnner Alice

Bob

Nobody

AliceBob

Alice

Nobody

Nobody

Alice

Bob

Borice

Horace

Maurice

132n

311b

223a

MHButility

wvdh 107

Strategic voting: extensive

132n

311b

223a

MHBuaab

aaa

aba

baa

bbb

bba

bab

abb

a b

nnb

nnn

nbn

bnn

bbb

bbn

bnb

nbb

aan

aaa

ana

naa

nnn

nna

nan

ann

a nb

312

213

231

wvdh 108

Strategic voting: extensive

aab

aaa

aba

baa

bbb

bba

bab

abb

a b

nnb

nnn

nbb

bnn

bbb

bbn

bnb

nbb

aan

aaa

ana

naa

nnn

nna

nan

ann

a nb

312

213

231

1

2 3

(aaa,aaa,xyz) is Nash

Page 19: Literature Game Theory - University of Liverpool ...cgi.csc.liv.ac.uk/~wiebe/Teaching/Masters/gametheory.pdf · Game Theory 1 Game Theory ... wvdh 2 Literature Fun and Games Ken Binmore

Game Theory 19

wvdh 109

Strategic voting: extensive

aab

aaa

aba

baa

bbb

bba

bab

abb

a b

nnb

nnn

nba

bnn

bbb

bbn

bnb

nbb

aan

aaa

ana

naa

nnn

nna

nan

ann

a nb

312

213

231

1

2 3

H can do better: bxb

(aab,aab,nnb) is not Nash!

wvdh 110

Pirates on an island

� Five pirates p1, .... , p5 are on an island

� There is also a bag of 100 diamonds

� And hence, a need to distribute them

wvdh 111

Five Pirates: procedure

player i proposes a division Di over pi, ..., p5� with a majority for Di: so be it done

� no majority for Di: pi gets shot, we move on to pi+1

Now you are p1. What will D1 be? wvdh 112

Pirates on an island

� Assumptions: Any pirate

� values his life higher than 100 diamonds

� values 1 diamond higher than another’s life

� votes in favour of a proposal iff others are worse

wvdh 113

Voting agenda paradox

� 1: x > z > y; 2: y > x > z; 3: z > y > x

� 40% type 1, 30% type 2, 30% type 3� majority rule: x wins

XX YY

XX YY ZZZZ

XX

XX YY YY

ZZ

ZZ XX XX

YY

YY

ZZ

ZZ

binary protocol: chair decides!binary protocol: chair decides!

wvdh 114

Voting agenda paradox

� 1: x > z > y; 2: y > x > z; 3: z > y > x

� 40% type 1, 30% type 2, 30% type 3� majority rule: x wins

XX YY

XX YY ZZZZ

XX

XX YY YY

ZZ

ZZ XX XX

YY

YY

ZZ

ZZ

binary protocol: chair decides!binary protocol: chair decides!

Page 20: Literature Game Theory - University of Liverpool ...cgi.csc.liv.ac.uk/~wiebe/Teaching/Masters/gametheory.pdf · Game Theory 1 Game Theory ... wvdh 2 Literature Fun and Games Ken Binmore

Game Theory 20

wvdh 115

Pareto dominated paradox� 1: x > y > b > a

� 2: a > x > y > b

� 3: b > a > x > y

xx bb

xx bb yyyy

aa bb

aa bb yyyy

xx aa

wvdh 116

Pareto dominated paradox

� 1: x > y > b > a

� 2: a > x > y > b

� 3: b > a > x > y

xx bb

xx bb yyyy

aa bb

aa bb yyyy

xx aa

but for all, x > y !!but for all, x > y !!

wvdh 117

Borda protocol

� allocate points: 4, 3, 2, 1.� 1: x > c > b > a

� 2: a > x > c > b

� 3: b > a > x > c

� 4: x > c > b > a

� 5: a > x > c > b

� 6: b > a > x > c

� 7: x > c > b > a

Σ: Σ: x : 22, a : 17, b: 16, c: 15x : 22, a : 17, b: 16, c: 15

If x withdrIf x withdrawsaws::

c: 15, b: 14, a:13 !!!c: 15, b: 14, a:13 !!!

wvdh 118

Arrow's theorem

� m agents, each with preference ≤i over D

� Wanted: � G(≤1, ...... , ≤m , D) = ≤

wvdh 119

Arrow's theorem

1 completeness: x ≤ y or y ≤ x

2 transitivity: if x ≤ y ≤ z, then x ≤ z

3 unrestricted domain: all ≤ satifsy 1 and 2

4 Pareto: if ∀ i, x ≤i y, then x ≤ y

5 independece of irrelevant choices if ≤i is as ≤i’ regarding x and y,

then ≤ = ≤‘ regarding x and y

6 no dictator: no i completely determines ≤

ii≤≤

ii<<

ii≤≤ ''

ii≤≤

It is impossible to generate suchIt is impossible to generate suchaa ≤ !