CSC304 Lecture 4 Game Theory
Transcript of CSC304 Lecture 4 Game Theory
CSC304 Lecture 4
Game Theory (Cost sharing & congestion games,
Potential function, Braessβ paradox)
CSC304 - Nisarg Shah 1
Recap
CSC304 - Nisarg Shah 2
β’ Nash equilibria (NE)
β’ No agent wants to change their strategy
β’ Guaranteed to exist if mixed strategies are allowed
β’ Could be multiple
β’ Pure NE through best-response diagrams
β’ Mixed NE through the indifference principle
Worst and Best Nash Equilibria
CSC304 - Nisarg Shah 3
β’ What can we say after we identify all Nash equilibria?
β’ Compute how βgoodβ they are in the best/worst case
β’ How do we measure βsocial goodβ?
β’ Game with only rewards?Higher total reward of players = more social good
β’ Game with only penalties?Lower total penalty to players = more social good
β’ Game with rewards and penalties?No clear consensusβ¦
Price of Anarchy and Stability
CSC304 - Nisarg Shah 4
β’ Price of Anarchy (PoA)
βWorst NE vs optimumβ
Max total reward
Min total reward in any NE
or
Max total cost in any NE
Min total cost
β’ Price of Stability (PoS)
βBest NE vs optimumβ
Max total reward
Max total reward in any NE
or
Min total cost in any NE
Min total cost
PoA β₯ PoS β₯ 1
Revisiting Stag-Hunt
CSC304 - Nisarg Shah 5
β’ Max total reward = 4 + 4 = 8
β’ Three equilibria
β’ (Stag, Stag) : Total reward = 8
β’ (Hare, Hare) : Total reward = 2
β’ ( Ξ€13 Stag β Ξ€2
3 Hare, Ξ€13 Stag β Ξ€2
3 Hare)
o Total reward = 1
3β
1
3β 8 + 1 β
1
3β
1
3β 2 β (2,8)
β’ Price of stability? Price of anarchy?
Hunter 1Hunter 2 Stag Hare
Stag (4 , 4) (0 , 2)
Hare (2 , 0) (1 , 1)
Revisiting Prisonerβs Dilemma
CSC304 - Nisarg Shah 6
β’ Min total cost = 1 + 1 = 2
β’ Only equilibrium:
β’ (Betray, Betray) : Total cost = 2 + 2 = 4
β’ Price of stability? Price of anarchy?
SamJohn Stay Silent Betray
Stay Silent (-1 , -1) (-3 , 0)
Betray (0 , -3) (-2 , -2)
Cost Sharing Game
CSC304 - Nisarg Shah 7
β’ π players on directed weighted graph πΊ
β’ Player π
β’ Wants to go from π π to π‘π
β’ Strategy set ππ = {directed π π β π‘π paths}
β’ Denote his chosen path by ππ β ππ
β’ Each edge π has cost ππ (weight)
β’ Cost is split among all players taking edge π
β’ That is, among all players π with π β ππ
1
1 1
1π 1
π‘1
10
π 2
π‘2
1010
Cost Sharing Game
CSC304 - Nisarg Shah 8
β’ Given strategy profile π, cost ππ π to
player π is sum of his costs for edges π β ππ
β’ Social cost πΆ π = Οπ ππ π
β’ Note: πΆ π = ΟπβπΈ π
ππ, whereβ¦
β’ πΈ(π)={edges taken in π by at least one player}
β’ Why?
1
1 1
1π 1
π‘1
10
π 2
π‘2
1010
Cost Sharing Game
CSC304 - Nisarg Shah 9
β’ In the example on the right:
β’ What if both players take direct paths?
β’ What if both take middle paths?
β’ What if one player takes direct path and the other takes middle path?
β’ Pure Nash equilibria?
1
1 1
1π 1
π‘1
10
π 2
π‘2
1010
Cost Sharing: Simple Example
CSC304 - Nisarg Shah 10
β’ Example on the right: π players
β’ Two pure NE
β’ All taking the n-edge: social cost = π
β’ All taking the 1-edge: social cost = 1o Also the social optimum
β’ Price of stability: 1
β’ Price of anarchy: π
β’ We can show that price of anarchy β€ π in every cost-sharing game!
s
t
π 1
Cost Sharing: PoA
CSC304 - Nisarg Shah 11
β’ Theorem: The price of anarchy of a cost sharing game is at most π.
β’ Proof:
β’ Suppose the social optimum is (π1β, π2
β, β¦ , ππβ), in which
the cost to player π is ππβ.
β’ Take any NE with cost ππ to player π.
β’ Let ππβ² be his cost if he switches to ππ
β.
β’ NE β ππβ² β₯ ππ (Why?)
β’ But : ππβ² β€ π β ππ
β (Why?)
β’ ππ β€ π β ππβ for each π β no worse than π Γ optimum
β
Cost Sharing
CSC304 - Nisarg Shah 12
β’ Price of anarchy
β’ Every cost-sharing game: PoA β€ π
β’ Example game with PoA = π
β’ Bound of π is tight.
β’ Price of stability?
β’ In the previous game, it was 1.
β’ In general, it can be higher. How high?
β’ Weβll answer this after a short detour.
Cost Sharing
CSC304 - Nisarg Shah 13
β’ Nashβs theorem shows existence of a mixed NE.
β’ Pure NE may not always exist in general.
β’ But in both cost-sharing games we saw, there was a PNE.
β’ What about a more complex game like the one on the right?
10 players: πΈ β πΆ27 players: π΅ β π·19 players: πΆ β π·
ED
A
7
B
C60
12
32
10
20
Good News
CSC304 - Nisarg Shah 14
β’ Theorem: Every cost-sharing game have a pure Nash equilibrium.
β’ Proof:
β’ Via βpotential functionβ argument
Step 1: Define Potential Fn
CSC304 - Nisarg Shah 15
β’ Potential function: Ξ¦ βΆ Οπ ππ β β+
β’ This is a function such that for every pure strategy profile π = π1, β¦ , ππ , player π, and strategy ππ
β² of π,
ππ ππβ², πβπ β ππ π = Ξ¦ ππ
β², πβπ β Ξ¦ π
β’ When a single player π changes her strategy, the change in potential function equals the change in cost to π!
β’ In contrast, the change in the social cost πΆ equals the total change in cost to all players.o Hence, the social cost will often not be a valid potential function.
Step 2: Potential Fn β pure Nash Eq
CSC304 - Nisarg Shah 16
β’ A potential function exists β a pure NE exists.
β’ Consider a π that minimizes the potential function.
β’ Deviation by any single player π can only (weakly) increase the potential function.
β’ But change in potential function = change in cost to π.
β’ Hence, there is no beneficial deviation for any player.
β’ Hence, every pure strategy profile minimizing the potential function is a pure Nash equilibrium.
Step 3: Potential Fn for Cost-Sharing
CSC304 - Nisarg Shah 17
β’ Recall: πΈ(π) = {edges taken in π by at least one player}
β’ Let ππ (π) be the number of players taking π in π
Ξ¦ π =
πβπΈ(π)
π=1
ππ(π)ππ
π
β’ Note: The cost of edge π to each player taking π is
ππ/ππ(π). But the potential function includes all
fractions: ππ/1, ππ/2, β¦, ππ/ππ π .
CSC304 - Nisarg Shah 18
Ξ¦ π =
πβπΈ(π)
π=1
ππ(π)ππ
π
β’ Why is this a potential function?
β’ If a player changes path, he pays ππ
ππ π +1for each new
edge π, gets back ππ
ππ πfor each old edge π.
β’ This is precisely the change in the potential function too.
β’ So Ξππ = ΞΞ¦.
β
Step 3: Potential Fn for Cost-Sharing
Potential Minimizing Eq.
CSC304 - Nisarg Shah 19
β’ Minimizing the potential function gives some pure Nash equilibrium
β’ Is this equilibrium special? Yes!
β’ Recall that the price of anarchy can be up to π.
β’ That is, the worst Nash equilibrium can be up to π times worse than the social optimum.
β’ A potential-minimizing pure Nash equilibrium is better!
Potential Minimizing Eq.
CSC304 - Nisarg Shah 20
πβπΈ(π)
ππ β€ Ξ¦ π =
πβπΈ(π)
π=1
ππ(π)ππ
πβ€
πβπΈ(π)
ππ β
π=1
π1
π
Social cost
βπ, πΆ π β€ Ξ¦ π β€ πΆ π β π» π
πΆ πβ β€ Ξ¦ πβ β€ Ξ¦ πππ β€ πΆ πππ β π»(π)
Harmonic function π»(π)= Οπ=1
π 1/π = π(log π)
Potential minimizing eq. Social optimum
Potential Minimizing Eq.
CSC304 - Nisarg Shah 21
β’ Potential-minimizing PNE is π(log π)-approximation to the
social optimum.
β’ Thus, in every cost-sharing game, the price of stability is
π log π .
β’ Compare to the price of anarchy, which can be π
Congestion Games
CSC304 - Nisarg Shah 22
β’ Generalize cost sharing games
β’ π players, π resources (e.g., edges)
β’ Each player π chooses a set of resources ππ (e.g., π π β π‘π
paths)
β’ When ππ player use resource π, each of them get a cost
ππ(ππ)
β’ Cost to player is the sum of costs of resources used
Congestion Games
CSC304 - Nisarg Shah 23
β’ Theorem [Rosenthal 1973]: Every congestion game is a potential game.
β’ Potential function:
Ξ¦ π =
πβπΈ(π)
π=1
ππ π
ππ π
β’ Theorem [Monderer and Shapley 1996]: Every potential game is equivalent to a congestion game.
Potential Functions
CSC304 - Nisarg Shah 24
β’ Potential functions are useful for deriving various results
β’ E.g., used for analyzing amortized complexity of algorithms
β’ Bad news: Finding a potential function that works may be hard.
The Braessβ Paradox
CSC304 - Nisarg Shah 25
β’ In cost sharing, ππ is decreasing
β’ The more people use a resource, the less the cost to each.
β’ ππ can also be increasing
β’ Road network, each player going from home to work
β’ Uses a sequence of roads
β’ The more people on a road, the greater the congestion, the greater the delay (cost)
β’ Can lead to unintuitive phenomena
The Braessβ Paradox
CSC304 - Nisarg Shah 26
β’ Parkes-Seuken Example:
β’ 2000 players want to go from 1 to 4
β’ 1 β 2 and 3 β 4 are βcongestibleβ roads
β’ 1 β 3 and 2 β 4 are βconstant delayβ roads
1 4
2
3
The Braessβ Paradox
CSC304 - Nisarg Shah 27
β’ Pure Nash equilibrium?β’ 1000 take 1 β 2 β 4, 1000 take 1 β 3 β 4β’ Each player has cost 10 + 25 = 35β’ Anyone switching to the other creates a greater
congestion on it, and faces a higher cost
1 4
2
3
The Braessβ Paradox
CSC304 - Nisarg Shah 28
β’ What if we add a zero-cost connection 2 β 3?
β’ Intuitively, adding more roads should only be helpful
β’ In reality, it leads to a greater delay for everyone in the unique equilibrium!
1 4
2
3
π23 π23 = 0
The Braessβ Paradox
CSC304 - Nisarg Shah 29
β’ Nobody chooses 1 β 3 as 1 β 2 β 3 is better irrespective of how many other players take it
β’ Similarly, nobody chooses 2 β 4
β’ Everyone takes 1 β 2 β 3 β 4, faces delay = 40!
1 4
2
3
π23 π23 = 0
The Braessβ Paradox
CSC304 - Nisarg Shah 30
β’ In fact, what we showed is:
β’ In the new game, 1 β 2 β 3 β 4 is a strictly dominant strategy for each player!
1 4
2
3
π23 π23 = 0