Transition Models of Equilibrium Assessment in Bayesian Game

December 18 Fri., 2015, 09:10-09:30, Regular Session: Modeling 1, Frb04.3 @ 802

Transition Models of Equilibrium Assessment in Bayesian Game

Kiminao KogisoUniversity of Electro-Communications

Tokyo, Japan

The 54 Conference on Decision and ControlOsaka International Convention Center, Osaka, Japan

December 15 to 18, 2015

Supported by JSPS Grant-in-Aid for Challenging Exploratory Research

2014 to 2016

Outline

2

Introduction Static Bayesian Game Novel Form in Bayesian Nash Equilibrium Dynamics in Equilibrium Assessment Simulation Conclusion

Introduction

3

Strategic game enabling to consider uncertainties in player’s decisions. player: a reasonable decision maker

action: what a player chooses

utility: a player’s preference over the actions

type: a label of player’s private valuation (what the player really feels)

belief: a probability distribution over the types (degree of feeling, tendency, proclivity,…)

Static Bayesian Game[1]

[1] Harsanyi, 1967. [2] Alpcan and Basar, et al., 2011, 2013. [3] Roy, et al., 2010. [4] Liu, et al., 2006. [5] Akkarajitsakul, et al., 2011.

A Bayesian game used in engineering problems to analyze a Bayesian Nash equilibrium or to design a game mechanism. network security[2,3], intrusion detection[4,5,6], belief learning[7]

electricity pricing[8,9], mechanism design[10]

[6] Sedjelmachi, et al., 2014, 2015. [7] Nachbar, 2008. [8] Li, et al., 2011, 2014. [9] Yang, et al., 2013. [10] Tao, et al., 2015.

Introduction

4

Insufficient tools and concepts[11]

Bayesian Nash equilibrium plays key roles in game analysis & design. equilibrium analysis: for given belief, find a Bayesian Nash Equilibrium(BNE).

belief learning: for given BNE, find a corresponding belief.

mechanism design: for given utility, find rules to achieve a desired BNE.

Objective of this talkDerive a dynamical state-space model whose state involves a BNE.derive a novel condition related to the BNE,

discover a map (discrete-time system) defined by the novel condition,

confirm a time response of the map.

[11] Powell, 2011.

Challenge: prepare tools & concepts to apply our model-based fashion to analysis and design of the game.

Bayesian Game

Player set

Action set

Type set

Utility

Strategy (mixed)

Belief

Static Bayesian Game: General

5

Two-player two-action Bayesian game w/ two types G(N ,A,⇥, u, µ, S)

N := {1, 2}

A := A1 ⇥A2

⇥ := ⇥1 ⇥⇥2

u := (u1, u2)

µ := (µ1, µ2)

S := (S1, S2)

ai 2 Ai := {a, a} 8i 2 N

✓i 2 ⇥i := {✓, ✓} 8i 2 N

µi 2 ⇧(⇥i) 8i 2 N

Si : ⇥i ! ⇧(Ai) 8i 2 Nsi 2 Si(⇥i) 8i 2 N

⇧(X) : a probability distribution over a finite set X

Ui(✓i, ✓�i) :=

ui(a, a, ✓i, ✓�i) ui(a, a, ✓i, ✓�i)ui(a, a, ✓i, ✓�i) ui(a, a, ✓i, ✓�i)

�: utility matrix8i 2 N , 8✓ 2 ⇥

ui : A⇥⇥ ! < 8i 2 N

i 2 N

Static Bayesian Game: Example

6

Service of tennis

2, 2 0, 1

1, 21, 1

flat

spin

flat spin

0, 1 1, 2

0, 11, 2

flat

spin

flat spin

side

line 1, 0 1, 1

2, 00, 1

flat

spin

flat spin

1, 3 1, 2

0, 32, 2

flat

spin

flat spin

cent

er li

ne s1(a|✓)

s1(a|✓)

s1(a|✓)

s1(a|✓)

s2(a|✓) s2(a|✓)s2(a|✓)s2(a|✓)center line ✓ side line ✓

✓✓

a

a a

a

a a

aaa

a a

a

µ1(✓)

µ1(✓)

µ2(✓)µ2(✓)

a

a a

a

typebelief

Bayesian Nash Equilibrium

7

Equilibrium assessmentdefinitions of Bayesian Nash Equilibrium(BNE) using an ex-ante expected utility:

using a best response to opponent strategy:

EUi(si, s�i) � EUi(s0, s�i) 8s0i 2 Si, s0i 6= si

is denoted as the Bayesian Nash equilibrium.

A strategy profile , satisfying , is also a BNE.s = (si, s�i)

Given a prior common probability , for any , the strategy satisfyingi 2 N sp(µ)

si 2 BRi(s�i, µ) 8i 2 N

the pair of considered as key variables of the Bayesian game.

Equilibrium Assessment : a pair of a belief and the corresponding BNE.(µ, s)

(µ, s)

equilibrium analysis[10]: find a BNE .s9 µ,

[10] Y. Shoham and K. Leyton-Brown, Multiagent Systems, Cambridge University Press, 2009.

then the pair is an Equilibrium Assessment, where ,✏ :=⇥1 �1

⇤(µ, s)

Novel Form Satisfying BNE

8

If the game satisfies the following condition (simultaneous polynomial in ):

Sufficient condition to be BNELemma

8✓i 2 ⇥i, 8i 2 N

G

✏⇣i(s�i, ✓i) (✓i)p(µ) = 0

⇣i(s�i, ✓i) :=⇥Ui(✓i, ✓)s�i(✓) Ui(✓i, ✓)s�i(✓)

⇤,

(✓) :=

1 0 0 00 1 0 0

�, (✓) :=

0 0 1 00 0 0 1

�.

idea: derived from KKT condition of BNE by cancelation of Lagrangian variables.

point: # of the polynomials: 4, # of the variables: 6; D.O.F. in determining their values.

note: a BNE (mixed strategy) holds the above equation, but some of pure strategy BNEs do not hold it.

µ

all of EAs

Discover Dynamics!

9

Map from EA to EAIdea to derive dynamics in EA

all of EAs

⇥EA

(µ, s)

satisfyingthe Lemma

all of EAs

⇥EA

(µ+ �µ, s+ �s)

satisfyingthe Lemma

Given an initial EA, if there exists such that the game satisfies the following condition w.r.t. utility matrices: ,

Dynamics in Equilibrium Assessment

10

Main resultTheorem

⇥1 �1

⇤Ui(✓i, ✓)

�1

1� �1

�= 0

⇥1 �1

⇤Ui(✓i, ✓)

�2

1� �2

�= 0

8✓i 2 ⇥i8i 2 N

µ(k + 1) = diag(A1, A2)µ(k)

s(k + 1) = A�(ci(k))si(k)

ci(k) :=µi(✓i, k + 1)

µi(✓i, k), and is a row stochastic matrix.Ai 2 <2⇥2 8i 2 N

� = [�1 �2]T 2 <2

then a nonlinear autonomous system in terms of the equilibrium assessment:

transfers from an EA to another EA , where (µ(k), s(k)) (µ(k + 1), s(k + 1))

ci(k) ! 1

A�(1) = I

ci(k) :=µi(✓i, k + 1)

µi(✓i, k)µ(k + 1) = diag(A1, A2)µ(k) s(k + 1) = A�(ci(k))si(k)µ(k)

µ(k + 1)stable linear system: time-varying system:· s(k)

µ(k)

Simulation

11

Trajectory of equilibrium assessment

0.3

0.4

0.5

0.6

0.7

10

0.2

0.4

0.6

0.8

probability

probability

step0 1 2 3 4 5 6 7 8 9

100 1 2 3 4 5 6 7 8 9

s1( )a|θ−− s1( )a|θ−−

s1( )a|θ−− s1( )a|θ−−

s2( )a|θ−− s2( )a|θ−−

s2( )a|θ−− s2( )a|θ−−

0.4

0.6

0.8

1.0

1.2

1.4

1.6

1.8

the proposed model

the computation of the best respose (4)

10

expec

ted u

tility

val

ue

step0 1 2 3 4 5 6 7 8 9

EU1

EU2

0.4

0.6

0.8

1.0

1.2

1.4

1.6

1.8

10

step0 1 2 3 4 5 6 7 8 9

valu

e of

ci(

)θ i

c ( )1 θ−c ( )1 θ−

c ( )2 θ−c ( )2 θ−

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

10

probability

step0 1 2 3 4 5 6 7 8 9

1( )η θ−

1( )η θ−

2( )η θ−

2( )η θ−

belief strategy

c expectedutility

Conclusion

12

Introduction Static Bayesian Game two-players two-actions game with two-types

New Form in Bayesian Nash Equilib. polynomial conditions in equilibrium assessment

Dynamics in Equilibrium Assessment discrete-time autonomous time-varying system convergence of the EA (stability)

Simulation confirms states updated become EA and converge.

Future works estimate player’s belief for a given BNE, and realize a control-theoretic mechanism design method.

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

10

probability

step0 1 2 3 4 5 6 7 8 9

1( )η θ−

1( )η θ−

2( )η θ−

2( )η θ−

belief

0.3

0.4

0.5

0.6

0.7

10

0.2

0.4

0.6

0.8

probability

probability

step0 1 2 3 4 5 6 7 8 9

100 1 2 3 4 5 6 7 8 9

s1( )a|θ−− s1( )a|θ−−

s1( )a|θ−− s1( )a|θ−−

s2( )a|θ−− s2( )a|θ−−

s2( )a|θ−− s2( )a|θ−−

strategy

µ(k + 1) = diag(A1, A2)µ(k)

s(k + 1) = A�(ci(k))si(k)

Dynamics in EA:

Transition Models of Equilibrium Assessment in Bayesian Game

Engineering

Transcript of Transition Models of Equilibrium Assessment in Bayesian Game