S. Ceppi, N. Gatti, and N. Basilico DEI, Politecnico di Milano Computing Bayes-Nash Equilibria...

S. Ceppi, N. Gatti, and N. BasilicoDEI, Politecnico di Milano

Computing Bayes-Nash Equilibria through Support Enumeration Methods in Bayesian

Two-Player Strategic-Form Games

Sofia Ceppi, Nicola Gatti, and Nicola BasilicoDipartimento di Elettronica e Informazione,

Politecnico di Milano{ceppi, ngatti, basilico}@elet.polimi.it

Outline

• State of the Art

– What is a Bayesian game

– Why to study Bayesian games

• Original Contributions

– Extensions of existing algorithms for Bayesian games

– B-PNS algorithm

• Experimental Evaluation

• Conclusions and Future Contributions

Bayesian Games

• What is a Bayesian Game?

– Non-cooperative game

– A game wherein information is uncertain

Type 2.1

2, 72, 7 9, 49, 4

3, 53, 5 2, 32, 3

ω2.1 = 0.3 Type 2.2

2, 72, 7 9, 89, 8

3, 53, 5 1, 31, 3

ω2.2 = 0.7

Player 2 Player 2

Bayesian Games

• Why to study Bayesian Games?

– Most real world strategic situations present uncertainty and therefore can be modeled as Bayesian games, e.g.,

• Negotiation settings: bilateral bargaining and auctions

• Security settings: strategic mobile robot patrolling

– The literature does not study algorithms for computing Bayes-Nash equilibria in depth [Shoham and Leyton-Brown, 2008]

State of the Art

• Solution concept for Bayesian games is Bayes-Nash equilibrium

• A Bayesian game is solved by reducing it to a complete-information game and then computing a Nash equilibrium in this game

• The literature provides a detailed comparison of the algorithms for the computation of Nash equilibria in complete-information games

• The exact algorithms for two-player complete-information strategic-form games are:

– LH: based on linear complementary programming [Lemke-Howson, 1964]

– PNS: based on support enumeration [Porter et al., 2004]

– SGC: based on mixed integer linear programming [Sandholm et al., 2005]

Bayesian Game Peculiarities

• The experimental results provided from the literature for computing Nash equilibria cannot be generalized to Bayesian case. The main reasons are:

– Bayesian games can present characteristics (e.g., existence of equilibria with small supports) different from those of complete-information games

– The reduction to complete-information games raises several problems in the application of algorithms for computing Nash equilibria [Koller and Megiddo, 1996]

Original Contributions

• Extension of the algorithms existing in the literature for the computation of Bayes-Nash equilibrium

– PNS → B-PNS (the main result)

– LH → B-LC

– SGC → B-SGC

PNS Algorithm

• The support Si of an agent i is the set of actions played by i with non-null probability

• The joint support S is the set of single agents’ support

• To derive the B-PNS algorithm we modified all the three parts of PNS algorithm

STEP 1:Choosing S

(EnumerationCriteria)

STEP 1:Choosing S

(EnumerationCriteria)

STEP 2:Pruning

(ConditionalDominance)

STEP 2:Pruning

(ConditionalDominance)

STEP 3:Equilibrium

Checking(FeasibilityProblem)

STEP 3:Equilibrium

Checking(FeasibilityProblem)

notdominated

feasible

dominated notfeasible

Supports

• Supports for player 1: S1=(a), S1=(b), S1=(a,b)

• Supports for type 1 of player 2: S2.1=(c), S2.1=(d), S2.1=(c,d)

• Supports for type 2 of player 2: S2.2=(c), S2.2=(d), S2.2=(c,d)

• Joint support: S={S1,S2.1,S2.2} → S={ (a), (d), (c,d) }

• Goal: enumerate the joint supports and check if they are of equilibrium

• How to enumerate the joint supports?

Type 2.1

2, 72, 7 9, 49, 4

3, 53, 5 2, 32, 3

ω2.1 = 0.3 Type 2.2

2, 72, 7 9, 89, 8

3, 53, 5 1, 31, 3

ω2.2 = 0.7

Player 2 Player 2P

Step 1: Heuristics

• Balance

– Non Bayesian games: |S1|-|S2| → If S1=(a), S2=(c) the balance is 0

– We call – In Bayesian games the balance is

If S1=(a,b), S2.1=(c), S2.2=(c,d) the balance is 0

– Increasing order of balance

• Size– The size of a player is the sum of all the actions played with non-

null probability by all the types of the player– Increasing order of size

Step 1: Peak Criterion (1)

• Open Issue: – Given the values of balance and size, ranking a player’s supports– Example:

Balance = 0

Size = 7

Player 1’s types = 3

Actions = {a,b,c,d,e}

→ S1 = { (a), (a,b,c,d,e), (c) }

→ S1 = { (a,c), (a,b,c), (c,e) }

• Peak Criterion– Based on the size of types’ supports– The peak is the size of the maximum possible support– Decreasing criterion and increasing criterion

• We use an enumeration tree to order the supports where each node defines the size of all the types. (e.g. |S1|, |S2.1|,|S2.2|)

Size = 7 Types = 3 Available Actions = 5

Step 1: Peak Criterion (2)

Step 2: Pruning Techniques

• The problem of checking whether or not an action is strictly conditionally Bayesian dominated by another action can be formulated as a linear feasibility problem

• In our case, it can be formulated as a fractional knapsack problem and then solved in linear time in the number of variables

Type 2.1

2, 72, 7 9, 49, 4

3, 53, 5 2, 32, 3

ω2.1 = 0.3 Type 2.2

2, 72, 7 9, 89, 8

3, 53, 5 1, 31, 3

ω2.2 = 0.7

• Action a is strictly conditionally Bayesian dominated by action a’ if for every σ-i | S-i

Player 2 Player 2

• Given S-i = {S2.1 = (c), S2.2 = (d)}

EU1(a) = ω2.1 · 2 + ω2.2 · 9

EU1(b) = ω2.1 · 3 + ω2.2 · 1

EU1(a) > EU1(b)

Step 3: B-PNS Feasibility Problem (1)

• Linear feasibility problem used for checking if a joint support is of equilibrium

Step 3: B-PNS Feasibility Problem (2)

• The problem with support S = { (a), (c,d), (d) } is infeasible

• The problem with support S = { (a,b), (c), (c,d) } is feasible:

– the probabilities of the actions are:

player 1: p(a) = 0.667 p(b)= 0.333

type 1 player 2: p(c) = 1 p(d) = 0

type 2 player 2: p(c) = 0.841 p(d) = 0.159

Type 2.1

2, 72, 7 9, 49, 4

3, 53, 5 2, 32, 3

ω2.1 = 0.3 Type 2.2

2, 72, 7 9, 89, 8

3, 53, 5 1, 31, 3

ω2.2 = 0.7

Player 2 Player 2P

Experimental Evaluation

• We developed a tool based on GAMUT to generate Bayesian games

• We compared computational time in:

– Different configurations of B-PNS

– PNS and B-PNS

– B-PNS, B-SGC, and B-LC

Experimental Results

Conclusions

• We focus on the computation of equilibria in Bayesian games

• This class of game is important since most strategic real-world situations can be modeled as a Bayesian game

• Computing Nash equilibria in complete-information games is inefficient when the game is Bayesian

• We extend the algorithms used for the computation of Nash equilibria for the Bayesian games

• We focus on B-PNS

• We experimentally evaluate the Bayesian algorithms

Future Contributions

• Improvement of support enumeration methods using algorithms based on local search techniques

– Non-Stochastic

– Stochastic

• Application to open problems

S. Ceppi, N. Gatti, and N. Basilico DEI, Politecnico di Milano Computing Bayes-Nash Equilibria...

Documents

Transcript of S. Ceppi, N. Gatti, and N. Basilico DEI, Politecnico di Milano Computing Bayes-Nash Equilibria...

Alterazioni Video Giarre Gabriele Basilico Paul Virilio Wu ... · PDF filericerca di / research by photo byAlterazioni Video foto di / Gabriele Basilico Sicilian Incompletion Collective

Anatomy in Nature By Juan Gatti

Sontag, Song to the Dark Virgin, and Si tra i ceppi

Flavio Gatti,CSN1, Feb 6 th, 06 1 TC status. Flavio Gatti UNIVERSITY and INFN of Genoa, Italy.

entrevista Bernardete Gatti

The Basilico Villa · VILLA MAGNOLIA PORTO HELI, GREECE info@brightbluevillas.com tel: +30 6944 796 796 Basilico Villa is the main house of Vineyard Villa set …

Gatti, Las Modalidades Débiles de La Identidad

Cat Duet (Duetto di duo buffo gatti)

Circolo dei gatti libri 2014

Artigo Grupo Focal Gatti 1 e 2

Kt gatti view's on childeren litterature

PR Omaggio a basilico - studiolacitta.it

GAS PROCESSING - GATTI S.p.A

A. Ceppi, G. Ravazzani , M. Mancini - Arpae · Il modello idrologico distribuito FEST-WB A. Ceppi, G. Ravazzani , M. Mancini Bologna, 30 maggio 2011 Politecnico di Milano – D.I.I.A.R.

Sandra Johnson, M.A. & M.S. Susie Ceppi- Bussmann, Ph.D. New Mexico State University.

Alterazioni Video Giarre Gabriele Basilico Paul Virilio Wu ... · ricerca di / research by photo byAlterazioni Video foto di / Gabriele Basilico Sicilian Incompletion Collective Alterazioni

TABLE OF CONTENTS ARTURO GATTI AND MICKEY WARD …Gatti came roaring back but Ward stayed right with him as they traded belts until the bell. I scored this close round for Gatti but

Navigation Strategies for Exploration and Patrolling - Nicola Basilico

Fresa trincia ceppi Stumps cutter - Home | Agri World...FTC Fresa trincia ceppi Stumps cutter Las Fresadoras de tocones son máquinas diseñadas y construidas para triturar, no sólo

Developing a Deterministic Patrolling Strategy for Security Agents Nicola Basilico, Nicola Gatti, Francesco Amigoni.