A Mechanism Design Approach for the Stabilization of Networked dynamical systems

23
A Mechanism Design Approach for the Stabilization of Networked dynamical systems L. Galbusera, N. Gatti, C. Romani Dipartimento di Elettronica e Informazione – Politecnico di Milano e-mail: galbusera, ngatti, [email protected] 48th IEEE Conference on Decision and Control 48th IEEE Conference on Decision and Control Shanghai, China December 16-18, 2009

description

48th IEEE Conference on Decision and Control Shanghai, China December 16-18, 2009. A Mechanism Design Approach for the Stabilization of Networked dynamical systems L. Galbusera, N. Gatti, C. Romani - PowerPoint PPT Presentation

Transcript of A Mechanism Design Approach for the Stabilization of Networked dynamical systems

Page 1: A  Mechanism  Design  Approach for  the  Stabilization of Networked dynamical systems

A Mechanism Design Approachfor the Stabilization of

Networked dynamical systems

L. Galbusera, N. Gatti, C. Romani

Dipartimento di Elettronica e Informazione – Politecnico di Milanoe-mail: galbusera, ngatti, [email protected]

48th IEEE Conference on Decision and Control48th IEEE Conference on Decision and Control

Shanghai, ChinaDecember 16-18, 2009

Page 2: A  Mechanism  Design  Approach for  the  Stabilization of Networked dynamical systems

2Networked control system (NCS)Networked control system (NCS)

Elements:

N linear continuous-time subplants with unstable uncontrolled dynamics.

A bus communication medium.

N controllersdesigned to stabilize each subplant.

Standing assumption: at each time instant, only one subplant is connected to its

controller

Control objective:

Synthesis of an effective dynamic scheduling policy (not preassigned).

Page 3: A  Mechanism  Design  Approach for  the  Stabilization of Networked dynamical systems

3Networked control system (NCS)Networked control system (NCS)

Previous literature on dynamic scheduling policies:

the scheduling is usually assigned in a centralized manner by comparing systems’ states and parameters(e.g., the CLS-ε policy in [Hristu-Varsakelis, CDC 2001]).

Real-world applications:

the subplants can be modeled as strategic players in a game for having access to the communication medium.

t

AUCTION FOR ACCESSING THE

MEDIUM AT TIME t*

S1 S2 SN

t*

PLAYERS REPORT THEIR(NOT-NECESSARILY

TRUE)CURRENT STATES

Page 4: A  Mechanism  Design  Approach for  the  Stabilization of Networked dynamical systems

4Networked control system (NCS)Networked control system (NCS)

t

AUCTION FOR ACCESSING THE

MEDIUM AT TIME t*

S1

S2

SN

t*

Control objectivesControl objectives

1.Stability of the NCS

2.Efficient allocation of the communication medium

3.Avoiding strategic behaviors of the players

PLAYERS ARESELF-INTERESTED

THEY REPORT THEIR(NOT-NECESSARILY

TRUE)CURRENT STATES

Page 5: A  Mechanism  Design  Approach for  the  Stabilization of Networked dynamical systems

5Preliminaries: stability in NCSPreliminaries: stability in NCS

t

Dynamical model of subsystem i:

Control law:

Time: T

S1

S2

j-th time interval of lenght T

Page 6: A  Mechanism  Design  Approach for  the  Stabilization of Networked dynamical systems

6Preliminaries: stability in NCSPreliminaries: stability in NCS

Stability condition:

Further assumption:

Period T is discretized in M regular time intervals for executing the auctions.

Lower bound to control subintervals

tT

1 2 3 4 5 … M

AUCTIONS

Page 7: A  Mechanism  Design  Approach for  the  Stabilization of Networked dynamical systems

7

if player i pays

Groundings on mechanism designGroundings on mechanism design

ALTERNATIVES(= possible outcomes of the game)

PLAYER i

MECHANISM

TRUE EVALUATIONof player i over the set of alternatives

REPORTED EVALUATION of player i over the set of alternatives

(other players)

PAYMENT of player i

MONETARY RESOURCES

of player i

A player can participate to the auction only if

Reference: [Fudemberg & Tirole, Game theory, The MIT Press, 1991]

Page 8: A  Mechanism  Design  Approach for  the  Stabilization of Networked dynamical systems

8Groundings on mechanism designGroundings on mechanism design

WINNING ALTERNATIVE

PLAYER 1

MECHANISM

PLAYER N

PLAYER 2

Maximization of the social welfare

Reference: [Fudemberg & Tirole, Game theory, The MIT Press, 1991]

Page 9: A  Mechanism  Design  Approach for  the  Stabilization of Networked dynamical systems

9Groundings on mechanism designGroundings on mechanism design

WINNING ALTERNATIVE

PLAYER 1

MECHANISM

PLAYER N

PLAYER 2

Maximization of the social welfare

DEFINITION OF PAYMENTS

Reference: [Fudemberg & Tirole, Game theory, The MIT Press, 1991]

Page 10: A  Mechanism  Design  Approach for  the  Stabilization of Networked dynamical systems

10Groundings on mechanism designGroundings on mechanism design

Key features:

Player i ’s utility:

Truthful mechanism: a mechanism in which each player cannot increase its utility by misreporting its true evaluation, i.e., a mechanism in which

VCG mechanisms (Vickrey, Clarke and Grove):a class of mechanisms which is guaranteed to be truthful by means of a suitable definition of the payment function:

Page 11: A  Mechanism  Design  Approach for  the  Stabilization of Networked dynamical systems

11Groundings on mechanism designGroundings on mechanism design

Key features:

Clarke’s pivot rule for specifying the payment:

the winner’s payment equals the second-highest bid

VCG mechanisms are weakly budget-balanced, i.e.,

Therefore, the iterated application of the mechanism (non-strictly) decreases the players’ resources.

A solution: Cavallo’s pivot rule

Cavallo’s pivot = Clarke’s pivot + redistribution mechanism

> Truthfulness is preserved> Budget balancing is enhanced The second and third classified in

the bid increase their resources

Page 12: A  Mechanism  Design  Approach for  the  Stabilization of Networked dynamical systems

12A mechanism for NCSA mechanism for NCS

Two-layer structure:

First layer efficient allocation of the medium

(with no stability guarantees);

Second layer for ensuring stability.

The allocation procedure is governed by two sets of monetary sources:

Standard resources (ci)used at the first layer, in order to allocate the medium;

Stability-preserving resources (csi)used at the second layer, in order to preserve stability.

PLAYER i

Page 13: A  Mechanism  Design  Approach for  the  Stabilization of Networked dynamical systems

13A mechanism for NCSA mechanism for NCS

What does the mechanism need to know in order to work?

The quantities

The period T

The standard resources andstability-preserving resources of the players

The true value of the state of eachsubsystem (=player) at the beginning of each period

Set of alternatives:

t

a priori information

online information

Page 14: A  Mechanism  Design  Approach for  the  Stabilization of Networked dynamical systems

14A mechanism for NCSA mechanism for NCS

Evaluation function (common to both layers):

if the subplant is choosenif the subplant is not choosen

Remarks:

Subplant i has a positive evaluation only if it is chosen to be controlled.

The monetary resources do not directly affect the value of the evaluation function, but only enable the participation of the subplants to the mechanism.

VCG mechanism (truthfulnes

s)

Depends on the state evolution of the closed-loop subsystem along the next time subinterval

Page 15: A  Mechanism  Design  Approach for  the  Stabilization of Networked dynamical systems

15A mechanism for NCSA mechanism for NCS

Social-efficiency based selection criterion:

Payment mechanism (related to standard resources)

In view of truthfulness, the subplant i* with the highest evaluation value maximizes the social efficiency and is thus selected.

Cavallo’s redistributions

Limited communication requirements:each player only sends its own evaluation

Page 16: A  Mechanism  Design  Approach for  the  Stabilization of Networked dynamical systems

16A mechanism for NCSA mechanism for NCS

Initialization of monetary resources

At the beginning of each period of length T, ci and csi are initialized as follows:

Standard resources (ci)depend on the state at the beginning of the same period

Stability-preserving resources (csi) equal the minimum number of subintervals subsystem i needs to be controlled in order to preserve stability

Page 17: A  Mechanism  Design  Approach for  the  Stabilization of Networked dynamical systems

17A mechanism for NCSA mechanism for NCS

Update rules for monetary resources

Both standard resources and stability-preserving resources are updated at each execution of the mechanism during the period ( ).

Standard resources (ci)

Stability-preserving resources (csi)

Each time the subplant is chosen, the resources are reduced by one unit until they reach zero.

current resources payment Cavallo’s redistribution

Page 18: A  Mechanism  Design  Approach for  the  Stabilization of Networked dynamical systems

18A mechanism for NCSA mechanism for NCS

Mechanism design switching rule

IDEAallocation based on standard resources (efficiency-based) until the stabilization requirement becomes critical.

At each time step both resources are updated;

Standard resources are used for the bid until the number of remaining time step before the end of the period are just enough to complete the stabilization of all subsystems (i.e., zeroing the stability-preserving resources).

Page 19: A  Mechanism  Design  Approach for  the  Stabilization of Networked dynamical systems

19Simulation exampleSimulation example

Three first-order unstable linear subplants, each of them associated with a controller that stabilizes the respective subplant.Open- and closed-loop eigenvalues:

A comparison between different allocation methods over a time period T:

(A) The proposed mechanism-based allocation method;

(B) A modified allocation method obtained by removing induced payments and standard monetary resources.

In order to emphasize the difference in the resulting control action, we assume that subplant S1 reports the following altered evaluation function value:

uncontrolled plants

controlled plants

Page 20: A  Mechanism  Design  Approach for  the  Stabilization of Networked dynamical systems

20

Solution (A): more marked alternation among subsystems in the

scheduling;

penalization of the subsystem that “lies” (S1), in view of the resource-exhaustion phenomenon.

Simulation exampleSimulation example

(A) Proposed mechanism (B) Alternative solution

Page 21: A  Mechanism  Design  Approach for  the  Stabilization of Networked dynamical systems

21Simulation exampleSimulation example

Solution (A): Better overall state performance.

(A) Proposed mechanism (B) Alternative solution

Page 22: A  Mechanism  Design  Approach for  the  Stabilization of Networked dynamical systems

22ConclusionsConclusions

Main features:

Application of mechanism design to the stabilization issue of networked control systems;

synthesis of a dynamic scheduling policy in a game-theoretical setting;

our scheme avoids strategic behaviors of the players and efficiently allocates the communication;

the mechanism needs limited information to properly work.

Page 23: A  Mechanism  Design  Approach for  the  Stabilization of Networked dynamical systems

23Simulation exampleSimulation example

Stability tokens zeroed before the end of the period.

Switching to the second layer does not occur in this example.

Stability tokens Ordinary tokens

(A) Proposed mechanism