On Time with Minimal Expected Cost ! Time with Minimal Expected Cost ! Alexandre David, Peter G...

On Time with Minimal Expected Cost !

Alexandre David, Peter G Jensen, Kim G Larsen, Axel Legay, Didier Lime, Mathias G Sørensen, Jakob H Taankvist

– Aalborg University

DENMARK

TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAAAAAAA

Motivation

CASSTING, Brussels, May 21-23, 2014 Kim Larsen [2]

Optimal WC strategy:

E=WC=45 (bicycle)

Optimal Expected Strategy:

E=33, WC=71 (car)

Optimal Expected Strategy ensuring WC<=60:

try train upto 3 delays then bike

E=37.5, WC=59

Bruyere, V., Filiot, E., Randour, M., Raskin, J.F.: Meet your expectations with guarantees: Beyond worst-case synthesis in quantitative games. STACS14

2-Player Game (Antagonistic opponent)

Markov Decision Process (probabilistic opponent)

Duration Probabilistic Automata

Pr[<=1000](<> P1.Done && P2.Done)

Kempf, J.F., Bozga, M., Maler, O.: As soon as probable: Optimal scheduling under stochastic uncertainty. In: TACAS. pp. 385{400 (2013)

Uni[2,6]

Uni[2,4]

Strategy that that will minimize

expected completion time??

Motivation

× Priced Timed Game

Timed Game Timed Automata Priced Timed Automata Stochastic (P)TA × Priced Timed MDP ∼ Decision Stochastic Priced Timed Automata

UPPAAL

TIGA/SMC

Minimize expected cost subject to guaranteed time-bound

Uni[0,100]

Nondeterministic choice

Overview

Priced Timed Games

Time bounded reachability strategies

Priced Timed Markov Decision Processes

Minimal expected cost reachability strategy

Optimal Strategy Synthesis Using Reinforcement Learning

Representation of Stochastic Strategies

Experimental Results

PTA and PTG

PTA Semantics

Strategies & Outcome

Cost Bounded Reachability Strategies

Motivation

Objective: 𝐴⟨⟩(END ∧ time ≤ 210)

Deterministic, memoryless strategy:

100 200

𝝀 𝝀

100 200

Most permissive, memoryless strategy

Priced Timed MDPs

Stochastic Strategies

Induced Probability Measure

Minimum Expected Cost

Motivation

Minimal Expected Cost Strategy (0,b) 2*80=160 Expected Cost for TIGA Strategy (100,w) 4*95=380

Minimal Expected Cost while

guaranteeing END is reached

within time 210:

Strat.: t>90 (100,w)

t>70 (0,b) ow (0,a) = 204

Reinforcement Learning

Time Bounded Reachability (G,T)

Strategies

Nondeterministic Strategies (UPPAAL TIGA)

Stochastic Strategies (non-lazy *)

* Non-lazy strategies suffices for DPAs

Classes allowing for

efficient

representation and

learning

Learning

Simulation of 𝑼𝒏𝒊 𝝈𝒑 for 𝐴⟨⟩(END ∧ time ≤ 210) time(𝝅)@Choice

C(𝝅) wait

Strategies – Representation & Manipulation

Covariance Matrices

Logistic Regression

Splitting

Using Learning Determinization

Experiments

Learned Strategies

More plots of runs according to strategies learne.

Covariance

Learned Strategies

Covariance

Learned Strategies

Covariance

Learned Strategies

Covariance

Learned Strategies

Regression

Learned Strategies

Regression

Learned Strategies

Regression

Learned Strategies

Regression

Learned Strategies

Regression

Learned Strategies

Regression

Learned Strategies

Splitting

Learned Strategies

Splitting

Learned Strategies

Splitting

Learned Strategies

Splitting

Learned Strategies

Splitting

Learned Strategies

Splitting

Learned Strategies

Splitting

Experiments /DPA

Kempf, J.F., Bozga, M., Maler, O.: As soon as probable: Optimal scheduling under stochastic uncertainty. In: TACAS. pp. 385{400 (2013)

http://www-verimag.imag.fr/PROJECTS/TEMPO/DATA/201304_dpa/

Experiments /DPA Random

Conclusion & Future Work

Efficient synthesis of strategies for PTMDP ensuring time-bounds and minimizing expected cost.

If not time-bound needed we can omit the UPPAAL TIGA synthesis

Extension to Hybrid MDPs utilizing UPPAAL SMCs support for SHAs.

Make TIGA/SMC available to you! Datastructures supporting general stochastic

strategies – not just non-lazy ones. More clever filtrations of runs.

On Time with Minimal Expected Cost ! Time with Minimal Expected Cost ! Alexandre David, Peter G...

Documents

Transcript of On Time with Minimal Expected Cost ! Time with Minimal Expected Cost ! Alexandre David, Peter G...

MARKING SCHEME SET 55/1/G Q. No. Expected Answer / Value ...

Minimal Space; Minimal EquipmentMinimal Space; Minimal Equipment . 1 1 Minimal Space; Minimal Equipment ... Several nuts and bolts sets Directions: Student will find partner sets of

Minimal Art, Post-Minimal, and Conceptual Art

DEFORMATIONS OF COMPLETE MINIMAL SURFACES...Any minimal surface in R3 yields a minimal surface in R3/G by projection, however periodic minimal surfaces may be simpler in the quotient.

F-MINIMAL SETS€¦ · The Sturmian minimal sets [9] and the minimal set of Jones [8, 14.16 to 14.24] are F-minimal sets. A discrete substitution minimal set is an F-minimal set if

Minimal Space; Minimal Equipmentdoe.sd.gov/title/2018-conference/documents/MinSpace.pdf · Minimal Space; Minimal Equipment Fine Motor Skills I use these fine motor skills as stations,

2017-ICCVW-Structured lerning and detailed interpretation ... · other pairs of minimal and sub-minimal images (Fig. 1F-G), and we evaluated its statistical contribution to the learning

G. Tamoliūnė "What is Expected from Early Stage Researcher in Nowadays Society?"

Determine Cause of Variances Between Expected and Actual Army G-1 Case

Minimal (IPC Phase 1) expected in most areas with average ... FSO_… · UGANDA Food Security Outlook February to September 2018 Minimal (IPC Phase 1) expected in most areas with

Dragonfly Topology and Routing. Outline Background Motivation Topology description Routing – Minimal Routing – Valiant Routing – UGAL/G Adaptive Routing.

Highness and b ounding minimal pairs - math.wisc.edulempp/papers/highbou.pdf · Highness and b ounding minimal pairs Ro dney G. wney Do Victoria y ersit Univ of ellington, W New Zealand

Improved RANSAC performance using simple, iterative minimal-set … · 2012-01-09 · Improved RANSAC performance using simple, iterative minimal-set solvers E. Rosten, G. Reitmayry,

Determine Cause of Variances Between Expected and Actual Army G-1 Case 1.

Minimal nets and minimal minimal surfacespeople.physics.anu.edu.au/~sth110/13_5_MinimalMinimal.pdf · minimal minimal surfaces. 1. Introduction The special periodic nets known as

Sonoran-WEC 138 CEC Application FINAL V6...transmission line operation and maintenance activities are expected to be minimal. I.2 Radio Interference Corona‐generated radio interference

Morgan Stanley Financial Services Conference economic and ... › media › assets › ... · transition hybrid securities Deductions (e.g. DTA, pension) expected to be minimal over

STEREOGRAPHIC PARAMETERS AND PSEUDO- MINIMAL … · Introduction In a recent paperf G. Y. Rainich and the writer showed that the Weier-strass formulas for minimal surfaces in terms

Aon Benfield Analytics | Impact Forecastingthoughtleadership.aonbenfield.com/Documents/... · at the end of May and into June. Total economic losses were expected to be minimal. Earthquakes

Minimal Industrial Needs for a Minimal Mars Colony.