Dynamic Programming

Study Guide for ES205

Yu-Chi HoJonathan T. LeeJan. 11, 2001

Outline Sample Problem General Formulation Linear-Quadratic Problem General Problems

Path-Cost Problem Find the path with minimal cost

N 1 2 3 4

Principle of Optimality “An optimal policy has the property

that whatever the initial state and initial decision are, the remaining decisions must constitute an optimal policy with regard to the state resulting from the first decision.”

Path-Cost Problem (cont.)

1 2 3 4

Formulation for Cost-Path Pb.

position of

funciton a ascost Terminal 00 NxJ

position offunction

a as go) left to step 1(with

1- stageat go-to-Cost 101 NNxJ

Formulation (cont.)

More generally,

1segmentpath ofcost min 0

1path" of "choice

path" of "choice

segmentpath ofcost min

General Formulation Multistage optimization problem:

The cost-to-go

with initial condition

01,...,0

,,minN

iiuixLNxJ

1,,min 0

iNxJiNiNuiNxL

NxNxJ 00

Multistage or Optimal Control Problem Can be approached as static optimization

problem with specialized equality (staircase) constraints

See study guides titled “Dynamic Systems”

These two equivalent ways will be made clear below in the solution of a specific class of problems

Linear-Quadratic Problem

subject to linear system dynamics

given the initial state x(0)where x(i) is the state variables at time iu(i) is the control variable at time ia(i) and b(i) are the cost factor at time i

iuibixiaNxNaJ

iuigixifix 1

LQ Problem (cont.)

NxNaNxJ 200 2

LQ Problem (cont.) Substitute

We get

1111 NuNgNxNfNx

101 NxJ

NgNaNb

NgNxNfNaNu

LQ Problem (cont.) With some work, we have

LetThen, we have

NxNaNgNu

LQ Problem (cont.) With NxNNxJ

min1 2

LQ Problem (cont.) Substitute the optimum u(N-1), then

we have

Define

1 NxNxNaNfNxNaNxJ

1111 NxNaNfNN

101 NxNNxJ

LQ Problem (cont.)

min2 2

LQ Problem (cont.) By induction, we have the optimal

solution to be

with boundary condition

ixiaifii 1

General Problems Stochastic problems Combinatorial problems Variable termination time Constraints in the problem

Stochastic Problem

The cost-to-go

01,...,0

,,,,minN

iiuixLNxEJ

1,,,min 0

iNxJiNiNuiNxLE

,00 NxENxJ

Combinatorial Problem

The cost-to-go

NxxixL

1,...,1min

1min 0

iNxJiNxL

iiNiNx

NxLNxJ N00

References:• Bellman, R., Dynamic Programming, Princeton

University Press, 1957.• Bryson, Jr., A. E. and Y.-C. Ho, Applied Optimal

Control: Optimization, Estimation, and Control, Taylor & Francis, 1975.

• Dreyfus, S. E. and A. M. Law, The Art and Theory of Dynamic Programming, Academic Press, 1977.

• Ho, Y.-C., Lecture Notes, Harvard University, 1997.

References:• National Institute of Standards and Technology,

Dictionary of Algorithms, Data Structures, and Problems, http://hissa.nist.gov/dads/HTML/principle.html

• Ortega, A. and K. Ramchandran, “Rate-Distortion Methods for Image and Video Compression: An Overview,” IEEE Signal Processing Magazine, Nov. 1998. http://sipi.usc.edu/~ortega/RD_Examples/boxDP.html

Dynamic Programming

Documents

Transcript of Dynamic Programming

Dynamic Programming: Example Dynamic Programming Problems

Dynamic Programming

Advanced Dynamic Programming in CLweb.engr.oregonstate.edu/~huanlian/slides/COLING-tutorial-anim.pdfLiang Huang (Penn) Dynamic Programming Dynamic Programming • Dynamic Programming

Approximate Dynamic Programming via Linear Programmingpapers.nips.cc/paper/2129-approximate-dynamic-programming-via... · Approximate Dynamic Programming via Linear Programming ...

Dynamic Programming Dynamic Programming is a general ...

Dynamic Programming and Stochastic Controlcontrolsystems.upb.de/.../dynamic_programming_slides_2018.pdf · 2 The Dynamic Programming Principle and Dynamic Programming Algorithm Basic

Dynamic Programming - AndroBenchcsl.skku.edu › uploads › SSE2025S13 › Lecture10b.pdf · 2013-03-05 · Dynamic Programming In mathematics and computer science, dynamic programming

2 Dynamic Programming - wirtschaftsinformatik.uni-wuppertal.de · Wirtschaftsinformatik und Operations Research 73 2.1 Basic attributes of Dynamic Programming Dynamic Programming

Dynamic Storytimes - Dynamic Children's Programming

Dynamic programming and Finance Applications · 2017. 1. 26. · Dynamic Programming Incremental decision making lends itself to dynamic programming approach. In dynamic programming,

Dynamic Programming ACM Workshop 24 August 2011. Dynamic Programming Dynamic Programming is a programming technique that dramatically reduces the runtime.

Lecture 5 Dynamic Programming. Dynamic Programming Self-reducibility.

Lecture 3: Planning by Dynamic Programming · Lecture 3: Planning by Dynamic Programming Introduction Requirements for Dynamic Programming Dynamic Programming is a very general solution

Dynamic Programming - Virginia Techcourses.cs.vt.edu/~cs4104/...dynamic-programming.pdf · History of Dynamic Programming Bellman pioneered the systematic study of dynamic programming

Dynamic Programming - Princeton University Computer Science · 3 Dynamic Programming History Bellman. [1950s] Pioneered the systematic study of dynamic programming. Etymology. Dynamic

Dynamic Programming - WordPress.com€¦ · Dynamic Programming Deﬁnition Dynamicprogrammingisaverypowerfulalgorithmictoolinwhich aproblemissolvedbyidentifyingacollectionofsubproblemsand

Dynamic programming - Saylor Academy€¦ · Dynamic programming 1 Dynamic programming In mathematics and computer science, dynamic programming is a method for solving complex problems

1 Dynamic Programming 2012/11/20. P.2 Dynamic Programming (DP) Dynamic programming Dynamic programming is typically applied to optimization problems.

Dynamic Programming - McGill Universitycrypto.cs.mcgill.ca/.../06dynamic-programming.pdf · Dynamic Programming History Bellman. Pioneered the systematic study of dynamic programming

Approximate Dynamic Programming for Large-Scale Resource ... · programming methods. Keywords dynamic programming; approximate dynamic programming; stochastic approxima-tion; large-scale