TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE

22
TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS RODRIGO LECAROS 1,3 AND ENRIQUE ZUAZUA 1,2 Abstract. We analyze a model tracking problem for a 1D scalar conservation law. It consists in optimizing the initial datum so to minimize a weighted distance to a given target during a given finite time horizon. Even if the optimal control problem under consideration is of classical nature, the presence of shocks is an impediment for classical methods, based on linearization, to be directly applied. We adapt the so-called alternating descent method that exploits the generalized lin- earization that takes into account both the sensitivity of the shock location and of the smooth components of solutions. This method was previously applied successfully on the inverse design problem and that of identifying the nonlinearity in the equation. The efficiency of the method in comparison with more classical discrete methods is illustrated by several numerical experiments. Contents 1. Introduction 2 2. Existence of Minimizers 3 3. The Discrete Minimization Problem 4 4. Sensitivity analysis: the continuous approach 6 4.1. Sensitivity without shocks 6 4.2. Sensitivity of the state in the presence of shocks 8 4.3. Sensitivity of the cost in the presence of shocks 9 5. The alternating descent method 12 6. Numerical approximation of the descent direction 13 6.1. The discrete approach 14 6.2. The alternating descent method 15 7. Numerical experiments 16 7.1. Experiment 1 17 7.2. Experiment 2 17 8. Conclusions and perspectives 20 References 21 Partially supported by Grants MTM2008-03541 and MTM2011-29306 of MICINN Spain, Project PI2010-04 of the Basque Government, ERC Advanced Grant FP7-246775 NUMERIWAVES and ESF Research Networking Programme OPTPDE. This work was finished while the second author was visiting the Laboratoire Jacques Louis Lions with the support of the Paris City Hall “Research in Paris” program. 1

Transcript of TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE

Page 1: TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS INTHE PRESENCE OF SHOCKS

RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Abstract We analyze a model tracking problem for a 1D scalar conservation law Itconsists in optimizing the initial datum so to minimize a weighted distance to a giventarget during a given finite time horizon

Even if the optimal control problem under consideration is of classical nature thepresence of shocks is an impediment for classical methods based on linearization to bedirectly applied

We adapt the so-called alternating descent method that exploits the generalized lin-earization that takes into account both the sensitivity of the shock location and of thesmooth components of solutions This method was previously applied successfully on theinverse design problem and that of identifying the nonlinearity in the equation

The efficiency of the method in comparison with more classical discrete methods isillustrated by several numerical experiments

Contents

1 Introduction 22 Existence of Minimizers 33 The Discrete Minimization Problem 44 Sensitivity analysis the continuous approach 641 Sensitivity without shocks 642 Sensitivity of the state in the presence of shocks 843 Sensitivity of the cost in the presence of shocks 95 The alternating descent method 126 Numerical approximation of the descent direction 1361 The discrete approach 1462 The alternating descent method 157 Numerical experiments 1671 Experiment 1 1772 Experiment 2 178 Conclusions and perspectives 20References 21

Partially supported by Grants MTM2008-03541 and MTM2011-29306 of MICINN Spain ProjectPI2010-04 of the Basque Government ERC Advanced Grant FP7-246775 NUMERIWAVES and ESFResearch Networking Programme OPTPDEThis work was finished while the second author was visiting the Laboratoire Jacques Louis Lions with thesupport of the Paris City Hall ldquoResearch in Parisrdquo program

1

2 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

1 Introduction

There is an extensive literature on the control optimization and inverse design of partialdifferential equations But when dealing with nonlinear models most often the analysisrequires to linearize the system under consideration This is why most of the existing resultsdo not apply to hyperbolic conservation laws since the shock discontinuities of solutionsare an impediment to linearize the system under consideration in a classical manner

This paper is devoted to analyze this issue for 1D scalar conservation laws To comple-ment previous related works by our group we consider the tracking problem in which thegoal is to optimally choose the initial datum so that the solution is as close as possible to aprescribed trajectory Our previous works are devoted to the problem of inverse design inwhich the goal is to determine the initial datum so that the solution achieves a given targetat the final time (see for instance [8] [9]) or the nonlinearity entering in the conservationlaw [10])

To be more precise given a finite time T gt 0 a target function ud isin L2(R times (0 T ))and a positive weight function ρ isin Linfin(Rtimes (0 T )) with compact support in Rtimes (0 T ) weconsider the functional cost to be minimized J over a suitable class of initial data Uaddefined by

J(u0) =1

2

int T

0

intRρ(x t)|u(x t)minus ud(x t)|2dxdt (11)

where u Rx times Rt rarr R is the unique entropy solution of the scalar conservation law

parttu+ partx(f(u)) = 0 in Rtimes (0 T ) u(x 0) = u0(x) x isin R (12)

Thus the problem under consideration reads To find u0min isin Uad such that

J(u0min) = minu0isinUab

J(u0) (13)

Here the flux f R rarr R is assumed to be smooth f isin C1(RR) The initial datumu0 will be assumed to belong to a suitable admissible class Uad to ensure the existence ofa minimizer As a preliminary fact we remind that for u0 isin L1(R) cap Linfin(R) cap BV (R)there exists an unique entropy solution in the sense of Kruzkov (see [15]) in the classC0([0 T ]L1(R)) cap Linfin(Rtimes [0 T ]) cap Linfin([0 T ]BV (R))

As we will see the existence of minimizers can be established under some natural as-sumptions on the class of admissible data Uad using the well-posedness and compactnessproperties of solutions of the conservation law (12) The uniqueness of the minimizersis false in general due in particular to the possible presence of discontinuities in thesolutions of (12)

One of our main goals in this paper is to compare how the discrete approach and thealternating descent methods perform in this new prototypical optimization problem Thisalternating descent method was introduced and developed in the previous works by ourgroup to deal with and exploit the sensitivity of shocks In other words the solutions ofthe conservation law are seen as a multi-physics object integrated by both the solutionitself but also by the geometric location of the shock Perturbing the initial datum of theconservation law produces perturbations on both components Accordingly it is natural to

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 3

analyze and employ both sensitivities to develop descent algorithms leading to an efficientcomputation of the minimizer This is the basis of the alternating descent method ([8])

As we shall see in agreement with the results achieved in previously considered exam-ples the alternating descent method performs better than the classical discrete one whichconsists simply in applying a classical gradient descent algorithm to a discrete version ofthe functional and the conservation law

This paper is limited to the one dimensional case but the alternating descent methodcan also be extended to the multi-dimensional frame although this requires a much morecareful geometric treatment of shocks since for instance in 2minusd these are curves evolvingin time whose location has to be carefully determined and their motion and perturbationhandled carefully to avoid numerical instabilities (see [16])

There are other possible methods that could be employed to deal with these problemsIn the present 1 minus d setting for instance whenever the flux is convex one could use thetechniques developed in [1] based on the Lax-Oleinik representation formula of solutionsOne could also employ the nudging method developed by J Blum and coworkers ([2] [3])It would be interesting to compare in a systematic manner the results obtained in thisarticle with those that could be achieved by these other methods

The rest of this paper is organized as follows In Section 2 we formulate the problemunder consideration more precisely and prove the existence of minimizers

In Section 3 we introduce the discrete approximation of the continuous optimal controlproblem We prove the existence of minimizers for the discrete problem and their Γ-convergence towards the continuous ones as the mesh-size parameters tend to zero

As we shall see purely discrete approaches based on the minimization of the resultingdiscrete functionals by descent algorithms lead to very slow iterative processes We thusneed to introduce an alternated descent algorithm that takes into account the possiblepresence of shock discontinuities in solutions For doing this the first step is to develop acareful sensitivity analysis This is done in Section 4

In Section 5 we present the alternating descent method which combines the advantagesof both the discrete approach and the sensitivity analysis in the presence of shocks

In Section 6 we explain how to implement both descent algorithms The discrete ap-proach consists mainly in applying a descent algorithm to the discrete version J∆ of thefunctional J while the alternating descent method by the contrary is a continuous methodbased on the analysis of the previous section on the sensitivity of shocks

In Section 7 we present some numerical experiments illustrating the overall efficiency ofthe alternating descent method We conclude discussing some possible extensions of theresults and methods presented in the paper

2 Existence of Minimizers

In this section we prove that under certain conditions on the set of admissible initialdata Uad there exists at least one minimizer of the functional J given in (11) To do thiswe consider the class of admissible initial data Uad as

Uad = u0 isin Linfin(R) capBV (R) supp(u0) sub K u0Linfin + TV (u0) le C (21)

4 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

where K sub R is a given compact set and C gt 0 a given constant Note however that thesame theoretical results and descent strategies we shall develop here can be applied to amuch wider class of admissible sets

Theorem 21 Assume that ud isin L2(Rtimes (0 T )) let Uad be defined in (21) and f be a C1

function Then the minimization problem

minu0isinUab

J(u0) (22)

has at least one minimizer u0min isin Uad Moreover uniqueness is false in general

The proof follows the classical strategy of the Direct Method of the Calculus of Vari-ations We refer the reader to [8] for a similar proof in the case where the functional Jis replaced by the one in which the distance to a given target at the final time t = T isminimized Note that in particular for the class Uad of admissible initial data consideredsolutions enjoy uniform BV bounds allowing to prove the needed compactness propertiesto pass to the limit In the case of convex fluxes using the well-known one-sided Lipschitzcondition the class of admissible initial data can be further extended

We also observe that as indicated in [8] the uniqueness of the minimizer is in generalfalse for this type of optimization problems

3 The Discrete Minimization Problem

The purpose of this section is to show that discrete minimizers obtained by a numericalapproximation of (11) and (12) converge to a minimizer of the continuous problem asthe mesh-size tends to zero This justifies the usual engineering practice of replacing thecontinuous functional and model by discrete ones to compute an approximation of thecontinuous minimizer

Let us introduce a mesh in Rtimes [0 T ] given by (xi tn) = (i∆x n∆t) (i = minusinfin infin n =

0 N + 1 so that (N + 1)∆t = T ) and let unij be a numerical approximation of u(xi tn)

obtained as solution of a suitable discretization of the equation (12)Let us consider the following approximation of the functional J in (11)

J∆(u0∆) =

∆x∆t

2

Nsumn=0

infinsumi=minusinfin

ρni(uni minus (ud)ni

)2 (31)

where u0∆ = u0

i is the discrete initial datum and ud∆ = (ud)ni = Π∆ud is the discretiza-

tion of the target ud at (xi tn) Π∆ being a discretization operator A common choice

consists in taking

Π∆ud = (ud)ni =

1

∆x∆t

int xi+12

ximinus12

int tn+12

tnminus12

ud(x t) dxdt (32)

where xiplusmn12 = xi plusmn∆x2 and tnplusmn12 = tn plusmn∆t2Moreover we introduce an approximation of the class of admissible initial data Uad

denoted by Uad∆ and constituted by sequences ϕ∆ = ϕiiisinZ for which the associated

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 5

piecewise constant interpolation function that we still denote by ϕ∆ defined by

ϕ∆(x) = ϕi x isin (ximinus12 xi+12)

satisfies ϕ∆ isin Uad Obviously Uad∆ coincides with the class of discrete vectors withsupport on those indices i such that xi isin K and for which the discrete Linfin and TV -normsare bounded above by the same constant C

Let us consider S∆ l1(Z)rarr l1(Z) an explicit numerical scheme for (12) where

un∆ = Sn∆u

0∆ (33)

is the approximation of the entropy solution u(middot t) = S(t)u0 of (12) ie un∆ S(t)u0with t = n∆t Here S L1(R) cap Linfin(R) cap BV (R) rarr L1(R) cap Linfin(R) cap BV (R) is thesemigroup (solution operator)

S u0 rarr u = Su0

which associates to the initial condition u0 isin L1(R)capLinfin(R)capBV (R) the entropy solutionu of (12)

For each ∆ = ∆t (with λ = ∆t∆x fixed typically given by the corresponding CFL-condition for explicit schemes) it is easy to see that the discrete analogue of Theorem21 holds In fact this is automatic in the present setting since Uad∆ only involves afinite number of mesh-points But passing to the limit as ∆ rarr 0 requires a more carefultreatment In fact for that to be done one needs to assume that the scheme underconsideration (33) is a contracting map in l1(Z)

Thus we consider the following discrete minimization problem Find u0min∆ such that

J∆(u0min∆ ) = min

u0∆isinUad∆

J∆(u0∆) (34)

The following holds

Theorem 31 Assume that un∆ is obtained by a numerical scheme (33) which satisfiesthe following

bull For a given u0 isin Uad un∆ = Sn∆Π∆u

0 converges to u(x t) the entropy solution of(12) More precisely if n∆t = t and T0 gt 0 is any given number

max0letleT0

un∆ minus u(middot t)L1(R) rarr 0 as ∆rarr 0 (35)

bull The map S∆ is Linfin-stable ie

S∆u0∆Linfin le u0

∆Linfin (36)

bull The map S∆ is a contracting map in l1(Z) ie

S∆u0∆ minus S∆v

0∆L1 le u0

∆ minus v0∆L1 (37)

Then

bull For all ∆ the discrete minimization problem (34) has at least one solution u0min∆ isin

Uad∆

bull Any accumulation point of u0min∆ with respect to the weak-lowast topology in Linfin as

∆rarr 0 is a minimizer of the continuous problem (22)

6 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

The proof of this result can be developed as in [8] We also refer to [16] for an extensionto the multi-dimensional case

This convergence result applies for 3-point conservative numerical approximation schemeswhere S∆ is given by

(S∆v∆)i = vi minus∆t

∆x(g(vi+1 vi)minus g(vi viminus1)) (38)

and g is the numerical flux This scheme is consistent with the corresponding equation(12) when g(u u) = f(u)

When the discrete semigroup S∆(u v w) = v minus λ(g(u v)minus g(v w)) with λ = ∆t∆x ismonotonic increasing with respect to each argument the scheme is also monotone Thisensures the convergence to the weak entropy solutions of the continuous conservation lawas the discretization parameters tend to zero under a suitable CFL condition (see Ref[12] Chap 3 Th 42)

All this analysis and results apply to the classical Godunov Lax-Friedfrichs and Engquist-Osher schemes the corresponding numerical flux being

gG(u v) =

minwisin[uv] f(w) if u le vmaxwisin[uv] f(w) if u ge v

(39)

gLF (u v) =(f(u) + f(v))

2minus (v minus u)

2λx (310)

gEO(u v) =f(u) + f(v)minus

int v

u|f prime(τ)|dτ

2 (311)

See Chapter 3 in [12] for more detailsThese 1D methods satisfy the conditions of Theorem 31

4 Sensitivity analysis the continuous approach

We divide this section in three subsections Specifically in the first one we consider thecase where the solution u of (12) has no shocks In the second and third subsections weanalyze the sensitivity of the solution and the functional in the presence of a single shock

41 Sensitivity without shocks In this subsection we give an expression for the sen-sitivity of the functional J with respect to the initial datum based on a classical adjointcalculus for smooth solutions First we present a formal calculus and then we show howto justify it when dealing with a classical smooth solution for (12)

Let C10(R) be the set of C1 functions with compact support and let u0 isin C1

0(R) be agiven initial datum for which there exists a classical solution u(x t) of (12) that can beextended to a classical solution in t isin [0 T + τ ] for some τ gt 0 Note that this imposessome restrictions on u0 other than being smooth

Let δu0 isin C10(R) be any possible variation of the initial datum u0 Due to the finite

speed of propagation this perturbation will only affect the solution in a bounded set ofRtimes [0 T ] This simplifies the argument below that applies in a much more general settingprovided solutions are smooth enough

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 7

Then for ε gt 0 sufficiently small the solution uε(x t) corresponding to the initial datum

uε0(x) = u0(x) + εδu0(x) (41)

is also a classical solution in (x t) isin Rtimes (0 T ) and uε isin C1(Rtimes [0 T ]) can be written as

uε = u+ εδu+ o(ε)with respect to the C1 topology (42)

where δu is the solution of the linearized equation

parttδu+ partx(f prime(u)δu) = 0 in Rtimes (0 T ) δu(x 0) = δu0(x) x isin R (43)

Let δJ be the Gateaux derivative of J at u0 in the direction δu0 We have

δJ(u0)[δu0] =

int T

0

intRρ(x t)(u(x t)minus ud(x t))δu(x t)dxdt (44)

where δu solves the linearized system above (43) Now we introduce the adjoint system

minus parttpminus f prime(u)partxp = ρ (uminus ud) in Rtimes (0 T ) p(x T ) = 0 x isin R (45)

Multiplying the equations satisfied by δu by p integrating by parts and taking intoaccount that p satisfies (45) we easily obtainint T

0

intRρ(x t)(u(x t)minus ud(x t))δu(x t)dxdt =

intRp(x 0)δu0(x)dx (46)

Thus δJ in (44) can be written as

δJ(u0)[δu0] =

intRp(x 0)δu0(x)dx (47)

This expression provides an easy way to compute a descent direction for the continuousfunctional J once we have computed the adjoint state We just take

δu0(x) = minusp(x 0) (48)

Under the assumptions above on u0 u δu and p can be obtained from their data u0(x)δu0(x) and ρ (u minus ud) by using the characteristic curves associated to (12) For the sakeof completeness we briefly explain this below

The characteristic curves associated to (12) are defined by

xprime(t) = f prime(u(x(t) t)) t isin (0 T ) x(0) = x0 (49)

They are straight lines whose slopes depend on the initial data

x(t) = x0 + tf prime(u0(x0)) t isin (0 T )

As we are dealing with classical solutions u is constant along such curves and by assump-tion two different characteristic curves do not meet each other in R times [0 T + τ ] Thisallows to define u in Rtimes [0 T + τ ] in a unique way from the initial data

For ε gt 0 sufficiently small the solution uε(x t) corresponding to the initial datum(41) has similar characteristics to those of u This allows guaranteeing that two differentcharacteristic lines do not intersect for 0 le t le T if ε gt 0 is small enough Note that uε

may possibly be discontinuous for t isin (T T +τ ] if u0 generates a discontinuity at t = T +τ

8 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

but this is irrelevant for the analysis in [0 T ] we are carrying out Therefore uε(x t) is alsoa classical solution in (x t) isin R times [0 T ] and it is easy to see that the solution uε can bewritten as (42) where δu satisfies (43)

System (43) can be solved again by the method of characteristics In fact as u is aregular function the first equation in (43) can be written as

parttδu+ f prime(u)partxδu = minuspartx(f prime(u)) δu (410)

ied

dtδu(x(t) t) = minuspartx(f prime(u)) δu (411)

where x(t) are the characteristic curves defined by (49) Thus the solution δu along acharacteristic line can be obtained from δu0 by solving this differential equation ie

δu(x(t) t) = δu0(x0) exp

(minusint t

0

partx(f prime(u))(x(s) s)ds

)

Finally the adjoint system (45) is also solved by characteristics ie

minus d

dtp(x(t) t) = ρ(x(t) t)(u(x(t) t)minus ud(x(t) t))

This yields the steepest descent direction in (48) for the continuous functional

p(x0 0) = u0(x0)

int T

0

ρ(x(s) s)dsminusint T

0

ρ(x(s) s)ud(x(s) s)ds

Remark 41 Note that for classical solutions the Gateaux derivative of J at u0 is given by(47) and this provides an obvious descent direction for J at u0 given by (48) Howeverthis fact is not very useful in practice since even when we initialize the iterative descentalgorithm with a smooth u0 we cannot guarantee that the solution remains classical alongthe iterative process

42 Sensitivity of the state in the presence of shocks Inspired in several results onthe sensitivity of solutions of conservation laws in the presence of shocks in one-dimension(see [7 4 5 6 17 13]) we focus on the particular case of solutions having a single shockBut the analysis can be extended to consider more general one-dimensional systems ofconservation laws with a finite number of noninteracting shocks We introduce the followinghypothesis

Hypothesis 42 Assume that u(x t) is a weak entropy solution of (12) with a discon-tinuity along a regular curve Σ = (ϕ(t) t) t isin (0 T ) which is Lipschitz continuousoutside Σ In particular it satisfies the Rankine-Hugoniot condition on Σ

ϕprime(t)[u]Σt = [f(u)]Σt

Here we have used the notation [v]Σt = limε0

v(ϕ(t) + ε t)minus v(ϕ(t)minus ε t) for the jump at

Σt = (ϕ(t) t) of any piecewise continuous function v with a discontinuity at Σt

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 9

Note that Σ divides Rtimes(0 T ) into two parts Qminus and Q+ the sub-domains of Rtimes(0 T )to the left and to the right of Σ respectively

As we will see in the presence of shocks to deal correctly with optimal control anddesign problems the state of the system needs to be viewed as constituted by the pair(u ϕ) combining the solution of (12) and the shock location ϕ This is relevant in theanalysis of sensitivity of functions below and when applying descent algorithms

We adopt the functional framework based on the generalized tangent vectors (see [7] andDefinition 41 in [8])

Let u0 be the initial datum that we assume to be Lipschitz continuous to both sidesof a single discontinuity located at x = ϕ0 and consider a generalized tangent vector(δu0 δϕ0) isin L1(R) times R for all 0 le T Let u0ε be a path which generates (δu0 δϕ0)For ε sufficiently small the solution uε(middot t) of (12) is Lipschitz continuous with a singlediscontinuity at x = ϕε(t) for all t isin [0 T ] Therefore uε(middot t) generates a generalizedtangent vector (δu(middot t) δϕ(t)) isin L1(R) times R Moreover in [8] it is proved that it satisfiesthe following linearized system

parttδu+ partx(f prime(u)δu) = 0 in Qminus cupQ+ (412)

d

dt([u]Σtδϕ) = [f prime(u)δu]Σt minus [δu]Σt

d

dtϕ t isin (0 T ) (413)

δu(x 0) = δu0(x) x lt ϕ0 cup x gt ϕ0 (414)

δϕ(0) = δϕ0 (415)

43 Sensitivity of the cost in the presence of shocks In this section we study thesensitivity of the functional J with respect to variations associated with the generalizedtangent vectors defined in the previous section We first define an appropriate generaliza-tion of the Gateaux derivative of J

Definition 43 Let J L1(R) rarr R be a functional and u0 isin L1(R) be Lipschitz contin-uous with a discontinuity in Σ0 an initial datum for which the solution of (12) satisfieshypothesis (42) J is Gateaux differentiable at u0 in a generalized sense if for any gener-alized tangent vector (δu0 δϕ0) and any family u0ε associated to (δu0 δϕ0) the followinglimit exists

δJ = limεrarr0

J(u0ε)minus J(u0)

ε

and it depends only on (u0 ϕ0) and (δu0 δϕ0) ie it does not depend on the particularfamily u0ε which generates (δu0 δϕ0) The limit is the generalized Gateux derivative of Jin the direction (δu0 δϕ0)

The following result easily provides a characterization of the generalized Gateaux deriv-ative of J in terms of the solution of the associated adjoint system (417) (418)(419)(420) (421) and (422)

Proposition 44 The Gateaux derivative of J can be written as follows

δJ(u0)[δu0 δϕ0] =

intRp(x 0)δu0(x)dxminus q(0)[u]Σ0δϕ0 (416)

10 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

where the adjoint state pair (p q) satisfies the system

minus parttpminus f prime(u)partxp = ρ (uminus ud) in Qminus cupQ+ (417)

[p]Σt = 0 t isin (0 T ) (418)

q(t) = p(ϕ(t) t) t isin (0 T ) (419)

minus d

dtq =

(1 + (ϕ)2)12[ρ (uminus ud)2]Σt

2[u]Σt

t isin (0 T ) (420)

p(x T ) = 0 x lt ϕ(T ) cup x gt ϕ(T ) (421)

q(T ) = 0 (422)

Let us briefly comment the result of Proposition 44 before giving its proofSystem (417)-(422) has a unique solution In fact to solve the backward system (417)-

(422) we first define the solution q on the shock Σ from the conditions for q (420) and(422) This determines the value of p along the shock We then propagate this informationtogether with (417) and (421) to both sides of Σ by characteristics (see Figure 1 wherewe illustrate this construction)

p is defined by the method of characteristics

p on the region of influence of the characteristics emanating from the shock

Xminus X+t = 0

t = T

Σ

Σ0

Figure 1 Characteristic lines entering on a shock and how they may beused to build the solution of the adjoint system both away from the shockand on its region of influence

Formula (416) provides an obvious way to compute a first descent direction of J at u0We just take

(δu0 δϕ0) = (minusp(middot 0) q(0)[u]Σ0) (423)

Here the value of δϕ0 must be interpreted as the optimal infinitesimal displacement of thediscontinuity of u0

In [8] when considering the inverse design problem it was observed that the solution pof the corresponding adjoint system at t = 0 was discontinuous with two discontinuities

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 11

one in each side of the original location of the discontinuity at Σ0 This was a reason notto use this descent direction and for introducing the alternating descent method In thepresent setting however the adjoint state p obtained is typically continuous This is due tothe fact that p at both side of the discontinuity is defined by the method of characteristicsand that on the region of influence of the characteristics emanating from the shock thecontinuity is preserved by the fact that on one hand q = q(t) itself is continuous as theprimitive of an integrable function and that the data for p and q at t = T are continuoustoo Despite of this as we shall see the implementation of the alternating descent directionmethod is worth since it significantly improves the results obtained by the purely discreteapproach

We finish this section with the proof of Proposition 44

Proof (of Proposition 44) A straightforward computation shows that J is Gateaux dif-ferentiable in the generalized sense of Definition 43 and that the generalized Gateauxderivative of J in the direction of the generalized tangent vector (δu0 δϕ0) is given by

δJ(u0)[δu0 δϕ0] =

int T

0

intxgtϕ(t)cupxltϕ(t)

ρ(x t)(u(x t)minus ud(x t))δu(x t) dxdt

minusint

Σ

(uminus ud)2

2

]Σt

δϕ(t) dΣ(t) (424)

where the pair (δu δϕ) solves the linearized problem (412)-(415) with initial data (δu0 δϕ0)Let us now introduce the adjoint system (417)-(422) Multiplying the equations of δu

by p and integrating we get

0 =

int T

0

intxgtϕ(t)cupxltϕ(t)

(parttδu+ partx(f prime(u)δu)) p dxdt = minusint T

0

intxgtϕ(t)cupxltϕ(t)

δu (parttp+ f prime(u)partxp) dxdt

+

intxgtϕ(T )cupxltϕ(T )

δu(x T )p(x T )dxminusint

xgtϕ0cupxltϕ0

δu0(x)p(x 0)dx

minusint

Σ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ(t) (425)

where (nx nt) are the Cartesian components of the normal vector to the curve ΣTherefore since p satisfies the adjoint equation (417) (421) from (424) we obtain

δJ(u0)[δu0 δϕ0] =

intxgtϕ0cupxltϕ0

δu0(x)p(x 0)dx

+

intΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ(t)minus

intΣ

(uminus ud)2

2

]Σt

δϕ(t) dΣ(t) (426)

The last two terms in the right hand side of (426) will determine the conditions that pmust satisfy on the shock

12 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Observe that for any functions f g we have

[fg]Σt = f [g]Σt + g[f ]Σt

where g represents the average of g to both sides of the shock Σt ie

g(t) =1

2limε0

(g(ϕ(t) + ε t) + g(ϕ(t)minus ε t)) forallt isin (0 T )

Thus we haveintΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ =

intΣ

[p]Σt

(δunt

Σ + f prime(u)δu middot nxΣ

)dΣ

+

intΣ

p([δu]Σtnt

Σ + [f prime(u)δu]Σt middot nxΣ

)dΣ

(427)

Now we note that the Cartesian components of the normal vector to Σ are given by

nt =minusϕprime(t)radic

1 + (ϕprime(t))2 nx =

1radic1 + (ϕprime(t))2

and dΣ(t) =radic

1 + (ϕprime(t))2dt Using (413) (418) we haveintΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]ΣtnxΣ

)dΣ =

int T

0

p (minus[δu]Σtϕprime(t) + [f prime(u)δu]Σt) dt

=

int T

0

pd

dt([u]Σtδϕ) dt (428)

Therefore replacing (419) (420) (422) and (428) in (426) we obtain (416)This concludes the proof

5 The alternating descent method

Here we explain how the alternating descent method introduced in [8] can be adapted tothe present optimal control problem (13)

First let us introduce some notation We consider two points

Xminus = ϕ(T )minus Tf prime(uminus(ϕ(T ) T )) X+ = ϕ(T )minus Tf prime(u+(x T )) (51)

(p q) the solutions of (417)-(422) and

p0minus = limxXminus

p(x 0) p0+ = limxX+

p(x 0)

Now we introduce the two classes of descent directions we shall use in our descentalgorithm

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 13

First directions With this set of directions we mainly modify the profile of u0 Weset the first directions d1 = (δu0 δϕ0) given by

δu0 =

minusp(x 0) x lt Xminusminusp0minus x isin (Xminus ϕ0)minusp0+ x isin (ϕ0 X+)minusp(x 0) x gt X+

δϕ0 =

0 if

X+intXminus

p(x 0)δu0(x)dx le 0

X+intXminus

p(x 0)δu0(x)dx

[u0]Σ0q(0) if

X+intXminus

p(x 0)δu0(x)dx gt 0 and q(0) 6= 0

(52)

We note that if q(0) = 0 andX+intXminus

p(x 0)δu0(x)dx gt 0 these directions are not

definedSecond directions They are aimed to move the shock without changing the profile

of the solution to both sides Then d2 = (δu0 δϕ0) with

δu0 equiv 0 δϕ0(x) = [u0]Σ0q(0) (53)

We observe that d1 given by (52) satisfies

δJ(u0)[d1] = minusintxisin[XminusX+]

|p(x 0)|2dx

minusp0minusint ϕ0

Xminusp(x 0)dxminus p0+

int X+

ϕ0

p(x 0)dxminus [u0]Σ0q(0)δϕ0 le 0 (54)

Note that in those cases where d1 is not defined as indicated above this descent directionis simply not employed in the descent algorithm

And for d2 given by (53) we have

δJ(u0)[d2] = minus([u0]Σ0q(0))2 le 0 (55)

Thus the two classes of descent directions under consideration have three importantproperties

(1) They are both descent directions(2) They allow to split the design of the profile and the shock location(3) They are true generalized gradients and therefore keep the structure of the data

without increasing its complexity

6 Numerical approximation of the descent direction

We have computed the gradient of the continuous functional J in several cases (u smoothor having shock discontinuities) but in practice one has to work with discrete versions of

14 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

the functional J In this section we discuss two possible ways of searching discrete descentdirections based either on the discrete or the continuous approaches and in the later onthe alternating descent method

The discrete approach consists mainly in applying directly a descent algorithm to thediscrete version J∆ of the functional J by using its gradient The alternating descentmethod by the contrary is a method inspired on the continuous analysis of the previoussection in which the two main classes of descent directions that are first identified andlater discretized

Let us first discuss the discrete approach

61 The discrete approach Let us consider the approximation of the functional J byJ∆ defined (11) and (31) respectively We shall use the Engquist-Osher scheme whichis a 3-point conservative numerical approximation scheme for (12) More explicitly weconsider

un+1i = uni minus

∆t

∆x

(g(uni+1 u

ni )minus g(uni u

niminus1)) i isin Z n = 0 N (61)

where g is the numerical flux defined in (311)The gradient of the discrete functional J∆ requires computing one derivative of J∆ with

respect to each node of the mesh This can be done in a cheaper way using the discreteadjoint state We illustrate it for the Engquist-Osher numerical scheme However asthe discrete functionals J∆ are not necessarily convex the gradient methods could possiblyprovide sequences that do not converge to a global minimizer of J∆ But this drawback anddifficulty appears in most applications of descent methods in optimal design and controlproblems

As we will see in the present context the approximations obtained by discrete gradientmethods are satisfactory although convergence is slow due to unnecessary oscillations thatthe descent method introduces

The gradient of J∆ rigorously speaking requires the linearization of the numericalscheme (61) used to approximate the equation (12) Then the linearization correspondsto

δun+1i = δuni minus

∆t

∆x

(partag(uni+1 u

ni )δuni+1 minus partbg(uni u

niminus1)δuniminus1

)minus∆t

∆x

(partbg(uni+1 u

ni )minus partag(uni u

niminus1))δuni

i isin Z n = 0 N (62)

In view of this the discrete adjoint system of (62) can also be written for the differen-tiable flux functions

pN+1i = 0 i isin Z

pni = pn+1i +

∆t

∆xpartbg(uni+1 u

ni )(pn+1i+1 minus pn+1

i

)+

∆t

∆xpartag(uni u

niminus1)

(pn+1i minus pn+1

iminus1

)+ F n

i

i isin Z n = 0 N (63)

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 15

whereF ni = ∆tρni

(uni minus (ud)ni

) i isin Z n = 0 N (64)

In fact when multiplying the equations in (62) by pn+1i and adding in i isin Z and

n = 0 N the following identity is easily obtained

∆xsumiisinZ

p0i δu

0i = ∆x

Nsumn=0

sumiisinZ

F ni δu

0i (65)

This is the discrete version of formula (46) which allows us to simplify the derivative ofthe discrete cost functional

Thus for any variation δu0∆ the Gateaux derivative of the cost functional defined in

(31) is given by

δJ∆ = ∆x∆tNsum

n=0

sumiisinZ

ρni(uni minus (ud)ni

)δuni (66)

where δunij solves the linearized system (62) If we consider pni the solution of (63) with(64) we obtain that δJ∆ in (66) can be written as

δJ∆ = ∆xsumiisinZ

p0i δu

0i

and this allows to obtain easily the steepest descent direction for J∆ by considering

δu0∆ = minusp0

∆ (67)

Remark 61 We observe that for the Enquis-Osherrsquos flux (311) the system (63) is theupwind method for the continuous adjoint system We do not address here the problemof the convergence of this adjoint scheme towards the solution of the continuous adjointsystem Of course this is an easy matter when u is smooth but it is far from being trivialwhen u has shock discontinuities Whether or not this discrete adjoint system as ∆rarr 0allows reconstructing the complete adjoint system with the inner Dirichlet condition alongthe shock (417)(418)(419)(420)(421) and (422) constitutes an interesting problemfor future research We refer to [14] and [18] for preliminary work on this direction inone-dimension

62 The alternating descent method Now we describe the implementation of thealternating descent method

The main idea is to approximate a minimizer of J alternating between two directions ofdescent First we perturb the initial datum u0 using the direction (52) which principallychanges the profile of u0 Second we move the shock curve without altering the profile ofu0 at both sides of Σ using the direction (53)

More precisely for a given initialization u0∆ and target function ud∆ we compute Σ0

∆ thejump-point of u0

∆u∆ and p0∆ as the solutions of (62) (63) respectively

As numerical approximation of the adjoint state q(0) we take the value of p0∆ over the

point Σ0∆ ie

q0∆ = p0

∆(Σ0∆) = p0

i Σ0∆ isin (ximinus12 xi+12)

16 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

The main advantage of this method introduced in [8] is that for an initial datum u0

with a single discontinuity the descent directions are generalized tangent vectors ie theyintroduce Lipschitz continuous variations of u0 at both sides of the discontinuity and adisplacement of the shock position In this way the new datum obtained modifying theold one in the direction of this generalized tangent vector has again a single discontinuityThe method can be applied in a much more general context in which for instance thesolution has various shocks since the method is able both to generate shocks and to destroythem if any of these facts contributes to the decrease of the functional This method isin some sense close to those employed in shape design in elasticity in which topologicalderivatives (that would correspond to controlling the location of the shock in our method)are combined with classical shape deformations (that would correspond to simply shapingthe solution away from the shock in the present setting) [11]

As mentioned above in the context of the tracking problem we are considering thesolutions of the adjoint system do not increase the number of shocks Thus a directapplication of the continuous method without employing this alternating variant couldmake sense Note that the purely discrete method is a limited version of the continuous onesince one simply employs the descent direction indicated by p0 without paying attention tothe value q0 corresponding to the shock sensitivity However the numerical experimentswe have done with the full continuous method do not improve the results obtained by thediscrete one since the definition of the location of the shocks is lost along the iterativeprocess

7 Numerical experiments

In this section we present some numerical experiments which illustrate the results ob-tained in an optimization model problem with each one of the numerical methods describedin the previous section

We have chosen as computational domain the interval (minus4 4) and we have taken asboundary conditions in (61) at each time step t = tn the value of the initial data at theboundary This can be justified if we assume that the initial datum u0 is constant in asufficiently large inner neighborhood of the boundary x = plusmn4 (which depends on the sizeof the Linfin-norm of the data under consideration and the time horizon T ) due to the finitespeed of propagation A similar procedure is employed for the adjoint equation

We underline once more that the solutions obtained with each method may correspondto global minima or local ones since the gradient algorithm does not distinguish them

In the experiments we consider the Burgersrsquo equation ie f(z) = z22

parttu+ partx

(u2

2

)= 0 u(x 0) = u0(x) (71)

The weight function ρ under consideration in the experiments is given by

ρ(x t) =

1 t isin (T2 T )0 otherwise

And the time horizon T = 1

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 17

To compare the efficiency of the different methods we consider a fixed ∆x = 120 λ =∆t∆x = 23 (which satisfies the CFL condition) We then analyze the number of itera-tions that each method needs to attain a prescribed value of the functional

71 Experiment 1 We first consider a piecewise constant target profile ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

07 x isin [minus2 1]0 otherwise

(72)

Note that in this case (72) yields a particular solution of the optimization problemand the minimum value of J vanishes

We solve the optimization problem (13) with the above described different methodsstarting from the following initialization for u0

u0(x) =

05 x le 0minus01 x gt 0

(73)

which also has a discontinuity but located on a different pointIn Figure 2 we plot log(J) with respect to the number of iterations for both the purely

discrete method and the alternating descent one We see that the latter stabilizes in feweriterations

Figure 2 Experiment 1 log(J) versus the number of iterations in thedescent algorithm for the discrete and the alternating descent methods

In Figures 3 and 4 we present the minimizers obtained by the methods above and theassociated solutions Figures 5 and 6

The initial datum u0 obtained by the alternating descent method (Figures 4) is a goodapproximation of (72) The solution given by the discrete approach (Figures 3) presentsadded spurious oscillations Furthermore the discrete method is much slower and does notachieve the same level of accuracy since the functional J∆ does not decrease so much

18 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Figure 3 Experiment 1 u0discrete method iteration k =

999

Figure 4 Experiment 1 u0alternating descent method it-eration k = 38

Figure 5 Experiment 1 So-lution u(x t) discrete methoditeration k = 999

Figure 6 Experiment 1 So-lution u(x t) alternating de-scent method iteration k = 38

72 Experiment 2 The previous experiment indicates that the alternating descent methodperforms significantly better In order to show that this is a systematic fact which arisesindependently of the initialization of the method we consider the target ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

05 x le 050 otherwise

(74)

but this time we compare the performance of both methods starting from different initial-izations

The obtained numerical results are presented in Figure 7We see that regardless the initialization considered the alternating descent method

performs significantly better

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 19

u0 obtained by the discrete

approach

u0 obtained by the alternating

descent method

The value of the functional

Figure 7 Experiment 2 We present a comparison of the results obtainedwith both methods starting out of five different initialization configurationsIn the left column we exhibit the results obtained with the discrete methodIn the second one those achieved by the alternating descent method In thelast one we plot the evolution of the functional with both methods

20 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

We observe that in the five experiments the alternating descent method perfumes betterensuring the descent of the functional in much fewer iterations and yielding smoother lessoscillatory approximation of the minimizer

Note also that the discrete method rather than yielding discontinuous approximationsof the minimizer as the alternating descent method does it produces an initial datum witha Lipschitz front Observe that these are two different configurations that can lead tothe same evolution for the Burgers equation after some time once the front develops thediscontinuity This is in agreement with the fact that the functional to be minimized isonly active in the time-interval T2 le t le T

8 Conclusions and perspectives

In this paper we have adapted and presented the alternating descent method for atracking problem for a 1minus d scalar conservation law the goal being to identify an optimalinitial datum so that the solution gets as close as possible to a given trajectory Wehave exhibited in a number of examples the better performance of the alternating descentmethod with respect to the classical discrete method Thus the paper extends previousworks on other optimization problems such as the inverse design or the optimization of theflux function

The developments in this paper raise a number of interesting problems and questionsthat will deserve further investigation We summarize here some of them

bull The alternating descent method and the discrete one do not seem to yield the sameminimizer This should be further investigated in a more systematic manner inother experiments

The minimizer that the discrete method yields seems to replace the shock of theinitial datum by a Lipschitz function which eventually develops the same dynamicswithin the time interval T2 le t le T in which the functional we have chosen isactive This is an agreement with the behavior of these methods in the context ofthe classical problem of inverse design in which one aims to find the initial datumso that the solution at the final time takes a given value It would be interesting toprove analytically that the two methods may lead to different minimizers in somecircumstancesbull The experiments in this paper concern the case where the target ud is exactly

reachable It would be interesting to explore the performance of both methods inthe case where the target ud is not a solution of the underlying dynamicsbull It would be interesting to compare the performance of the methods presented in the

paper with the direct continuous method without introducing the alternating strat-egy Our preliminary numerical experiments indicate that the continuous methodwithout implementing the alternating strategy does not improve the results thatthe purely discrete strategy yieldsbull In [16] the problem of inverse design has been investigated adapting and extend-

ing the alternating descent method to the multi-dimensional case It would beinteresting to extend the analysis of this paper to the multidimensional case too

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 21

bull It would worth to compare the results in this paper with those that could beachieved by the nudging method as in [2] [3]

AcknowledgementsThe authors would like to thank C Castro F Palacios and A Pozo for stimulating

discussions

References

[1] A Adimurthi S S Ghoshal and V Gowda Optimal controllability for scalar conservation laws withconvex flux 12 Jan 2012 3

[2] D Auroux and J Blum Back and forth nudging algorithm for data assimilation problems ComptesRendus Mathematique 340(12)873 ndash 878 2005 3 20

[3] D Auroux and M Nodet The back and forth nudging algorithm for data assimilation problems theoretical results on transport equations ESAIM Control Optimisation and Calculus of Variations18(2)318ndash342 7 2012 3 20

[4] C Bardos and O Pironneau A formalism for the differentiation of conservation laws C R MathAcad Sci Paris 335(10)839ndash845 2002 8

[5] F Bouchut and F James One-dimensional transport equations with discontinuous coefficients Non-linear Anal 32(7)891ndash933 1998 8

[6] F Bouchut and F James Differentiability with respect to initial data for a scalar conservation lawIn Hyperbolic problems theory numerics applications Vol I (Zurich 1998) volume 129 of InternatSer Numer Math pages 113ndash118 Birkhauser Basel 1999 8

[7] A Bressan and A Marson A variational calculus for discontinuous solutions of systems of conservationlaws Comm Partial Differential Equations 20(9-10)1491ndash1552 1995 8 9

[8] C Castro F Palacios and E Zuazua An alternating descent method for the optimal control of theinviscid burgers equation in the presence of shocks Mathematical Models and Methods in AppliedSciences 18(03)369ndash416 2008 2 3 4 6 9 10 12 16

[9] C Castro F Palacios and E Zuazua Optimal control and vanishing viscosity for the burgers equa-tion In C Constanda and M Perez editors Integral Methods in Science and Engineering Volume2 pages 65ndash90 Birkhauser Boston 2010 2

[10] C Castro and E Zuazua Flux identification for 1-d scalar conservation laws in the presence of shocksMath Comp 80(276)2025ndash2070 2011 2

[11] S Garreau P Guillaume and M Masmoudi The topological asymptotic for pde systems Theelasticity case SIAM J Control Optim 39(6)1756ndash1778 Dec 2000 16

[12] E Godlewski and P Raviart Hyperbolic systems of conservation laws Mathematiques and Applica-tions Ellipses Paris 1991 6

[13] E Godlewski and P A Raviart The linearized stability of solutions of nonlinear hyperbolic systemsof conservation laws A general numerical approach Math Comput Simulation 50(1-4)77ndash95 1999Modelling rsquo98 (Prague) 8

[14] L Gosse and F James Numerical approximations of one-dimensional linear conservation equationswith discontinuous coefficients Mathematics of Computation 69(231)pp 987ndash1015 2000 15

[15] S N Kruzkov First order quasilinear equations with several independent variables Mat Sb (NS)81 (123)228ndash255 1970 2

[16] R Lecaros and E Zuazua Control of 2D scalar conservation laws in the presence of shocks preprint2014 3 6 20

[17] S Ulbrich Adjoint-based derivative computations for the optimal control of discontinuous solutions ofhyperbolic conservation laws Systems Control Lett 48(3-4)313ndash328 2003 Optimization and controlof distributed systems 8

22 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

[18] X Wen and S Jin Convergence of an immersed interface upwind scheme for linear advection equationswith piecewise constant coefficients I L1-error estimates J Comput Math 26(1)1ndash22 2008 15

Rodrigo Lecaros13 and Enrique Zuazua12

1BCAM - Basque Center for Applied MathematicsMazarredo 14 E-48009 Bilbao Basque Country Spain

2Ikerbasque - Basque Foundation for ScienceAlameda Urquijo 36-5 Plaza Bizkaia 48011 Bilbao Basque Country Spain

3CMM - Centro de Modelamiento Matematico Universidad de Chile (UMI CNRS 2807)Avenida Blanco Encalada 2120 Casilla 170-3 Correo 3 Santiago Chile

E-mail address rlecarosdimuchilecl zuazuabcamathorg

URL wwwbcamathorgzuazua

  • 1 Introduction
  • 2 Existence of Minimizers
  • 3 The Discrete Minimization Problem
  • 4 Sensitivity analysis the continuous approach
    • 41 Sensitivity without shocks
    • 42 Sensitivity of the state in the presence of shocks
    • 43 Sensitivity of the cost in the presence of shocks
      • 5 The alternating descent method
      • 6 Numerical approximation of the descent direction
        • 61 The discrete approach
        • 62 The alternating descent method
          • 7 Numerical experiments
            • 71 Experiment 1
            • 72 Experiment 2
              • 8 Conclusions and perspectives
              • References
Page 2: TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE

2 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

1 Introduction

There is an extensive literature on the control optimization and inverse design of partialdifferential equations But when dealing with nonlinear models most often the analysisrequires to linearize the system under consideration This is why most of the existing resultsdo not apply to hyperbolic conservation laws since the shock discontinuities of solutionsare an impediment to linearize the system under consideration in a classical manner

This paper is devoted to analyze this issue for 1D scalar conservation laws To comple-ment previous related works by our group we consider the tracking problem in which thegoal is to optimally choose the initial datum so that the solution is as close as possible to aprescribed trajectory Our previous works are devoted to the problem of inverse design inwhich the goal is to determine the initial datum so that the solution achieves a given targetat the final time (see for instance [8] [9]) or the nonlinearity entering in the conservationlaw [10])

To be more precise given a finite time T gt 0 a target function ud isin L2(R times (0 T ))and a positive weight function ρ isin Linfin(Rtimes (0 T )) with compact support in Rtimes (0 T ) weconsider the functional cost to be minimized J over a suitable class of initial data Uaddefined by

J(u0) =1

2

int T

0

intRρ(x t)|u(x t)minus ud(x t)|2dxdt (11)

where u Rx times Rt rarr R is the unique entropy solution of the scalar conservation law

parttu+ partx(f(u)) = 0 in Rtimes (0 T ) u(x 0) = u0(x) x isin R (12)

Thus the problem under consideration reads To find u0min isin Uad such that

J(u0min) = minu0isinUab

J(u0) (13)

Here the flux f R rarr R is assumed to be smooth f isin C1(RR) The initial datumu0 will be assumed to belong to a suitable admissible class Uad to ensure the existence ofa minimizer As a preliminary fact we remind that for u0 isin L1(R) cap Linfin(R) cap BV (R)there exists an unique entropy solution in the sense of Kruzkov (see [15]) in the classC0([0 T ]L1(R)) cap Linfin(Rtimes [0 T ]) cap Linfin([0 T ]BV (R))

As we will see the existence of minimizers can be established under some natural as-sumptions on the class of admissible data Uad using the well-posedness and compactnessproperties of solutions of the conservation law (12) The uniqueness of the minimizersis false in general due in particular to the possible presence of discontinuities in thesolutions of (12)

One of our main goals in this paper is to compare how the discrete approach and thealternating descent methods perform in this new prototypical optimization problem Thisalternating descent method was introduced and developed in the previous works by ourgroup to deal with and exploit the sensitivity of shocks In other words the solutions ofthe conservation law are seen as a multi-physics object integrated by both the solutionitself but also by the geometric location of the shock Perturbing the initial datum of theconservation law produces perturbations on both components Accordingly it is natural to

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 3

analyze and employ both sensitivities to develop descent algorithms leading to an efficientcomputation of the minimizer This is the basis of the alternating descent method ([8])

As we shall see in agreement with the results achieved in previously considered exam-ples the alternating descent method performs better than the classical discrete one whichconsists simply in applying a classical gradient descent algorithm to a discrete version ofthe functional and the conservation law

This paper is limited to the one dimensional case but the alternating descent methodcan also be extended to the multi-dimensional frame although this requires a much morecareful geometric treatment of shocks since for instance in 2minusd these are curves evolvingin time whose location has to be carefully determined and their motion and perturbationhandled carefully to avoid numerical instabilities (see [16])

There are other possible methods that could be employed to deal with these problemsIn the present 1 minus d setting for instance whenever the flux is convex one could use thetechniques developed in [1] based on the Lax-Oleinik representation formula of solutionsOne could also employ the nudging method developed by J Blum and coworkers ([2] [3])It would be interesting to compare in a systematic manner the results obtained in thisarticle with those that could be achieved by these other methods

The rest of this paper is organized as follows In Section 2 we formulate the problemunder consideration more precisely and prove the existence of minimizers

In Section 3 we introduce the discrete approximation of the continuous optimal controlproblem We prove the existence of minimizers for the discrete problem and their Γ-convergence towards the continuous ones as the mesh-size parameters tend to zero

As we shall see purely discrete approaches based on the minimization of the resultingdiscrete functionals by descent algorithms lead to very slow iterative processes We thusneed to introduce an alternated descent algorithm that takes into account the possiblepresence of shock discontinuities in solutions For doing this the first step is to develop acareful sensitivity analysis This is done in Section 4

In Section 5 we present the alternating descent method which combines the advantagesof both the discrete approach and the sensitivity analysis in the presence of shocks

In Section 6 we explain how to implement both descent algorithms The discrete ap-proach consists mainly in applying a descent algorithm to the discrete version J∆ of thefunctional J while the alternating descent method by the contrary is a continuous methodbased on the analysis of the previous section on the sensitivity of shocks

In Section 7 we present some numerical experiments illustrating the overall efficiency ofthe alternating descent method We conclude discussing some possible extensions of theresults and methods presented in the paper

2 Existence of Minimizers

In this section we prove that under certain conditions on the set of admissible initialdata Uad there exists at least one minimizer of the functional J given in (11) To do thiswe consider the class of admissible initial data Uad as

Uad = u0 isin Linfin(R) capBV (R) supp(u0) sub K u0Linfin + TV (u0) le C (21)

4 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

where K sub R is a given compact set and C gt 0 a given constant Note however that thesame theoretical results and descent strategies we shall develop here can be applied to amuch wider class of admissible sets

Theorem 21 Assume that ud isin L2(Rtimes (0 T )) let Uad be defined in (21) and f be a C1

function Then the minimization problem

minu0isinUab

J(u0) (22)

has at least one minimizer u0min isin Uad Moreover uniqueness is false in general

The proof follows the classical strategy of the Direct Method of the Calculus of Vari-ations We refer the reader to [8] for a similar proof in the case where the functional Jis replaced by the one in which the distance to a given target at the final time t = T isminimized Note that in particular for the class Uad of admissible initial data consideredsolutions enjoy uniform BV bounds allowing to prove the needed compactness propertiesto pass to the limit In the case of convex fluxes using the well-known one-sided Lipschitzcondition the class of admissible initial data can be further extended

We also observe that as indicated in [8] the uniqueness of the minimizer is in generalfalse for this type of optimization problems

3 The Discrete Minimization Problem

The purpose of this section is to show that discrete minimizers obtained by a numericalapproximation of (11) and (12) converge to a minimizer of the continuous problem asthe mesh-size tends to zero This justifies the usual engineering practice of replacing thecontinuous functional and model by discrete ones to compute an approximation of thecontinuous minimizer

Let us introduce a mesh in Rtimes [0 T ] given by (xi tn) = (i∆x n∆t) (i = minusinfin infin n =

0 N + 1 so that (N + 1)∆t = T ) and let unij be a numerical approximation of u(xi tn)

obtained as solution of a suitable discretization of the equation (12)Let us consider the following approximation of the functional J in (11)

J∆(u0∆) =

∆x∆t

2

Nsumn=0

infinsumi=minusinfin

ρni(uni minus (ud)ni

)2 (31)

where u0∆ = u0

i is the discrete initial datum and ud∆ = (ud)ni = Π∆ud is the discretiza-

tion of the target ud at (xi tn) Π∆ being a discretization operator A common choice

consists in taking

Π∆ud = (ud)ni =

1

∆x∆t

int xi+12

ximinus12

int tn+12

tnminus12

ud(x t) dxdt (32)

where xiplusmn12 = xi plusmn∆x2 and tnplusmn12 = tn plusmn∆t2Moreover we introduce an approximation of the class of admissible initial data Uad

denoted by Uad∆ and constituted by sequences ϕ∆ = ϕiiisinZ for which the associated

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 5

piecewise constant interpolation function that we still denote by ϕ∆ defined by

ϕ∆(x) = ϕi x isin (ximinus12 xi+12)

satisfies ϕ∆ isin Uad Obviously Uad∆ coincides with the class of discrete vectors withsupport on those indices i such that xi isin K and for which the discrete Linfin and TV -normsare bounded above by the same constant C

Let us consider S∆ l1(Z)rarr l1(Z) an explicit numerical scheme for (12) where

un∆ = Sn∆u

0∆ (33)

is the approximation of the entropy solution u(middot t) = S(t)u0 of (12) ie un∆ S(t)u0with t = n∆t Here S L1(R) cap Linfin(R) cap BV (R) rarr L1(R) cap Linfin(R) cap BV (R) is thesemigroup (solution operator)

S u0 rarr u = Su0

which associates to the initial condition u0 isin L1(R)capLinfin(R)capBV (R) the entropy solutionu of (12)

For each ∆ = ∆t (with λ = ∆t∆x fixed typically given by the corresponding CFL-condition for explicit schemes) it is easy to see that the discrete analogue of Theorem21 holds In fact this is automatic in the present setting since Uad∆ only involves afinite number of mesh-points But passing to the limit as ∆ rarr 0 requires a more carefultreatment In fact for that to be done one needs to assume that the scheme underconsideration (33) is a contracting map in l1(Z)

Thus we consider the following discrete minimization problem Find u0min∆ such that

J∆(u0min∆ ) = min

u0∆isinUad∆

J∆(u0∆) (34)

The following holds

Theorem 31 Assume that un∆ is obtained by a numerical scheme (33) which satisfiesthe following

bull For a given u0 isin Uad un∆ = Sn∆Π∆u

0 converges to u(x t) the entropy solution of(12) More precisely if n∆t = t and T0 gt 0 is any given number

max0letleT0

un∆ minus u(middot t)L1(R) rarr 0 as ∆rarr 0 (35)

bull The map S∆ is Linfin-stable ie

S∆u0∆Linfin le u0

∆Linfin (36)

bull The map S∆ is a contracting map in l1(Z) ie

S∆u0∆ minus S∆v

0∆L1 le u0

∆ minus v0∆L1 (37)

Then

bull For all ∆ the discrete minimization problem (34) has at least one solution u0min∆ isin

Uad∆

bull Any accumulation point of u0min∆ with respect to the weak-lowast topology in Linfin as

∆rarr 0 is a minimizer of the continuous problem (22)

6 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

The proof of this result can be developed as in [8] We also refer to [16] for an extensionto the multi-dimensional case

This convergence result applies for 3-point conservative numerical approximation schemeswhere S∆ is given by

(S∆v∆)i = vi minus∆t

∆x(g(vi+1 vi)minus g(vi viminus1)) (38)

and g is the numerical flux This scheme is consistent with the corresponding equation(12) when g(u u) = f(u)

When the discrete semigroup S∆(u v w) = v minus λ(g(u v)minus g(v w)) with λ = ∆t∆x ismonotonic increasing with respect to each argument the scheme is also monotone Thisensures the convergence to the weak entropy solutions of the continuous conservation lawas the discretization parameters tend to zero under a suitable CFL condition (see Ref[12] Chap 3 Th 42)

All this analysis and results apply to the classical Godunov Lax-Friedfrichs and Engquist-Osher schemes the corresponding numerical flux being

gG(u v) =

minwisin[uv] f(w) if u le vmaxwisin[uv] f(w) if u ge v

(39)

gLF (u v) =(f(u) + f(v))

2minus (v minus u)

2λx (310)

gEO(u v) =f(u) + f(v)minus

int v

u|f prime(τ)|dτ

2 (311)

See Chapter 3 in [12] for more detailsThese 1D methods satisfy the conditions of Theorem 31

4 Sensitivity analysis the continuous approach

We divide this section in three subsections Specifically in the first one we consider thecase where the solution u of (12) has no shocks In the second and third subsections weanalyze the sensitivity of the solution and the functional in the presence of a single shock

41 Sensitivity without shocks In this subsection we give an expression for the sen-sitivity of the functional J with respect to the initial datum based on a classical adjointcalculus for smooth solutions First we present a formal calculus and then we show howto justify it when dealing with a classical smooth solution for (12)

Let C10(R) be the set of C1 functions with compact support and let u0 isin C1

0(R) be agiven initial datum for which there exists a classical solution u(x t) of (12) that can beextended to a classical solution in t isin [0 T + τ ] for some τ gt 0 Note that this imposessome restrictions on u0 other than being smooth

Let δu0 isin C10(R) be any possible variation of the initial datum u0 Due to the finite

speed of propagation this perturbation will only affect the solution in a bounded set ofRtimes [0 T ] This simplifies the argument below that applies in a much more general settingprovided solutions are smooth enough

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 7

Then for ε gt 0 sufficiently small the solution uε(x t) corresponding to the initial datum

uε0(x) = u0(x) + εδu0(x) (41)

is also a classical solution in (x t) isin Rtimes (0 T ) and uε isin C1(Rtimes [0 T ]) can be written as

uε = u+ εδu+ o(ε)with respect to the C1 topology (42)

where δu is the solution of the linearized equation

parttδu+ partx(f prime(u)δu) = 0 in Rtimes (0 T ) δu(x 0) = δu0(x) x isin R (43)

Let δJ be the Gateaux derivative of J at u0 in the direction δu0 We have

δJ(u0)[δu0] =

int T

0

intRρ(x t)(u(x t)minus ud(x t))δu(x t)dxdt (44)

where δu solves the linearized system above (43) Now we introduce the adjoint system

minus parttpminus f prime(u)partxp = ρ (uminus ud) in Rtimes (0 T ) p(x T ) = 0 x isin R (45)

Multiplying the equations satisfied by δu by p integrating by parts and taking intoaccount that p satisfies (45) we easily obtainint T

0

intRρ(x t)(u(x t)minus ud(x t))δu(x t)dxdt =

intRp(x 0)δu0(x)dx (46)

Thus δJ in (44) can be written as

δJ(u0)[δu0] =

intRp(x 0)δu0(x)dx (47)

This expression provides an easy way to compute a descent direction for the continuousfunctional J once we have computed the adjoint state We just take

δu0(x) = minusp(x 0) (48)

Under the assumptions above on u0 u δu and p can be obtained from their data u0(x)δu0(x) and ρ (u minus ud) by using the characteristic curves associated to (12) For the sakeof completeness we briefly explain this below

The characteristic curves associated to (12) are defined by

xprime(t) = f prime(u(x(t) t)) t isin (0 T ) x(0) = x0 (49)

They are straight lines whose slopes depend on the initial data

x(t) = x0 + tf prime(u0(x0)) t isin (0 T )

As we are dealing with classical solutions u is constant along such curves and by assump-tion two different characteristic curves do not meet each other in R times [0 T + τ ] Thisallows to define u in Rtimes [0 T + τ ] in a unique way from the initial data

For ε gt 0 sufficiently small the solution uε(x t) corresponding to the initial datum(41) has similar characteristics to those of u This allows guaranteeing that two differentcharacteristic lines do not intersect for 0 le t le T if ε gt 0 is small enough Note that uε

may possibly be discontinuous for t isin (T T +τ ] if u0 generates a discontinuity at t = T +τ

8 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

but this is irrelevant for the analysis in [0 T ] we are carrying out Therefore uε(x t) is alsoa classical solution in (x t) isin R times [0 T ] and it is easy to see that the solution uε can bewritten as (42) where δu satisfies (43)

System (43) can be solved again by the method of characteristics In fact as u is aregular function the first equation in (43) can be written as

parttδu+ f prime(u)partxδu = minuspartx(f prime(u)) δu (410)

ied

dtδu(x(t) t) = minuspartx(f prime(u)) δu (411)

where x(t) are the characteristic curves defined by (49) Thus the solution δu along acharacteristic line can be obtained from δu0 by solving this differential equation ie

δu(x(t) t) = δu0(x0) exp

(minusint t

0

partx(f prime(u))(x(s) s)ds

)

Finally the adjoint system (45) is also solved by characteristics ie

minus d

dtp(x(t) t) = ρ(x(t) t)(u(x(t) t)minus ud(x(t) t))

This yields the steepest descent direction in (48) for the continuous functional

p(x0 0) = u0(x0)

int T

0

ρ(x(s) s)dsminusint T

0

ρ(x(s) s)ud(x(s) s)ds

Remark 41 Note that for classical solutions the Gateaux derivative of J at u0 is given by(47) and this provides an obvious descent direction for J at u0 given by (48) Howeverthis fact is not very useful in practice since even when we initialize the iterative descentalgorithm with a smooth u0 we cannot guarantee that the solution remains classical alongthe iterative process

42 Sensitivity of the state in the presence of shocks Inspired in several results onthe sensitivity of solutions of conservation laws in the presence of shocks in one-dimension(see [7 4 5 6 17 13]) we focus on the particular case of solutions having a single shockBut the analysis can be extended to consider more general one-dimensional systems ofconservation laws with a finite number of noninteracting shocks We introduce the followinghypothesis

Hypothesis 42 Assume that u(x t) is a weak entropy solution of (12) with a discon-tinuity along a regular curve Σ = (ϕ(t) t) t isin (0 T ) which is Lipschitz continuousoutside Σ In particular it satisfies the Rankine-Hugoniot condition on Σ

ϕprime(t)[u]Σt = [f(u)]Σt

Here we have used the notation [v]Σt = limε0

v(ϕ(t) + ε t)minus v(ϕ(t)minus ε t) for the jump at

Σt = (ϕ(t) t) of any piecewise continuous function v with a discontinuity at Σt

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 9

Note that Σ divides Rtimes(0 T ) into two parts Qminus and Q+ the sub-domains of Rtimes(0 T )to the left and to the right of Σ respectively

As we will see in the presence of shocks to deal correctly with optimal control anddesign problems the state of the system needs to be viewed as constituted by the pair(u ϕ) combining the solution of (12) and the shock location ϕ This is relevant in theanalysis of sensitivity of functions below and when applying descent algorithms

We adopt the functional framework based on the generalized tangent vectors (see [7] andDefinition 41 in [8])

Let u0 be the initial datum that we assume to be Lipschitz continuous to both sidesof a single discontinuity located at x = ϕ0 and consider a generalized tangent vector(δu0 δϕ0) isin L1(R) times R for all 0 le T Let u0ε be a path which generates (δu0 δϕ0)For ε sufficiently small the solution uε(middot t) of (12) is Lipschitz continuous with a singlediscontinuity at x = ϕε(t) for all t isin [0 T ] Therefore uε(middot t) generates a generalizedtangent vector (δu(middot t) δϕ(t)) isin L1(R) times R Moreover in [8] it is proved that it satisfiesthe following linearized system

parttδu+ partx(f prime(u)δu) = 0 in Qminus cupQ+ (412)

d

dt([u]Σtδϕ) = [f prime(u)δu]Σt minus [δu]Σt

d

dtϕ t isin (0 T ) (413)

δu(x 0) = δu0(x) x lt ϕ0 cup x gt ϕ0 (414)

δϕ(0) = δϕ0 (415)

43 Sensitivity of the cost in the presence of shocks In this section we study thesensitivity of the functional J with respect to variations associated with the generalizedtangent vectors defined in the previous section We first define an appropriate generaliza-tion of the Gateaux derivative of J

Definition 43 Let J L1(R) rarr R be a functional and u0 isin L1(R) be Lipschitz contin-uous with a discontinuity in Σ0 an initial datum for which the solution of (12) satisfieshypothesis (42) J is Gateaux differentiable at u0 in a generalized sense if for any gener-alized tangent vector (δu0 δϕ0) and any family u0ε associated to (δu0 δϕ0) the followinglimit exists

δJ = limεrarr0

J(u0ε)minus J(u0)

ε

and it depends only on (u0 ϕ0) and (δu0 δϕ0) ie it does not depend on the particularfamily u0ε which generates (δu0 δϕ0) The limit is the generalized Gateux derivative of Jin the direction (δu0 δϕ0)

The following result easily provides a characterization of the generalized Gateaux deriv-ative of J in terms of the solution of the associated adjoint system (417) (418)(419)(420) (421) and (422)

Proposition 44 The Gateaux derivative of J can be written as follows

δJ(u0)[δu0 δϕ0] =

intRp(x 0)δu0(x)dxminus q(0)[u]Σ0δϕ0 (416)

10 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

where the adjoint state pair (p q) satisfies the system

minus parttpminus f prime(u)partxp = ρ (uminus ud) in Qminus cupQ+ (417)

[p]Σt = 0 t isin (0 T ) (418)

q(t) = p(ϕ(t) t) t isin (0 T ) (419)

minus d

dtq =

(1 + (ϕ)2)12[ρ (uminus ud)2]Σt

2[u]Σt

t isin (0 T ) (420)

p(x T ) = 0 x lt ϕ(T ) cup x gt ϕ(T ) (421)

q(T ) = 0 (422)

Let us briefly comment the result of Proposition 44 before giving its proofSystem (417)-(422) has a unique solution In fact to solve the backward system (417)-

(422) we first define the solution q on the shock Σ from the conditions for q (420) and(422) This determines the value of p along the shock We then propagate this informationtogether with (417) and (421) to both sides of Σ by characteristics (see Figure 1 wherewe illustrate this construction)

p is defined by the method of characteristics

p on the region of influence of the characteristics emanating from the shock

Xminus X+t = 0

t = T

Σ

Σ0

Figure 1 Characteristic lines entering on a shock and how they may beused to build the solution of the adjoint system both away from the shockand on its region of influence

Formula (416) provides an obvious way to compute a first descent direction of J at u0We just take

(δu0 δϕ0) = (minusp(middot 0) q(0)[u]Σ0) (423)

Here the value of δϕ0 must be interpreted as the optimal infinitesimal displacement of thediscontinuity of u0

In [8] when considering the inverse design problem it was observed that the solution pof the corresponding adjoint system at t = 0 was discontinuous with two discontinuities

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 11

one in each side of the original location of the discontinuity at Σ0 This was a reason notto use this descent direction and for introducing the alternating descent method In thepresent setting however the adjoint state p obtained is typically continuous This is due tothe fact that p at both side of the discontinuity is defined by the method of characteristicsand that on the region of influence of the characteristics emanating from the shock thecontinuity is preserved by the fact that on one hand q = q(t) itself is continuous as theprimitive of an integrable function and that the data for p and q at t = T are continuoustoo Despite of this as we shall see the implementation of the alternating descent directionmethod is worth since it significantly improves the results obtained by the purely discreteapproach

We finish this section with the proof of Proposition 44

Proof (of Proposition 44) A straightforward computation shows that J is Gateaux dif-ferentiable in the generalized sense of Definition 43 and that the generalized Gateauxderivative of J in the direction of the generalized tangent vector (δu0 δϕ0) is given by

δJ(u0)[δu0 δϕ0] =

int T

0

intxgtϕ(t)cupxltϕ(t)

ρ(x t)(u(x t)minus ud(x t))δu(x t) dxdt

minusint

Σ

(uminus ud)2

2

]Σt

δϕ(t) dΣ(t) (424)

where the pair (δu δϕ) solves the linearized problem (412)-(415) with initial data (δu0 δϕ0)Let us now introduce the adjoint system (417)-(422) Multiplying the equations of δu

by p and integrating we get

0 =

int T

0

intxgtϕ(t)cupxltϕ(t)

(parttδu+ partx(f prime(u)δu)) p dxdt = minusint T

0

intxgtϕ(t)cupxltϕ(t)

δu (parttp+ f prime(u)partxp) dxdt

+

intxgtϕ(T )cupxltϕ(T )

δu(x T )p(x T )dxminusint

xgtϕ0cupxltϕ0

δu0(x)p(x 0)dx

minusint

Σ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ(t) (425)

where (nx nt) are the Cartesian components of the normal vector to the curve ΣTherefore since p satisfies the adjoint equation (417) (421) from (424) we obtain

δJ(u0)[δu0 δϕ0] =

intxgtϕ0cupxltϕ0

δu0(x)p(x 0)dx

+

intΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ(t)minus

intΣ

(uminus ud)2

2

]Σt

δϕ(t) dΣ(t) (426)

The last two terms in the right hand side of (426) will determine the conditions that pmust satisfy on the shock

12 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Observe that for any functions f g we have

[fg]Σt = f [g]Σt + g[f ]Σt

where g represents the average of g to both sides of the shock Σt ie

g(t) =1

2limε0

(g(ϕ(t) + ε t) + g(ϕ(t)minus ε t)) forallt isin (0 T )

Thus we haveintΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ =

intΣ

[p]Σt

(δunt

Σ + f prime(u)δu middot nxΣ

)dΣ

+

intΣ

p([δu]Σtnt

Σ + [f prime(u)δu]Σt middot nxΣ

)dΣ

(427)

Now we note that the Cartesian components of the normal vector to Σ are given by

nt =minusϕprime(t)radic

1 + (ϕprime(t))2 nx =

1radic1 + (ϕprime(t))2

and dΣ(t) =radic

1 + (ϕprime(t))2dt Using (413) (418) we haveintΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]ΣtnxΣ

)dΣ =

int T

0

p (minus[δu]Σtϕprime(t) + [f prime(u)δu]Σt) dt

=

int T

0

pd

dt([u]Σtδϕ) dt (428)

Therefore replacing (419) (420) (422) and (428) in (426) we obtain (416)This concludes the proof

5 The alternating descent method

Here we explain how the alternating descent method introduced in [8] can be adapted tothe present optimal control problem (13)

First let us introduce some notation We consider two points

Xminus = ϕ(T )minus Tf prime(uminus(ϕ(T ) T )) X+ = ϕ(T )minus Tf prime(u+(x T )) (51)

(p q) the solutions of (417)-(422) and

p0minus = limxXminus

p(x 0) p0+ = limxX+

p(x 0)

Now we introduce the two classes of descent directions we shall use in our descentalgorithm

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 13

First directions With this set of directions we mainly modify the profile of u0 Weset the first directions d1 = (δu0 δϕ0) given by

δu0 =

minusp(x 0) x lt Xminusminusp0minus x isin (Xminus ϕ0)minusp0+ x isin (ϕ0 X+)minusp(x 0) x gt X+

δϕ0 =

0 if

X+intXminus

p(x 0)δu0(x)dx le 0

X+intXminus

p(x 0)δu0(x)dx

[u0]Σ0q(0) if

X+intXminus

p(x 0)δu0(x)dx gt 0 and q(0) 6= 0

(52)

We note that if q(0) = 0 andX+intXminus

p(x 0)δu0(x)dx gt 0 these directions are not

definedSecond directions They are aimed to move the shock without changing the profile

of the solution to both sides Then d2 = (δu0 δϕ0) with

δu0 equiv 0 δϕ0(x) = [u0]Σ0q(0) (53)

We observe that d1 given by (52) satisfies

δJ(u0)[d1] = minusintxisin[XminusX+]

|p(x 0)|2dx

minusp0minusint ϕ0

Xminusp(x 0)dxminus p0+

int X+

ϕ0

p(x 0)dxminus [u0]Σ0q(0)δϕ0 le 0 (54)

Note that in those cases where d1 is not defined as indicated above this descent directionis simply not employed in the descent algorithm

And for d2 given by (53) we have

δJ(u0)[d2] = minus([u0]Σ0q(0))2 le 0 (55)

Thus the two classes of descent directions under consideration have three importantproperties

(1) They are both descent directions(2) They allow to split the design of the profile and the shock location(3) They are true generalized gradients and therefore keep the structure of the data

without increasing its complexity

6 Numerical approximation of the descent direction

We have computed the gradient of the continuous functional J in several cases (u smoothor having shock discontinuities) but in practice one has to work with discrete versions of

14 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

the functional J In this section we discuss two possible ways of searching discrete descentdirections based either on the discrete or the continuous approaches and in the later onthe alternating descent method

The discrete approach consists mainly in applying directly a descent algorithm to thediscrete version J∆ of the functional J by using its gradient The alternating descentmethod by the contrary is a method inspired on the continuous analysis of the previoussection in which the two main classes of descent directions that are first identified andlater discretized

Let us first discuss the discrete approach

61 The discrete approach Let us consider the approximation of the functional J byJ∆ defined (11) and (31) respectively We shall use the Engquist-Osher scheme whichis a 3-point conservative numerical approximation scheme for (12) More explicitly weconsider

un+1i = uni minus

∆t

∆x

(g(uni+1 u

ni )minus g(uni u

niminus1)) i isin Z n = 0 N (61)

where g is the numerical flux defined in (311)The gradient of the discrete functional J∆ requires computing one derivative of J∆ with

respect to each node of the mesh This can be done in a cheaper way using the discreteadjoint state We illustrate it for the Engquist-Osher numerical scheme However asthe discrete functionals J∆ are not necessarily convex the gradient methods could possiblyprovide sequences that do not converge to a global minimizer of J∆ But this drawback anddifficulty appears in most applications of descent methods in optimal design and controlproblems

As we will see in the present context the approximations obtained by discrete gradientmethods are satisfactory although convergence is slow due to unnecessary oscillations thatthe descent method introduces

The gradient of J∆ rigorously speaking requires the linearization of the numericalscheme (61) used to approximate the equation (12) Then the linearization correspondsto

δun+1i = δuni minus

∆t

∆x

(partag(uni+1 u

ni )δuni+1 minus partbg(uni u

niminus1)δuniminus1

)minus∆t

∆x

(partbg(uni+1 u

ni )minus partag(uni u

niminus1))δuni

i isin Z n = 0 N (62)

In view of this the discrete adjoint system of (62) can also be written for the differen-tiable flux functions

pN+1i = 0 i isin Z

pni = pn+1i +

∆t

∆xpartbg(uni+1 u

ni )(pn+1i+1 minus pn+1

i

)+

∆t

∆xpartag(uni u

niminus1)

(pn+1i minus pn+1

iminus1

)+ F n

i

i isin Z n = 0 N (63)

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 15

whereF ni = ∆tρni

(uni minus (ud)ni

) i isin Z n = 0 N (64)

In fact when multiplying the equations in (62) by pn+1i and adding in i isin Z and

n = 0 N the following identity is easily obtained

∆xsumiisinZ

p0i δu

0i = ∆x

Nsumn=0

sumiisinZ

F ni δu

0i (65)

This is the discrete version of formula (46) which allows us to simplify the derivative ofthe discrete cost functional

Thus for any variation δu0∆ the Gateaux derivative of the cost functional defined in

(31) is given by

δJ∆ = ∆x∆tNsum

n=0

sumiisinZ

ρni(uni minus (ud)ni

)δuni (66)

where δunij solves the linearized system (62) If we consider pni the solution of (63) with(64) we obtain that δJ∆ in (66) can be written as

δJ∆ = ∆xsumiisinZ

p0i δu

0i

and this allows to obtain easily the steepest descent direction for J∆ by considering

δu0∆ = minusp0

∆ (67)

Remark 61 We observe that for the Enquis-Osherrsquos flux (311) the system (63) is theupwind method for the continuous adjoint system We do not address here the problemof the convergence of this adjoint scheme towards the solution of the continuous adjointsystem Of course this is an easy matter when u is smooth but it is far from being trivialwhen u has shock discontinuities Whether or not this discrete adjoint system as ∆rarr 0allows reconstructing the complete adjoint system with the inner Dirichlet condition alongthe shock (417)(418)(419)(420)(421) and (422) constitutes an interesting problemfor future research We refer to [14] and [18] for preliminary work on this direction inone-dimension

62 The alternating descent method Now we describe the implementation of thealternating descent method

The main idea is to approximate a minimizer of J alternating between two directions ofdescent First we perturb the initial datum u0 using the direction (52) which principallychanges the profile of u0 Second we move the shock curve without altering the profile ofu0 at both sides of Σ using the direction (53)

More precisely for a given initialization u0∆ and target function ud∆ we compute Σ0

∆ thejump-point of u0

∆u∆ and p0∆ as the solutions of (62) (63) respectively

As numerical approximation of the adjoint state q(0) we take the value of p0∆ over the

point Σ0∆ ie

q0∆ = p0

∆(Σ0∆) = p0

i Σ0∆ isin (ximinus12 xi+12)

16 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

The main advantage of this method introduced in [8] is that for an initial datum u0

with a single discontinuity the descent directions are generalized tangent vectors ie theyintroduce Lipschitz continuous variations of u0 at both sides of the discontinuity and adisplacement of the shock position In this way the new datum obtained modifying theold one in the direction of this generalized tangent vector has again a single discontinuityThe method can be applied in a much more general context in which for instance thesolution has various shocks since the method is able both to generate shocks and to destroythem if any of these facts contributes to the decrease of the functional This method isin some sense close to those employed in shape design in elasticity in which topologicalderivatives (that would correspond to controlling the location of the shock in our method)are combined with classical shape deformations (that would correspond to simply shapingthe solution away from the shock in the present setting) [11]

As mentioned above in the context of the tracking problem we are considering thesolutions of the adjoint system do not increase the number of shocks Thus a directapplication of the continuous method without employing this alternating variant couldmake sense Note that the purely discrete method is a limited version of the continuous onesince one simply employs the descent direction indicated by p0 without paying attention tothe value q0 corresponding to the shock sensitivity However the numerical experimentswe have done with the full continuous method do not improve the results obtained by thediscrete one since the definition of the location of the shocks is lost along the iterativeprocess

7 Numerical experiments

In this section we present some numerical experiments which illustrate the results ob-tained in an optimization model problem with each one of the numerical methods describedin the previous section

We have chosen as computational domain the interval (minus4 4) and we have taken asboundary conditions in (61) at each time step t = tn the value of the initial data at theboundary This can be justified if we assume that the initial datum u0 is constant in asufficiently large inner neighborhood of the boundary x = plusmn4 (which depends on the sizeof the Linfin-norm of the data under consideration and the time horizon T ) due to the finitespeed of propagation A similar procedure is employed for the adjoint equation

We underline once more that the solutions obtained with each method may correspondto global minima or local ones since the gradient algorithm does not distinguish them

In the experiments we consider the Burgersrsquo equation ie f(z) = z22

parttu+ partx

(u2

2

)= 0 u(x 0) = u0(x) (71)

The weight function ρ under consideration in the experiments is given by

ρ(x t) =

1 t isin (T2 T )0 otherwise

And the time horizon T = 1

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 17

To compare the efficiency of the different methods we consider a fixed ∆x = 120 λ =∆t∆x = 23 (which satisfies the CFL condition) We then analyze the number of itera-tions that each method needs to attain a prescribed value of the functional

71 Experiment 1 We first consider a piecewise constant target profile ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

07 x isin [minus2 1]0 otherwise

(72)

Note that in this case (72) yields a particular solution of the optimization problemand the minimum value of J vanishes

We solve the optimization problem (13) with the above described different methodsstarting from the following initialization for u0

u0(x) =

05 x le 0minus01 x gt 0

(73)

which also has a discontinuity but located on a different pointIn Figure 2 we plot log(J) with respect to the number of iterations for both the purely

discrete method and the alternating descent one We see that the latter stabilizes in feweriterations

Figure 2 Experiment 1 log(J) versus the number of iterations in thedescent algorithm for the discrete and the alternating descent methods

In Figures 3 and 4 we present the minimizers obtained by the methods above and theassociated solutions Figures 5 and 6

The initial datum u0 obtained by the alternating descent method (Figures 4) is a goodapproximation of (72) The solution given by the discrete approach (Figures 3) presentsadded spurious oscillations Furthermore the discrete method is much slower and does notachieve the same level of accuracy since the functional J∆ does not decrease so much

18 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Figure 3 Experiment 1 u0discrete method iteration k =

999

Figure 4 Experiment 1 u0alternating descent method it-eration k = 38

Figure 5 Experiment 1 So-lution u(x t) discrete methoditeration k = 999

Figure 6 Experiment 1 So-lution u(x t) alternating de-scent method iteration k = 38

72 Experiment 2 The previous experiment indicates that the alternating descent methodperforms significantly better In order to show that this is a systematic fact which arisesindependently of the initialization of the method we consider the target ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

05 x le 050 otherwise

(74)

but this time we compare the performance of both methods starting from different initial-izations

The obtained numerical results are presented in Figure 7We see that regardless the initialization considered the alternating descent method

performs significantly better

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 19

u0 obtained by the discrete

approach

u0 obtained by the alternating

descent method

The value of the functional

Figure 7 Experiment 2 We present a comparison of the results obtainedwith both methods starting out of five different initialization configurationsIn the left column we exhibit the results obtained with the discrete methodIn the second one those achieved by the alternating descent method In thelast one we plot the evolution of the functional with both methods

20 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

We observe that in the five experiments the alternating descent method perfumes betterensuring the descent of the functional in much fewer iterations and yielding smoother lessoscillatory approximation of the minimizer

Note also that the discrete method rather than yielding discontinuous approximationsof the minimizer as the alternating descent method does it produces an initial datum witha Lipschitz front Observe that these are two different configurations that can lead tothe same evolution for the Burgers equation after some time once the front develops thediscontinuity This is in agreement with the fact that the functional to be minimized isonly active in the time-interval T2 le t le T

8 Conclusions and perspectives

In this paper we have adapted and presented the alternating descent method for atracking problem for a 1minus d scalar conservation law the goal being to identify an optimalinitial datum so that the solution gets as close as possible to a given trajectory Wehave exhibited in a number of examples the better performance of the alternating descentmethod with respect to the classical discrete method Thus the paper extends previousworks on other optimization problems such as the inverse design or the optimization of theflux function

The developments in this paper raise a number of interesting problems and questionsthat will deserve further investigation We summarize here some of them

bull The alternating descent method and the discrete one do not seem to yield the sameminimizer This should be further investigated in a more systematic manner inother experiments

The minimizer that the discrete method yields seems to replace the shock of theinitial datum by a Lipschitz function which eventually develops the same dynamicswithin the time interval T2 le t le T in which the functional we have chosen isactive This is an agreement with the behavior of these methods in the context ofthe classical problem of inverse design in which one aims to find the initial datumso that the solution at the final time takes a given value It would be interesting toprove analytically that the two methods may lead to different minimizers in somecircumstancesbull The experiments in this paper concern the case where the target ud is exactly

reachable It would be interesting to explore the performance of both methods inthe case where the target ud is not a solution of the underlying dynamicsbull It would be interesting to compare the performance of the methods presented in the

paper with the direct continuous method without introducing the alternating strat-egy Our preliminary numerical experiments indicate that the continuous methodwithout implementing the alternating strategy does not improve the results thatthe purely discrete strategy yieldsbull In [16] the problem of inverse design has been investigated adapting and extend-

ing the alternating descent method to the multi-dimensional case It would beinteresting to extend the analysis of this paper to the multidimensional case too

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 21

bull It would worth to compare the results in this paper with those that could beachieved by the nudging method as in [2] [3]

AcknowledgementsThe authors would like to thank C Castro F Palacios and A Pozo for stimulating

discussions

References

[1] A Adimurthi S S Ghoshal and V Gowda Optimal controllability for scalar conservation laws withconvex flux 12 Jan 2012 3

[2] D Auroux and J Blum Back and forth nudging algorithm for data assimilation problems ComptesRendus Mathematique 340(12)873 ndash 878 2005 3 20

[3] D Auroux and M Nodet The back and forth nudging algorithm for data assimilation problems theoretical results on transport equations ESAIM Control Optimisation and Calculus of Variations18(2)318ndash342 7 2012 3 20

[4] C Bardos and O Pironneau A formalism for the differentiation of conservation laws C R MathAcad Sci Paris 335(10)839ndash845 2002 8

[5] F Bouchut and F James One-dimensional transport equations with discontinuous coefficients Non-linear Anal 32(7)891ndash933 1998 8

[6] F Bouchut and F James Differentiability with respect to initial data for a scalar conservation lawIn Hyperbolic problems theory numerics applications Vol I (Zurich 1998) volume 129 of InternatSer Numer Math pages 113ndash118 Birkhauser Basel 1999 8

[7] A Bressan and A Marson A variational calculus for discontinuous solutions of systems of conservationlaws Comm Partial Differential Equations 20(9-10)1491ndash1552 1995 8 9

[8] C Castro F Palacios and E Zuazua An alternating descent method for the optimal control of theinviscid burgers equation in the presence of shocks Mathematical Models and Methods in AppliedSciences 18(03)369ndash416 2008 2 3 4 6 9 10 12 16

[9] C Castro F Palacios and E Zuazua Optimal control and vanishing viscosity for the burgers equa-tion In C Constanda and M Perez editors Integral Methods in Science and Engineering Volume2 pages 65ndash90 Birkhauser Boston 2010 2

[10] C Castro and E Zuazua Flux identification for 1-d scalar conservation laws in the presence of shocksMath Comp 80(276)2025ndash2070 2011 2

[11] S Garreau P Guillaume and M Masmoudi The topological asymptotic for pde systems Theelasticity case SIAM J Control Optim 39(6)1756ndash1778 Dec 2000 16

[12] E Godlewski and P Raviart Hyperbolic systems of conservation laws Mathematiques and Applica-tions Ellipses Paris 1991 6

[13] E Godlewski and P A Raviart The linearized stability of solutions of nonlinear hyperbolic systemsof conservation laws A general numerical approach Math Comput Simulation 50(1-4)77ndash95 1999Modelling rsquo98 (Prague) 8

[14] L Gosse and F James Numerical approximations of one-dimensional linear conservation equationswith discontinuous coefficients Mathematics of Computation 69(231)pp 987ndash1015 2000 15

[15] S N Kruzkov First order quasilinear equations with several independent variables Mat Sb (NS)81 (123)228ndash255 1970 2

[16] R Lecaros and E Zuazua Control of 2D scalar conservation laws in the presence of shocks preprint2014 3 6 20

[17] S Ulbrich Adjoint-based derivative computations for the optimal control of discontinuous solutions ofhyperbolic conservation laws Systems Control Lett 48(3-4)313ndash328 2003 Optimization and controlof distributed systems 8

22 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

[18] X Wen and S Jin Convergence of an immersed interface upwind scheme for linear advection equationswith piecewise constant coefficients I L1-error estimates J Comput Math 26(1)1ndash22 2008 15

Rodrigo Lecaros13 and Enrique Zuazua12

1BCAM - Basque Center for Applied MathematicsMazarredo 14 E-48009 Bilbao Basque Country Spain

2Ikerbasque - Basque Foundation for ScienceAlameda Urquijo 36-5 Plaza Bizkaia 48011 Bilbao Basque Country Spain

3CMM - Centro de Modelamiento Matematico Universidad de Chile (UMI CNRS 2807)Avenida Blanco Encalada 2120 Casilla 170-3 Correo 3 Santiago Chile

E-mail address rlecarosdimuchilecl zuazuabcamathorg

URL wwwbcamathorgzuazua

  • 1 Introduction
  • 2 Existence of Minimizers
  • 3 The Discrete Minimization Problem
  • 4 Sensitivity analysis the continuous approach
    • 41 Sensitivity without shocks
    • 42 Sensitivity of the state in the presence of shocks
    • 43 Sensitivity of the cost in the presence of shocks
      • 5 The alternating descent method
      • 6 Numerical approximation of the descent direction
        • 61 The discrete approach
        • 62 The alternating descent method
          • 7 Numerical experiments
            • 71 Experiment 1
            • 72 Experiment 2
              • 8 Conclusions and perspectives
              • References
Page 3: TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 3

analyze and employ both sensitivities to develop descent algorithms leading to an efficientcomputation of the minimizer This is the basis of the alternating descent method ([8])

As we shall see in agreement with the results achieved in previously considered exam-ples the alternating descent method performs better than the classical discrete one whichconsists simply in applying a classical gradient descent algorithm to a discrete version ofthe functional and the conservation law

This paper is limited to the one dimensional case but the alternating descent methodcan also be extended to the multi-dimensional frame although this requires a much morecareful geometric treatment of shocks since for instance in 2minusd these are curves evolvingin time whose location has to be carefully determined and their motion and perturbationhandled carefully to avoid numerical instabilities (see [16])

There are other possible methods that could be employed to deal with these problemsIn the present 1 minus d setting for instance whenever the flux is convex one could use thetechniques developed in [1] based on the Lax-Oleinik representation formula of solutionsOne could also employ the nudging method developed by J Blum and coworkers ([2] [3])It would be interesting to compare in a systematic manner the results obtained in thisarticle with those that could be achieved by these other methods

The rest of this paper is organized as follows In Section 2 we formulate the problemunder consideration more precisely and prove the existence of minimizers

In Section 3 we introduce the discrete approximation of the continuous optimal controlproblem We prove the existence of minimizers for the discrete problem and their Γ-convergence towards the continuous ones as the mesh-size parameters tend to zero

As we shall see purely discrete approaches based on the minimization of the resultingdiscrete functionals by descent algorithms lead to very slow iterative processes We thusneed to introduce an alternated descent algorithm that takes into account the possiblepresence of shock discontinuities in solutions For doing this the first step is to develop acareful sensitivity analysis This is done in Section 4

In Section 5 we present the alternating descent method which combines the advantagesof both the discrete approach and the sensitivity analysis in the presence of shocks

In Section 6 we explain how to implement both descent algorithms The discrete ap-proach consists mainly in applying a descent algorithm to the discrete version J∆ of thefunctional J while the alternating descent method by the contrary is a continuous methodbased on the analysis of the previous section on the sensitivity of shocks

In Section 7 we present some numerical experiments illustrating the overall efficiency ofthe alternating descent method We conclude discussing some possible extensions of theresults and methods presented in the paper

2 Existence of Minimizers

In this section we prove that under certain conditions on the set of admissible initialdata Uad there exists at least one minimizer of the functional J given in (11) To do thiswe consider the class of admissible initial data Uad as

Uad = u0 isin Linfin(R) capBV (R) supp(u0) sub K u0Linfin + TV (u0) le C (21)

4 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

where K sub R is a given compact set and C gt 0 a given constant Note however that thesame theoretical results and descent strategies we shall develop here can be applied to amuch wider class of admissible sets

Theorem 21 Assume that ud isin L2(Rtimes (0 T )) let Uad be defined in (21) and f be a C1

function Then the minimization problem

minu0isinUab

J(u0) (22)

has at least one minimizer u0min isin Uad Moreover uniqueness is false in general

The proof follows the classical strategy of the Direct Method of the Calculus of Vari-ations We refer the reader to [8] for a similar proof in the case where the functional Jis replaced by the one in which the distance to a given target at the final time t = T isminimized Note that in particular for the class Uad of admissible initial data consideredsolutions enjoy uniform BV bounds allowing to prove the needed compactness propertiesto pass to the limit In the case of convex fluxes using the well-known one-sided Lipschitzcondition the class of admissible initial data can be further extended

We also observe that as indicated in [8] the uniqueness of the minimizer is in generalfalse for this type of optimization problems

3 The Discrete Minimization Problem

The purpose of this section is to show that discrete minimizers obtained by a numericalapproximation of (11) and (12) converge to a minimizer of the continuous problem asthe mesh-size tends to zero This justifies the usual engineering practice of replacing thecontinuous functional and model by discrete ones to compute an approximation of thecontinuous minimizer

Let us introduce a mesh in Rtimes [0 T ] given by (xi tn) = (i∆x n∆t) (i = minusinfin infin n =

0 N + 1 so that (N + 1)∆t = T ) and let unij be a numerical approximation of u(xi tn)

obtained as solution of a suitable discretization of the equation (12)Let us consider the following approximation of the functional J in (11)

J∆(u0∆) =

∆x∆t

2

Nsumn=0

infinsumi=minusinfin

ρni(uni minus (ud)ni

)2 (31)

where u0∆ = u0

i is the discrete initial datum and ud∆ = (ud)ni = Π∆ud is the discretiza-

tion of the target ud at (xi tn) Π∆ being a discretization operator A common choice

consists in taking

Π∆ud = (ud)ni =

1

∆x∆t

int xi+12

ximinus12

int tn+12

tnminus12

ud(x t) dxdt (32)

where xiplusmn12 = xi plusmn∆x2 and tnplusmn12 = tn plusmn∆t2Moreover we introduce an approximation of the class of admissible initial data Uad

denoted by Uad∆ and constituted by sequences ϕ∆ = ϕiiisinZ for which the associated

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 5

piecewise constant interpolation function that we still denote by ϕ∆ defined by

ϕ∆(x) = ϕi x isin (ximinus12 xi+12)

satisfies ϕ∆ isin Uad Obviously Uad∆ coincides with the class of discrete vectors withsupport on those indices i such that xi isin K and for which the discrete Linfin and TV -normsare bounded above by the same constant C

Let us consider S∆ l1(Z)rarr l1(Z) an explicit numerical scheme for (12) where

un∆ = Sn∆u

0∆ (33)

is the approximation of the entropy solution u(middot t) = S(t)u0 of (12) ie un∆ S(t)u0with t = n∆t Here S L1(R) cap Linfin(R) cap BV (R) rarr L1(R) cap Linfin(R) cap BV (R) is thesemigroup (solution operator)

S u0 rarr u = Su0

which associates to the initial condition u0 isin L1(R)capLinfin(R)capBV (R) the entropy solutionu of (12)

For each ∆ = ∆t (with λ = ∆t∆x fixed typically given by the corresponding CFL-condition for explicit schemes) it is easy to see that the discrete analogue of Theorem21 holds In fact this is automatic in the present setting since Uad∆ only involves afinite number of mesh-points But passing to the limit as ∆ rarr 0 requires a more carefultreatment In fact for that to be done one needs to assume that the scheme underconsideration (33) is a contracting map in l1(Z)

Thus we consider the following discrete minimization problem Find u0min∆ such that

J∆(u0min∆ ) = min

u0∆isinUad∆

J∆(u0∆) (34)

The following holds

Theorem 31 Assume that un∆ is obtained by a numerical scheme (33) which satisfiesthe following

bull For a given u0 isin Uad un∆ = Sn∆Π∆u

0 converges to u(x t) the entropy solution of(12) More precisely if n∆t = t and T0 gt 0 is any given number

max0letleT0

un∆ minus u(middot t)L1(R) rarr 0 as ∆rarr 0 (35)

bull The map S∆ is Linfin-stable ie

S∆u0∆Linfin le u0

∆Linfin (36)

bull The map S∆ is a contracting map in l1(Z) ie

S∆u0∆ minus S∆v

0∆L1 le u0

∆ minus v0∆L1 (37)

Then

bull For all ∆ the discrete minimization problem (34) has at least one solution u0min∆ isin

Uad∆

bull Any accumulation point of u0min∆ with respect to the weak-lowast topology in Linfin as

∆rarr 0 is a minimizer of the continuous problem (22)

6 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

The proof of this result can be developed as in [8] We also refer to [16] for an extensionto the multi-dimensional case

This convergence result applies for 3-point conservative numerical approximation schemeswhere S∆ is given by

(S∆v∆)i = vi minus∆t

∆x(g(vi+1 vi)minus g(vi viminus1)) (38)

and g is the numerical flux This scheme is consistent with the corresponding equation(12) when g(u u) = f(u)

When the discrete semigroup S∆(u v w) = v minus λ(g(u v)minus g(v w)) with λ = ∆t∆x ismonotonic increasing with respect to each argument the scheme is also monotone Thisensures the convergence to the weak entropy solutions of the continuous conservation lawas the discretization parameters tend to zero under a suitable CFL condition (see Ref[12] Chap 3 Th 42)

All this analysis and results apply to the classical Godunov Lax-Friedfrichs and Engquist-Osher schemes the corresponding numerical flux being

gG(u v) =

minwisin[uv] f(w) if u le vmaxwisin[uv] f(w) if u ge v

(39)

gLF (u v) =(f(u) + f(v))

2minus (v minus u)

2λx (310)

gEO(u v) =f(u) + f(v)minus

int v

u|f prime(τ)|dτ

2 (311)

See Chapter 3 in [12] for more detailsThese 1D methods satisfy the conditions of Theorem 31

4 Sensitivity analysis the continuous approach

We divide this section in three subsections Specifically in the first one we consider thecase where the solution u of (12) has no shocks In the second and third subsections weanalyze the sensitivity of the solution and the functional in the presence of a single shock

41 Sensitivity without shocks In this subsection we give an expression for the sen-sitivity of the functional J with respect to the initial datum based on a classical adjointcalculus for smooth solutions First we present a formal calculus and then we show howto justify it when dealing with a classical smooth solution for (12)

Let C10(R) be the set of C1 functions with compact support and let u0 isin C1

0(R) be agiven initial datum for which there exists a classical solution u(x t) of (12) that can beextended to a classical solution in t isin [0 T + τ ] for some τ gt 0 Note that this imposessome restrictions on u0 other than being smooth

Let δu0 isin C10(R) be any possible variation of the initial datum u0 Due to the finite

speed of propagation this perturbation will only affect the solution in a bounded set ofRtimes [0 T ] This simplifies the argument below that applies in a much more general settingprovided solutions are smooth enough

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 7

Then for ε gt 0 sufficiently small the solution uε(x t) corresponding to the initial datum

uε0(x) = u0(x) + εδu0(x) (41)

is also a classical solution in (x t) isin Rtimes (0 T ) and uε isin C1(Rtimes [0 T ]) can be written as

uε = u+ εδu+ o(ε)with respect to the C1 topology (42)

where δu is the solution of the linearized equation

parttδu+ partx(f prime(u)δu) = 0 in Rtimes (0 T ) δu(x 0) = δu0(x) x isin R (43)

Let δJ be the Gateaux derivative of J at u0 in the direction δu0 We have

δJ(u0)[δu0] =

int T

0

intRρ(x t)(u(x t)minus ud(x t))δu(x t)dxdt (44)

where δu solves the linearized system above (43) Now we introduce the adjoint system

minus parttpminus f prime(u)partxp = ρ (uminus ud) in Rtimes (0 T ) p(x T ) = 0 x isin R (45)

Multiplying the equations satisfied by δu by p integrating by parts and taking intoaccount that p satisfies (45) we easily obtainint T

0

intRρ(x t)(u(x t)minus ud(x t))δu(x t)dxdt =

intRp(x 0)δu0(x)dx (46)

Thus δJ in (44) can be written as

δJ(u0)[δu0] =

intRp(x 0)δu0(x)dx (47)

This expression provides an easy way to compute a descent direction for the continuousfunctional J once we have computed the adjoint state We just take

δu0(x) = minusp(x 0) (48)

Under the assumptions above on u0 u δu and p can be obtained from their data u0(x)δu0(x) and ρ (u minus ud) by using the characteristic curves associated to (12) For the sakeof completeness we briefly explain this below

The characteristic curves associated to (12) are defined by

xprime(t) = f prime(u(x(t) t)) t isin (0 T ) x(0) = x0 (49)

They are straight lines whose slopes depend on the initial data

x(t) = x0 + tf prime(u0(x0)) t isin (0 T )

As we are dealing with classical solutions u is constant along such curves and by assump-tion two different characteristic curves do not meet each other in R times [0 T + τ ] Thisallows to define u in Rtimes [0 T + τ ] in a unique way from the initial data

For ε gt 0 sufficiently small the solution uε(x t) corresponding to the initial datum(41) has similar characteristics to those of u This allows guaranteeing that two differentcharacteristic lines do not intersect for 0 le t le T if ε gt 0 is small enough Note that uε

may possibly be discontinuous for t isin (T T +τ ] if u0 generates a discontinuity at t = T +τ

8 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

but this is irrelevant for the analysis in [0 T ] we are carrying out Therefore uε(x t) is alsoa classical solution in (x t) isin R times [0 T ] and it is easy to see that the solution uε can bewritten as (42) where δu satisfies (43)

System (43) can be solved again by the method of characteristics In fact as u is aregular function the first equation in (43) can be written as

parttδu+ f prime(u)partxδu = minuspartx(f prime(u)) δu (410)

ied

dtδu(x(t) t) = minuspartx(f prime(u)) δu (411)

where x(t) are the characteristic curves defined by (49) Thus the solution δu along acharacteristic line can be obtained from δu0 by solving this differential equation ie

δu(x(t) t) = δu0(x0) exp

(minusint t

0

partx(f prime(u))(x(s) s)ds

)

Finally the adjoint system (45) is also solved by characteristics ie

minus d

dtp(x(t) t) = ρ(x(t) t)(u(x(t) t)minus ud(x(t) t))

This yields the steepest descent direction in (48) for the continuous functional

p(x0 0) = u0(x0)

int T

0

ρ(x(s) s)dsminusint T

0

ρ(x(s) s)ud(x(s) s)ds

Remark 41 Note that for classical solutions the Gateaux derivative of J at u0 is given by(47) and this provides an obvious descent direction for J at u0 given by (48) Howeverthis fact is not very useful in practice since even when we initialize the iterative descentalgorithm with a smooth u0 we cannot guarantee that the solution remains classical alongthe iterative process

42 Sensitivity of the state in the presence of shocks Inspired in several results onthe sensitivity of solutions of conservation laws in the presence of shocks in one-dimension(see [7 4 5 6 17 13]) we focus on the particular case of solutions having a single shockBut the analysis can be extended to consider more general one-dimensional systems ofconservation laws with a finite number of noninteracting shocks We introduce the followinghypothesis

Hypothesis 42 Assume that u(x t) is a weak entropy solution of (12) with a discon-tinuity along a regular curve Σ = (ϕ(t) t) t isin (0 T ) which is Lipschitz continuousoutside Σ In particular it satisfies the Rankine-Hugoniot condition on Σ

ϕprime(t)[u]Σt = [f(u)]Σt

Here we have used the notation [v]Σt = limε0

v(ϕ(t) + ε t)minus v(ϕ(t)minus ε t) for the jump at

Σt = (ϕ(t) t) of any piecewise continuous function v with a discontinuity at Σt

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 9

Note that Σ divides Rtimes(0 T ) into two parts Qminus and Q+ the sub-domains of Rtimes(0 T )to the left and to the right of Σ respectively

As we will see in the presence of shocks to deal correctly with optimal control anddesign problems the state of the system needs to be viewed as constituted by the pair(u ϕ) combining the solution of (12) and the shock location ϕ This is relevant in theanalysis of sensitivity of functions below and when applying descent algorithms

We adopt the functional framework based on the generalized tangent vectors (see [7] andDefinition 41 in [8])

Let u0 be the initial datum that we assume to be Lipschitz continuous to both sidesof a single discontinuity located at x = ϕ0 and consider a generalized tangent vector(δu0 δϕ0) isin L1(R) times R for all 0 le T Let u0ε be a path which generates (δu0 δϕ0)For ε sufficiently small the solution uε(middot t) of (12) is Lipschitz continuous with a singlediscontinuity at x = ϕε(t) for all t isin [0 T ] Therefore uε(middot t) generates a generalizedtangent vector (δu(middot t) δϕ(t)) isin L1(R) times R Moreover in [8] it is proved that it satisfiesthe following linearized system

parttδu+ partx(f prime(u)δu) = 0 in Qminus cupQ+ (412)

d

dt([u]Σtδϕ) = [f prime(u)δu]Σt minus [δu]Σt

d

dtϕ t isin (0 T ) (413)

δu(x 0) = δu0(x) x lt ϕ0 cup x gt ϕ0 (414)

δϕ(0) = δϕ0 (415)

43 Sensitivity of the cost in the presence of shocks In this section we study thesensitivity of the functional J with respect to variations associated with the generalizedtangent vectors defined in the previous section We first define an appropriate generaliza-tion of the Gateaux derivative of J

Definition 43 Let J L1(R) rarr R be a functional and u0 isin L1(R) be Lipschitz contin-uous with a discontinuity in Σ0 an initial datum for which the solution of (12) satisfieshypothesis (42) J is Gateaux differentiable at u0 in a generalized sense if for any gener-alized tangent vector (δu0 δϕ0) and any family u0ε associated to (δu0 δϕ0) the followinglimit exists

δJ = limεrarr0

J(u0ε)minus J(u0)

ε

and it depends only on (u0 ϕ0) and (δu0 δϕ0) ie it does not depend on the particularfamily u0ε which generates (δu0 δϕ0) The limit is the generalized Gateux derivative of Jin the direction (δu0 δϕ0)

The following result easily provides a characterization of the generalized Gateaux deriv-ative of J in terms of the solution of the associated adjoint system (417) (418)(419)(420) (421) and (422)

Proposition 44 The Gateaux derivative of J can be written as follows

δJ(u0)[δu0 δϕ0] =

intRp(x 0)δu0(x)dxminus q(0)[u]Σ0δϕ0 (416)

10 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

where the adjoint state pair (p q) satisfies the system

minus parttpminus f prime(u)partxp = ρ (uminus ud) in Qminus cupQ+ (417)

[p]Σt = 0 t isin (0 T ) (418)

q(t) = p(ϕ(t) t) t isin (0 T ) (419)

minus d

dtq =

(1 + (ϕ)2)12[ρ (uminus ud)2]Σt

2[u]Σt

t isin (0 T ) (420)

p(x T ) = 0 x lt ϕ(T ) cup x gt ϕ(T ) (421)

q(T ) = 0 (422)

Let us briefly comment the result of Proposition 44 before giving its proofSystem (417)-(422) has a unique solution In fact to solve the backward system (417)-

(422) we first define the solution q on the shock Σ from the conditions for q (420) and(422) This determines the value of p along the shock We then propagate this informationtogether with (417) and (421) to both sides of Σ by characteristics (see Figure 1 wherewe illustrate this construction)

p is defined by the method of characteristics

p on the region of influence of the characteristics emanating from the shock

Xminus X+t = 0

t = T

Σ

Σ0

Figure 1 Characteristic lines entering on a shock and how they may beused to build the solution of the adjoint system both away from the shockand on its region of influence

Formula (416) provides an obvious way to compute a first descent direction of J at u0We just take

(δu0 δϕ0) = (minusp(middot 0) q(0)[u]Σ0) (423)

Here the value of δϕ0 must be interpreted as the optimal infinitesimal displacement of thediscontinuity of u0

In [8] when considering the inverse design problem it was observed that the solution pof the corresponding adjoint system at t = 0 was discontinuous with two discontinuities

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 11

one in each side of the original location of the discontinuity at Σ0 This was a reason notto use this descent direction and for introducing the alternating descent method In thepresent setting however the adjoint state p obtained is typically continuous This is due tothe fact that p at both side of the discontinuity is defined by the method of characteristicsand that on the region of influence of the characteristics emanating from the shock thecontinuity is preserved by the fact that on one hand q = q(t) itself is continuous as theprimitive of an integrable function and that the data for p and q at t = T are continuoustoo Despite of this as we shall see the implementation of the alternating descent directionmethod is worth since it significantly improves the results obtained by the purely discreteapproach

We finish this section with the proof of Proposition 44

Proof (of Proposition 44) A straightforward computation shows that J is Gateaux dif-ferentiable in the generalized sense of Definition 43 and that the generalized Gateauxderivative of J in the direction of the generalized tangent vector (δu0 δϕ0) is given by

δJ(u0)[δu0 δϕ0] =

int T

0

intxgtϕ(t)cupxltϕ(t)

ρ(x t)(u(x t)minus ud(x t))δu(x t) dxdt

minusint

Σ

(uminus ud)2

2

]Σt

δϕ(t) dΣ(t) (424)

where the pair (δu δϕ) solves the linearized problem (412)-(415) with initial data (δu0 δϕ0)Let us now introduce the adjoint system (417)-(422) Multiplying the equations of δu

by p and integrating we get

0 =

int T

0

intxgtϕ(t)cupxltϕ(t)

(parttδu+ partx(f prime(u)δu)) p dxdt = minusint T

0

intxgtϕ(t)cupxltϕ(t)

δu (parttp+ f prime(u)partxp) dxdt

+

intxgtϕ(T )cupxltϕ(T )

δu(x T )p(x T )dxminusint

xgtϕ0cupxltϕ0

δu0(x)p(x 0)dx

minusint

Σ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ(t) (425)

where (nx nt) are the Cartesian components of the normal vector to the curve ΣTherefore since p satisfies the adjoint equation (417) (421) from (424) we obtain

δJ(u0)[δu0 δϕ0] =

intxgtϕ0cupxltϕ0

δu0(x)p(x 0)dx

+

intΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ(t)minus

intΣ

(uminus ud)2

2

]Σt

δϕ(t) dΣ(t) (426)

The last two terms in the right hand side of (426) will determine the conditions that pmust satisfy on the shock

12 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Observe that for any functions f g we have

[fg]Σt = f [g]Σt + g[f ]Σt

where g represents the average of g to both sides of the shock Σt ie

g(t) =1

2limε0

(g(ϕ(t) + ε t) + g(ϕ(t)minus ε t)) forallt isin (0 T )

Thus we haveintΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ =

intΣ

[p]Σt

(δunt

Σ + f prime(u)δu middot nxΣ

)dΣ

+

intΣ

p([δu]Σtnt

Σ + [f prime(u)δu]Σt middot nxΣ

)dΣ

(427)

Now we note that the Cartesian components of the normal vector to Σ are given by

nt =minusϕprime(t)radic

1 + (ϕprime(t))2 nx =

1radic1 + (ϕprime(t))2

and dΣ(t) =radic

1 + (ϕprime(t))2dt Using (413) (418) we haveintΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]ΣtnxΣ

)dΣ =

int T

0

p (minus[δu]Σtϕprime(t) + [f prime(u)δu]Σt) dt

=

int T

0

pd

dt([u]Σtδϕ) dt (428)

Therefore replacing (419) (420) (422) and (428) in (426) we obtain (416)This concludes the proof

5 The alternating descent method

Here we explain how the alternating descent method introduced in [8] can be adapted tothe present optimal control problem (13)

First let us introduce some notation We consider two points

Xminus = ϕ(T )minus Tf prime(uminus(ϕ(T ) T )) X+ = ϕ(T )minus Tf prime(u+(x T )) (51)

(p q) the solutions of (417)-(422) and

p0minus = limxXminus

p(x 0) p0+ = limxX+

p(x 0)

Now we introduce the two classes of descent directions we shall use in our descentalgorithm

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 13

First directions With this set of directions we mainly modify the profile of u0 Weset the first directions d1 = (δu0 δϕ0) given by

δu0 =

minusp(x 0) x lt Xminusminusp0minus x isin (Xminus ϕ0)minusp0+ x isin (ϕ0 X+)minusp(x 0) x gt X+

δϕ0 =

0 if

X+intXminus

p(x 0)δu0(x)dx le 0

X+intXminus

p(x 0)δu0(x)dx

[u0]Σ0q(0) if

X+intXminus

p(x 0)δu0(x)dx gt 0 and q(0) 6= 0

(52)

We note that if q(0) = 0 andX+intXminus

p(x 0)δu0(x)dx gt 0 these directions are not

definedSecond directions They are aimed to move the shock without changing the profile

of the solution to both sides Then d2 = (δu0 δϕ0) with

δu0 equiv 0 δϕ0(x) = [u0]Σ0q(0) (53)

We observe that d1 given by (52) satisfies

δJ(u0)[d1] = minusintxisin[XminusX+]

|p(x 0)|2dx

minusp0minusint ϕ0

Xminusp(x 0)dxminus p0+

int X+

ϕ0

p(x 0)dxminus [u0]Σ0q(0)δϕ0 le 0 (54)

Note that in those cases where d1 is not defined as indicated above this descent directionis simply not employed in the descent algorithm

And for d2 given by (53) we have

δJ(u0)[d2] = minus([u0]Σ0q(0))2 le 0 (55)

Thus the two classes of descent directions under consideration have three importantproperties

(1) They are both descent directions(2) They allow to split the design of the profile and the shock location(3) They are true generalized gradients and therefore keep the structure of the data

without increasing its complexity

6 Numerical approximation of the descent direction

We have computed the gradient of the continuous functional J in several cases (u smoothor having shock discontinuities) but in practice one has to work with discrete versions of

14 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

the functional J In this section we discuss two possible ways of searching discrete descentdirections based either on the discrete or the continuous approaches and in the later onthe alternating descent method

The discrete approach consists mainly in applying directly a descent algorithm to thediscrete version J∆ of the functional J by using its gradient The alternating descentmethod by the contrary is a method inspired on the continuous analysis of the previoussection in which the two main classes of descent directions that are first identified andlater discretized

Let us first discuss the discrete approach

61 The discrete approach Let us consider the approximation of the functional J byJ∆ defined (11) and (31) respectively We shall use the Engquist-Osher scheme whichis a 3-point conservative numerical approximation scheme for (12) More explicitly weconsider

un+1i = uni minus

∆t

∆x

(g(uni+1 u

ni )minus g(uni u

niminus1)) i isin Z n = 0 N (61)

where g is the numerical flux defined in (311)The gradient of the discrete functional J∆ requires computing one derivative of J∆ with

respect to each node of the mesh This can be done in a cheaper way using the discreteadjoint state We illustrate it for the Engquist-Osher numerical scheme However asthe discrete functionals J∆ are not necessarily convex the gradient methods could possiblyprovide sequences that do not converge to a global minimizer of J∆ But this drawback anddifficulty appears in most applications of descent methods in optimal design and controlproblems

As we will see in the present context the approximations obtained by discrete gradientmethods are satisfactory although convergence is slow due to unnecessary oscillations thatthe descent method introduces

The gradient of J∆ rigorously speaking requires the linearization of the numericalscheme (61) used to approximate the equation (12) Then the linearization correspondsto

δun+1i = δuni minus

∆t

∆x

(partag(uni+1 u

ni )δuni+1 minus partbg(uni u

niminus1)δuniminus1

)minus∆t

∆x

(partbg(uni+1 u

ni )minus partag(uni u

niminus1))δuni

i isin Z n = 0 N (62)

In view of this the discrete adjoint system of (62) can also be written for the differen-tiable flux functions

pN+1i = 0 i isin Z

pni = pn+1i +

∆t

∆xpartbg(uni+1 u

ni )(pn+1i+1 minus pn+1

i

)+

∆t

∆xpartag(uni u

niminus1)

(pn+1i minus pn+1

iminus1

)+ F n

i

i isin Z n = 0 N (63)

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 15

whereF ni = ∆tρni

(uni minus (ud)ni

) i isin Z n = 0 N (64)

In fact when multiplying the equations in (62) by pn+1i and adding in i isin Z and

n = 0 N the following identity is easily obtained

∆xsumiisinZ

p0i δu

0i = ∆x

Nsumn=0

sumiisinZ

F ni δu

0i (65)

This is the discrete version of formula (46) which allows us to simplify the derivative ofthe discrete cost functional

Thus for any variation δu0∆ the Gateaux derivative of the cost functional defined in

(31) is given by

δJ∆ = ∆x∆tNsum

n=0

sumiisinZ

ρni(uni minus (ud)ni

)δuni (66)

where δunij solves the linearized system (62) If we consider pni the solution of (63) with(64) we obtain that δJ∆ in (66) can be written as

δJ∆ = ∆xsumiisinZ

p0i δu

0i

and this allows to obtain easily the steepest descent direction for J∆ by considering

δu0∆ = minusp0

∆ (67)

Remark 61 We observe that for the Enquis-Osherrsquos flux (311) the system (63) is theupwind method for the continuous adjoint system We do not address here the problemof the convergence of this adjoint scheme towards the solution of the continuous adjointsystem Of course this is an easy matter when u is smooth but it is far from being trivialwhen u has shock discontinuities Whether or not this discrete adjoint system as ∆rarr 0allows reconstructing the complete adjoint system with the inner Dirichlet condition alongthe shock (417)(418)(419)(420)(421) and (422) constitutes an interesting problemfor future research We refer to [14] and [18] for preliminary work on this direction inone-dimension

62 The alternating descent method Now we describe the implementation of thealternating descent method

The main idea is to approximate a minimizer of J alternating between two directions ofdescent First we perturb the initial datum u0 using the direction (52) which principallychanges the profile of u0 Second we move the shock curve without altering the profile ofu0 at both sides of Σ using the direction (53)

More precisely for a given initialization u0∆ and target function ud∆ we compute Σ0

∆ thejump-point of u0

∆u∆ and p0∆ as the solutions of (62) (63) respectively

As numerical approximation of the adjoint state q(0) we take the value of p0∆ over the

point Σ0∆ ie

q0∆ = p0

∆(Σ0∆) = p0

i Σ0∆ isin (ximinus12 xi+12)

16 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

The main advantage of this method introduced in [8] is that for an initial datum u0

with a single discontinuity the descent directions are generalized tangent vectors ie theyintroduce Lipschitz continuous variations of u0 at both sides of the discontinuity and adisplacement of the shock position In this way the new datum obtained modifying theold one in the direction of this generalized tangent vector has again a single discontinuityThe method can be applied in a much more general context in which for instance thesolution has various shocks since the method is able both to generate shocks and to destroythem if any of these facts contributes to the decrease of the functional This method isin some sense close to those employed in shape design in elasticity in which topologicalderivatives (that would correspond to controlling the location of the shock in our method)are combined with classical shape deformations (that would correspond to simply shapingthe solution away from the shock in the present setting) [11]

As mentioned above in the context of the tracking problem we are considering thesolutions of the adjoint system do not increase the number of shocks Thus a directapplication of the continuous method without employing this alternating variant couldmake sense Note that the purely discrete method is a limited version of the continuous onesince one simply employs the descent direction indicated by p0 without paying attention tothe value q0 corresponding to the shock sensitivity However the numerical experimentswe have done with the full continuous method do not improve the results obtained by thediscrete one since the definition of the location of the shocks is lost along the iterativeprocess

7 Numerical experiments

In this section we present some numerical experiments which illustrate the results ob-tained in an optimization model problem with each one of the numerical methods describedin the previous section

We have chosen as computational domain the interval (minus4 4) and we have taken asboundary conditions in (61) at each time step t = tn the value of the initial data at theboundary This can be justified if we assume that the initial datum u0 is constant in asufficiently large inner neighborhood of the boundary x = plusmn4 (which depends on the sizeof the Linfin-norm of the data under consideration and the time horizon T ) due to the finitespeed of propagation A similar procedure is employed for the adjoint equation

We underline once more that the solutions obtained with each method may correspondto global minima or local ones since the gradient algorithm does not distinguish them

In the experiments we consider the Burgersrsquo equation ie f(z) = z22

parttu+ partx

(u2

2

)= 0 u(x 0) = u0(x) (71)

The weight function ρ under consideration in the experiments is given by

ρ(x t) =

1 t isin (T2 T )0 otherwise

And the time horizon T = 1

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 17

To compare the efficiency of the different methods we consider a fixed ∆x = 120 λ =∆t∆x = 23 (which satisfies the CFL condition) We then analyze the number of itera-tions that each method needs to attain a prescribed value of the functional

71 Experiment 1 We first consider a piecewise constant target profile ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

07 x isin [minus2 1]0 otherwise

(72)

Note that in this case (72) yields a particular solution of the optimization problemand the minimum value of J vanishes

We solve the optimization problem (13) with the above described different methodsstarting from the following initialization for u0

u0(x) =

05 x le 0minus01 x gt 0

(73)

which also has a discontinuity but located on a different pointIn Figure 2 we plot log(J) with respect to the number of iterations for both the purely

discrete method and the alternating descent one We see that the latter stabilizes in feweriterations

Figure 2 Experiment 1 log(J) versus the number of iterations in thedescent algorithm for the discrete and the alternating descent methods

In Figures 3 and 4 we present the minimizers obtained by the methods above and theassociated solutions Figures 5 and 6

The initial datum u0 obtained by the alternating descent method (Figures 4) is a goodapproximation of (72) The solution given by the discrete approach (Figures 3) presentsadded spurious oscillations Furthermore the discrete method is much slower and does notachieve the same level of accuracy since the functional J∆ does not decrease so much

18 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Figure 3 Experiment 1 u0discrete method iteration k =

999

Figure 4 Experiment 1 u0alternating descent method it-eration k = 38

Figure 5 Experiment 1 So-lution u(x t) discrete methoditeration k = 999

Figure 6 Experiment 1 So-lution u(x t) alternating de-scent method iteration k = 38

72 Experiment 2 The previous experiment indicates that the alternating descent methodperforms significantly better In order to show that this is a systematic fact which arisesindependently of the initialization of the method we consider the target ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

05 x le 050 otherwise

(74)

but this time we compare the performance of both methods starting from different initial-izations

The obtained numerical results are presented in Figure 7We see that regardless the initialization considered the alternating descent method

performs significantly better

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 19

u0 obtained by the discrete

approach

u0 obtained by the alternating

descent method

The value of the functional

Figure 7 Experiment 2 We present a comparison of the results obtainedwith both methods starting out of five different initialization configurationsIn the left column we exhibit the results obtained with the discrete methodIn the second one those achieved by the alternating descent method In thelast one we plot the evolution of the functional with both methods

20 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

We observe that in the five experiments the alternating descent method perfumes betterensuring the descent of the functional in much fewer iterations and yielding smoother lessoscillatory approximation of the minimizer

Note also that the discrete method rather than yielding discontinuous approximationsof the minimizer as the alternating descent method does it produces an initial datum witha Lipschitz front Observe that these are two different configurations that can lead tothe same evolution for the Burgers equation after some time once the front develops thediscontinuity This is in agreement with the fact that the functional to be minimized isonly active in the time-interval T2 le t le T

8 Conclusions and perspectives

In this paper we have adapted and presented the alternating descent method for atracking problem for a 1minus d scalar conservation law the goal being to identify an optimalinitial datum so that the solution gets as close as possible to a given trajectory Wehave exhibited in a number of examples the better performance of the alternating descentmethod with respect to the classical discrete method Thus the paper extends previousworks on other optimization problems such as the inverse design or the optimization of theflux function

The developments in this paper raise a number of interesting problems and questionsthat will deserve further investigation We summarize here some of them

bull The alternating descent method and the discrete one do not seem to yield the sameminimizer This should be further investigated in a more systematic manner inother experiments

The minimizer that the discrete method yields seems to replace the shock of theinitial datum by a Lipschitz function which eventually develops the same dynamicswithin the time interval T2 le t le T in which the functional we have chosen isactive This is an agreement with the behavior of these methods in the context ofthe classical problem of inverse design in which one aims to find the initial datumso that the solution at the final time takes a given value It would be interesting toprove analytically that the two methods may lead to different minimizers in somecircumstancesbull The experiments in this paper concern the case where the target ud is exactly

reachable It would be interesting to explore the performance of both methods inthe case where the target ud is not a solution of the underlying dynamicsbull It would be interesting to compare the performance of the methods presented in the

paper with the direct continuous method without introducing the alternating strat-egy Our preliminary numerical experiments indicate that the continuous methodwithout implementing the alternating strategy does not improve the results thatthe purely discrete strategy yieldsbull In [16] the problem of inverse design has been investigated adapting and extend-

ing the alternating descent method to the multi-dimensional case It would beinteresting to extend the analysis of this paper to the multidimensional case too

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 21

bull It would worth to compare the results in this paper with those that could beachieved by the nudging method as in [2] [3]

AcknowledgementsThe authors would like to thank C Castro F Palacios and A Pozo for stimulating

discussions

References

[1] A Adimurthi S S Ghoshal and V Gowda Optimal controllability for scalar conservation laws withconvex flux 12 Jan 2012 3

[2] D Auroux and J Blum Back and forth nudging algorithm for data assimilation problems ComptesRendus Mathematique 340(12)873 ndash 878 2005 3 20

[3] D Auroux and M Nodet The back and forth nudging algorithm for data assimilation problems theoretical results on transport equations ESAIM Control Optimisation and Calculus of Variations18(2)318ndash342 7 2012 3 20

[4] C Bardos and O Pironneau A formalism for the differentiation of conservation laws C R MathAcad Sci Paris 335(10)839ndash845 2002 8

[5] F Bouchut and F James One-dimensional transport equations with discontinuous coefficients Non-linear Anal 32(7)891ndash933 1998 8

[6] F Bouchut and F James Differentiability with respect to initial data for a scalar conservation lawIn Hyperbolic problems theory numerics applications Vol I (Zurich 1998) volume 129 of InternatSer Numer Math pages 113ndash118 Birkhauser Basel 1999 8

[7] A Bressan and A Marson A variational calculus for discontinuous solutions of systems of conservationlaws Comm Partial Differential Equations 20(9-10)1491ndash1552 1995 8 9

[8] C Castro F Palacios and E Zuazua An alternating descent method for the optimal control of theinviscid burgers equation in the presence of shocks Mathematical Models and Methods in AppliedSciences 18(03)369ndash416 2008 2 3 4 6 9 10 12 16

[9] C Castro F Palacios and E Zuazua Optimal control and vanishing viscosity for the burgers equa-tion In C Constanda and M Perez editors Integral Methods in Science and Engineering Volume2 pages 65ndash90 Birkhauser Boston 2010 2

[10] C Castro and E Zuazua Flux identification for 1-d scalar conservation laws in the presence of shocksMath Comp 80(276)2025ndash2070 2011 2

[11] S Garreau P Guillaume and M Masmoudi The topological asymptotic for pde systems Theelasticity case SIAM J Control Optim 39(6)1756ndash1778 Dec 2000 16

[12] E Godlewski and P Raviart Hyperbolic systems of conservation laws Mathematiques and Applica-tions Ellipses Paris 1991 6

[13] E Godlewski and P A Raviart The linearized stability of solutions of nonlinear hyperbolic systemsof conservation laws A general numerical approach Math Comput Simulation 50(1-4)77ndash95 1999Modelling rsquo98 (Prague) 8

[14] L Gosse and F James Numerical approximations of one-dimensional linear conservation equationswith discontinuous coefficients Mathematics of Computation 69(231)pp 987ndash1015 2000 15

[15] S N Kruzkov First order quasilinear equations with several independent variables Mat Sb (NS)81 (123)228ndash255 1970 2

[16] R Lecaros and E Zuazua Control of 2D scalar conservation laws in the presence of shocks preprint2014 3 6 20

[17] S Ulbrich Adjoint-based derivative computations for the optimal control of discontinuous solutions ofhyperbolic conservation laws Systems Control Lett 48(3-4)313ndash328 2003 Optimization and controlof distributed systems 8

22 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

[18] X Wen and S Jin Convergence of an immersed interface upwind scheme for linear advection equationswith piecewise constant coefficients I L1-error estimates J Comput Math 26(1)1ndash22 2008 15

Rodrigo Lecaros13 and Enrique Zuazua12

1BCAM - Basque Center for Applied MathematicsMazarredo 14 E-48009 Bilbao Basque Country Spain

2Ikerbasque - Basque Foundation for ScienceAlameda Urquijo 36-5 Plaza Bizkaia 48011 Bilbao Basque Country Spain

3CMM - Centro de Modelamiento Matematico Universidad de Chile (UMI CNRS 2807)Avenida Blanco Encalada 2120 Casilla 170-3 Correo 3 Santiago Chile

E-mail address rlecarosdimuchilecl zuazuabcamathorg

URL wwwbcamathorgzuazua

  • 1 Introduction
  • 2 Existence of Minimizers
  • 3 The Discrete Minimization Problem
  • 4 Sensitivity analysis the continuous approach
    • 41 Sensitivity without shocks
    • 42 Sensitivity of the state in the presence of shocks
    • 43 Sensitivity of the cost in the presence of shocks
      • 5 The alternating descent method
      • 6 Numerical approximation of the descent direction
        • 61 The discrete approach
        • 62 The alternating descent method
          • 7 Numerical experiments
            • 71 Experiment 1
            • 72 Experiment 2
              • 8 Conclusions and perspectives
              • References
Page 4: TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE

4 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

where K sub R is a given compact set and C gt 0 a given constant Note however that thesame theoretical results and descent strategies we shall develop here can be applied to amuch wider class of admissible sets

Theorem 21 Assume that ud isin L2(Rtimes (0 T )) let Uad be defined in (21) and f be a C1

function Then the minimization problem

minu0isinUab

J(u0) (22)

has at least one minimizer u0min isin Uad Moreover uniqueness is false in general

The proof follows the classical strategy of the Direct Method of the Calculus of Vari-ations We refer the reader to [8] for a similar proof in the case where the functional Jis replaced by the one in which the distance to a given target at the final time t = T isminimized Note that in particular for the class Uad of admissible initial data consideredsolutions enjoy uniform BV bounds allowing to prove the needed compactness propertiesto pass to the limit In the case of convex fluxes using the well-known one-sided Lipschitzcondition the class of admissible initial data can be further extended

We also observe that as indicated in [8] the uniqueness of the minimizer is in generalfalse for this type of optimization problems

3 The Discrete Minimization Problem

The purpose of this section is to show that discrete minimizers obtained by a numericalapproximation of (11) and (12) converge to a minimizer of the continuous problem asthe mesh-size tends to zero This justifies the usual engineering practice of replacing thecontinuous functional and model by discrete ones to compute an approximation of thecontinuous minimizer

Let us introduce a mesh in Rtimes [0 T ] given by (xi tn) = (i∆x n∆t) (i = minusinfin infin n =

0 N + 1 so that (N + 1)∆t = T ) and let unij be a numerical approximation of u(xi tn)

obtained as solution of a suitable discretization of the equation (12)Let us consider the following approximation of the functional J in (11)

J∆(u0∆) =

∆x∆t

2

Nsumn=0

infinsumi=minusinfin

ρni(uni minus (ud)ni

)2 (31)

where u0∆ = u0

i is the discrete initial datum and ud∆ = (ud)ni = Π∆ud is the discretiza-

tion of the target ud at (xi tn) Π∆ being a discretization operator A common choice

consists in taking

Π∆ud = (ud)ni =

1

∆x∆t

int xi+12

ximinus12

int tn+12

tnminus12

ud(x t) dxdt (32)

where xiplusmn12 = xi plusmn∆x2 and tnplusmn12 = tn plusmn∆t2Moreover we introduce an approximation of the class of admissible initial data Uad

denoted by Uad∆ and constituted by sequences ϕ∆ = ϕiiisinZ for which the associated

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 5

piecewise constant interpolation function that we still denote by ϕ∆ defined by

ϕ∆(x) = ϕi x isin (ximinus12 xi+12)

satisfies ϕ∆ isin Uad Obviously Uad∆ coincides with the class of discrete vectors withsupport on those indices i such that xi isin K and for which the discrete Linfin and TV -normsare bounded above by the same constant C

Let us consider S∆ l1(Z)rarr l1(Z) an explicit numerical scheme for (12) where

un∆ = Sn∆u

0∆ (33)

is the approximation of the entropy solution u(middot t) = S(t)u0 of (12) ie un∆ S(t)u0with t = n∆t Here S L1(R) cap Linfin(R) cap BV (R) rarr L1(R) cap Linfin(R) cap BV (R) is thesemigroup (solution operator)

S u0 rarr u = Su0

which associates to the initial condition u0 isin L1(R)capLinfin(R)capBV (R) the entropy solutionu of (12)

For each ∆ = ∆t (with λ = ∆t∆x fixed typically given by the corresponding CFL-condition for explicit schemes) it is easy to see that the discrete analogue of Theorem21 holds In fact this is automatic in the present setting since Uad∆ only involves afinite number of mesh-points But passing to the limit as ∆ rarr 0 requires a more carefultreatment In fact for that to be done one needs to assume that the scheme underconsideration (33) is a contracting map in l1(Z)

Thus we consider the following discrete minimization problem Find u0min∆ such that

J∆(u0min∆ ) = min

u0∆isinUad∆

J∆(u0∆) (34)

The following holds

Theorem 31 Assume that un∆ is obtained by a numerical scheme (33) which satisfiesthe following

bull For a given u0 isin Uad un∆ = Sn∆Π∆u

0 converges to u(x t) the entropy solution of(12) More precisely if n∆t = t and T0 gt 0 is any given number

max0letleT0

un∆ minus u(middot t)L1(R) rarr 0 as ∆rarr 0 (35)

bull The map S∆ is Linfin-stable ie

S∆u0∆Linfin le u0

∆Linfin (36)

bull The map S∆ is a contracting map in l1(Z) ie

S∆u0∆ minus S∆v

0∆L1 le u0

∆ minus v0∆L1 (37)

Then

bull For all ∆ the discrete minimization problem (34) has at least one solution u0min∆ isin

Uad∆

bull Any accumulation point of u0min∆ with respect to the weak-lowast topology in Linfin as

∆rarr 0 is a minimizer of the continuous problem (22)

6 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

The proof of this result can be developed as in [8] We also refer to [16] for an extensionto the multi-dimensional case

This convergence result applies for 3-point conservative numerical approximation schemeswhere S∆ is given by

(S∆v∆)i = vi minus∆t

∆x(g(vi+1 vi)minus g(vi viminus1)) (38)

and g is the numerical flux This scheme is consistent with the corresponding equation(12) when g(u u) = f(u)

When the discrete semigroup S∆(u v w) = v minus λ(g(u v)minus g(v w)) with λ = ∆t∆x ismonotonic increasing with respect to each argument the scheme is also monotone Thisensures the convergence to the weak entropy solutions of the continuous conservation lawas the discretization parameters tend to zero under a suitable CFL condition (see Ref[12] Chap 3 Th 42)

All this analysis and results apply to the classical Godunov Lax-Friedfrichs and Engquist-Osher schemes the corresponding numerical flux being

gG(u v) =

minwisin[uv] f(w) if u le vmaxwisin[uv] f(w) if u ge v

(39)

gLF (u v) =(f(u) + f(v))

2minus (v minus u)

2λx (310)

gEO(u v) =f(u) + f(v)minus

int v

u|f prime(τ)|dτ

2 (311)

See Chapter 3 in [12] for more detailsThese 1D methods satisfy the conditions of Theorem 31

4 Sensitivity analysis the continuous approach

We divide this section in three subsections Specifically in the first one we consider thecase where the solution u of (12) has no shocks In the second and third subsections weanalyze the sensitivity of the solution and the functional in the presence of a single shock

41 Sensitivity without shocks In this subsection we give an expression for the sen-sitivity of the functional J with respect to the initial datum based on a classical adjointcalculus for smooth solutions First we present a formal calculus and then we show howto justify it when dealing with a classical smooth solution for (12)

Let C10(R) be the set of C1 functions with compact support and let u0 isin C1

0(R) be agiven initial datum for which there exists a classical solution u(x t) of (12) that can beextended to a classical solution in t isin [0 T + τ ] for some τ gt 0 Note that this imposessome restrictions on u0 other than being smooth

Let δu0 isin C10(R) be any possible variation of the initial datum u0 Due to the finite

speed of propagation this perturbation will only affect the solution in a bounded set ofRtimes [0 T ] This simplifies the argument below that applies in a much more general settingprovided solutions are smooth enough

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 7

Then for ε gt 0 sufficiently small the solution uε(x t) corresponding to the initial datum

uε0(x) = u0(x) + εδu0(x) (41)

is also a classical solution in (x t) isin Rtimes (0 T ) and uε isin C1(Rtimes [0 T ]) can be written as

uε = u+ εδu+ o(ε)with respect to the C1 topology (42)

where δu is the solution of the linearized equation

parttδu+ partx(f prime(u)δu) = 0 in Rtimes (0 T ) δu(x 0) = δu0(x) x isin R (43)

Let δJ be the Gateaux derivative of J at u0 in the direction δu0 We have

δJ(u0)[δu0] =

int T

0

intRρ(x t)(u(x t)minus ud(x t))δu(x t)dxdt (44)

where δu solves the linearized system above (43) Now we introduce the adjoint system

minus parttpminus f prime(u)partxp = ρ (uminus ud) in Rtimes (0 T ) p(x T ) = 0 x isin R (45)

Multiplying the equations satisfied by δu by p integrating by parts and taking intoaccount that p satisfies (45) we easily obtainint T

0

intRρ(x t)(u(x t)minus ud(x t))δu(x t)dxdt =

intRp(x 0)δu0(x)dx (46)

Thus δJ in (44) can be written as

δJ(u0)[δu0] =

intRp(x 0)δu0(x)dx (47)

This expression provides an easy way to compute a descent direction for the continuousfunctional J once we have computed the adjoint state We just take

δu0(x) = minusp(x 0) (48)

Under the assumptions above on u0 u δu and p can be obtained from their data u0(x)δu0(x) and ρ (u minus ud) by using the characteristic curves associated to (12) For the sakeof completeness we briefly explain this below

The characteristic curves associated to (12) are defined by

xprime(t) = f prime(u(x(t) t)) t isin (0 T ) x(0) = x0 (49)

They are straight lines whose slopes depend on the initial data

x(t) = x0 + tf prime(u0(x0)) t isin (0 T )

As we are dealing with classical solutions u is constant along such curves and by assump-tion two different characteristic curves do not meet each other in R times [0 T + τ ] Thisallows to define u in Rtimes [0 T + τ ] in a unique way from the initial data

For ε gt 0 sufficiently small the solution uε(x t) corresponding to the initial datum(41) has similar characteristics to those of u This allows guaranteeing that two differentcharacteristic lines do not intersect for 0 le t le T if ε gt 0 is small enough Note that uε

may possibly be discontinuous for t isin (T T +τ ] if u0 generates a discontinuity at t = T +τ

8 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

but this is irrelevant for the analysis in [0 T ] we are carrying out Therefore uε(x t) is alsoa classical solution in (x t) isin R times [0 T ] and it is easy to see that the solution uε can bewritten as (42) where δu satisfies (43)

System (43) can be solved again by the method of characteristics In fact as u is aregular function the first equation in (43) can be written as

parttδu+ f prime(u)partxδu = minuspartx(f prime(u)) δu (410)

ied

dtδu(x(t) t) = minuspartx(f prime(u)) δu (411)

where x(t) are the characteristic curves defined by (49) Thus the solution δu along acharacteristic line can be obtained from δu0 by solving this differential equation ie

δu(x(t) t) = δu0(x0) exp

(minusint t

0

partx(f prime(u))(x(s) s)ds

)

Finally the adjoint system (45) is also solved by characteristics ie

minus d

dtp(x(t) t) = ρ(x(t) t)(u(x(t) t)minus ud(x(t) t))

This yields the steepest descent direction in (48) for the continuous functional

p(x0 0) = u0(x0)

int T

0

ρ(x(s) s)dsminusint T

0

ρ(x(s) s)ud(x(s) s)ds

Remark 41 Note that for classical solutions the Gateaux derivative of J at u0 is given by(47) and this provides an obvious descent direction for J at u0 given by (48) Howeverthis fact is not very useful in practice since even when we initialize the iterative descentalgorithm with a smooth u0 we cannot guarantee that the solution remains classical alongthe iterative process

42 Sensitivity of the state in the presence of shocks Inspired in several results onthe sensitivity of solutions of conservation laws in the presence of shocks in one-dimension(see [7 4 5 6 17 13]) we focus on the particular case of solutions having a single shockBut the analysis can be extended to consider more general one-dimensional systems ofconservation laws with a finite number of noninteracting shocks We introduce the followinghypothesis

Hypothesis 42 Assume that u(x t) is a weak entropy solution of (12) with a discon-tinuity along a regular curve Σ = (ϕ(t) t) t isin (0 T ) which is Lipschitz continuousoutside Σ In particular it satisfies the Rankine-Hugoniot condition on Σ

ϕprime(t)[u]Σt = [f(u)]Σt

Here we have used the notation [v]Σt = limε0

v(ϕ(t) + ε t)minus v(ϕ(t)minus ε t) for the jump at

Σt = (ϕ(t) t) of any piecewise continuous function v with a discontinuity at Σt

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 9

Note that Σ divides Rtimes(0 T ) into two parts Qminus and Q+ the sub-domains of Rtimes(0 T )to the left and to the right of Σ respectively

As we will see in the presence of shocks to deal correctly with optimal control anddesign problems the state of the system needs to be viewed as constituted by the pair(u ϕ) combining the solution of (12) and the shock location ϕ This is relevant in theanalysis of sensitivity of functions below and when applying descent algorithms

We adopt the functional framework based on the generalized tangent vectors (see [7] andDefinition 41 in [8])

Let u0 be the initial datum that we assume to be Lipschitz continuous to both sidesof a single discontinuity located at x = ϕ0 and consider a generalized tangent vector(δu0 δϕ0) isin L1(R) times R for all 0 le T Let u0ε be a path which generates (δu0 δϕ0)For ε sufficiently small the solution uε(middot t) of (12) is Lipschitz continuous with a singlediscontinuity at x = ϕε(t) for all t isin [0 T ] Therefore uε(middot t) generates a generalizedtangent vector (δu(middot t) δϕ(t)) isin L1(R) times R Moreover in [8] it is proved that it satisfiesthe following linearized system

parttδu+ partx(f prime(u)δu) = 0 in Qminus cupQ+ (412)

d

dt([u]Σtδϕ) = [f prime(u)δu]Σt minus [δu]Σt

d

dtϕ t isin (0 T ) (413)

δu(x 0) = δu0(x) x lt ϕ0 cup x gt ϕ0 (414)

δϕ(0) = δϕ0 (415)

43 Sensitivity of the cost in the presence of shocks In this section we study thesensitivity of the functional J with respect to variations associated with the generalizedtangent vectors defined in the previous section We first define an appropriate generaliza-tion of the Gateaux derivative of J

Definition 43 Let J L1(R) rarr R be a functional and u0 isin L1(R) be Lipschitz contin-uous with a discontinuity in Σ0 an initial datum for which the solution of (12) satisfieshypothesis (42) J is Gateaux differentiable at u0 in a generalized sense if for any gener-alized tangent vector (δu0 δϕ0) and any family u0ε associated to (δu0 δϕ0) the followinglimit exists

δJ = limεrarr0

J(u0ε)minus J(u0)

ε

and it depends only on (u0 ϕ0) and (δu0 δϕ0) ie it does not depend on the particularfamily u0ε which generates (δu0 δϕ0) The limit is the generalized Gateux derivative of Jin the direction (δu0 δϕ0)

The following result easily provides a characterization of the generalized Gateaux deriv-ative of J in terms of the solution of the associated adjoint system (417) (418)(419)(420) (421) and (422)

Proposition 44 The Gateaux derivative of J can be written as follows

δJ(u0)[δu0 δϕ0] =

intRp(x 0)δu0(x)dxminus q(0)[u]Σ0δϕ0 (416)

10 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

where the adjoint state pair (p q) satisfies the system

minus parttpminus f prime(u)partxp = ρ (uminus ud) in Qminus cupQ+ (417)

[p]Σt = 0 t isin (0 T ) (418)

q(t) = p(ϕ(t) t) t isin (0 T ) (419)

minus d

dtq =

(1 + (ϕ)2)12[ρ (uminus ud)2]Σt

2[u]Σt

t isin (0 T ) (420)

p(x T ) = 0 x lt ϕ(T ) cup x gt ϕ(T ) (421)

q(T ) = 0 (422)

Let us briefly comment the result of Proposition 44 before giving its proofSystem (417)-(422) has a unique solution In fact to solve the backward system (417)-

(422) we first define the solution q on the shock Σ from the conditions for q (420) and(422) This determines the value of p along the shock We then propagate this informationtogether with (417) and (421) to both sides of Σ by characteristics (see Figure 1 wherewe illustrate this construction)

p is defined by the method of characteristics

p on the region of influence of the characteristics emanating from the shock

Xminus X+t = 0

t = T

Σ

Σ0

Figure 1 Characteristic lines entering on a shock and how they may beused to build the solution of the adjoint system both away from the shockand on its region of influence

Formula (416) provides an obvious way to compute a first descent direction of J at u0We just take

(δu0 δϕ0) = (minusp(middot 0) q(0)[u]Σ0) (423)

Here the value of δϕ0 must be interpreted as the optimal infinitesimal displacement of thediscontinuity of u0

In [8] when considering the inverse design problem it was observed that the solution pof the corresponding adjoint system at t = 0 was discontinuous with two discontinuities

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 11

one in each side of the original location of the discontinuity at Σ0 This was a reason notto use this descent direction and for introducing the alternating descent method In thepresent setting however the adjoint state p obtained is typically continuous This is due tothe fact that p at both side of the discontinuity is defined by the method of characteristicsand that on the region of influence of the characteristics emanating from the shock thecontinuity is preserved by the fact that on one hand q = q(t) itself is continuous as theprimitive of an integrable function and that the data for p and q at t = T are continuoustoo Despite of this as we shall see the implementation of the alternating descent directionmethod is worth since it significantly improves the results obtained by the purely discreteapproach

We finish this section with the proof of Proposition 44

Proof (of Proposition 44) A straightforward computation shows that J is Gateaux dif-ferentiable in the generalized sense of Definition 43 and that the generalized Gateauxderivative of J in the direction of the generalized tangent vector (δu0 δϕ0) is given by

δJ(u0)[δu0 δϕ0] =

int T

0

intxgtϕ(t)cupxltϕ(t)

ρ(x t)(u(x t)minus ud(x t))δu(x t) dxdt

minusint

Σ

(uminus ud)2

2

]Σt

δϕ(t) dΣ(t) (424)

where the pair (δu δϕ) solves the linearized problem (412)-(415) with initial data (δu0 δϕ0)Let us now introduce the adjoint system (417)-(422) Multiplying the equations of δu

by p and integrating we get

0 =

int T

0

intxgtϕ(t)cupxltϕ(t)

(parttδu+ partx(f prime(u)δu)) p dxdt = minusint T

0

intxgtϕ(t)cupxltϕ(t)

δu (parttp+ f prime(u)partxp) dxdt

+

intxgtϕ(T )cupxltϕ(T )

δu(x T )p(x T )dxminusint

xgtϕ0cupxltϕ0

δu0(x)p(x 0)dx

minusint

Σ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ(t) (425)

where (nx nt) are the Cartesian components of the normal vector to the curve ΣTherefore since p satisfies the adjoint equation (417) (421) from (424) we obtain

δJ(u0)[δu0 δϕ0] =

intxgtϕ0cupxltϕ0

δu0(x)p(x 0)dx

+

intΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ(t)minus

intΣ

(uminus ud)2

2

]Σt

δϕ(t) dΣ(t) (426)

The last two terms in the right hand side of (426) will determine the conditions that pmust satisfy on the shock

12 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Observe that for any functions f g we have

[fg]Σt = f [g]Σt + g[f ]Σt

where g represents the average of g to both sides of the shock Σt ie

g(t) =1

2limε0

(g(ϕ(t) + ε t) + g(ϕ(t)minus ε t)) forallt isin (0 T )

Thus we haveintΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ =

intΣ

[p]Σt

(δunt

Σ + f prime(u)δu middot nxΣ

)dΣ

+

intΣ

p([δu]Σtnt

Σ + [f prime(u)δu]Σt middot nxΣ

)dΣ

(427)

Now we note that the Cartesian components of the normal vector to Σ are given by

nt =minusϕprime(t)radic

1 + (ϕprime(t))2 nx =

1radic1 + (ϕprime(t))2

and dΣ(t) =radic

1 + (ϕprime(t))2dt Using (413) (418) we haveintΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]ΣtnxΣ

)dΣ =

int T

0

p (minus[δu]Σtϕprime(t) + [f prime(u)δu]Σt) dt

=

int T

0

pd

dt([u]Σtδϕ) dt (428)

Therefore replacing (419) (420) (422) and (428) in (426) we obtain (416)This concludes the proof

5 The alternating descent method

Here we explain how the alternating descent method introduced in [8] can be adapted tothe present optimal control problem (13)

First let us introduce some notation We consider two points

Xminus = ϕ(T )minus Tf prime(uminus(ϕ(T ) T )) X+ = ϕ(T )minus Tf prime(u+(x T )) (51)

(p q) the solutions of (417)-(422) and

p0minus = limxXminus

p(x 0) p0+ = limxX+

p(x 0)

Now we introduce the two classes of descent directions we shall use in our descentalgorithm

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 13

First directions With this set of directions we mainly modify the profile of u0 Weset the first directions d1 = (δu0 δϕ0) given by

δu0 =

minusp(x 0) x lt Xminusminusp0minus x isin (Xminus ϕ0)minusp0+ x isin (ϕ0 X+)minusp(x 0) x gt X+

δϕ0 =

0 if

X+intXminus

p(x 0)δu0(x)dx le 0

X+intXminus

p(x 0)δu0(x)dx

[u0]Σ0q(0) if

X+intXminus

p(x 0)δu0(x)dx gt 0 and q(0) 6= 0

(52)

We note that if q(0) = 0 andX+intXminus

p(x 0)δu0(x)dx gt 0 these directions are not

definedSecond directions They are aimed to move the shock without changing the profile

of the solution to both sides Then d2 = (δu0 δϕ0) with

δu0 equiv 0 δϕ0(x) = [u0]Σ0q(0) (53)

We observe that d1 given by (52) satisfies

δJ(u0)[d1] = minusintxisin[XminusX+]

|p(x 0)|2dx

minusp0minusint ϕ0

Xminusp(x 0)dxminus p0+

int X+

ϕ0

p(x 0)dxminus [u0]Σ0q(0)δϕ0 le 0 (54)

Note that in those cases where d1 is not defined as indicated above this descent directionis simply not employed in the descent algorithm

And for d2 given by (53) we have

δJ(u0)[d2] = minus([u0]Σ0q(0))2 le 0 (55)

Thus the two classes of descent directions under consideration have three importantproperties

(1) They are both descent directions(2) They allow to split the design of the profile and the shock location(3) They are true generalized gradients and therefore keep the structure of the data

without increasing its complexity

6 Numerical approximation of the descent direction

We have computed the gradient of the continuous functional J in several cases (u smoothor having shock discontinuities) but in practice one has to work with discrete versions of

14 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

the functional J In this section we discuss two possible ways of searching discrete descentdirections based either on the discrete or the continuous approaches and in the later onthe alternating descent method

The discrete approach consists mainly in applying directly a descent algorithm to thediscrete version J∆ of the functional J by using its gradient The alternating descentmethod by the contrary is a method inspired on the continuous analysis of the previoussection in which the two main classes of descent directions that are first identified andlater discretized

Let us first discuss the discrete approach

61 The discrete approach Let us consider the approximation of the functional J byJ∆ defined (11) and (31) respectively We shall use the Engquist-Osher scheme whichis a 3-point conservative numerical approximation scheme for (12) More explicitly weconsider

un+1i = uni minus

∆t

∆x

(g(uni+1 u

ni )minus g(uni u

niminus1)) i isin Z n = 0 N (61)

where g is the numerical flux defined in (311)The gradient of the discrete functional J∆ requires computing one derivative of J∆ with

respect to each node of the mesh This can be done in a cheaper way using the discreteadjoint state We illustrate it for the Engquist-Osher numerical scheme However asthe discrete functionals J∆ are not necessarily convex the gradient methods could possiblyprovide sequences that do not converge to a global minimizer of J∆ But this drawback anddifficulty appears in most applications of descent methods in optimal design and controlproblems

As we will see in the present context the approximations obtained by discrete gradientmethods are satisfactory although convergence is slow due to unnecessary oscillations thatthe descent method introduces

The gradient of J∆ rigorously speaking requires the linearization of the numericalscheme (61) used to approximate the equation (12) Then the linearization correspondsto

δun+1i = δuni minus

∆t

∆x

(partag(uni+1 u

ni )δuni+1 minus partbg(uni u

niminus1)δuniminus1

)minus∆t

∆x

(partbg(uni+1 u

ni )minus partag(uni u

niminus1))δuni

i isin Z n = 0 N (62)

In view of this the discrete adjoint system of (62) can also be written for the differen-tiable flux functions

pN+1i = 0 i isin Z

pni = pn+1i +

∆t

∆xpartbg(uni+1 u

ni )(pn+1i+1 minus pn+1

i

)+

∆t

∆xpartag(uni u

niminus1)

(pn+1i minus pn+1

iminus1

)+ F n

i

i isin Z n = 0 N (63)

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 15

whereF ni = ∆tρni

(uni minus (ud)ni

) i isin Z n = 0 N (64)

In fact when multiplying the equations in (62) by pn+1i and adding in i isin Z and

n = 0 N the following identity is easily obtained

∆xsumiisinZ

p0i δu

0i = ∆x

Nsumn=0

sumiisinZ

F ni δu

0i (65)

This is the discrete version of formula (46) which allows us to simplify the derivative ofthe discrete cost functional

Thus for any variation δu0∆ the Gateaux derivative of the cost functional defined in

(31) is given by

δJ∆ = ∆x∆tNsum

n=0

sumiisinZ

ρni(uni minus (ud)ni

)δuni (66)

where δunij solves the linearized system (62) If we consider pni the solution of (63) with(64) we obtain that δJ∆ in (66) can be written as

δJ∆ = ∆xsumiisinZ

p0i δu

0i

and this allows to obtain easily the steepest descent direction for J∆ by considering

δu0∆ = minusp0

∆ (67)

Remark 61 We observe that for the Enquis-Osherrsquos flux (311) the system (63) is theupwind method for the continuous adjoint system We do not address here the problemof the convergence of this adjoint scheme towards the solution of the continuous adjointsystem Of course this is an easy matter when u is smooth but it is far from being trivialwhen u has shock discontinuities Whether or not this discrete adjoint system as ∆rarr 0allows reconstructing the complete adjoint system with the inner Dirichlet condition alongthe shock (417)(418)(419)(420)(421) and (422) constitutes an interesting problemfor future research We refer to [14] and [18] for preliminary work on this direction inone-dimension

62 The alternating descent method Now we describe the implementation of thealternating descent method

The main idea is to approximate a minimizer of J alternating between two directions ofdescent First we perturb the initial datum u0 using the direction (52) which principallychanges the profile of u0 Second we move the shock curve without altering the profile ofu0 at both sides of Σ using the direction (53)

More precisely for a given initialization u0∆ and target function ud∆ we compute Σ0

∆ thejump-point of u0

∆u∆ and p0∆ as the solutions of (62) (63) respectively

As numerical approximation of the adjoint state q(0) we take the value of p0∆ over the

point Σ0∆ ie

q0∆ = p0

∆(Σ0∆) = p0

i Σ0∆ isin (ximinus12 xi+12)

16 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

The main advantage of this method introduced in [8] is that for an initial datum u0

with a single discontinuity the descent directions are generalized tangent vectors ie theyintroduce Lipschitz continuous variations of u0 at both sides of the discontinuity and adisplacement of the shock position In this way the new datum obtained modifying theold one in the direction of this generalized tangent vector has again a single discontinuityThe method can be applied in a much more general context in which for instance thesolution has various shocks since the method is able both to generate shocks and to destroythem if any of these facts contributes to the decrease of the functional This method isin some sense close to those employed in shape design in elasticity in which topologicalderivatives (that would correspond to controlling the location of the shock in our method)are combined with classical shape deformations (that would correspond to simply shapingthe solution away from the shock in the present setting) [11]

As mentioned above in the context of the tracking problem we are considering thesolutions of the adjoint system do not increase the number of shocks Thus a directapplication of the continuous method without employing this alternating variant couldmake sense Note that the purely discrete method is a limited version of the continuous onesince one simply employs the descent direction indicated by p0 without paying attention tothe value q0 corresponding to the shock sensitivity However the numerical experimentswe have done with the full continuous method do not improve the results obtained by thediscrete one since the definition of the location of the shocks is lost along the iterativeprocess

7 Numerical experiments

In this section we present some numerical experiments which illustrate the results ob-tained in an optimization model problem with each one of the numerical methods describedin the previous section

We have chosen as computational domain the interval (minus4 4) and we have taken asboundary conditions in (61) at each time step t = tn the value of the initial data at theboundary This can be justified if we assume that the initial datum u0 is constant in asufficiently large inner neighborhood of the boundary x = plusmn4 (which depends on the sizeof the Linfin-norm of the data under consideration and the time horizon T ) due to the finitespeed of propagation A similar procedure is employed for the adjoint equation

We underline once more that the solutions obtained with each method may correspondto global minima or local ones since the gradient algorithm does not distinguish them

In the experiments we consider the Burgersrsquo equation ie f(z) = z22

parttu+ partx

(u2

2

)= 0 u(x 0) = u0(x) (71)

The weight function ρ under consideration in the experiments is given by

ρ(x t) =

1 t isin (T2 T )0 otherwise

And the time horizon T = 1

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 17

To compare the efficiency of the different methods we consider a fixed ∆x = 120 λ =∆t∆x = 23 (which satisfies the CFL condition) We then analyze the number of itera-tions that each method needs to attain a prescribed value of the functional

71 Experiment 1 We first consider a piecewise constant target profile ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

07 x isin [minus2 1]0 otherwise

(72)

Note that in this case (72) yields a particular solution of the optimization problemand the minimum value of J vanishes

We solve the optimization problem (13) with the above described different methodsstarting from the following initialization for u0

u0(x) =

05 x le 0minus01 x gt 0

(73)

which also has a discontinuity but located on a different pointIn Figure 2 we plot log(J) with respect to the number of iterations for both the purely

discrete method and the alternating descent one We see that the latter stabilizes in feweriterations

Figure 2 Experiment 1 log(J) versus the number of iterations in thedescent algorithm for the discrete and the alternating descent methods

In Figures 3 and 4 we present the minimizers obtained by the methods above and theassociated solutions Figures 5 and 6

The initial datum u0 obtained by the alternating descent method (Figures 4) is a goodapproximation of (72) The solution given by the discrete approach (Figures 3) presentsadded spurious oscillations Furthermore the discrete method is much slower and does notachieve the same level of accuracy since the functional J∆ does not decrease so much

18 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Figure 3 Experiment 1 u0discrete method iteration k =

999

Figure 4 Experiment 1 u0alternating descent method it-eration k = 38

Figure 5 Experiment 1 So-lution u(x t) discrete methoditeration k = 999

Figure 6 Experiment 1 So-lution u(x t) alternating de-scent method iteration k = 38

72 Experiment 2 The previous experiment indicates that the alternating descent methodperforms significantly better In order to show that this is a systematic fact which arisesindependently of the initialization of the method we consider the target ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

05 x le 050 otherwise

(74)

but this time we compare the performance of both methods starting from different initial-izations

The obtained numerical results are presented in Figure 7We see that regardless the initialization considered the alternating descent method

performs significantly better

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 19

u0 obtained by the discrete

approach

u0 obtained by the alternating

descent method

The value of the functional

Figure 7 Experiment 2 We present a comparison of the results obtainedwith both methods starting out of five different initialization configurationsIn the left column we exhibit the results obtained with the discrete methodIn the second one those achieved by the alternating descent method In thelast one we plot the evolution of the functional with both methods

20 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

We observe that in the five experiments the alternating descent method perfumes betterensuring the descent of the functional in much fewer iterations and yielding smoother lessoscillatory approximation of the minimizer

Note also that the discrete method rather than yielding discontinuous approximationsof the minimizer as the alternating descent method does it produces an initial datum witha Lipschitz front Observe that these are two different configurations that can lead tothe same evolution for the Burgers equation after some time once the front develops thediscontinuity This is in agreement with the fact that the functional to be minimized isonly active in the time-interval T2 le t le T

8 Conclusions and perspectives

In this paper we have adapted and presented the alternating descent method for atracking problem for a 1minus d scalar conservation law the goal being to identify an optimalinitial datum so that the solution gets as close as possible to a given trajectory Wehave exhibited in a number of examples the better performance of the alternating descentmethod with respect to the classical discrete method Thus the paper extends previousworks on other optimization problems such as the inverse design or the optimization of theflux function

The developments in this paper raise a number of interesting problems and questionsthat will deserve further investigation We summarize here some of them

bull The alternating descent method and the discrete one do not seem to yield the sameminimizer This should be further investigated in a more systematic manner inother experiments

The minimizer that the discrete method yields seems to replace the shock of theinitial datum by a Lipschitz function which eventually develops the same dynamicswithin the time interval T2 le t le T in which the functional we have chosen isactive This is an agreement with the behavior of these methods in the context ofthe classical problem of inverse design in which one aims to find the initial datumso that the solution at the final time takes a given value It would be interesting toprove analytically that the two methods may lead to different minimizers in somecircumstancesbull The experiments in this paper concern the case where the target ud is exactly

reachable It would be interesting to explore the performance of both methods inthe case where the target ud is not a solution of the underlying dynamicsbull It would be interesting to compare the performance of the methods presented in the

paper with the direct continuous method without introducing the alternating strat-egy Our preliminary numerical experiments indicate that the continuous methodwithout implementing the alternating strategy does not improve the results thatthe purely discrete strategy yieldsbull In [16] the problem of inverse design has been investigated adapting and extend-

ing the alternating descent method to the multi-dimensional case It would beinteresting to extend the analysis of this paper to the multidimensional case too

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 21

bull It would worth to compare the results in this paper with those that could beachieved by the nudging method as in [2] [3]

AcknowledgementsThe authors would like to thank C Castro F Palacios and A Pozo for stimulating

discussions

References

[1] A Adimurthi S S Ghoshal and V Gowda Optimal controllability for scalar conservation laws withconvex flux 12 Jan 2012 3

[2] D Auroux and J Blum Back and forth nudging algorithm for data assimilation problems ComptesRendus Mathematique 340(12)873 ndash 878 2005 3 20

[3] D Auroux and M Nodet The back and forth nudging algorithm for data assimilation problems theoretical results on transport equations ESAIM Control Optimisation and Calculus of Variations18(2)318ndash342 7 2012 3 20

[4] C Bardos and O Pironneau A formalism for the differentiation of conservation laws C R MathAcad Sci Paris 335(10)839ndash845 2002 8

[5] F Bouchut and F James One-dimensional transport equations with discontinuous coefficients Non-linear Anal 32(7)891ndash933 1998 8

[6] F Bouchut and F James Differentiability with respect to initial data for a scalar conservation lawIn Hyperbolic problems theory numerics applications Vol I (Zurich 1998) volume 129 of InternatSer Numer Math pages 113ndash118 Birkhauser Basel 1999 8

[7] A Bressan and A Marson A variational calculus for discontinuous solutions of systems of conservationlaws Comm Partial Differential Equations 20(9-10)1491ndash1552 1995 8 9

[8] C Castro F Palacios and E Zuazua An alternating descent method for the optimal control of theinviscid burgers equation in the presence of shocks Mathematical Models and Methods in AppliedSciences 18(03)369ndash416 2008 2 3 4 6 9 10 12 16

[9] C Castro F Palacios and E Zuazua Optimal control and vanishing viscosity for the burgers equa-tion In C Constanda and M Perez editors Integral Methods in Science and Engineering Volume2 pages 65ndash90 Birkhauser Boston 2010 2

[10] C Castro and E Zuazua Flux identification for 1-d scalar conservation laws in the presence of shocksMath Comp 80(276)2025ndash2070 2011 2

[11] S Garreau P Guillaume and M Masmoudi The topological asymptotic for pde systems Theelasticity case SIAM J Control Optim 39(6)1756ndash1778 Dec 2000 16

[12] E Godlewski and P Raviart Hyperbolic systems of conservation laws Mathematiques and Applica-tions Ellipses Paris 1991 6

[13] E Godlewski and P A Raviart The linearized stability of solutions of nonlinear hyperbolic systemsof conservation laws A general numerical approach Math Comput Simulation 50(1-4)77ndash95 1999Modelling rsquo98 (Prague) 8

[14] L Gosse and F James Numerical approximations of one-dimensional linear conservation equationswith discontinuous coefficients Mathematics of Computation 69(231)pp 987ndash1015 2000 15

[15] S N Kruzkov First order quasilinear equations with several independent variables Mat Sb (NS)81 (123)228ndash255 1970 2

[16] R Lecaros and E Zuazua Control of 2D scalar conservation laws in the presence of shocks preprint2014 3 6 20

[17] S Ulbrich Adjoint-based derivative computations for the optimal control of discontinuous solutions ofhyperbolic conservation laws Systems Control Lett 48(3-4)313ndash328 2003 Optimization and controlof distributed systems 8

22 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

[18] X Wen and S Jin Convergence of an immersed interface upwind scheme for linear advection equationswith piecewise constant coefficients I L1-error estimates J Comput Math 26(1)1ndash22 2008 15

Rodrigo Lecaros13 and Enrique Zuazua12

1BCAM - Basque Center for Applied MathematicsMazarredo 14 E-48009 Bilbao Basque Country Spain

2Ikerbasque - Basque Foundation for ScienceAlameda Urquijo 36-5 Plaza Bizkaia 48011 Bilbao Basque Country Spain

3CMM - Centro de Modelamiento Matematico Universidad de Chile (UMI CNRS 2807)Avenida Blanco Encalada 2120 Casilla 170-3 Correo 3 Santiago Chile

E-mail address rlecarosdimuchilecl zuazuabcamathorg

URL wwwbcamathorgzuazua

  • 1 Introduction
  • 2 Existence of Minimizers
  • 3 The Discrete Minimization Problem
  • 4 Sensitivity analysis the continuous approach
    • 41 Sensitivity without shocks
    • 42 Sensitivity of the state in the presence of shocks
    • 43 Sensitivity of the cost in the presence of shocks
      • 5 The alternating descent method
      • 6 Numerical approximation of the descent direction
        • 61 The discrete approach
        • 62 The alternating descent method
          • 7 Numerical experiments
            • 71 Experiment 1
            • 72 Experiment 2
              • 8 Conclusions and perspectives
              • References
Page 5: TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 5

piecewise constant interpolation function that we still denote by ϕ∆ defined by

ϕ∆(x) = ϕi x isin (ximinus12 xi+12)

satisfies ϕ∆ isin Uad Obviously Uad∆ coincides with the class of discrete vectors withsupport on those indices i such that xi isin K and for which the discrete Linfin and TV -normsare bounded above by the same constant C

Let us consider S∆ l1(Z)rarr l1(Z) an explicit numerical scheme for (12) where

un∆ = Sn∆u

0∆ (33)

is the approximation of the entropy solution u(middot t) = S(t)u0 of (12) ie un∆ S(t)u0with t = n∆t Here S L1(R) cap Linfin(R) cap BV (R) rarr L1(R) cap Linfin(R) cap BV (R) is thesemigroup (solution operator)

S u0 rarr u = Su0

which associates to the initial condition u0 isin L1(R)capLinfin(R)capBV (R) the entropy solutionu of (12)

For each ∆ = ∆t (with λ = ∆t∆x fixed typically given by the corresponding CFL-condition for explicit schemes) it is easy to see that the discrete analogue of Theorem21 holds In fact this is automatic in the present setting since Uad∆ only involves afinite number of mesh-points But passing to the limit as ∆ rarr 0 requires a more carefultreatment In fact for that to be done one needs to assume that the scheme underconsideration (33) is a contracting map in l1(Z)

Thus we consider the following discrete minimization problem Find u0min∆ such that

J∆(u0min∆ ) = min

u0∆isinUad∆

J∆(u0∆) (34)

The following holds

Theorem 31 Assume that un∆ is obtained by a numerical scheme (33) which satisfiesthe following

bull For a given u0 isin Uad un∆ = Sn∆Π∆u

0 converges to u(x t) the entropy solution of(12) More precisely if n∆t = t and T0 gt 0 is any given number

max0letleT0

un∆ minus u(middot t)L1(R) rarr 0 as ∆rarr 0 (35)

bull The map S∆ is Linfin-stable ie

S∆u0∆Linfin le u0

∆Linfin (36)

bull The map S∆ is a contracting map in l1(Z) ie

S∆u0∆ minus S∆v

0∆L1 le u0

∆ minus v0∆L1 (37)

Then

bull For all ∆ the discrete minimization problem (34) has at least one solution u0min∆ isin

Uad∆

bull Any accumulation point of u0min∆ with respect to the weak-lowast topology in Linfin as

∆rarr 0 is a minimizer of the continuous problem (22)

6 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

The proof of this result can be developed as in [8] We also refer to [16] for an extensionto the multi-dimensional case

This convergence result applies for 3-point conservative numerical approximation schemeswhere S∆ is given by

(S∆v∆)i = vi minus∆t

∆x(g(vi+1 vi)minus g(vi viminus1)) (38)

and g is the numerical flux This scheme is consistent with the corresponding equation(12) when g(u u) = f(u)

When the discrete semigroup S∆(u v w) = v minus λ(g(u v)minus g(v w)) with λ = ∆t∆x ismonotonic increasing with respect to each argument the scheme is also monotone Thisensures the convergence to the weak entropy solutions of the continuous conservation lawas the discretization parameters tend to zero under a suitable CFL condition (see Ref[12] Chap 3 Th 42)

All this analysis and results apply to the classical Godunov Lax-Friedfrichs and Engquist-Osher schemes the corresponding numerical flux being

gG(u v) =

minwisin[uv] f(w) if u le vmaxwisin[uv] f(w) if u ge v

(39)

gLF (u v) =(f(u) + f(v))

2minus (v minus u)

2λx (310)

gEO(u v) =f(u) + f(v)minus

int v

u|f prime(τ)|dτ

2 (311)

See Chapter 3 in [12] for more detailsThese 1D methods satisfy the conditions of Theorem 31

4 Sensitivity analysis the continuous approach

We divide this section in three subsections Specifically in the first one we consider thecase where the solution u of (12) has no shocks In the second and third subsections weanalyze the sensitivity of the solution and the functional in the presence of a single shock

41 Sensitivity without shocks In this subsection we give an expression for the sen-sitivity of the functional J with respect to the initial datum based on a classical adjointcalculus for smooth solutions First we present a formal calculus and then we show howto justify it when dealing with a classical smooth solution for (12)

Let C10(R) be the set of C1 functions with compact support and let u0 isin C1

0(R) be agiven initial datum for which there exists a classical solution u(x t) of (12) that can beextended to a classical solution in t isin [0 T + τ ] for some τ gt 0 Note that this imposessome restrictions on u0 other than being smooth

Let δu0 isin C10(R) be any possible variation of the initial datum u0 Due to the finite

speed of propagation this perturbation will only affect the solution in a bounded set ofRtimes [0 T ] This simplifies the argument below that applies in a much more general settingprovided solutions are smooth enough

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 7

Then for ε gt 0 sufficiently small the solution uε(x t) corresponding to the initial datum

uε0(x) = u0(x) + εδu0(x) (41)

is also a classical solution in (x t) isin Rtimes (0 T ) and uε isin C1(Rtimes [0 T ]) can be written as

uε = u+ εδu+ o(ε)with respect to the C1 topology (42)

where δu is the solution of the linearized equation

parttδu+ partx(f prime(u)δu) = 0 in Rtimes (0 T ) δu(x 0) = δu0(x) x isin R (43)

Let δJ be the Gateaux derivative of J at u0 in the direction δu0 We have

δJ(u0)[δu0] =

int T

0

intRρ(x t)(u(x t)minus ud(x t))δu(x t)dxdt (44)

where δu solves the linearized system above (43) Now we introduce the adjoint system

minus parttpminus f prime(u)partxp = ρ (uminus ud) in Rtimes (0 T ) p(x T ) = 0 x isin R (45)

Multiplying the equations satisfied by δu by p integrating by parts and taking intoaccount that p satisfies (45) we easily obtainint T

0

intRρ(x t)(u(x t)minus ud(x t))δu(x t)dxdt =

intRp(x 0)δu0(x)dx (46)

Thus δJ in (44) can be written as

δJ(u0)[δu0] =

intRp(x 0)δu0(x)dx (47)

This expression provides an easy way to compute a descent direction for the continuousfunctional J once we have computed the adjoint state We just take

δu0(x) = minusp(x 0) (48)

Under the assumptions above on u0 u δu and p can be obtained from their data u0(x)δu0(x) and ρ (u minus ud) by using the characteristic curves associated to (12) For the sakeof completeness we briefly explain this below

The characteristic curves associated to (12) are defined by

xprime(t) = f prime(u(x(t) t)) t isin (0 T ) x(0) = x0 (49)

They are straight lines whose slopes depend on the initial data

x(t) = x0 + tf prime(u0(x0)) t isin (0 T )

As we are dealing with classical solutions u is constant along such curves and by assump-tion two different characteristic curves do not meet each other in R times [0 T + τ ] Thisallows to define u in Rtimes [0 T + τ ] in a unique way from the initial data

For ε gt 0 sufficiently small the solution uε(x t) corresponding to the initial datum(41) has similar characteristics to those of u This allows guaranteeing that two differentcharacteristic lines do not intersect for 0 le t le T if ε gt 0 is small enough Note that uε

may possibly be discontinuous for t isin (T T +τ ] if u0 generates a discontinuity at t = T +τ

8 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

but this is irrelevant for the analysis in [0 T ] we are carrying out Therefore uε(x t) is alsoa classical solution in (x t) isin R times [0 T ] and it is easy to see that the solution uε can bewritten as (42) where δu satisfies (43)

System (43) can be solved again by the method of characteristics In fact as u is aregular function the first equation in (43) can be written as

parttδu+ f prime(u)partxδu = minuspartx(f prime(u)) δu (410)

ied

dtδu(x(t) t) = minuspartx(f prime(u)) δu (411)

where x(t) are the characteristic curves defined by (49) Thus the solution δu along acharacteristic line can be obtained from δu0 by solving this differential equation ie

δu(x(t) t) = δu0(x0) exp

(minusint t

0

partx(f prime(u))(x(s) s)ds

)

Finally the adjoint system (45) is also solved by characteristics ie

minus d

dtp(x(t) t) = ρ(x(t) t)(u(x(t) t)minus ud(x(t) t))

This yields the steepest descent direction in (48) for the continuous functional

p(x0 0) = u0(x0)

int T

0

ρ(x(s) s)dsminusint T

0

ρ(x(s) s)ud(x(s) s)ds

Remark 41 Note that for classical solutions the Gateaux derivative of J at u0 is given by(47) and this provides an obvious descent direction for J at u0 given by (48) Howeverthis fact is not very useful in practice since even when we initialize the iterative descentalgorithm with a smooth u0 we cannot guarantee that the solution remains classical alongthe iterative process

42 Sensitivity of the state in the presence of shocks Inspired in several results onthe sensitivity of solutions of conservation laws in the presence of shocks in one-dimension(see [7 4 5 6 17 13]) we focus on the particular case of solutions having a single shockBut the analysis can be extended to consider more general one-dimensional systems ofconservation laws with a finite number of noninteracting shocks We introduce the followinghypothesis

Hypothesis 42 Assume that u(x t) is a weak entropy solution of (12) with a discon-tinuity along a regular curve Σ = (ϕ(t) t) t isin (0 T ) which is Lipschitz continuousoutside Σ In particular it satisfies the Rankine-Hugoniot condition on Σ

ϕprime(t)[u]Σt = [f(u)]Σt

Here we have used the notation [v]Σt = limε0

v(ϕ(t) + ε t)minus v(ϕ(t)minus ε t) for the jump at

Σt = (ϕ(t) t) of any piecewise continuous function v with a discontinuity at Σt

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 9

Note that Σ divides Rtimes(0 T ) into two parts Qminus and Q+ the sub-domains of Rtimes(0 T )to the left and to the right of Σ respectively

As we will see in the presence of shocks to deal correctly with optimal control anddesign problems the state of the system needs to be viewed as constituted by the pair(u ϕ) combining the solution of (12) and the shock location ϕ This is relevant in theanalysis of sensitivity of functions below and when applying descent algorithms

We adopt the functional framework based on the generalized tangent vectors (see [7] andDefinition 41 in [8])

Let u0 be the initial datum that we assume to be Lipschitz continuous to both sidesof a single discontinuity located at x = ϕ0 and consider a generalized tangent vector(δu0 δϕ0) isin L1(R) times R for all 0 le T Let u0ε be a path which generates (δu0 δϕ0)For ε sufficiently small the solution uε(middot t) of (12) is Lipschitz continuous with a singlediscontinuity at x = ϕε(t) for all t isin [0 T ] Therefore uε(middot t) generates a generalizedtangent vector (δu(middot t) δϕ(t)) isin L1(R) times R Moreover in [8] it is proved that it satisfiesthe following linearized system

parttδu+ partx(f prime(u)δu) = 0 in Qminus cupQ+ (412)

d

dt([u]Σtδϕ) = [f prime(u)δu]Σt minus [δu]Σt

d

dtϕ t isin (0 T ) (413)

δu(x 0) = δu0(x) x lt ϕ0 cup x gt ϕ0 (414)

δϕ(0) = δϕ0 (415)

43 Sensitivity of the cost in the presence of shocks In this section we study thesensitivity of the functional J with respect to variations associated with the generalizedtangent vectors defined in the previous section We first define an appropriate generaliza-tion of the Gateaux derivative of J

Definition 43 Let J L1(R) rarr R be a functional and u0 isin L1(R) be Lipschitz contin-uous with a discontinuity in Σ0 an initial datum for which the solution of (12) satisfieshypothesis (42) J is Gateaux differentiable at u0 in a generalized sense if for any gener-alized tangent vector (δu0 δϕ0) and any family u0ε associated to (δu0 δϕ0) the followinglimit exists

δJ = limεrarr0

J(u0ε)minus J(u0)

ε

and it depends only on (u0 ϕ0) and (δu0 δϕ0) ie it does not depend on the particularfamily u0ε which generates (δu0 δϕ0) The limit is the generalized Gateux derivative of Jin the direction (δu0 δϕ0)

The following result easily provides a characterization of the generalized Gateaux deriv-ative of J in terms of the solution of the associated adjoint system (417) (418)(419)(420) (421) and (422)

Proposition 44 The Gateaux derivative of J can be written as follows

δJ(u0)[δu0 δϕ0] =

intRp(x 0)δu0(x)dxminus q(0)[u]Σ0δϕ0 (416)

10 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

where the adjoint state pair (p q) satisfies the system

minus parttpminus f prime(u)partxp = ρ (uminus ud) in Qminus cupQ+ (417)

[p]Σt = 0 t isin (0 T ) (418)

q(t) = p(ϕ(t) t) t isin (0 T ) (419)

minus d

dtq =

(1 + (ϕ)2)12[ρ (uminus ud)2]Σt

2[u]Σt

t isin (0 T ) (420)

p(x T ) = 0 x lt ϕ(T ) cup x gt ϕ(T ) (421)

q(T ) = 0 (422)

Let us briefly comment the result of Proposition 44 before giving its proofSystem (417)-(422) has a unique solution In fact to solve the backward system (417)-

(422) we first define the solution q on the shock Σ from the conditions for q (420) and(422) This determines the value of p along the shock We then propagate this informationtogether with (417) and (421) to both sides of Σ by characteristics (see Figure 1 wherewe illustrate this construction)

p is defined by the method of characteristics

p on the region of influence of the characteristics emanating from the shock

Xminus X+t = 0

t = T

Σ

Σ0

Figure 1 Characteristic lines entering on a shock and how they may beused to build the solution of the adjoint system both away from the shockand on its region of influence

Formula (416) provides an obvious way to compute a first descent direction of J at u0We just take

(δu0 δϕ0) = (minusp(middot 0) q(0)[u]Σ0) (423)

Here the value of δϕ0 must be interpreted as the optimal infinitesimal displacement of thediscontinuity of u0

In [8] when considering the inverse design problem it was observed that the solution pof the corresponding adjoint system at t = 0 was discontinuous with two discontinuities

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 11

one in each side of the original location of the discontinuity at Σ0 This was a reason notto use this descent direction and for introducing the alternating descent method In thepresent setting however the adjoint state p obtained is typically continuous This is due tothe fact that p at both side of the discontinuity is defined by the method of characteristicsand that on the region of influence of the characteristics emanating from the shock thecontinuity is preserved by the fact that on one hand q = q(t) itself is continuous as theprimitive of an integrable function and that the data for p and q at t = T are continuoustoo Despite of this as we shall see the implementation of the alternating descent directionmethod is worth since it significantly improves the results obtained by the purely discreteapproach

We finish this section with the proof of Proposition 44

Proof (of Proposition 44) A straightforward computation shows that J is Gateaux dif-ferentiable in the generalized sense of Definition 43 and that the generalized Gateauxderivative of J in the direction of the generalized tangent vector (δu0 δϕ0) is given by

δJ(u0)[δu0 δϕ0] =

int T

0

intxgtϕ(t)cupxltϕ(t)

ρ(x t)(u(x t)minus ud(x t))δu(x t) dxdt

minusint

Σ

(uminus ud)2

2

]Σt

δϕ(t) dΣ(t) (424)

where the pair (δu δϕ) solves the linearized problem (412)-(415) with initial data (δu0 δϕ0)Let us now introduce the adjoint system (417)-(422) Multiplying the equations of δu

by p and integrating we get

0 =

int T

0

intxgtϕ(t)cupxltϕ(t)

(parttδu+ partx(f prime(u)δu)) p dxdt = minusint T

0

intxgtϕ(t)cupxltϕ(t)

δu (parttp+ f prime(u)partxp) dxdt

+

intxgtϕ(T )cupxltϕ(T )

δu(x T )p(x T )dxminusint

xgtϕ0cupxltϕ0

δu0(x)p(x 0)dx

minusint

Σ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ(t) (425)

where (nx nt) are the Cartesian components of the normal vector to the curve ΣTherefore since p satisfies the adjoint equation (417) (421) from (424) we obtain

δJ(u0)[δu0 δϕ0] =

intxgtϕ0cupxltϕ0

δu0(x)p(x 0)dx

+

intΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ(t)minus

intΣ

(uminus ud)2

2

]Σt

δϕ(t) dΣ(t) (426)

The last two terms in the right hand side of (426) will determine the conditions that pmust satisfy on the shock

12 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Observe that for any functions f g we have

[fg]Σt = f [g]Σt + g[f ]Σt

where g represents the average of g to both sides of the shock Σt ie

g(t) =1

2limε0

(g(ϕ(t) + ε t) + g(ϕ(t)minus ε t)) forallt isin (0 T )

Thus we haveintΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ =

intΣ

[p]Σt

(δunt

Σ + f prime(u)δu middot nxΣ

)dΣ

+

intΣ

p([δu]Σtnt

Σ + [f prime(u)δu]Σt middot nxΣ

)dΣ

(427)

Now we note that the Cartesian components of the normal vector to Σ are given by

nt =minusϕprime(t)radic

1 + (ϕprime(t))2 nx =

1radic1 + (ϕprime(t))2

and dΣ(t) =radic

1 + (ϕprime(t))2dt Using (413) (418) we haveintΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]ΣtnxΣ

)dΣ =

int T

0

p (minus[δu]Σtϕprime(t) + [f prime(u)δu]Σt) dt

=

int T

0

pd

dt([u]Σtδϕ) dt (428)

Therefore replacing (419) (420) (422) and (428) in (426) we obtain (416)This concludes the proof

5 The alternating descent method

Here we explain how the alternating descent method introduced in [8] can be adapted tothe present optimal control problem (13)

First let us introduce some notation We consider two points

Xminus = ϕ(T )minus Tf prime(uminus(ϕ(T ) T )) X+ = ϕ(T )minus Tf prime(u+(x T )) (51)

(p q) the solutions of (417)-(422) and

p0minus = limxXminus

p(x 0) p0+ = limxX+

p(x 0)

Now we introduce the two classes of descent directions we shall use in our descentalgorithm

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 13

First directions With this set of directions we mainly modify the profile of u0 Weset the first directions d1 = (δu0 δϕ0) given by

δu0 =

minusp(x 0) x lt Xminusminusp0minus x isin (Xminus ϕ0)minusp0+ x isin (ϕ0 X+)minusp(x 0) x gt X+

δϕ0 =

0 if

X+intXminus

p(x 0)δu0(x)dx le 0

X+intXminus

p(x 0)δu0(x)dx

[u0]Σ0q(0) if

X+intXminus

p(x 0)δu0(x)dx gt 0 and q(0) 6= 0

(52)

We note that if q(0) = 0 andX+intXminus

p(x 0)δu0(x)dx gt 0 these directions are not

definedSecond directions They are aimed to move the shock without changing the profile

of the solution to both sides Then d2 = (δu0 δϕ0) with

δu0 equiv 0 δϕ0(x) = [u0]Σ0q(0) (53)

We observe that d1 given by (52) satisfies

δJ(u0)[d1] = minusintxisin[XminusX+]

|p(x 0)|2dx

minusp0minusint ϕ0

Xminusp(x 0)dxminus p0+

int X+

ϕ0

p(x 0)dxminus [u0]Σ0q(0)δϕ0 le 0 (54)

Note that in those cases where d1 is not defined as indicated above this descent directionis simply not employed in the descent algorithm

And for d2 given by (53) we have

δJ(u0)[d2] = minus([u0]Σ0q(0))2 le 0 (55)

Thus the two classes of descent directions under consideration have three importantproperties

(1) They are both descent directions(2) They allow to split the design of the profile and the shock location(3) They are true generalized gradients and therefore keep the structure of the data

without increasing its complexity

6 Numerical approximation of the descent direction

We have computed the gradient of the continuous functional J in several cases (u smoothor having shock discontinuities) but in practice one has to work with discrete versions of

14 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

the functional J In this section we discuss two possible ways of searching discrete descentdirections based either on the discrete or the continuous approaches and in the later onthe alternating descent method

The discrete approach consists mainly in applying directly a descent algorithm to thediscrete version J∆ of the functional J by using its gradient The alternating descentmethod by the contrary is a method inspired on the continuous analysis of the previoussection in which the two main classes of descent directions that are first identified andlater discretized

Let us first discuss the discrete approach

61 The discrete approach Let us consider the approximation of the functional J byJ∆ defined (11) and (31) respectively We shall use the Engquist-Osher scheme whichis a 3-point conservative numerical approximation scheme for (12) More explicitly weconsider

un+1i = uni minus

∆t

∆x

(g(uni+1 u

ni )minus g(uni u

niminus1)) i isin Z n = 0 N (61)

where g is the numerical flux defined in (311)The gradient of the discrete functional J∆ requires computing one derivative of J∆ with

respect to each node of the mesh This can be done in a cheaper way using the discreteadjoint state We illustrate it for the Engquist-Osher numerical scheme However asthe discrete functionals J∆ are not necessarily convex the gradient methods could possiblyprovide sequences that do not converge to a global minimizer of J∆ But this drawback anddifficulty appears in most applications of descent methods in optimal design and controlproblems

As we will see in the present context the approximations obtained by discrete gradientmethods are satisfactory although convergence is slow due to unnecessary oscillations thatthe descent method introduces

The gradient of J∆ rigorously speaking requires the linearization of the numericalscheme (61) used to approximate the equation (12) Then the linearization correspondsto

δun+1i = δuni minus

∆t

∆x

(partag(uni+1 u

ni )δuni+1 minus partbg(uni u

niminus1)δuniminus1

)minus∆t

∆x

(partbg(uni+1 u

ni )minus partag(uni u

niminus1))δuni

i isin Z n = 0 N (62)

In view of this the discrete adjoint system of (62) can also be written for the differen-tiable flux functions

pN+1i = 0 i isin Z

pni = pn+1i +

∆t

∆xpartbg(uni+1 u

ni )(pn+1i+1 minus pn+1

i

)+

∆t

∆xpartag(uni u

niminus1)

(pn+1i minus pn+1

iminus1

)+ F n

i

i isin Z n = 0 N (63)

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 15

whereF ni = ∆tρni

(uni minus (ud)ni

) i isin Z n = 0 N (64)

In fact when multiplying the equations in (62) by pn+1i and adding in i isin Z and

n = 0 N the following identity is easily obtained

∆xsumiisinZ

p0i δu

0i = ∆x

Nsumn=0

sumiisinZ

F ni δu

0i (65)

This is the discrete version of formula (46) which allows us to simplify the derivative ofthe discrete cost functional

Thus for any variation δu0∆ the Gateaux derivative of the cost functional defined in

(31) is given by

δJ∆ = ∆x∆tNsum

n=0

sumiisinZ

ρni(uni minus (ud)ni

)δuni (66)

where δunij solves the linearized system (62) If we consider pni the solution of (63) with(64) we obtain that δJ∆ in (66) can be written as

δJ∆ = ∆xsumiisinZ

p0i δu

0i

and this allows to obtain easily the steepest descent direction for J∆ by considering

δu0∆ = minusp0

∆ (67)

Remark 61 We observe that for the Enquis-Osherrsquos flux (311) the system (63) is theupwind method for the continuous adjoint system We do not address here the problemof the convergence of this adjoint scheme towards the solution of the continuous adjointsystem Of course this is an easy matter when u is smooth but it is far from being trivialwhen u has shock discontinuities Whether or not this discrete adjoint system as ∆rarr 0allows reconstructing the complete adjoint system with the inner Dirichlet condition alongthe shock (417)(418)(419)(420)(421) and (422) constitutes an interesting problemfor future research We refer to [14] and [18] for preliminary work on this direction inone-dimension

62 The alternating descent method Now we describe the implementation of thealternating descent method

The main idea is to approximate a minimizer of J alternating between two directions ofdescent First we perturb the initial datum u0 using the direction (52) which principallychanges the profile of u0 Second we move the shock curve without altering the profile ofu0 at both sides of Σ using the direction (53)

More precisely for a given initialization u0∆ and target function ud∆ we compute Σ0

∆ thejump-point of u0

∆u∆ and p0∆ as the solutions of (62) (63) respectively

As numerical approximation of the adjoint state q(0) we take the value of p0∆ over the

point Σ0∆ ie

q0∆ = p0

∆(Σ0∆) = p0

i Σ0∆ isin (ximinus12 xi+12)

16 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

The main advantage of this method introduced in [8] is that for an initial datum u0

with a single discontinuity the descent directions are generalized tangent vectors ie theyintroduce Lipschitz continuous variations of u0 at both sides of the discontinuity and adisplacement of the shock position In this way the new datum obtained modifying theold one in the direction of this generalized tangent vector has again a single discontinuityThe method can be applied in a much more general context in which for instance thesolution has various shocks since the method is able both to generate shocks and to destroythem if any of these facts contributes to the decrease of the functional This method isin some sense close to those employed in shape design in elasticity in which topologicalderivatives (that would correspond to controlling the location of the shock in our method)are combined with classical shape deformations (that would correspond to simply shapingthe solution away from the shock in the present setting) [11]

As mentioned above in the context of the tracking problem we are considering thesolutions of the adjoint system do not increase the number of shocks Thus a directapplication of the continuous method without employing this alternating variant couldmake sense Note that the purely discrete method is a limited version of the continuous onesince one simply employs the descent direction indicated by p0 without paying attention tothe value q0 corresponding to the shock sensitivity However the numerical experimentswe have done with the full continuous method do not improve the results obtained by thediscrete one since the definition of the location of the shocks is lost along the iterativeprocess

7 Numerical experiments

In this section we present some numerical experiments which illustrate the results ob-tained in an optimization model problem with each one of the numerical methods describedin the previous section

We have chosen as computational domain the interval (minus4 4) and we have taken asboundary conditions in (61) at each time step t = tn the value of the initial data at theboundary This can be justified if we assume that the initial datum u0 is constant in asufficiently large inner neighborhood of the boundary x = plusmn4 (which depends on the sizeof the Linfin-norm of the data under consideration and the time horizon T ) due to the finitespeed of propagation A similar procedure is employed for the adjoint equation

We underline once more that the solutions obtained with each method may correspondto global minima or local ones since the gradient algorithm does not distinguish them

In the experiments we consider the Burgersrsquo equation ie f(z) = z22

parttu+ partx

(u2

2

)= 0 u(x 0) = u0(x) (71)

The weight function ρ under consideration in the experiments is given by

ρ(x t) =

1 t isin (T2 T )0 otherwise

And the time horizon T = 1

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 17

To compare the efficiency of the different methods we consider a fixed ∆x = 120 λ =∆t∆x = 23 (which satisfies the CFL condition) We then analyze the number of itera-tions that each method needs to attain a prescribed value of the functional

71 Experiment 1 We first consider a piecewise constant target profile ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

07 x isin [minus2 1]0 otherwise

(72)

Note that in this case (72) yields a particular solution of the optimization problemand the minimum value of J vanishes

We solve the optimization problem (13) with the above described different methodsstarting from the following initialization for u0

u0(x) =

05 x le 0minus01 x gt 0

(73)

which also has a discontinuity but located on a different pointIn Figure 2 we plot log(J) with respect to the number of iterations for both the purely

discrete method and the alternating descent one We see that the latter stabilizes in feweriterations

Figure 2 Experiment 1 log(J) versus the number of iterations in thedescent algorithm for the discrete and the alternating descent methods

In Figures 3 and 4 we present the minimizers obtained by the methods above and theassociated solutions Figures 5 and 6

The initial datum u0 obtained by the alternating descent method (Figures 4) is a goodapproximation of (72) The solution given by the discrete approach (Figures 3) presentsadded spurious oscillations Furthermore the discrete method is much slower and does notachieve the same level of accuracy since the functional J∆ does not decrease so much

18 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Figure 3 Experiment 1 u0discrete method iteration k =

999

Figure 4 Experiment 1 u0alternating descent method it-eration k = 38

Figure 5 Experiment 1 So-lution u(x t) discrete methoditeration k = 999

Figure 6 Experiment 1 So-lution u(x t) alternating de-scent method iteration k = 38

72 Experiment 2 The previous experiment indicates that the alternating descent methodperforms significantly better In order to show that this is a systematic fact which arisesindependently of the initialization of the method we consider the target ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

05 x le 050 otherwise

(74)

but this time we compare the performance of both methods starting from different initial-izations

The obtained numerical results are presented in Figure 7We see that regardless the initialization considered the alternating descent method

performs significantly better

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 19

u0 obtained by the discrete

approach

u0 obtained by the alternating

descent method

The value of the functional

Figure 7 Experiment 2 We present a comparison of the results obtainedwith both methods starting out of five different initialization configurationsIn the left column we exhibit the results obtained with the discrete methodIn the second one those achieved by the alternating descent method In thelast one we plot the evolution of the functional with both methods

20 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

We observe that in the five experiments the alternating descent method perfumes betterensuring the descent of the functional in much fewer iterations and yielding smoother lessoscillatory approximation of the minimizer

Note also that the discrete method rather than yielding discontinuous approximationsof the minimizer as the alternating descent method does it produces an initial datum witha Lipschitz front Observe that these are two different configurations that can lead tothe same evolution for the Burgers equation after some time once the front develops thediscontinuity This is in agreement with the fact that the functional to be minimized isonly active in the time-interval T2 le t le T

8 Conclusions and perspectives

In this paper we have adapted and presented the alternating descent method for atracking problem for a 1minus d scalar conservation law the goal being to identify an optimalinitial datum so that the solution gets as close as possible to a given trajectory Wehave exhibited in a number of examples the better performance of the alternating descentmethod with respect to the classical discrete method Thus the paper extends previousworks on other optimization problems such as the inverse design or the optimization of theflux function

The developments in this paper raise a number of interesting problems and questionsthat will deserve further investigation We summarize here some of them

bull The alternating descent method and the discrete one do not seem to yield the sameminimizer This should be further investigated in a more systematic manner inother experiments

The minimizer that the discrete method yields seems to replace the shock of theinitial datum by a Lipschitz function which eventually develops the same dynamicswithin the time interval T2 le t le T in which the functional we have chosen isactive This is an agreement with the behavior of these methods in the context ofthe classical problem of inverse design in which one aims to find the initial datumso that the solution at the final time takes a given value It would be interesting toprove analytically that the two methods may lead to different minimizers in somecircumstancesbull The experiments in this paper concern the case where the target ud is exactly

reachable It would be interesting to explore the performance of both methods inthe case where the target ud is not a solution of the underlying dynamicsbull It would be interesting to compare the performance of the methods presented in the

paper with the direct continuous method without introducing the alternating strat-egy Our preliminary numerical experiments indicate that the continuous methodwithout implementing the alternating strategy does not improve the results thatthe purely discrete strategy yieldsbull In [16] the problem of inverse design has been investigated adapting and extend-

ing the alternating descent method to the multi-dimensional case It would beinteresting to extend the analysis of this paper to the multidimensional case too

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 21

bull It would worth to compare the results in this paper with those that could beachieved by the nudging method as in [2] [3]

AcknowledgementsThe authors would like to thank C Castro F Palacios and A Pozo for stimulating

discussions

References

[1] A Adimurthi S S Ghoshal and V Gowda Optimal controllability for scalar conservation laws withconvex flux 12 Jan 2012 3

[2] D Auroux and J Blum Back and forth nudging algorithm for data assimilation problems ComptesRendus Mathematique 340(12)873 ndash 878 2005 3 20

[3] D Auroux and M Nodet The back and forth nudging algorithm for data assimilation problems theoretical results on transport equations ESAIM Control Optimisation and Calculus of Variations18(2)318ndash342 7 2012 3 20

[4] C Bardos and O Pironneau A formalism for the differentiation of conservation laws C R MathAcad Sci Paris 335(10)839ndash845 2002 8

[5] F Bouchut and F James One-dimensional transport equations with discontinuous coefficients Non-linear Anal 32(7)891ndash933 1998 8

[6] F Bouchut and F James Differentiability with respect to initial data for a scalar conservation lawIn Hyperbolic problems theory numerics applications Vol I (Zurich 1998) volume 129 of InternatSer Numer Math pages 113ndash118 Birkhauser Basel 1999 8

[7] A Bressan and A Marson A variational calculus for discontinuous solutions of systems of conservationlaws Comm Partial Differential Equations 20(9-10)1491ndash1552 1995 8 9

[8] C Castro F Palacios and E Zuazua An alternating descent method for the optimal control of theinviscid burgers equation in the presence of shocks Mathematical Models and Methods in AppliedSciences 18(03)369ndash416 2008 2 3 4 6 9 10 12 16

[9] C Castro F Palacios and E Zuazua Optimal control and vanishing viscosity for the burgers equa-tion In C Constanda and M Perez editors Integral Methods in Science and Engineering Volume2 pages 65ndash90 Birkhauser Boston 2010 2

[10] C Castro and E Zuazua Flux identification for 1-d scalar conservation laws in the presence of shocksMath Comp 80(276)2025ndash2070 2011 2

[11] S Garreau P Guillaume and M Masmoudi The topological asymptotic for pde systems Theelasticity case SIAM J Control Optim 39(6)1756ndash1778 Dec 2000 16

[12] E Godlewski and P Raviart Hyperbolic systems of conservation laws Mathematiques and Applica-tions Ellipses Paris 1991 6

[13] E Godlewski and P A Raviart The linearized stability of solutions of nonlinear hyperbolic systemsof conservation laws A general numerical approach Math Comput Simulation 50(1-4)77ndash95 1999Modelling rsquo98 (Prague) 8

[14] L Gosse and F James Numerical approximations of one-dimensional linear conservation equationswith discontinuous coefficients Mathematics of Computation 69(231)pp 987ndash1015 2000 15

[15] S N Kruzkov First order quasilinear equations with several independent variables Mat Sb (NS)81 (123)228ndash255 1970 2

[16] R Lecaros and E Zuazua Control of 2D scalar conservation laws in the presence of shocks preprint2014 3 6 20

[17] S Ulbrich Adjoint-based derivative computations for the optimal control of discontinuous solutions ofhyperbolic conservation laws Systems Control Lett 48(3-4)313ndash328 2003 Optimization and controlof distributed systems 8

22 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

[18] X Wen and S Jin Convergence of an immersed interface upwind scheme for linear advection equationswith piecewise constant coefficients I L1-error estimates J Comput Math 26(1)1ndash22 2008 15

Rodrigo Lecaros13 and Enrique Zuazua12

1BCAM - Basque Center for Applied MathematicsMazarredo 14 E-48009 Bilbao Basque Country Spain

2Ikerbasque - Basque Foundation for ScienceAlameda Urquijo 36-5 Plaza Bizkaia 48011 Bilbao Basque Country Spain

3CMM - Centro de Modelamiento Matematico Universidad de Chile (UMI CNRS 2807)Avenida Blanco Encalada 2120 Casilla 170-3 Correo 3 Santiago Chile

E-mail address rlecarosdimuchilecl zuazuabcamathorg

URL wwwbcamathorgzuazua

  • 1 Introduction
  • 2 Existence of Minimizers
  • 3 The Discrete Minimization Problem
  • 4 Sensitivity analysis the continuous approach
    • 41 Sensitivity without shocks
    • 42 Sensitivity of the state in the presence of shocks
    • 43 Sensitivity of the cost in the presence of shocks
      • 5 The alternating descent method
      • 6 Numerical approximation of the descent direction
        • 61 The discrete approach
        • 62 The alternating descent method
          • 7 Numerical experiments
            • 71 Experiment 1
            • 72 Experiment 2
              • 8 Conclusions and perspectives
              • References
Page 6: TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE

6 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

The proof of this result can be developed as in [8] We also refer to [16] for an extensionto the multi-dimensional case

This convergence result applies for 3-point conservative numerical approximation schemeswhere S∆ is given by

(S∆v∆)i = vi minus∆t

∆x(g(vi+1 vi)minus g(vi viminus1)) (38)

and g is the numerical flux This scheme is consistent with the corresponding equation(12) when g(u u) = f(u)

When the discrete semigroup S∆(u v w) = v minus λ(g(u v)minus g(v w)) with λ = ∆t∆x ismonotonic increasing with respect to each argument the scheme is also monotone Thisensures the convergence to the weak entropy solutions of the continuous conservation lawas the discretization parameters tend to zero under a suitable CFL condition (see Ref[12] Chap 3 Th 42)

All this analysis and results apply to the classical Godunov Lax-Friedfrichs and Engquist-Osher schemes the corresponding numerical flux being

gG(u v) =

minwisin[uv] f(w) if u le vmaxwisin[uv] f(w) if u ge v

(39)

gLF (u v) =(f(u) + f(v))

2minus (v minus u)

2λx (310)

gEO(u v) =f(u) + f(v)minus

int v

u|f prime(τ)|dτ

2 (311)

See Chapter 3 in [12] for more detailsThese 1D methods satisfy the conditions of Theorem 31

4 Sensitivity analysis the continuous approach

We divide this section in three subsections Specifically in the first one we consider thecase where the solution u of (12) has no shocks In the second and third subsections weanalyze the sensitivity of the solution and the functional in the presence of a single shock

41 Sensitivity without shocks In this subsection we give an expression for the sen-sitivity of the functional J with respect to the initial datum based on a classical adjointcalculus for smooth solutions First we present a formal calculus and then we show howto justify it when dealing with a classical smooth solution for (12)

Let C10(R) be the set of C1 functions with compact support and let u0 isin C1

0(R) be agiven initial datum for which there exists a classical solution u(x t) of (12) that can beextended to a classical solution in t isin [0 T + τ ] for some τ gt 0 Note that this imposessome restrictions on u0 other than being smooth

Let δu0 isin C10(R) be any possible variation of the initial datum u0 Due to the finite

speed of propagation this perturbation will only affect the solution in a bounded set ofRtimes [0 T ] This simplifies the argument below that applies in a much more general settingprovided solutions are smooth enough

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 7

Then for ε gt 0 sufficiently small the solution uε(x t) corresponding to the initial datum

uε0(x) = u0(x) + εδu0(x) (41)

is also a classical solution in (x t) isin Rtimes (0 T ) and uε isin C1(Rtimes [0 T ]) can be written as

uε = u+ εδu+ o(ε)with respect to the C1 topology (42)

where δu is the solution of the linearized equation

parttδu+ partx(f prime(u)δu) = 0 in Rtimes (0 T ) δu(x 0) = δu0(x) x isin R (43)

Let δJ be the Gateaux derivative of J at u0 in the direction δu0 We have

δJ(u0)[δu0] =

int T

0

intRρ(x t)(u(x t)minus ud(x t))δu(x t)dxdt (44)

where δu solves the linearized system above (43) Now we introduce the adjoint system

minus parttpminus f prime(u)partxp = ρ (uminus ud) in Rtimes (0 T ) p(x T ) = 0 x isin R (45)

Multiplying the equations satisfied by δu by p integrating by parts and taking intoaccount that p satisfies (45) we easily obtainint T

0

intRρ(x t)(u(x t)minus ud(x t))δu(x t)dxdt =

intRp(x 0)δu0(x)dx (46)

Thus δJ in (44) can be written as

δJ(u0)[δu0] =

intRp(x 0)δu0(x)dx (47)

This expression provides an easy way to compute a descent direction for the continuousfunctional J once we have computed the adjoint state We just take

δu0(x) = minusp(x 0) (48)

Under the assumptions above on u0 u δu and p can be obtained from their data u0(x)δu0(x) and ρ (u minus ud) by using the characteristic curves associated to (12) For the sakeof completeness we briefly explain this below

The characteristic curves associated to (12) are defined by

xprime(t) = f prime(u(x(t) t)) t isin (0 T ) x(0) = x0 (49)

They are straight lines whose slopes depend on the initial data

x(t) = x0 + tf prime(u0(x0)) t isin (0 T )

As we are dealing with classical solutions u is constant along such curves and by assump-tion two different characteristic curves do not meet each other in R times [0 T + τ ] Thisallows to define u in Rtimes [0 T + τ ] in a unique way from the initial data

For ε gt 0 sufficiently small the solution uε(x t) corresponding to the initial datum(41) has similar characteristics to those of u This allows guaranteeing that two differentcharacteristic lines do not intersect for 0 le t le T if ε gt 0 is small enough Note that uε

may possibly be discontinuous for t isin (T T +τ ] if u0 generates a discontinuity at t = T +τ

8 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

but this is irrelevant for the analysis in [0 T ] we are carrying out Therefore uε(x t) is alsoa classical solution in (x t) isin R times [0 T ] and it is easy to see that the solution uε can bewritten as (42) where δu satisfies (43)

System (43) can be solved again by the method of characteristics In fact as u is aregular function the first equation in (43) can be written as

parttδu+ f prime(u)partxδu = minuspartx(f prime(u)) δu (410)

ied

dtδu(x(t) t) = minuspartx(f prime(u)) δu (411)

where x(t) are the characteristic curves defined by (49) Thus the solution δu along acharacteristic line can be obtained from δu0 by solving this differential equation ie

δu(x(t) t) = δu0(x0) exp

(minusint t

0

partx(f prime(u))(x(s) s)ds

)

Finally the adjoint system (45) is also solved by characteristics ie

minus d

dtp(x(t) t) = ρ(x(t) t)(u(x(t) t)minus ud(x(t) t))

This yields the steepest descent direction in (48) for the continuous functional

p(x0 0) = u0(x0)

int T

0

ρ(x(s) s)dsminusint T

0

ρ(x(s) s)ud(x(s) s)ds

Remark 41 Note that for classical solutions the Gateaux derivative of J at u0 is given by(47) and this provides an obvious descent direction for J at u0 given by (48) Howeverthis fact is not very useful in practice since even when we initialize the iterative descentalgorithm with a smooth u0 we cannot guarantee that the solution remains classical alongthe iterative process

42 Sensitivity of the state in the presence of shocks Inspired in several results onthe sensitivity of solutions of conservation laws in the presence of shocks in one-dimension(see [7 4 5 6 17 13]) we focus on the particular case of solutions having a single shockBut the analysis can be extended to consider more general one-dimensional systems ofconservation laws with a finite number of noninteracting shocks We introduce the followinghypothesis

Hypothesis 42 Assume that u(x t) is a weak entropy solution of (12) with a discon-tinuity along a regular curve Σ = (ϕ(t) t) t isin (0 T ) which is Lipschitz continuousoutside Σ In particular it satisfies the Rankine-Hugoniot condition on Σ

ϕprime(t)[u]Σt = [f(u)]Σt

Here we have used the notation [v]Σt = limε0

v(ϕ(t) + ε t)minus v(ϕ(t)minus ε t) for the jump at

Σt = (ϕ(t) t) of any piecewise continuous function v with a discontinuity at Σt

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 9

Note that Σ divides Rtimes(0 T ) into two parts Qminus and Q+ the sub-domains of Rtimes(0 T )to the left and to the right of Σ respectively

As we will see in the presence of shocks to deal correctly with optimal control anddesign problems the state of the system needs to be viewed as constituted by the pair(u ϕ) combining the solution of (12) and the shock location ϕ This is relevant in theanalysis of sensitivity of functions below and when applying descent algorithms

We adopt the functional framework based on the generalized tangent vectors (see [7] andDefinition 41 in [8])

Let u0 be the initial datum that we assume to be Lipschitz continuous to both sidesof a single discontinuity located at x = ϕ0 and consider a generalized tangent vector(δu0 δϕ0) isin L1(R) times R for all 0 le T Let u0ε be a path which generates (δu0 δϕ0)For ε sufficiently small the solution uε(middot t) of (12) is Lipschitz continuous with a singlediscontinuity at x = ϕε(t) for all t isin [0 T ] Therefore uε(middot t) generates a generalizedtangent vector (δu(middot t) δϕ(t)) isin L1(R) times R Moreover in [8] it is proved that it satisfiesthe following linearized system

parttδu+ partx(f prime(u)δu) = 0 in Qminus cupQ+ (412)

d

dt([u]Σtδϕ) = [f prime(u)δu]Σt minus [δu]Σt

d

dtϕ t isin (0 T ) (413)

δu(x 0) = δu0(x) x lt ϕ0 cup x gt ϕ0 (414)

δϕ(0) = δϕ0 (415)

43 Sensitivity of the cost in the presence of shocks In this section we study thesensitivity of the functional J with respect to variations associated with the generalizedtangent vectors defined in the previous section We first define an appropriate generaliza-tion of the Gateaux derivative of J

Definition 43 Let J L1(R) rarr R be a functional and u0 isin L1(R) be Lipschitz contin-uous with a discontinuity in Σ0 an initial datum for which the solution of (12) satisfieshypothesis (42) J is Gateaux differentiable at u0 in a generalized sense if for any gener-alized tangent vector (δu0 δϕ0) and any family u0ε associated to (δu0 δϕ0) the followinglimit exists

δJ = limεrarr0

J(u0ε)minus J(u0)

ε

and it depends only on (u0 ϕ0) and (δu0 δϕ0) ie it does not depend on the particularfamily u0ε which generates (δu0 δϕ0) The limit is the generalized Gateux derivative of Jin the direction (δu0 δϕ0)

The following result easily provides a characterization of the generalized Gateaux deriv-ative of J in terms of the solution of the associated adjoint system (417) (418)(419)(420) (421) and (422)

Proposition 44 The Gateaux derivative of J can be written as follows

δJ(u0)[δu0 δϕ0] =

intRp(x 0)δu0(x)dxminus q(0)[u]Σ0δϕ0 (416)

10 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

where the adjoint state pair (p q) satisfies the system

minus parttpminus f prime(u)partxp = ρ (uminus ud) in Qminus cupQ+ (417)

[p]Σt = 0 t isin (0 T ) (418)

q(t) = p(ϕ(t) t) t isin (0 T ) (419)

minus d

dtq =

(1 + (ϕ)2)12[ρ (uminus ud)2]Σt

2[u]Σt

t isin (0 T ) (420)

p(x T ) = 0 x lt ϕ(T ) cup x gt ϕ(T ) (421)

q(T ) = 0 (422)

Let us briefly comment the result of Proposition 44 before giving its proofSystem (417)-(422) has a unique solution In fact to solve the backward system (417)-

(422) we first define the solution q on the shock Σ from the conditions for q (420) and(422) This determines the value of p along the shock We then propagate this informationtogether with (417) and (421) to both sides of Σ by characteristics (see Figure 1 wherewe illustrate this construction)

p is defined by the method of characteristics

p on the region of influence of the characteristics emanating from the shock

Xminus X+t = 0

t = T

Σ

Σ0

Figure 1 Characteristic lines entering on a shock and how they may beused to build the solution of the adjoint system both away from the shockand on its region of influence

Formula (416) provides an obvious way to compute a first descent direction of J at u0We just take

(δu0 δϕ0) = (minusp(middot 0) q(0)[u]Σ0) (423)

Here the value of δϕ0 must be interpreted as the optimal infinitesimal displacement of thediscontinuity of u0

In [8] when considering the inverse design problem it was observed that the solution pof the corresponding adjoint system at t = 0 was discontinuous with two discontinuities

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 11

one in each side of the original location of the discontinuity at Σ0 This was a reason notto use this descent direction and for introducing the alternating descent method In thepresent setting however the adjoint state p obtained is typically continuous This is due tothe fact that p at both side of the discontinuity is defined by the method of characteristicsand that on the region of influence of the characteristics emanating from the shock thecontinuity is preserved by the fact that on one hand q = q(t) itself is continuous as theprimitive of an integrable function and that the data for p and q at t = T are continuoustoo Despite of this as we shall see the implementation of the alternating descent directionmethod is worth since it significantly improves the results obtained by the purely discreteapproach

We finish this section with the proof of Proposition 44

Proof (of Proposition 44) A straightforward computation shows that J is Gateaux dif-ferentiable in the generalized sense of Definition 43 and that the generalized Gateauxderivative of J in the direction of the generalized tangent vector (δu0 δϕ0) is given by

δJ(u0)[δu0 δϕ0] =

int T

0

intxgtϕ(t)cupxltϕ(t)

ρ(x t)(u(x t)minus ud(x t))δu(x t) dxdt

minusint

Σ

(uminus ud)2

2

]Σt

δϕ(t) dΣ(t) (424)

where the pair (δu δϕ) solves the linearized problem (412)-(415) with initial data (δu0 δϕ0)Let us now introduce the adjoint system (417)-(422) Multiplying the equations of δu

by p and integrating we get

0 =

int T

0

intxgtϕ(t)cupxltϕ(t)

(parttδu+ partx(f prime(u)δu)) p dxdt = minusint T

0

intxgtϕ(t)cupxltϕ(t)

δu (parttp+ f prime(u)partxp) dxdt

+

intxgtϕ(T )cupxltϕ(T )

δu(x T )p(x T )dxminusint

xgtϕ0cupxltϕ0

δu0(x)p(x 0)dx

minusint

Σ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ(t) (425)

where (nx nt) are the Cartesian components of the normal vector to the curve ΣTherefore since p satisfies the adjoint equation (417) (421) from (424) we obtain

δJ(u0)[δu0 δϕ0] =

intxgtϕ0cupxltϕ0

δu0(x)p(x 0)dx

+

intΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ(t)minus

intΣ

(uminus ud)2

2

]Σt

δϕ(t) dΣ(t) (426)

The last two terms in the right hand side of (426) will determine the conditions that pmust satisfy on the shock

12 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Observe that for any functions f g we have

[fg]Σt = f [g]Σt + g[f ]Σt

where g represents the average of g to both sides of the shock Σt ie

g(t) =1

2limε0

(g(ϕ(t) + ε t) + g(ϕ(t)minus ε t)) forallt isin (0 T )

Thus we haveintΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ =

intΣ

[p]Σt

(δunt

Σ + f prime(u)δu middot nxΣ

)dΣ

+

intΣ

p([δu]Σtnt

Σ + [f prime(u)δu]Σt middot nxΣ

)dΣ

(427)

Now we note that the Cartesian components of the normal vector to Σ are given by

nt =minusϕprime(t)radic

1 + (ϕprime(t))2 nx =

1radic1 + (ϕprime(t))2

and dΣ(t) =radic

1 + (ϕprime(t))2dt Using (413) (418) we haveintΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]ΣtnxΣ

)dΣ =

int T

0

p (minus[δu]Σtϕprime(t) + [f prime(u)δu]Σt) dt

=

int T

0

pd

dt([u]Σtδϕ) dt (428)

Therefore replacing (419) (420) (422) and (428) in (426) we obtain (416)This concludes the proof

5 The alternating descent method

Here we explain how the alternating descent method introduced in [8] can be adapted tothe present optimal control problem (13)

First let us introduce some notation We consider two points

Xminus = ϕ(T )minus Tf prime(uminus(ϕ(T ) T )) X+ = ϕ(T )minus Tf prime(u+(x T )) (51)

(p q) the solutions of (417)-(422) and

p0minus = limxXminus

p(x 0) p0+ = limxX+

p(x 0)

Now we introduce the two classes of descent directions we shall use in our descentalgorithm

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 13

First directions With this set of directions we mainly modify the profile of u0 Weset the first directions d1 = (δu0 δϕ0) given by

δu0 =

minusp(x 0) x lt Xminusminusp0minus x isin (Xminus ϕ0)minusp0+ x isin (ϕ0 X+)minusp(x 0) x gt X+

δϕ0 =

0 if

X+intXminus

p(x 0)δu0(x)dx le 0

X+intXminus

p(x 0)δu0(x)dx

[u0]Σ0q(0) if

X+intXminus

p(x 0)δu0(x)dx gt 0 and q(0) 6= 0

(52)

We note that if q(0) = 0 andX+intXminus

p(x 0)δu0(x)dx gt 0 these directions are not

definedSecond directions They are aimed to move the shock without changing the profile

of the solution to both sides Then d2 = (δu0 δϕ0) with

δu0 equiv 0 δϕ0(x) = [u0]Σ0q(0) (53)

We observe that d1 given by (52) satisfies

δJ(u0)[d1] = minusintxisin[XminusX+]

|p(x 0)|2dx

minusp0minusint ϕ0

Xminusp(x 0)dxminus p0+

int X+

ϕ0

p(x 0)dxminus [u0]Σ0q(0)δϕ0 le 0 (54)

Note that in those cases where d1 is not defined as indicated above this descent directionis simply not employed in the descent algorithm

And for d2 given by (53) we have

δJ(u0)[d2] = minus([u0]Σ0q(0))2 le 0 (55)

Thus the two classes of descent directions under consideration have three importantproperties

(1) They are both descent directions(2) They allow to split the design of the profile and the shock location(3) They are true generalized gradients and therefore keep the structure of the data

without increasing its complexity

6 Numerical approximation of the descent direction

We have computed the gradient of the continuous functional J in several cases (u smoothor having shock discontinuities) but in practice one has to work with discrete versions of

14 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

the functional J In this section we discuss two possible ways of searching discrete descentdirections based either on the discrete or the continuous approaches and in the later onthe alternating descent method

The discrete approach consists mainly in applying directly a descent algorithm to thediscrete version J∆ of the functional J by using its gradient The alternating descentmethod by the contrary is a method inspired on the continuous analysis of the previoussection in which the two main classes of descent directions that are first identified andlater discretized

Let us first discuss the discrete approach

61 The discrete approach Let us consider the approximation of the functional J byJ∆ defined (11) and (31) respectively We shall use the Engquist-Osher scheme whichis a 3-point conservative numerical approximation scheme for (12) More explicitly weconsider

un+1i = uni minus

∆t

∆x

(g(uni+1 u

ni )minus g(uni u

niminus1)) i isin Z n = 0 N (61)

where g is the numerical flux defined in (311)The gradient of the discrete functional J∆ requires computing one derivative of J∆ with

respect to each node of the mesh This can be done in a cheaper way using the discreteadjoint state We illustrate it for the Engquist-Osher numerical scheme However asthe discrete functionals J∆ are not necessarily convex the gradient methods could possiblyprovide sequences that do not converge to a global minimizer of J∆ But this drawback anddifficulty appears in most applications of descent methods in optimal design and controlproblems

As we will see in the present context the approximations obtained by discrete gradientmethods are satisfactory although convergence is slow due to unnecessary oscillations thatthe descent method introduces

The gradient of J∆ rigorously speaking requires the linearization of the numericalscheme (61) used to approximate the equation (12) Then the linearization correspondsto

δun+1i = δuni minus

∆t

∆x

(partag(uni+1 u

ni )δuni+1 minus partbg(uni u

niminus1)δuniminus1

)minus∆t

∆x

(partbg(uni+1 u

ni )minus partag(uni u

niminus1))δuni

i isin Z n = 0 N (62)

In view of this the discrete adjoint system of (62) can also be written for the differen-tiable flux functions

pN+1i = 0 i isin Z

pni = pn+1i +

∆t

∆xpartbg(uni+1 u

ni )(pn+1i+1 minus pn+1

i

)+

∆t

∆xpartag(uni u

niminus1)

(pn+1i minus pn+1

iminus1

)+ F n

i

i isin Z n = 0 N (63)

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 15

whereF ni = ∆tρni

(uni minus (ud)ni

) i isin Z n = 0 N (64)

In fact when multiplying the equations in (62) by pn+1i and adding in i isin Z and

n = 0 N the following identity is easily obtained

∆xsumiisinZ

p0i δu

0i = ∆x

Nsumn=0

sumiisinZ

F ni δu

0i (65)

This is the discrete version of formula (46) which allows us to simplify the derivative ofthe discrete cost functional

Thus for any variation δu0∆ the Gateaux derivative of the cost functional defined in

(31) is given by

δJ∆ = ∆x∆tNsum

n=0

sumiisinZ

ρni(uni minus (ud)ni

)δuni (66)

where δunij solves the linearized system (62) If we consider pni the solution of (63) with(64) we obtain that δJ∆ in (66) can be written as

δJ∆ = ∆xsumiisinZ

p0i δu

0i

and this allows to obtain easily the steepest descent direction for J∆ by considering

δu0∆ = minusp0

∆ (67)

Remark 61 We observe that for the Enquis-Osherrsquos flux (311) the system (63) is theupwind method for the continuous adjoint system We do not address here the problemof the convergence of this adjoint scheme towards the solution of the continuous adjointsystem Of course this is an easy matter when u is smooth but it is far from being trivialwhen u has shock discontinuities Whether or not this discrete adjoint system as ∆rarr 0allows reconstructing the complete adjoint system with the inner Dirichlet condition alongthe shock (417)(418)(419)(420)(421) and (422) constitutes an interesting problemfor future research We refer to [14] and [18] for preliminary work on this direction inone-dimension

62 The alternating descent method Now we describe the implementation of thealternating descent method

The main idea is to approximate a minimizer of J alternating between two directions ofdescent First we perturb the initial datum u0 using the direction (52) which principallychanges the profile of u0 Second we move the shock curve without altering the profile ofu0 at both sides of Σ using the direction (53)

More precisely for a given initialization u0∆ and target function ud∆ we compute Σ0

∆ thejump-point of u0

∆u∆ and p0∆ as the solutions of (62) (63) respectively

As numerical approximation of the adjoint state q(0) we take the value of p0∆ over the

point Σ0∆ ie

q0∆ = p0

∆(Σ0∆) = p0

i Σ0∆ isin (ximinus12 xi+12)

16 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

The main advantage of this method introduced in [8] is that for an initial datum u0

with a single discontinuity the descent directions are generalized tangent vectors ie theyintroduce Lipschitz continuous variations of u0 at both sides of the discontinuity and adisplacement of the shock position In this way the new datum obtained modifying theold one in the direction of this generalized tangent vector has again a single discontinuityThe method can be applied in a much more general context in which for instance thesolution has various shocks since the method is able both to generate shocks and to destroythem if any of these facts contributes to the decrease of the functional This method isin some sense close to those employed in shape design in elasticity in which topologicalderivatives (that would correspond to controlling the location of the shock in our method)are combined with classical shape deformations (that would correspond to simply shapingthe solution away from the shock in the present setting) [11]

As mentioned above in the context of the tracking problem we are considering thesolutions of the adjoint system do not increase the number of shocks Thus a directapplication of the continuous method without employing this alternating variant couldmake sense Note that the purely discrete method is a limited version of the continuous onesince one simply employs the descent direction indicated by p0 without paying attention tothe value q0 corresponding to the shock sensitivity However the numerical experimentswe have done with the full continuous method do not improve the results obtained by thediscrete one since the definition of the location of the shocks is lost along the iterativeprocess

7 Numerical experiments

In this section we present some numerical experiments which illustrate the results ob-tained in an optimization model problem with each one of the numerical methods describedin the previous section

We have chosen as computational domain the interval (minus4 4) and we have taken asboundary conditions in (61) at each time step t = tn the value of the initial data at theboundary This can be justified if we assume that the initial datum u0 is constant in asufficiently large inner neighborhood of the boundary x = plusmn4 (which depends on the sizeof the Linfin-norm of the data under consideration and the time horizon T ) due to the finitespeed of propagation A similar procedure is employed for the adjoint equation

We underline once more that the solutions obtained with each method may correspondto global minima or local ones since the gradient algorithm does not distinguish them

In the experiments we consider the Burgersrsquo equation ie f(z) = z22

parttu+ partx

(u2

2

)= 0 u(x 0) = u0(x) (71)

The weight function ρ under consideration in the experiments is given by

ρ(x t) =

1 t isin (T2 T )0 otherwise

And the time horizon T = 1

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 17

To compare the efficiency of the different methods we consider a fixed ∆x = 120 λ =∆t∆x = 23 (which satisfies the CFL condition) We then analyze the number of itera-tions that each method needs to attain a prescribed value of the functional

71 Experiment 1 We first consider a piecewise constant target profile ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

07 x isin [minus2 1]0 otherwise

(72)

Note that in this case (72) yields a particular solution of the optimization problemand the minimum value of J vanishes

We solve the optimization problem (13) with the above described different methodsstarting from the following initialization for u0

u0(x) =

05 x le 0minus01 x gt 0

(73)

which also has a discontinuity but located on a different pointIn Figure 2 we plot log(J) with respect to the number of iterations for both the purely

discrete method and the alternating descent one We see that the latter stabilizes in feweriterations

Figure 2 Experiment 1 log(J) versus the number of iterations in thedescent algorithm for the discrete and the alternating descent methods

In Figures 3 and 4 we present the minimizers obtained by the methods above and theassociated solutions Figures 5 and 6

The initial datum u0 obtained by the alternating descent method (Figures 4) is a goodapproximation of (72) The solution given by the discrete approach (Figures 3) presentsadded spurious oscillations Furthermore the discrete method is much slower and does notachieve the same level of accuracy since the functional J∆ does not decrease so much

18 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Figure 3 Experiment 1 u0discrete method iteration k =

999

Figure 4 Experiment 1 u0alternating descent method it-eration k = 38

Figure 5 Experiment 1 So-lution u(x t) discrete methoditeration k = 999

Figure 6 Experiment 1 So-lution u(x t) alternating de-scent method iteration k = 38

72 Experiment 2 The previous experiment indicates that the alternating descent methodperforms significantly better In order to show that this is a systematic fact which arisesindependently of the initialization of the method we consider the target ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

05 x le 050 otherwise

(74)

but this time we compare the performance of both methods starting from different initial-izations

The obtained numerical results are presented in Figure 7We see that regardless the initialization considered the alternating descent method

performs significantly better

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 19

u0 obtained by the discrete

approach

u0 obtained by the alternating

descent method

The value of the functional

Figure 7 Experiment 2 We present a comparison of the results obtainedwith both methods starting out of five different initialization configurationsIn the left column we exhibit the results obtained with the discrete methodIn the second one those achieved by the alternating descent method In thelast one we plot the evolution of the functional with both methods

20 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

We observe that in the five experiments the alternating descent method perfumes betterensuring the descent of the functional in much fewer iterations and yielding smoother lessoscillatory approximation of the minimizer

Note also that the discrete method rather than yielding discontinuous approximationsof the minimizer as the alternating descent method does it produces an initial datum witha Lipschitz front Observe that these are two different configurations that can lead tothe same evolution for the Burgers equation after some time once the front develops thediscontinuity This is in agreement with the fact that the functional to be minimized isonly active in the time-interval T2 le t le T

8 Conclusions and perspectives

In this paper we have adapted and presented the alternating descent method for atracking problem for a 1minus d scalar conservation law the goal being to identify an optimalinitial datum so that the solution gets as close as possible to a given trajectory Wehave exhibited in a number of examples the better performance of the alternating descentmethod with respect to the classical discrete method Thus the paper extends previousworks on other optimization problems such as the inverse design or the optimization of theflux function

The developments in this paper raise a number of interesting problems and questionsthat will deserve further investigation We summarize here some of them

bull The alternating descent method and the discrete one do not seem to yield the sameminimizer This should be further investigated in a more systematic manner inother experiments

The minimizer that the discrete method yields seems to replace the shock of theinitial datum by a Lipschitz function which eventually develops the same dynamicswithin the time interval T2 le t le T in which the functional we have chosen isactive This is an agreement with the behavior of these methods in the context ofthe classical problem of inverse design in which one aims to find the initial datumso that the solution at the final time takes a given value It would be interesting toprove analytically that the two methods may lead to different minimizers in somecircumstancesbull The experiments in this paper concern the case where the target ud is exactly

reachable It would be interesting to explore the performance of both methods inthe case where the target ud is not a solution of the underlying dynamicsbull It would be interesting to compare the performance of the methods presented in the

paper with the direct continuous method without introducing the alternating strat-egy Our preliminary numerical experiments indicate that the continuous methodwithout implementing the alternating strategy does not improve the results thatthe purely discrete strategy yieldsbull In [16] the problem of inverse design has been investigated adapting and extend-

ing the alternating descent method to the multi-dimensional case It would beinteresting to extend the analysis of this paper to the multidimensional case too

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 21

bull It would worth to compare the results in this paper with those that could beachieved by the nudging method as in [2] [3]

AcknowledgementsThe authors would like to thank C Castro F Palacios and A Pozo for stimulating

discussions

References

[1] A Adimurthi S S Ghoshal and V Gowda Optimal controllability for scalar conservation laws withconvex flux 12 Jan 2012 3

[2] D Auroux and J Blum Back and forth nudging algorithm for data assimilation problems ComptesRendus Mathematique 340(12)873 ndash 878 2005 3 20

[3] D Auroux and M Nodet The back and forth nudging algorithm for data assimilation problems theoretical results on transport equations ESAIM Control Optimisation and Calculus of Variations18(2)318ndash342 7 2012 3 20

[4] C Bardos and O Pironneau A formalism for the differentiation of conservation laws C R MathAcad Sci Paris 335(10)839ndash845 2002 8

[5] F Bouchut and F James One-dimensional transport equations with discontinuous coefficients Non-linear Anal 32(7)891ndash933 1998 8

[6] F Bouchut and F James Differentiability with respect to initial data for a scalar conservation lawIn Hyperbolic problems theory numerics applications Vol I (Zurich 1998) volume 129 of InternatSer Numer Math pages 113ndash118 Birkhauser Basel 1999 8

[7] A Bressan and A Marson A variational calculus for discontinuous solutions of systems of conservationlaws Comm Partial Differential Equations 20(9-10)1491ndash1552 1995 8 9

[8] C Castro F Palacios and E Zuazua An alternating descent method for the optimal control of theinviscid burgers equation in the presence of shocks Mathematical Models and Methods in AppliedSciences 18(03)369ndash416 2008 2 3 4 6 9 10 12 16

[9] C Castro F Palacios and E Zuazua Optimal control and vanishing viscosity for the burgers equa-tion In C Constanda and M Perez editors Integral Methods in Science and Engineering Volume2 pages 65ndash90 Birkhauser Boston 2010 2

[10] C Castro and E Zuazua Flux identification for 1-d scalar conservation laws in the presence of shocksMath Comp 80(276)2025ndash2070 2011 2

[11] S Garreau P Guillaume and M Masmoudi The topological asymptotic for pde systems Theelasticity case SIAM J Control Optim 39(6)1756ndash1778 Dec 2000 16

[12] E Godlewski and P Raviart Hyperbolic systems of conservation laws Mathematiques and Applica-tions Ellipses Paris 1991 6

[13] E Godlewski and P A Raviart The linearized stability of solutions of nonlinear hyperbolic systemsof conservation laws A general numerical approach Math Comput Simulation 50(1-4)77ndash95 1999Modelling rsquo98 (Prague) 8

[14] L Gosse and F James Numerical approximations of one-dimensional linear conservation equationswith discontinuous coefficients Mathematics of Computation 69(231)pp 987ndash1015 2000 15

[15] S N Kruzkov First order quasilinear equations with several independent variables Mat Sb (NS)81 (123)228ndash255 1970 2

[16] R Lecaros and E Zuazua Control of 2D scalar conservation laws in the presence of shocks preprint2014 3 6 20

[17] S Ulbrich Adjoint-based derivative computations for the optimal control of discontinuous solutions ofhyperbolic conservation laws Systems Control Lett 48(3-4)313ndash328 2003 Optimization and controlof distributed systems 8

22 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

[18] X Wen and S Jin Convergence of an immersed interface upwind scheme for linear advection equationswith piecewise constant coefficients I L1-error estimates J Comput Math 26(1)1ndash22 2008 15

Rodrigo Lecaros13 and Enrique Zuazua12

1BCAM - Basque Center for Applied MathematicsMazarredo 14 E-48009 Bilbao Basque Country Spain

2Ikerbasque - Basque Foundation for ScienceAlameda Urquijo 36-5 Plaza Bizkaia 48011 Bilbao Basque Country Spain

3CMM - Centro de Modelamiento Matematico Universidad de Chile (UMI CNRS 2807)Avenida Blanco Encalada 2120 Casilla 170-3 Correo 3 Santiago Chile

E-mail address rlecarosdimuchilecl zuazuabcamathorg

URL wwwbcamathorgzuazua

  • 1 Introduction
  • 2 Existence of Minimizers
  • 3 The Discrete Minimization Problem
  • 4 Sensitivity analysis the continuous approach
    • 41 Sensitivity without shocks
    • 42 Sensitivity of the state in the presence of shocks
    • 43 Sensitivity of the cost in the presence of shocks
      • 5 The alternating descent method
      • 6 Numerical approximation of the descent direction
        • 61 The discrete approach
        • 62 The alternating descent method
          • 7 Numerical experiments
            • 71 Experiment 1
            • 72 Experiment 2
              • 8 Conclusions and perspectives
              • References
Page 7: TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 7

Then for ε gt 0 sufficiently small the solution uε(x t) corresponding to the initial datum

uε0(x) = u0(x) + εδu0(x) (41)

is also a classical solution in (x t) isin Rtimes (0 T ) and uε isin C1(Rtimes [0 T ]) can be written as

uε = u+ εδu+ o(ε)with respect to the C1 topology (42)

where δu is the solution of the linearized equation

parttδu+ partx(f prime(u)δu) = 0 in Rtimes (0 T ) δu(x 0) = δu0(x) x isin R (43)

Let δJ be the Gateaux derivative of J at u0 in the direction δu0 We have

δJ(u0)[δu0] =

int T

0

intRρ(x t)(u(x t)minus ud(x t))δu(x t)dxdt (44)

where δu solves the linearized system above (43) Now we introduce the adjoint system

minus parttpminus f prime(u)partxp = ρ (uminus ud) in Rtimes (0 T ) p(x T ) = 0 x isin R (45)

Multiplying the equations satisfied by δu by p integrating by parts and taking intoaccount that p satisfies (45) we easily obtainint T

0

intRρ(x t)(u(x t)minus ud(x t))δu(x t)dxdt =

intRp(x 0)δu0(x)dx (46)

Thus δJ in (44) can be written as

δJ(u0)[δu0] =

intRp(x 0)δu0(x)dx (47)

This expression provides an easy way to compute a descent direction for the continuousfunctional J once we have computed the adjoint state We just take

δu0(x) = minusp(x 0) (48)

Under the assumptions above on u0 u δu and p can be obtained from their data u0(x)δu0(x) and ρ (u minus ud) by using the characteristic curves associated to (12) For the sakeof completeness we briefly explain this below

The characteristic curves associated to (12) are defined by

xprime(t) = f prime(u(x(t) t)) t isin (0 T ) x(0) = x0 (49)

They are straight lines whose slopes depend on the initial data

x(t) = x0 + tf prime(u0(x0)) t isin (0 T )

As we are dealing with classical solutions u is constant along such curves and by assump-tion two different characteristic curves do not meet each other in R times [0 T + τ ] Thisallows to define u in Rtimes [0 T + τ ] in a unique way from the initial data

For ε gt 0 sufficiently small the solution uε(x t) corresponding to the initial datum(41) has similar characteristics to those of u This allows guaranteeing that two differentcharacteristic lines do not intersect for 0 le t le T if ε gt 0 is small enough Note that uε

may possibly be discontinuous for t isin (T T +τ ] if u0 generates a discontinuity at t = T +τ

8 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

but this is irrelevant for the analysis in [0 T ] we are carrying out Therefore uε(x t) is alsoa classical solution in (x t) isin R times [0 T ] and it is easy to see that the solution uε can bewritten as (42) where δu satisfies (43)

System (43) can be solved again by the method of characteristics In fact as u is aregular function the first equation in (43) can be written as

parttδu+ f prime(u)partxδu = minuspartx(f prime(u)) δu (410)

ied

dtδu(x(t) t) = minuspartx(f prime(u)) δu (411)

where x(t) are the characteristic curves defined by (49) Thus the solution δu along acharacteristic line can be obtained from δu0 by solving this differential equation ie

δu(x(t) t) = δu0(x0) exp

(minusint t

0

partx(f prime(u))(x(s) s)ds

)

Finally the adjoint system (45) is also solved by characteristics ie

minus d

dtp(x(t) t) = ρ(x(t) t)(u(x(t) t)minus ud(x(t) t))

This yields the steepest descent direction in (48) for the continuous functional

p(x0 0) = u0(x0)

int T

0

ρ(x(s) s)dsminusint T

0

ρ(x(s) s)ud(x(s) s)ds

Remark 41 Note that for classical solutions the Gateaux derivative of J at u0 is given by(47) and this provides an obvious descent direction for J at u0 given by (48) Howeverthis fact is not very useful in practice since even when we initialize the iterative descentalgorithm with a smooth u0 we cannot guarantee that the solution remains classical alongthe iterative process

42 Sensitivity of the state in the presence of shocks Inspired in several results onthe sensitivity of solutions of conservation laws in the presence of shocks in one-dimension(see [7 4 5 6 17 13]) we focus on the particular case of solutions having a single shockBut the analysis can be extended to consider more general one-dimensional systems ofconservation laws with a finite number of noninteracting shocks We introduce the followinghypothesis

Hypothesis 42 Assume that u(x t) is a weak entropy solution of (12) with a discon-tinuity along a regular curve Σ = (ϕ(t) t) t isin (0 T ) which is Lipschitz continuousoutside Σ In particular it satisfies the Rankine-Hugoniot condition on Σ

ϕprime(t)[u]Σt = [f(u)]Σt

Here we have used the notation [v]Σt = limε0

v(ϕ(t) + ε t)minus v(ϕ(t)minus ε t) for the jump at

Σt = (ϕ(t) t) of any piecewise continuous function v with a discontinuity at Σt

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 9

Note that Σ divides Rtimes(0 T ) into two parts Qminus and Q+ the sub-domains of Rtimes(0 T )to the left and to the right of Σ respectively

As we will see in the presence of shocks to deal correctly with optimal control anddesign problems the state of the system needs to be viewed as constituted by the pair(u ϕ) combining the solution of (12) and the shock location ϕ This is relevant in theanalysis of sensitivity of functions below and when applying descent algorithms

We adopt the functional framework based on the generalized tangent vectors (see [7] andDefinition 41 in [8])

Let u0 be the initial datum that we assume to be Lipschitz continuous to both sidesof a single discontinuity located at x = ϕ0 and consider a generalized tangent vector(δu0 δϕ0) isin L1(R) times R for all 0 le T Let u0ε be a path which generates (δu0 δϕ0)For ε sufficiently small the solution uε(middot t) of (12) is Lipschitz continuous with a singlediscontinuity at x = ϕε(t) for all t isin [0 T ] Therefore uε(middot t) generates a generalizedtangent vector (δu(middot t) δϕ(t)) isin L1(R) times R Moreover in [8] it is proved that it satisfiesthe following linearized system

parttδu+ partx(f prime(u)δu) = 0 in Qminus cupQ+ (412)

d

dt([u]Σtδϕ) = [f prime(u)δu]Σt minus [δu]Σt

d

dtϕ t isin (0 T ) (413)

δu(x 0) = δu0(x) x lt ϕ0 cup x gt ϕ0 (414)

δϕ(0) = δϕ0 (415)

43 Sensitivity of the cost in the presence of shocks In this section we study thesensitivity of the functional J with respect to variations associated with the generalizedtangent vectors defined in the previous section We first define an appropriate generaliza-tion of the Gateaux derivative of J

Definition 43 Let J L1(R) rarr R be a functional and u0 isin L1(R) be Lipschitz contin-uous with a discontinuity in Σ0 an initial datum for which the solution of (12) satisfieshypothesis (42) J is Gateaux differentiable at u0 in a generalized sense if for any gener-alized tangent vector (δu0 δϕ0) and any family u0ε associated to (δu0 δϕ0) the followinglimit exists

δJ = limεrarr0

J(u0ε)minus J(u0)

ε

and it depends only on (u0 ϕ0) and (δu0 δϕ0) ie it does not depend on the particularfamily u0ε which generates (δu0 δϕ0) The limit is the generalized Gateux derivative of Jin the direction (δu0 δϕ0)

The following result easily provides a characterization of the generalized Gateaux deriv-ative of J in terms of the solution of the associated adjoint system (417) (418)(419)(420) (421) and (422)

Proposition 44 The Gateaux derivative of J can be written as follows

δJ(u0)[δu0 δϕ0] =

intRp(x 0)δu0(x)dxminus q(0)[u]Σ0δϕ0 (416)

10 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

where the adjoint state pair (p q) satisfies the system

minus parttpminus f prime(u)partxp = ρ (uminus ud) in Qminus cupQ+ (417)

[p]Σt = 0 t isin (0 T ) (418)

q(t) = p(ϕ(t) t) t isin (0 T ) (419)

minus d

dtq =

(1 + (ϕ)2)12[ρ (uminus ud)2]Σt

2[u]Σt

t isin (0 T ) (420)

p(x T ) = 0 x lt ϕ(T ) cup x gt ϕ(T ) (421)

q(T ) = 0 (422)

Let us briefly comment the result of Proposition 44 before giving its proofSystem (417)-(422) has a unique solution In fact to solve the backward system (417)-

(422) we first define the solution q on the shock Σ from the conditions for q (420) and(422) This determines the value of p along the shock We then propagate this informationtogether with (417) and (421) to both sides of Σ by characteristics (see Figure 1 wherewe illustrate this construction)

p is defined by the method of characteristics

p on the region of influence of the characteristics emanating from the shock

Xminus X+t = 0

t = T

Σ

Σ0

Figure 1 Characteristic lines entering on a shock and how they may beused to build the solution of the adjoint system both away from the shockand on its region of influence

Formula (416) provides an obvious way to compute a first descent direction of J at u0We just take

(δu0 δϕ0) = (minusp(middot 0) q(0)[u]Σ0) (423)

Here the value of δϕ0 must be interpreted as the optimal infinitesimal displacement of thediscontinuity of u0

In [8] when considering the inverse design problem it was observed that the solution pof the corresponding adjoint system at t = 0 was discontinuous with two discontinuities

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 11

one in each side of the original location of the discontinuity at Σ0 This was a reason notto use this descent direction and for introducing the alternating descent method In thepresent setting however the adjoint state p obtained is typically continuous This is due tothe fact that p at both side of the discontinuity is defined by the method of characteristicsand that on the region of influence of the characteristics emanating from the shock thecontinuity is preserved by the fact that on one hand q = q(t) itself is continuous as theprimitive of an integrable function and that the data for p and q at t = T are continuoustoo Despite of this as we shall see the implementation of the alternating descent directionmethod is worth since it significantly improves the results obtained by the purely discreteapproach

We finish this section with the proof of Proposition 44

Proof (of Proposition 44) A straightforward computation shows that J is Gateaux dif-ferentiable in the generalized sense of Definition 43 and that the generalized Gateauxderivative of J in the direction of the generalized tangent vector (δu0 δϕ0) is given by

δJ(u0)[δu0 δϕ0] =

int T

0

intxgtϕ(t)cupxltϕ(t)

ρ(x t)(u(x t)minus ud(x t))δu(x t) dxdt

minusint

Σ

(uminus ud)2

2

]Σt

δϕ(t) dΣ(t) (424)

where the pair (δu δϕ) solves the linearized problem (412)-(415) with initial data (δu0 δϕ0)Let us now introduce the adjoint system (417)-(422) Multiplying the equations of δu

by p and integrating we get

0 =

int T

0

intxgtϕ(t)cupxltϕ(t)

(parttδu+ partx(f prime(u)δu)) p dxdt = minusint T

0

intxgtϕ(t)cupxltϕ(t)

δu (parttp+ f prime(u)partxp) dxdt

+

intxgtϕ(T )cupxltϕ(T )

δu(x T )p(x T )dxminusint

xgtϕ0cupxltϕ0

δu0(x)p(x 0)dx

minusint

Σ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ(t) (425)

where (nx nt) are the Cartesian components of the normal vector to the curve ΣTherefore since p satisfies the adjoint equation (417) (421) from (424) we obtain

δJ(u0)[δu0 δϕ0] =

intxgtϕ0cupxltϕ0

δu0(x)p(x 0)dx

+

intΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ(t)minus

intΣ

(uminus ud)2

2

]Σt

δϕ(t) dΣ(t) (426)

The last two terms in the right hand side of (426) will determine the conditions that pmust satisfy on the shock

12 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Observe that for any functions f g we have

[fg]Σt = f [g]Σt + g[f ]Σt

where g represents the average of g to both sides of the shock Σt ie

g(t) =1

2limε0

(g(ϕ(t) + ε t) + g(ϕ(t)minus ε t)) forallt isin (0 T )

Thus we haveintΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ =

intΣ

[p]Σt

(δunt

Σ + f prime(u)δu middot nxΣ

)dΣ

+

intΣ

p([δu]Σtnt

Σ + [f prime(u)δu]Σt middot nxΣ

)dΣ

(427)

Now we note that the Cartesian components of the normal vector to Σ are given by

nt =minusϕprime(t)radic

1 + (ϕprime(t))2 nx =

1radic1 + (ϕprime(t))2

and dΣ(t) =radic

1 + (ϕprime(t))2dt Using (413) (418) we haveintΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]ΣtnxΣ

)dΣ =

int T

0

p (minus[δu]Σtϕprime(t) + [f prime(u)δu]Σt) dt

=

int T

0

pd

dt([u]Σtδϕ) dt (428)

Therefore replacing (419) (420) (422) and (428) in (426) we obtain (416)This concludes the proof

5 The alternating descent method

Here we explain how the alternating descent method introduced in [8] can be adapted tothe present optimal control problem (13)

First let us introduce some notation We consider two points

Xminus = ϕ(T )minus Tf prime(uminus(ϕ(T ) T )) X+ = ϕ(T )minus Tf prime(u+(x T )) (51)

(p q) the solutions of (417)-(422) and

p0minus = limxXminus

p(x 0) p0+ = limxX+

p(x 0)

Now we introduce the two classes of descent directions we shall use in our descentalgorithm

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 13

First directions With this set of directions we mainly modify the profile of u0 Weset the first directions d1 = (δu0 δϕ0) given by

δu0 =

minusp(x 0) x lt Xminusminusp0minus x isin (Xminus ϕ0)minusp0+ x isin (ϕ0 X+)minusp(x 0) x gt X+

δϕ0 =

0 if

X+intXminus

p(x 0)δu0(x)dx le 0

X+intXminus

p(x 0)δu0(x)dx

[u0]Σ0q(0) if

X+intXminus

p(x 0)δu0(x)dx gt 0 and q(0) 6= 0

(52)

We note that if q(0) = 0 andX+intXminus

p(x 0)δu0(x)dx gt 0 these directions are not

definedSecond directions They are aimed to move the shock without changing the profile

of the solution to both sides Then d2 = (δu0 δϕ0) with

δu0 equiv 0 δϕ0(x) = [u0]Σ0q(0) (53)

We observe that d1 given by (52) satisfies

δJ(u0)[d1] = minusintxisin[XminusX+]

|p(x 0)|2dx

minusp0minusint ϕ0

Xminusp(x 0)dxminus p0+

int X+

ϕ0

p(x 0)dxminus [u0]Σ0q(0)δϕ0 le 0 (54)

Note that in those cases where d1 is not defined as indicated above this descent directionis simply not employed in the descent algorithm

And for d2 given by (53) we have

δJ(u0)[d2] = minus([u0]Σ0q(0))2 le 0 (55)

Thus the two classes of descent directions under consideration have three importantproperties

(1) They are both descent directions(2) They allow to split the design of the profile and the shock location(3) They are true generalized gradients and therefore keep the structure of the data

without increasing its complexity

6 Numerical approximation of the descent direction

We have computed the gradient of the continuous functional J in several cases (u smoothor having shock discontinuities) but in practice one has to work with discrete versions of

14 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

the functional J In this section we discuss two possible ways of searching discrete descentdirections based either on the discrete or the continuous approaches and in the later onthe alternating descent method

The discrete approach consists mainly in applying directly a descent algorithm to thediscrete version J∆ of the functional J by using its gradient The alternating descentmethod by the contrary is a method inspired on the continuous analysis of the previoussection in which the two main classes of descent directions that are first identified andlater discretized

Let us first discuss the discrete approach

61 The discrete approach Let us consider the approximation of the functional J byJ∆ defined (11) and (31) respectively We shall use the Engquist-Osher scheme whichis a 3-point conservative numerical approximation scheme for (12) More explicitly weconsider

un+1i = uni minus

∆t

∆x

(g(uni+1 u

ni )minus g(uni u

niminus1)) i isin Z n = 0 N (61)

where g is the numerical flux defined in (311)The gradient of the discrete functional J∆ requires computing one derivative of J∆ with

respect to each node of the mesh This can be done in a cheaper way using the discreteadjoint state We illustrate it for the Engquist-Osher numerical scheme However asthe discrete functionals J∆ are not necessarily convex the gradient methods could possiblyprovide sequences that do not converge to a global minimizer of J∆ But this drawback anddifficulty appears in most applications of descent methods in optimal design and controlproblems

As we will see in the present context the approximations obtained by discrete gradientmethods are satisfactory although convergence is slow due to unnecessary oscillations thatthe descent method introduces

The gradient of J∆ rigorously speaking requires the linearization of the numericalscheme (61) used to approximate the equation (12) Then the linearization correspondsto

δun+1i = δuni minus

∆t

∆x

(partag(uni+1 u

ni )δuni+1 minus partbg(uni u

niminus1)δuniminus1

)minus∆t

∆x

(partbg(uni+1 u

ni )minus partag(uni u

niminus1))δuni

i isin Z n = 0 N (62)

In view of this the discrete adjoint system of (62) can also be written for the differen-tiable flux functions

pN+1i = 0 i isin Z

pni = pn+1i +

∆t

∆xpartbg(uni+1 u

ni )(pn+1i+1 minus pn+1

i

)+

∆t

∆xpartag(uni u

niminus1)

(pn+1i minus pn+1

iminus1

)+ F n

i

i isin Z n = 0 N (63)

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 15

whereF ni = ∆tρni

(uni minus (ud)ni

) i isin Z n = 0 N (64)

In fact when multiplying the equations in (62) by pn+1i and adding in i isin Z and

n = 0 N the following identity is easily obtained

∆xsumiisinZ

p0i δu

0i = ∆x

Nsumn=0

sumiisinZ

F ni δu

0i (65)

This is the discrete version of formula (46) which allows us to simplify the derivative ofthe discrete cost functional

Thus for any variation δu0∆ the Gateaux derivative of the cost functional defined in

(31) is given by

δJ∆ = ∆x∆tNsum

n=0

sumiisinZ

ρni(uni minus (ud)ni

)δuni (66)

where δunij solves the linearized system (62) If we consider pni the solution of (63) with(64) we obtain that δJ∆ in (66) can be written as

δJ∆ = ∆xsumiisinZ

p0i δu

0i

and this allows to obtain easily the steepest descent direction for J∆ by considering

δu0∆ = minusp0

∆ (67)

Remark 61 We observe that for the Enquis-Osherrsquos flux (311) the system (63) is theupwind method for the continuous adjoint system We do not address here the problemof the convergence of this adjoint scheme towards the solution of the continuous adjointsystem Of course this is an easy matter when u is smooth but it is far from being trivialwhen u has shock discontinuities Whether or not this discrete adjoint system as ∆rarr 0allows reconstructing the complete adjoint system with the inner Dirichlet condition alongthe shock (417)(418)(419)(420)(421) and (422) constitutes an interesting problemfor future research We refer to [14] and [18] for preliminary work on this direction inone-dimension

62 The alternating descent method Now we describe the implementation of thealternating descent method

The main idea is to approximate a minimizer of J alternating between two directions ofdescent First we perturb the initial datum u0 using the direction (52) which principallychanges the profile of u0 Second we move the shock curve without altering the profile ofu0 at both sides of Σ using the direction (53)

More precisely for a given initialization u0∆ and target function ud∆ we compute Σ0

∆ thejump-point of u0

∆u∆ and p0∆ as the solutions of (62) (63) respectively

As numerical approximation of the adjoint state q(0) we take the value of p0∆ over the

point Σ0∆ ie

q0∆ = p0

∆(Σ0∆) = p0

i Σ0∆ isin (ximinus12 xi+12)

16 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

The main advantage of this method introduced in [8] is that for an initial datum u0

with a single discontinuity the descent directions are generalized tangent vectors ie theyintroduce Lipschitz continuous variations of u0 at both sides of the discontinuity and adisplacement of the shock position In this way the new datum obtained modifying theold one in the direction of this generalized tangent vector has again a single discontinuityThe method can be applied in a much more general context in which for instance thesolution has various shocks since the method is able both to generate shocks and to destroythem if any of these facts contributes to the decrease of the functional This method isin some sense close to those employed in shape design in elasticity in which topologicalderivatives (that would correspond to controlling the location of the shock in our method)are combined with classical shape deformations (that would correspond to simply shapingthe solution away from the shock in the present setting) [11]

As mentioned above in the context of the tracking problem we are considering thesolutions of the adjoint system do not increase the number of shocks Thus a directapplication of the continuous method without employing this alternating variant couldmake sense Note that the purely discrete method is a limited version of the continuous onesince one simply employs the descent direction indicated by p0 without paying attention tothe value q0 corresponding to the shock sensitivity However the numerical experimentswe have done with the full continuous method do not improve the results obtained by thediscrete one since the definition of the location of the shocks is lost along the iterativeprocess

7 Numerical experiments

In this section we present some numerical experiments which illustrate the results ob-tained in an optimization model problem with each one of the numerical methods describedin the previous section

We have chosen as computational domain the interval (minus4 4) and we have taken asboundary conditions in (61) at each time step t = tn the value of the initial data at theboundary This can be justified if we assume that the initial datum u0 is constant in asufficiently large inner neighborhood of the boundary x = plusmn4 (which depends on the sizeof the Linfin-norm of the data under consideration and the time horizon T ) due to the finitespeed of propagation A similar procedure is employed for the adjoint equation

We underline once more that the solutions obtained with each method may correspondto global minima or local ones since the gradient algorithm does not distinguish them

In the experiments we consider the Burgersrsquo equation ie f(z) = z22

parttu+ partx

(u2

2

)= 0 u(x 0) = u0(x) (71)

The weight function ρ under consideration in the experiments is given by

ρ(x t) =

1 t isin (T2 T )0 otherwise

And the time horizon T = 1

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 17

To compare the efficiency of the different methods we consider a fixed ∆x = 120 λ =∆t∆x = 23 (which satisfies the CFL condition) We then analyze the number of itera-tions that each method needs to attain a prescribed value of the functional

71 Experiment 1 We first consider a piecewise constant target profile ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

07 x isin [minus2 1]0 otherwise

(72)

Note that in this case (72) yields a particular solution of the optimization problemand the minimum value of J vanishes

We solve the optimization problem (13) with the above described different methodsstarting from the following initialization for u0

u0(x) =

05 x le 0minus01 x gt 0

(73)

which also has a discontinuity but located on a different pointIn Figure 2 we plot log(J) with respect to the number of iterations for both the purely

discrete method and the alternating descent one We see that the latter stabilizes in feweriterations

Figure 2 Experiment 1 log(J) versus the number of iterations in thedescent algorithm for the discrete and the alternating descent methods

In Figures 3 and 4 we present the minimizers obtained by the methods above and theassociated solutions Figures 5 and 6

The initial datum u0 obtained by the alternating descent method (Figures 4) is a goodapproximation of (72) The solution given by the discrete approach (Figures 3) presentsadded spurious oscillations Furthermore the discrete method is much slower and does notachieve the same level of accuracy since the functional J∆ does not decrease so much

18 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Figure 3 Experiment 1 u0discrete method iteration k =

999

Figure 4 Experiment 1 u0alternating descent method it-eration k = 38

Figure 5 Experiment 1 So-lution u(x t) discrete methoditeration k = 999

Figure 6 Experiment 1 So-lution u(x t) alternating de-scent method iteration k = 38

72 Experiment 2 The previous experiment indicates that the alternating descent methodperforms significantly better In order to show that this is a systematic fact which arisesindependently of the initialization of the method we consider the target ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

05 x le 050 otherwise

(74)

but this time we compare the performance of both methods starting from different initial-izations

The obtained numerical results are presented in Figure 7We see that regardless the initialization considered the alternating descent method

performs significantly better

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 19

u0 obtained by the discrete

approach

u0 obtained by the alternating

descent method

The value of the functional

Figure 7 Experiment 2 We present a comparison of the results obtainedwith both methods starting out of five different initialization configurationsIn the left column we exhibit the results obtained with the discrete methodIn the second one those achieved by the alternating descent method In thelast one we plot the evolution of the functional with both methods

20 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

We observe that in the five experiments the alternating descent method perfumes betterensuring the descent of the functional in much fewer iterations and yielding smoother lessoscillatory approximation of the minimizer

Note also that the discrete method rather than yielding discontinuous approximationsof the minimizer as the alternating descent method does it produces an initial datum witha Lipschitz front Observe that these are two different configurations that can lead tothe same evolution for the Burgers equation after some time once the front develops thediscontinuity This is in agreement with the fact that the functional to be minimized isonly active in the time-interval T2 le t le T

8 Conclusions and perspectives

In this paper we have adapted and presented the alternating descent method for atracking problem for a 1minus d scalar conservation law the goal being to identify an optimalinitial datum so that the solution gets as close as possible to a given trajectory Wehave exhibited in a number of examples the better performance of the alternating descentmethod with respect to the classical discrete method Thus the paper extends previousworks on other optimization problems such as the inverse design or the optimization of theflux function

The developments in this paper raise a number of interesting problems and questionsthat will deserve further investigation We summarize here some of them

bull The alternating descent method and the discrete one do not seem to yield the sameminimizer This should be further investigated in a more systematic manner inother experiments

The minimizer that the discrete method yields seems to replace the shock of theinitial datum by a Lipschitz function which eventually develops the same dynamicswithin the time interval T2 le t le T in which the functional we have chosen isactive This is an agreement with the behavior of these methods in the context ofthe classical problem of inverse design in which one aims to find the initial datumso that the solution at the final time takes a given value It would be interesting toprove analytically that the two methods may lead to different minimizers in somecircumstancesbull The experiments in this paper concern the case where the target ud is exactly

reachable It would be interesting to explore the performance of both methods inthe case where the target ud is not a solution of the underlying dynamicsbull It would be interesting to compare the performance of the methods presented in the

paper with the direct continuous method without introducing the alternating strat-egy Our preliminary numerical experiments indicate that the continuous methodwithout implementing the alternating strategy does not improve the results thatthe purely discrete strategy yieldsbull In [16] the problem of inverse design has been investigated adapting and extend-

ing the alternating descent method to the multi-dimensional case It would beinteresting to extend the analysis of this paper to the multidimensional case too

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 21

bull It would worth to compare the results in this paper with those that could beachieved by the nudging method as in [2] [3]

AcknowledgementsThe authors would like to thank C Castro F Palacios and A Pozo for stimulating

discussions

References

[1] A Adimurthi S S Ghoshal and V Gowda Optimal controllability for scalar conservation laws withconvex flux 12 Jan 2012 3

[2] D Auroux and J Blum Back and forth nudging algorithm for data assimilation problems ComptesRendus Mathematique 340(12)873 ndash 878 2005 3 20

[3] D Auroux and M Nodet The back and forth nudging algorithm for data assimilation problems theoretical results on transport equations ESAIM Control Optimisation and Calculus of Variations18(2)318ndash342 7 2012 3 20

[4] C Bardos and O Pironneau A formalism for the differentiation of conservation laws C R MathAcad Sci Paris 335(10)839ndash845 2002 8

[5] F Bouchut and F James One-dimensional transport equations with discontinuous coefficients Non-linear Anal 32(7)891ndash933 1998 8

[6] F Bouchut and F James Differentiability with respect to initial data for a scalar conservation lawIn Hyperbolic problems theory numerics applications Vol I (Zurich 1998) volume 129 of InternatSer Numer Math pages 113ndash118 Birkhauser Basel 1999 8

[7] A Bressan and A Marson A variational calculus for discontinuous solutions of systems of conservationlaws Comm Partial Differential Equations 20(9-10)1491ndash1552 1995 8 9

[8] C Castro F Palacios and E Zuazua An alternating descent method for the optimal control of theinviscid burgers equation in the presence of shocks Mathematical Models and Methods in AppliedSciences 18(03)369ndash416 2008 2 3 4 6 9 10 12 16

[9] C Castro F Palacios and E Zuazua Optimal control and vanishing viscosity for the burgers equa-tion In C Constanda and M Perez editors Integral Methods in Science and Engineering Volume2 pages 65ndash90 Birkhauser Boston 2010 2

[10] C Castro and E Zuazua Flux identification for 1-d scalar conservation laws in the presence of shocksMath Comp 80(276)2025ndash2070 2011 2

[11] S Garreau P Guillaume and M Masmoudi The topological asymptotic for pde systems Theelasticity case SIAM J Control Optim 39(6)1756ndash1778 Dec 2000 16

[12] E Godlewski and P Raviart Hyperbolic systems of conservation laws Mathematiques and Applica-tions Ellipses Paris 1991 6

[13] E Godlewski and P A Raviart The linearized stability of solutions of nonlinear hyperbolic systemsof conservation laws A general numerical approach Math Comput Simulation 50(1-4)77ndash95 1999Modelling rsquo98 (Prague) 8

[14] L Gosse and F James Numerical approximations of one-dimensional linear conservation equationswith discontinuous coefficients Mathematics of Computation 69(231)pp 987ndash1015 2000 15

[15] S N Kruzkov First order quasilinear equations with several independent variables Mat Sb (NS)81 (123)228ndash255 1970 2

[16] R Lecaros and E Zuazua Control of 2D scalar conservation laws in the presence of shocks preprint2014 3 6 20

[17] S Ulbrich Adjoint-based derivative computations for the optimal control of discontinuous solutions ofhyperbolic conservation laws Systems Control Lett 48(3-4)313ndash328 2003 Optimization and controlof distributed systems 8

22 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

[18] X Wen and S Jin Convergence of an immersed interface upwind scheme for linear advection equationswith piecewise constant coefficients I L1-error estimates J Comput Math 26(1)1ndash22 2008 15

Rodrigo Lecaros13 and Enrique Zuazua12

1BCAM - Basque Center for Applied MathematicsMazarredo 14 E-48009 Bilbao Basque Country Spain

2Ikerbasque - Basque Foundation for ScienceAlameda Urquijo 36-5 Plaza Bizkaia 48011 Bilbao Basque Country Spain

3CMM - Centro de Modelamiento Matematico Universidad de Chile (UMI CNRS 2807)Avenida Blanco Encalada 2120 Casilla 170-3 Correo 3 Santiago Chile

E-mail address rlecarosdimuchilecl zuazuabcamathorg

URL wwwbcamathorgzuazua

  • 1 Introduction
  • 2 Existence of Minimizers
  • 3 The Discrete Minimization Problem
  • 4 Sensitivity analysis the continuous approach
    • 41 Sensitivity without shocks
    • 42 Sensitivity of the state in the presence of shocks
    • 43 Sensitivity of the cost in the presence of shocks
      • 5 The alternating descent method
      • 6 Numerical approximation of the descent direction
        • 61 The discrete approach
        • 62 The alternating descent method
          • 7 Numerical experiments
            • 71 Experiment 1
            • 72 Experiment 2
              • 8 Conclusions and perspectives
              • References
Page 8: TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE

8 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

but this is irrelevant for the analysis in [0 T ] we are carrying out Therefore uε(x t) is alsoa classical solution in (x t) isin R times [0 T ] and it is easy to see that the solution uε can bewritten as (42) where δu satisfies (43)

System (43) can be solved again by the method of characteristics In fact as u is aregular function the first equation in (43) can be written as

parttδu+ f prime(u)partxδu = minuspartx(f prime(u)) δu (410)

ied

dtδu(x(t) t) = minuspartx(f prime(u)) δu (411)

where x(t) are the characteristic curves defined by (49) Thus the solution δu along acharacteristic line can be obtained from δu0 by solving this differential equation ie

δu(x(t) t) = δu0(x0) exp

(minusint t

0

partx(f prime(u))(x(s) s)ds

)

Finally the adjoint system (45) is also solved by characteristics ie

minus d

dtp(x(t) t) = ρ(x(t) t)(u(x(t) t)minus ud(x(t) t))

This yields the steepest descent direction in (48) for the continuous functional

p(x0 0) = u0(x0)

int T

0

ρ(x(s) s)dsminusint T

0

ρ(x(s) s)ud(x(s) s)ds

Remark 41 Note that for classical solutions the Gateaux derivative of J at u0 is given by(47) and this provides an obvious descent direction for J at u0 given by (48) Howeverthis fact is not very useful in practice since even when we initialize the iterative descentalgorithm with a smooth u0 we cannot guarantee that the solution remains classical alongthe iterative process

42 Sensitivity of the state in the presence of shocks Inspired in several results onthe sensitivity of solutions of conservation laws in the presence of shocks in one-dimension(see [7 4 5 6 17 13]) we focus on the particular case of solutions having a single shockBut the analysis can be extended to consider more general one-dimensional systems ofconservation laws with a finite number of noninteracting shocks We introduce the followinghypothesis

Hypothesis 42 Assume that u(x t) is a weak entropy solution of (12) with a discon-tinuity along a regular curve Σ = (ϕ(t) t) t isin (0 T ) which is Lipschitz continuousoutside Σ In particular it satisfies the Rankine-Hugoniot condition on Σ

ϕprime(t)[u]Σt = [f(u)]Σt

Here we have used the notation [v]Σt = limε0

v(ϕ(t) + ε t)minus v(ϕ(t)minus ε t) for the jump at

Σt = (ϕ(t) t) of any piecewise continuous function v with a discontinuity at Σt

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 9

Note that Σ divides Rtimes(0 T ) into two parts Qminus and Q+ the sub-domains of Rtimes(0 T )to the left and to the right of Σ respectively

As we will see in the presence of shocks to deal correctly with optimal control anddesign problems the state of the system needs to be viewed as constituted by the pair(u ϕ) combining the solution of (12) and the shock location ϕ This is relevant in theanalysis of sensitivity of functions below and when applying descent algorithms

We adopt the functional framework based on the generalized tangent vectors (see [7] andDefinition 41 in [8])

Let u0 be the initial datum that we assume to be Lipschitz continuous to both sidesof a single discontinuity located at x = ϕ0 and consider a generalized tangent vector(δu0 δϕ0) isin L1(R) times R for all 0 le T Let u0ε be a path which generates (δu0 δϕ0)For ε sufficiently small the solution uε(middot t) of (12) is Lipschitz continuous with a singlediscontinuity at x = ϕε(t) for all t isin [0 T ] Therefore uε(middot t) generates a generalizedtangent vector (δu(middot t) δϕ(t)) isin L1(R) times R Moreover in [8] it is proved that it satisfiesthe following linearized system

parttδu+ partx(f prime(u)δu) = 0 in Qminus cupQ+ (412)

d

dt([u]Σtδϕ) = [f prime(u)δu]Σt minus [δu]Σt

d

dtϕ t isin (0 T ) (413)

δu(x 0) = δu0(x) x lt ϕ0 cup x gt ϕ0 (414)

δϕ(0) = δϕ0 (415)

43 Sensitivity of the cost in the presence of shocks In this section we study thesensitivity of the functional J with respect to variations associated with the generalizedtangent vectors defined in the previous section We first define an appropriate generaliza-tion of the Gateaux derivative of J

Definition 43 Let J L1(R) rarr R be a functional and u0 isin L1(R) be Lipschitz contin-uous with a discontinuity in Σ0 an initial datum for which the solution of (12) satisfieshypothesis (42) J is Gateaux differentiable at u0 in a generalized sense if for any gener-alized tangent vector (δu0 δϕ0) and any family u0ε associated to (δu0 δϕ0) the followinglimit exists

δJ = limεrarr0

J(u0ε)minus J(u0)

ε

and it depends only on (u0 ϕ0) and (δu0 δϕ0) ie it does not depend on the particularfamily u0ε which generates (δu0 δϕ0) The limit is the generalized Gateux derivative of Jin the direction (δu0 δϕ0)

The following result easily provides a characterization of the generalized Gateaux deriv-ative of J in terms of the solution of the associated adjoint system (417) (418)(419)(420) (421) and (422)

Proposition 44 The Gateaux derivative of J can be written as follows

δJ(u0)[δu0 δϕ0] =

intRp(x 0)δu0(x)dxminus q(0)[u]Σ0δϕ0 (416)

10 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

where the adjoint state pair (p q) satisfies the system

minus parttpminus f prime(u)partxp = ρ (uminus ud) in Qminus cupQ+ (417)

[p]Σt = 0 t isin (0 T ) (418)

q(t) = p(ϕ(t) t) t isin (0 T ) (419)

minus d

dtq =

(1 + (ϕ)2)12[ρ (uminus ud)2]Σt

2[u]Σt

t isin (0 T ) (420)

p(x T ) = 0 x lt ϕ(T ) cup x gt ϕ(T ) (421)

q(T ) = 0 (422)

Let us briefly comment the result of Proposition 44 before giving its proofSystem (417)-(422) has a unique solution In fact to solve the backward system (417)-

(422) we first define the solution q on the shock Σ from the conditions for q (420) and(422) This determines the value of p along the shock We then propagate this informationtogether with (417) and (421) to both sides of Σ by characteristics (see Figure 1 wherewe illustrate this construction)

p is defined by the method of characteristics

p on the region of influence of the characteristics emanating from the shock

Xminus X+t = 0

t = T

Σ

Σ0

Figure 1 Characteristic lines entering on a shock and how they may beused to build the solution of the adjoint system both away from the shockand on its region of influence

Formula (416) provides an obvious way to compute a first descent direction of J at u0We just take

(δu0 δϕ0) = (minusp(middot 0) q(0)[u]Σ0) (423)

Here the value of δϕ0 must be interpreted as the optimal infinitesimal displacement of thediscontinuity of u0

In [8] when considering the inverse design problem it was observed that the solution pof the corresponding adjoint system at t = 0 was discontinuous with two discontinuities

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 11

one in each side of the original location of the discontinuity at Σ0 This was a reason notto use this descent direction and for introducing the alternating descent method In thepresent setting however the adjoint state p obtained is typically continuous This is due tothe fact that p at both side of the discontinuity is defined by the method of characteristicsand that on the region of influence of the characteristics emanating from the shock thecontinuity is preserved by the fact that on one hand q = q(t) itself is continuous as theprimitive of an integrable function and that the data for p and q at t = T are continuoustoo Despite of this as we shall see the implementation of the alternating descent directionmethod is worth since it significantly improves the results obtained by the purely discreteapproach

We finish this section with the proof of Proposition 44

Proof (of Proposition 44) A straightforward computation shows that J is Gateaux dif-ferentiable in the generalized sense of Definition 43 and that the generalized Gateauxderivative of J in the direction of the generalized tangent vector (δu0 δϕ0) is given by

δJ(u0)[δu0 δϕ0] =

int T

0

intxgtϕ(t)cupxltϕ(t)

ρ(x t)(u(x t)minus ud(x t))δu(x t) dxdt

minusint

Σ

(uminus ud)2

2

]Σt

δϕ(t) dΣ(t) (424)

where the pair (δu δϕ) solves the linearized problem (412)-(415) with initial data (δu0 δϕ0)Let us now introduce the adjoint system (417)-(422) Multiplying the equations of δu

by p and integrating we get

0 =

int T

0

intxgtϕ(t)cupxltϕ(t)

(parttδu+ partx(f prime(u)δu)) p dxdt = minusint T

0

intxgtϕ(t)cupxltϕ(t)

δu (parttp+ f prime(u)partxp) dxdt

+

intxgtϕ(T )cupxltϕ(T )

δu(x T )p(x T )dxminusint

xgtϕ0cupxltϕ0

δu0(x)p(x 0)dx

minusint

Σ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ(t) (425)

where (nx nt) are the Cartesian components of the normal vector to the curve ΣTherefore since p satisfies the adjoint equation (417) (421) from (424) we obtain

δJ(u0)[δu0 δϕ0] =

intxgtϕ0cupxltϕ0

δu0(x)p(x 0)dx

+

intΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ(t)minus

intΣ

(uminus ud)2

2

]Σt

δϕ(t) dΣ(t) (426)

The last two terms in the right hand side of (426) will determine the conditions that pmust satisfy on the shock

12 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Observe that for any functions f g we have

[fg]Σt = f [g]Σt + g[f ]Σt

where g represents the average of g to both sides of the shock Σt ie

g(t) =1

2limε0

(g(ϕ(t) + ε t) + g(ϕ(t)minus ε t)) forallt isin (0 T )

Thus we haveintΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ =

intΣ

[p]Σt

(δunt

Σ + f prime(u)δu middot nxΣ

)dΣ

+

intΣ

p([δu]Σtnt

Σ + [f prime(u)δu]Σt middot nxΣ

)dΣ

(427)

Now we note that the Cartesian components of the normal vector to Σ are given by

nt =minusϕprime(t)radic

1 + (ϕprime(t))2 nx =

1radic1 + (ϕprime(t))2

and dΣ(t) =radic

1 + (ϕprime(t))2dt Using (413) (418) we haveintΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]ΣtnxΣ

)dΣ =

int T

0

p (minus[δu]Σtϕprime(t) + [f prime(u)δu]Σt) dt

=

int T

0

pd

dt([u]Σtδϕ) dt (428)

Therefore replacing (419) (420) (422) and (428) in (426) we obtain (416)This concludes the proof

5 The alternating descent method

Here we explain how the alternating descent method introduced in [8] can be adapted tothe present optimal control problem (13)

First let us introduce some notation We consider two points

Xminus = ϕ(T )minus Tf prime(uminus(ϕ(T ) T )) X+ = ϕ(T )minus Tf prime(u+(x T )) (51)

(p q) the solutions of (417)-(422) and

p0minus = limxXminus

p(x 0) p0+ = limxX+

p(x 0)

Now we introduce the two classes of descent directions we shall use in our descentalgorithm

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 13

First directions With this set of directions we mainly modify the profile of u0 Weset the first directions d1 = (δu0 δϕ0) given by

δu0 =

minusp(x 0) x lt Xminusminusp0minus x isin (Xminus ϕ0)minusp0+ x isin (ϕ0 X+)minusp(x 0) x gt X+

δϕ0 =

0 if

X+intXminus

p(x 0)δu0(x)dx le 0

X+intXminus

p(x 0)δu0(x)dx

[u0]Σ0q(0) if

X+intXminus

p(x 0)δu0(x)dx gt 0 and q(0) 6= 0

(52)

We note that if q(0) = 0 andX+intXminus

p(x 0)δu0(x)dx gt 0 these directions are not

definedSecond directions They are aimed to move the shock without changing the profile

of the solution to both sides Then d2 = (δu0 δϕ0) with

δu0 equiv 0 δϕ0(x) = [u0]Σ0q(0) (53)

We observe that d1 given by (52) satisfies

δJ(u0)[d1] = minusintxisin[XminusX+]

|p(x 0)|2dx

minusp0minusint ϕ0

Xminusp(x 0)dxminus p0+

int X+

ϕ0

p(x 0)dxminus [u0]Σ0q(0)δϕ0 le 0 (54)

Note that in those cases where d1 is not defined as indicated above this descent directionis simply not employed in the descent algorithm

And for d2 given by (53) we have

δJ(u0)[d2] = minus([u0]Σ0q(0))2 le 0 (55)

Thus the two classes of descent directions under consideration have three importantproperties

(1) They are both descent directions(2) They allow to split the design of the profile and the shock location(3) They are true generalized gradients and therefore keep the structure of the data

without increasing its complexity

6 Numerical approximation of the descent direction

We have computed the gradient of the continuous functional J in several cases (u smoothor having shock discontinuities) but in practice one has to work with discrete versions of

14 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

the functional J In this section we discuss two possible ways of searching discrete descentdirections based either on the discrete or the continuous approaches and in the later onthe alternating descent method

The discrete approach consists mainly in applying directly a descent algorithm to thediscrete version J∆ of the functional J by using its gradient The alternating descentmethod by the contrary is a method inspired on the continuous analysis of the previoussection in which the two main classes of descent directions that are first identified andlater discretized

Let us first discuss the discrete approach

61 The discrete approach Let us consider the approximation of the functional J byJ∆ defined (11) and (31) respectively We shall use the Engquist-Osher scheme whichis a 3-point conservative numerical approximation scheme for (12) More explicitly weconsider

un+1i = uni minus

∆t

∆x

(g(uni+1 u

ni )minus g(uni u

niminus1)) i isin Z n = 0 N (61)

where g is the numerical flux defined in (311)The gradient of the discrete functional J∆ requires computing one derivative of J∆ with

respect to each node of the mesh This can be done in a cheaper way using the discreteadjoint state We illustrate it for the Engquist-Osher numerical scheme However asthe discrete functionals J∆ are not necessarily convex the gradient methods could possiblyprovide sequences that do not converge to a global minimizer of J∆ But this drawback anddifficulty appears in most applications of descent methods in optimal design and controlproblems

As we will see in the present context the approximations obtained by discrete gradientmethods are satisfactory although convergence is slow due to unnecessary oscillations thatthe descent method introduces

The gradient of J∆ rigorously speaking requires the linearization of the numericalscheme (61) used to approximate the equation (12) Then the linearization correspondsto

δun+1i = δuni minus

∆t

∆x

(partag(uni+1 u

ni )δuni+1 minus partbg(uni u

niminus1)δuniminus1

)minus∆t

∆x

(partbg(uni+1 u

ni )minus partag(uni u

niminus1))δuni

i isin Z n = 0 N (62)

In view of this the discrete adjoint system of (62) can also be written for the differen-tiable flux functions

pN+1i = 0 i isin Z

pni = pn+1i +

∆t

∆xpartbg(uni+1 u

ni )(pn+1i+1 minus pn+1

i

)+

∆t

∆xpartag(uni u

niminus1)

(pn+1i minus pn+1

iminus1

)+ F n

i

i isin Z n = 0 N (63)

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 15

whereF ni = ∆tρni

(uni minus (ud)ni

) i isin Z n = 0 N (64)

In fact when multiplying the equations in (62) by pn+1i and adding in i isin Z and

n = 0 N the following identity is easily obtained

∆xsumiisinZ

p0i δu

0i = ∆x

Nsumn=0

sumiisinZ

F ni δu

0i (65)

This is the discrete version of formula (46) which allows us to simplify the derivative ofthe discrete cost functional

Thus for any variation δu0∆ the Gateaux derivative of the cost functional defined in

(31) is given by

δJ∆ = ∆x∆tNsum

n=0

sumiisinZ

ρni(uni minus (ud)ni

)δuni (66)

where δunij solves the linearized system (62) If we consider pni the solution of (63) with(64) we obtain that δJ∆ in (66) can be written as

δJ∆ = ∆xsumiisinZ

p0i δu

0i

and this allows to obtain easily the steepest descent direction for J∆ by considering

δu0∆ = minusp0

∆ (67)

Remark 61 We observe that for the Enquis-Osherrsquos flux (311) the system (63) is theupwind method for the continuous adjoint system We do not address here the problemof the convergence of this adjoint scheme towards the solution of the continuous adjointsystem Of course this is an easy matter when u is smooth but it is far from being trivialwhen u has shock discontinuities Whether or not this discrete adjoint system as ∆rarr 0allows reconstructing the complete adjoint system with the inner Dirichlet condition alongthe shock (417)(418)(419)(420)(421) and (422) constitutes an interesting problemfor future research We refer to [14] and [18] for preliminary work on this direction inone-dimension

62 The alternating descent method Now we describe the implementation of thealternating descent method

The main idea is to approximate a minimizer of J alternating between two directions ofdescent First we perturb the initial datum u0 using the direction (52) which principallychanges the profile of u0 Second we move the shock curve without altering the profile ofu0 at both sides of Σ using the direction (53)

More precisely for a given initialization u0∆ and target function ud∆ we compute Σ0

∆ thejump-point of u0

∆u∆ and p0∆ as the solutions of (62) (63) respectively

As numerical approximation of the adjoint state q(0) we take the value of p0∆ over the

point Σ0∆ ie

q0∆ = p0

∆(Σ0∆) = p0

i Σ0∆ isin (ximinus12 xi+12)

16 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

The main advantage of this method introduced in [8] is that for an initial datum u0

with a single discontinuity the descent directions are generalized tangent vectors ie theyintroduce Lipschitz continuous variations of u0 at both sides of the discontinuity and adisplacement of the shock position In this way the new datum obtained modifying theold one in the direction of this generalized tangent vector has again a single discontinuityThe method can be applied in a much more general context in which for instance thesolution has various shocks since the method is able both to generate shocks and to destroythem if any of these facts contributes to the decrease of the functional This method isin some sense close to those employed in shape design in elasticity in which topologicalderivatives (that would correspond to controlling the location of the shock in our method)are combined with classical shape deformations (that would correspond to simply shapingthe solution away from the shock in the present setting) [11]

As mentioned above in the context of the tracking problem we are considering thesolutions of the adjoint system do not increase the number of shocks Thus a directapplication of the continuous method without employing this alternating variant couldmake sense Note that the purely discrete method is a limited version of the continuous onesince one simply employs the descent direction indicated by p0 without paying attention tothe value q0 corresponding to the shock sensitivity However the numerical experimentswe have done with the full continuous method do not improve the results obtained by thediscrete one since the definition of the location of the shocks is lost along the iterativeprocess

7 Numerical experiments

In this section we present some numerical experiments which illustrate the results ob-tained in an optimization model problem with each one of the numerical methods describedin the previous section

We have chosen as computational domain the interval (minus4 4) and we have taken asboundary conditions in (61) at each time step t = tn the value of the initial data at theboundary This can be justified if we assume that the initial datum u0 is constant in asufficiently large inner neighborhood of the boundary x = plusmn4 (which depends on the sizeof the Linfin-norm of the data under consideration and the time horizon T ) due to the finitespeed of propagation A similar procedure is employed for the adjoint equation

We underline once more that the solutions obtained with each method may correspondto global minima or local ones since the gradient algorithm does not distinguish them

In the experiments we consider the Burgersrsquo equation ie f(z) = z22

parttu+ partx

(u2

2

)= 0 u(x 0) = u0(x) (71)

The weight function ρ under consideration in the experiments is given by

ρ(x t) =

1 t isin (T2 T )0 otherwise

And the time horizon T = 1

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 17

To compare the efficiency of the different methods we consider a fixed ∆x = 120 λ =∆t∆x = 23 (which satisfies the CFL condition) We then analyze the number of itera-tions that each method needs to attain a prescribed value of the functional

71 Experiment 1 We first consider a piecewise constant target profile ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

07 x isin [minus2 1]0 otherwise

(72)

Note that in this case (72) yields a particular solution of the optimization problemand the minimum value of J vanishes

We solve the optimization problem (13) with the above described different methodsstarting from the following initialization for u0

u0(x) =

05 x le 0minus01 x gt 0

(73)

which also has a discontinuity but located on a different pointIn Figure 2 we plot log(J) with respect to the number of iterations for both the purely

discrete method and the alternating descent one We see that the latter stabilizes in feweriterations

Figure 2 Experiment 1 log(J) versus the number of iterations in thedescent algorithm for the discrete and the alternating descent methods

In Figures 3 and 4 we present the minimizers obtained by the methods above and theassociated solutions Figures 5 and 6

The initial datum u0 obtained by the alternating descent method (Figures 4) is a goodapproximation of (72) The solution given by the discrete approach (Figures 3) presentsadded spurious oscillations Furthermore the discrete method is much slower and does notachieve the same level of accuracy since the functional J∆ does not decrease so much

18 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Figure 3 Experiment 1 u0discrete method iteration k =

999

Figure 4 Experiment 1 u0alternating descent method it-eration k = 38

Figure 5 Experiment 1 So-lution u(x t) discrete methoditeration k = 999

Figure 6 Experiment 1 So-lution u(x t) alternating de-scent method iteration k = 38

72 Experiment 2 The previous experiment indicates that the alternating descent methodperforms significantly better In order to show that this is a systematic fact which arisesindependently of the initialization of the method we consider the target ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

05 x le 050 otherwise

(74)

but this time we compare the performance of both methods starting from different initial-izations

The obtained numerical results are presented in Figure 7We see that regardless the initialization considered the alternating descent method

performs significantly better

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 19

u0 obtained by the discrete

approach

u0 obtained by the alternating

descent method

The value of the functional

Figure 7 Experiment 2 We present a comparison of the results obtainedwith both methods starting out of five different initialization configurationsIn the left column we exhibit the results obtained with the discrete methodIn the second one those achieved by the alternating descent method In thelast one we plot the evolution of the functional with both methods

20 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

We observe that in the five experiments the alternating descent method perfumes betterensuring the descent of the functional in much fewer iterations and yielding smoother lessoscillatory approximation of the minimizer

Note also that the discrete method rather than yielding discontinuous approximationsof the minimizer as the alternating descent method does it produces an initial datum witha Lipschitz front Observe that these are two different configurations that can lead tothe same evolution for the Burgers equation after some time once the front develops thediscontinuity This is in agreement with the fact that the functional to be minimized isonly active in the time-interval T2 le t le T

8 Conclusions and perspectives

In this paper we have adapted and presented the alternating descent method for atracking problem for a 1minus d scalar conservation law the goal being to identify an optimalinitial datum so that the solution gets as close as possible to a given trajectory Wehave exhibited in a number of examples the better performance of the alternating descentmethod with respect to the classical discrete method Thus the paper extends previousworks on other optimization problems such as the inverse design or the optimization of theflux function

The developments in this paper raise a number of interesting problems and questionsthat will deserve further investigation We summarize here some of them

bull The alternating descent method and the discrete one do not seem to yield the sameminimizer This should be further investigated in a more systematic manner inother experiments

The minimizer that the discrete method yields seems to replace the shock of theinitial datum by a Lipschitz function which eventually develops the same dynamicswithin the time interval T2 le t le T in which the functional we have chosen isactive This is an agreement with the behavior of these methods in the context ofthe classical problem of inverse design in which one aims to find the initial datumso that the solution at the final time takes a given value It would be interesting toprove analytically that the two methods may lead to different minimizers in somecircumstancesbull The experiments in this paper concern the case where the target ud is exactly

reachable It would be interesting to explore the performance of both methods inthe case where the target ud is not a solution of the underlying dynamicsbull It would be interesting to compare the performance of the methods presented in the

paper with the direct continuous method without introducing the alternating strat-egy Our preliminary numerical experiments indicate that the continuous methodwithout implementing the alternating strategy does not improve the results thatthe purely discrete strategy yieldsbull In [16] the problem of inverse design has been investigated adapting and extend-

ing the alternating descent method to the multi-dimensional case It would beinteresting to extend the analysis of this paper to the multidimensional case too

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 21

bull It would worth to compare the results in this paper with those that could beachieved by the nudging method as in [2] [3]

AcknowledgementsThe authors would like to thank C Castro F Palacios and A Pozo for stimulating

discussions

References

[1] A Adimurthi S S Ghoshal and V Gowda Optimal controllability for scalar conservation laws withconvex flux 12 Jan 2012 3

[2] D Auroux and J Blum Back and forth nudging algorithm for data assimilation problems ComptesRendus Mathematique 340(12)873 ndash 878 2005 3 20

[3] D Auroux and M Nodet The back and forth nudging algorithm for data assimilation problems theoretical results on transport equations ESAIM Control Optimisation and Calculus of Variations18(2)318ndash342 7 2012 3 20

[4] C Bardos and O Pironneau A formalism for the differentiation of conservation laws C R MathAcad Sci Paris 335(10)839ndash845 2002 8

[5] F Bouchut and F James One-dimensional transport equations with discontinuous coefficients Non-linear Anal 32(7)891ndash933 1998 8

[6] F Bouchut and F James Differentiability with respect to initial data for a scalar conservation lawIn Hyperbolic problems theory numerics applications Vol I (Zurich 1998) volume 129 of InternatSer Numer Math pages 113ndash118 Birkhauser Basel 1999 8

[7] A Bressan and A Marson A variational calculus for discontinuous solutions of systems of conservationlaws Comm Partial Differential Equations 20(9-10)1491ndash1552 1995 8 9

[8] C Castro F Palacios and E Zuazua An alternating descent method for the optimal control of theinviscid burgers equation in the presence of shocks Mathematical Models and Methods in AppliedSciences 18(03)369ndash416 2008 2 3 4 6 9 10 12 16

[9] C Castro F Palacios and E Zuazua Optimal control and vanishing viscosity for the burgers equa-tion In C Constanda and M Perez editors Integral Methods in Science and Engineering Volume2 pages 65ndash90 Birkhauser Boston 2010 2

[10] C Castro and E Zuazua Flux identification for 1-d scalar conservation laws in the presence of shocksMath Comp 80(276)2025ndash2070 2011 2

[11] S Garreau P Guillaume and M Masmoudi The topological asymptotic for pde systems Theelasticity case SIAM J Control Optim 39(6)1756ndash1778 Dec 2000 16

[12] E Godlewski and P Raviart Hyperbolic systems of conservation laws Mathematiques and Applica-tions Ellipses Paris 1991 6

[13] E Godlewski and P A Raviart The linearized stability of solutions of nonlinear hyperbolic systemsof conservation laws A general numerical approach Math Comput Simulation 50(1-4)77ndash95 1999Modelling rsquo98 (Prague) 8

[14] L Gosse and F James Numerical approximations of one-dimensional linear conservation equationswith discontinuous coefficients Mathematics of Computation 69(231)pp 987ndash1015 2000 15

[15] S N Kruzkov First order quasilinear equations with several independent variables Mat Sb (NS)81 (123)228ndash255 1970 2

[16] R Lecaros and E Zuazua Control of 2D scalar conservation laws in the presence of shocks preprint2014 3 6 20

[17] S Ulbrich Adjoint-based derivative computations for the optimal control of discontinuous solutions ofhyperbolic conservation laws Systems Control Lett 48(3-4)313ndash328 2003 Optimization and controlof distributed systems 8

22 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

[18] X Wen and S Jin Convergence of an immersed interface upwind scheme for linear advection equationswith piecewise constant coefficients I L1-error estimates J Comput Math 26(1)1ndash22 2008 15

Rodrigo Lecaros13 and Enrique Zuazua12

1BCAM - Basque Center for Applied MathematicsMazarredo 14 E-48009 Bilbao Basque Country Spain

2Ikerbasque - Basque Foundation for ScienceAlameda Urquijo 36-5 Plaza Bizkaia 48011 Bilbao Basque Country Spain

3CMM - Centro de Modelamiento Matematico Universidad de Chile (UMI CNRS 2807)Avenida Blanco Encalada 2120 Casilla 170-3 Correo 3 Santiago Chile

E-mail address rlecarosdimuchilecl zuazuabcamathorg

URL wwwbcamathorgzuazua

  • 1 Introduction
  • 2 Existence of Minimizers
  • 3 The Discrete Minimization Problem
  • 4 Sensitivity analysis the continuous approach
    • 41 Sensitivity without shocks
    • 42 Sensitivity of the state in the presence of shocks
    • 43 Sensitivity of the cost in the presence of shocks
      • 5 The alternating descent method
      • 6 Numerical approximation of the descent direction
        • 61 The discrete approach
        • 62 The alternating descent method
          • 7 Numerical experiments
            • 71 Experiment 1
            • 72 Experiment 2
              • 8 Conclusions and perspectives
              • References
Page 9: TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 9

Note that Σ divides Rtimes(0 T ) into two parts Qminus and Q+ the sub-domains of Rtimes(0 T )to the left and to the right of Σ respectively

As we will see in the presence of shocks to deal correctly with optimal control anddesign problems the state of the system needs to be viewed as constituted by the pair(u ϕ) combining the solution of (12) and the shock location ϕ This is relevant in theanalysis of sensitivity of functions below and when applying descent algorithms

We adopt the functional framework based on the generalized tangent vectors (see [7] andDefinition 41 in [8])

Let u0 be the initial datum that we assume to be Lipschitz continuous to both sidesof a single discontinuity located at x = ϕ0 and consider a generalized tangent vector(δu0 δϕ0) isin L1(R) times R for all 0 le T Let u0ε be a path which generates (δu0 δϕ0)For ε sufficiently small the solution uε(middot t) of (12) is Lipschitz continuous with a singlediscontinuity at x = ϕε(t) for all t isin [0 T ] Therefore uε(middot t) generates a generalizedtangent vector (δu(middot t) δϕ(t)) isin L1(R) times R Moreover in [8] it is proved that it satisfiesthe following linearized system

parttδu+ partx(f prime(u)δu) = 0 in Qminus cupQ+ (412)

d

dt([u]Σtδϕ) = [f prime(u)δu]Σt minus [δu]Σt

d

dtϕ t isin (0 T ) (413)

δu(x 0) = δu0(x) x lt ϕ0 cup x gt ϕ0 (414)

δϕ(0) = δϕ0 (415)

43 Sensitivity of the cost in the presence of shocks In this section we study thesensitivity of the functional J with respect to variations associated with the generalizedtangent vectors defined in the previous section We first define an appropriate generaliza-tion of the Gateaux derivative of J

Definition 43 Let J L1(R) rarr R be a functional and u0 isin L1(R) be Lipschitz contin-uous with a discontinuity in Σ0 an initial datum for which the solution of (12) satisfieshypothesis (42) J is Gateaux differentiable at u0 in a generalized sense if for any gener-alized tangent vector (δu0 δϕ0) and any family u0ε associated to (δu0 δϕ0) the followinglimit exists

δJ = limεrarr0

J(u0ε)minus J(u0)

ε

and it depends only on (u0 ϕ0) and (δu0 δϕ0) ie it does not depend on the particularfamily u0ε which generates (δu0 δϕ0) The limit is the generalized Gateux derivative of Jin the direction (δu0 δϕ0)

The following result easily provides a characterization of the generalized Gateaux deriv-ative of J in terms of the solution of the associated adjoint system (417) (418)(419)(420) (421) and (422)

Proposition 44 The Gateaux derivative of J can be written as follows

δJ(u0)[δu0 δϕ0] =

intRp(x 0)δu0(x)dxminus q(0)[u]Σ0δϕ0 (416)

10 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

where the adjoint state pair (p q) satisfies the system

minus parttpminus f prime(u)partxp = ρ (uminus ud) in Qminus cupQ+ (417)

[p]Σt = 0 t isin (0 T ) (418)

q(t) = p(ϕ(t) t) t isin (0 T ) (419)

minus d

dtq =

(1 + (ϕ)2)12[ρ (uminus ud)2]Σt

2[u]Σt

t isin (0 T ) (420)

p(x T ) = 0 x lt ϕ(T ) cup x gt ϕ(T ) (421)

q(T ) = 0 (422)

Let us briefly comment the result of Proposition 44 before giving its proofSystem (417)-(422) has a unique solution In fact to solve the backward system (417)-

(422) we first define the solution q on the shock Σ from the conditions for q (420) and(422) This determines the value of p along the shock We then propagate this informationtogether with (417) and (421) to both sides of Σ by characteristics (see Figure 1 wherewe illustrate this construction)

p is defined by the method of characteristics

p on the region of influence of the characteristics emanating from the shock

Xminus X+t = 0

t = T

Σ

Σ0

Figure 1 Characteristic lines entering on a shock and how they may beused to build the solution of the adjoint system both away from the shockand on its region of influence

Formula (416) provides an obvious way to compute a first descent direction of J at u0We just take

(δu0 δϕ0) = (minusp(middot 0) q(0)[u]Σ0) (423)

Here the value of δϕ0 must be interpreted as the optimal infinitesimal displacement of thediscontinuity of u0

In [8] when considering the inverse design problem it was observed that the solution pof the corresponding adjoint system at t = 0 was discontinuous with two discontinuities

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 11

one in each side of the original location of the discontinuity at Σ0 This was a reason notto use this descent direction and for introducing the alternating descent method In thepresent setting however the adjoint state p obtained is typically continuous This is due tothe fact that p at both side of the discontinuity is defined by the method of characteristicsand that on the region of influence of the characteristics emanating from the shock thecontinuity is preserved by the fact that on one hand q = q(t) itself is continuous as theprimitive of an integrable function and that the data for p and q at t = T are continuoustoo Despite of this as we shall see the implementation of the alternating descent directionmethod is worth since it significantly improves the results obtained by the purely discreteapproach

We finish this section with the proof of Proposition 44

Proof (of Proposition 44) A straightforward computation shows that J is Gateaux dif-ferentiable in the generalized sense of Definition 43 and that the generalized Gateauxderivative of J in the direction of the generalized tangent vector (δu0 δϕ0) is given by

δJ(u0)[δu0 δϕ0] =

int T

0

intxgtϕ(t)cupxltϕ(t)

ρ(x t)(u(x t)minus ud(x t))δu(x t) dxdt

minusint

Σ

(uminus ud)2

2

]Σt

δϕ(t) dΣ(t) (424)

where the pair (δu δϕ) solves the linearized problem (412)-(415) with initial data (δu0 δϕ0)Let us now introduce the adjoint system (417)-(422) Multiplying the equations of δu

by p and integrating we get

0 =

int T

0

intxgtϕ(t)cupxltϕ(t)

(parttδu+ partx(f prime(u)δu)) p dxdt = minusint T

0

intxgtϕ(t)cupxltϕ(t)

δu (parttp+ f prime(u)partxp) dxdt

+

intxgtϕ(T )cupxltϕ(T )

δu(x T )p(x T )dxminusint

xgtϕ0cupxltϕ0

δu0(x)p(x 0)dx

minusint

Σ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ(t) (425)

where (nx nt) are the Cartesian components of the normal vector to the curve ΣTherefore since p satisfies the adjoint equation (417) (421) from (424) we obtain

δJ(u0)[δu0 δϕ0] =

intxgtϕ0cupxltϕ0

δu0(x)p(x 0)dx

+

intΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ(t)minus

intΣ

(uminus ud)2

2

]Σt

δϕ(t) dΣ(t) (426)

The last two terms in the right hand side of (426) will determine the conditions that pmust satisfy on the shock

12 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Observe that for any functions f g we have

[fg]Σt = f [g]Σt + g[f ]Σt

where g represents the average of g to both sides of the shock Σt ie

g(t) =1

2limε0

(g(ϕ(t) + ε t) + g(ϕ(t)minus ε t)) forallt isin (0 T )

Thus we haveintΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ =

intΣ

[p]Σt

(δunt

Σ + f prime(u)δu middot nxΣ

)dΣ

+

intΣ

p([δu]Σtnt

Σ + [f prime(u)δu]Σt middot nxΣ

)dΣ

(427)

Now we note that the Cartesian components of the normal vector to Σ are given by

nt =minusϕprime(t)radic

1 + (ϕprime(t))2 nx =

1radic1 + (ϕprime(t))2

and dΣ(t) =radic

1 + (ϕprime(t))2dt Using (413) (418) we haveintΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]ΣtnxΣ

)dΣ =

int T

0

p (minus[δu]Σtϕprime(t) + [f prime(u)δu]Σt) dt

=

int T

0

pd

dt([u]Σtδϕ) dt (428)

Therefore replacing (419) (420) (422) and (428) in (426) we obtain (416)This concludes the proof

5 The alternating descent method

Here we explain how the alternating descent method introduced in [8] can be adapted tothe present optimal control problem (13)

First let us introduce some notation We consider two points

Xminus = ϕ(T )minus Tf prime(uminus(ϕ(T ) T )) X+ = ϕ(T )minus Tf prime(u+(x T )) (51)

(p q) the solutions of (417)-(422) and

p0minus = limxXminus

p(x 0) p0+ = limxX+

p(x 0)

Now we introduce the two classes of descent directions we shall use in our descentalgorithm

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 13

First directions With this set of directions we mainly modify the profile of u0 Weset the first directions d1 = (δu0 δϕ0) given by

δu0 =

minusp(x 0) x lt Xminusminusp0minus x isin (Xminus ϕ0)minusp0+ x isin (ϕ0 X+)minusp(x 0) x gt X+

δϕ0 =

0 if

X+intXminus

p(x 0)δu0(x)dx le 0

X+intXminus

p(x 0)δu0(x)dx

[u0]Σ0q(0) if

X+intXminus

p(x 0)δu0(x)dx gt 0 and q(0) 6= 0

(52)

We note that if q(0) = 0 andX+intXminus

p(x 0)δu0(x)dx gt 0 these directions are not

definedSecond directions They are aimed to move the shock without changing the profile

of the solution to both sides Then d2 = (δu0 δϕ0) with

δu0 equiv 0 δϕ0(x) = [u0]Σ0q(0) (53)

We observe that d1 given by (52) satisfies

δJ(u0)[d1] = minusintxisin[XminusX+]

|p(x 0)|2dx

minusp0minusint ϕ0

Xminusp(x 0)dxminus p0+

int X+

ϕ0

p(x 0)dxminus [u0]Σ0q(0)δϕ0 le 0 (54)

Note that in those cases where d1 is not defined as indicated above this descent directionis simply not employed in the descent algorithm

And for d2 given by (53) we have

δJ(u0)[d2] = minus([u0]Σ0q(0))2 le 0 (55)

Thus the two classes of descent directions under consideration have three importantproperties

(1) They are both descent directions(2) They allow to split the design of the profile and the shock location(3) They are true generalized gradients and therefore keep the structure of the data

without increasing its complexity

6 Numerical approximation of the descent direction

We have computed the gradient of the continuous functional J in several cases (u smoothor having shock discontinuities) but in practice one has to work with discrete versions of

14 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

the functional J In this section we discuss two possible ways of searching discrete descentdirections based either on the discrete or the continuous approaches and in the later onthe alternating descent method

The discrete approach consists mainly in applying directly a descent algorithm to thediscrete version J∆ of the functional J by using its gradient The alternating descentmethod by the contrary is a method inspired on the continuous analysis of the previoussection in which the two main classes of descent directions that are first identified andlater discretized

Let us first discuss the discrete approach

61 The discrete approach Let us consider the approximation of the functional J byJ∆ defined (11) and (31) respectively We shall use the Engquist-Osher scheme whichis a 3-point conservative numerical approximation scheme for (12) More explicitly weconsider

un+1i = uni minus

∆t

∆x

(g(uni+1 u

ni )minus g(uni u

niminus1)) i isin Z n = 0 N (61)

where g is the numerical flux defined in (311)The gradient of the discrete functional J∆ requires computing one derivative of J∆ with

respect to each node of the mesh This can be done in a cheaper way using the discreteadjoint state We illustrate it for the Engquist-Osher numerical scheme However asthe discrete functionals J∆ are not necessarily convex the gradient methods could possiblyprovide sequences that do not converge to a global minimizer of J∆ But this drawback anddifficulty appears in most applications of descent methods in optimal design and controlproblems

As we will see in the present context the approximations obtained by discrete gradientmethods are satisfactory although convergence is slow due to unnecessary oscillations thatthe descent method introduces

The gradient of J∆ rigorously speaking requires the linearization of the numericalscheme (61) used to approximate the equation (12) Then the linearization correspondsto

δun+1i = δuni minus

∆t

∆x

(partag(uni+1 u

ni )δuni+1 minus partbg(uni u

niminus1)δuniminus1

)minus∆t

∆x

(partbg(uni+1 u

ni )minus partag(uni u

niminus1))δuni

i isin Z n = 0 N (62)

In view of this the discrete adjoint system of (62) can also be written for the differen-tiable flux functions

pN+1i = 0 i isin Z

pni = pn+1i +

∆t

∆xpartbg(uni+1 u

ni )(pn+1i+1 minus pn+1

i

)+

∆t

∆xpartag(uni u

niminus1)

(pn+1i minus pn+1

iminus1

)+ F n

i

i isin Z n = 0 N (63)

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 15

whereF ni = ∆tρni

(uni minus (ud)ni

) i isin Z n = 0 N (64)

In fact when multiplying the equations in (62) by pn+1i and adding in i isin Z and

n = 0 N the following identity is easily obtained

∆xsumiisinZ

p0i δu

0i = ∆x

Nsumn=0

sumiisinZ

F ni δu

0i (65)

This is the discrete version of formula (46) which allows us to simplify the derivative ofthe discrete cost functional

Thus for any variation δu0∆ the Gateaux derivative of the cost functional defined in

(31) is given by

δJ∆ = ∆x∆tNsum

n=0

sumiisinZ

ρni(uni minus (ud)ni

)δuni (66)

where δunij solves the linearized system (62) If we consider pni the solution of (63) with(64) we obtain that δJ∆ in (66) can be written as

δJ∆ = ∆xsumiisinZ

p0i δu

0i

and this allows to obtain easily the steepest descent direction for J∆ by considering

δu0∆ = minusp0

∆ (67)

Remark 61 We observe that for the Enquis-Osherrsquos flux (311) the system (63) is theupwind method for the continuous adjoint system We do not address here the problemof the convergence of this adjoint scheme towards the solution of the continuous adjointsystem Of course this is an easy matter when u is smooth but it is far from being trivialwhen u has shock discontinuities Whether or not this discrete adjoint system as ∆rarr 0allows reconstructing the complete adjoint system with the inner Dirichlet condition alongthe shock (417)(418)(419)(420)(421) and (422) constitutes an interesting problemfor future research We refer to [14] and [18] for preliminary work on this direction inone-dimension

62 The alternating descent method Now we describe the implementation of thealternating descent method

The main idea is to approximate a minimizer of J alternating between two directions ofdescent First we perturb the initial datum u0 using the direction (52) which principallychanges the profile of u0 Second we move the shock curve without altering the profile ofu0 at both sides of Σ using the direction (53)

More precisely for a given initialization u0∆ and target function ud∆ we compute Σ0

∆ thejump-point of u0

∆u∆ and p0∆ as the solutions of (62) (63) respectively

As numerical approximation of the adjoint state q(0) we take the value of p0∆ over the

point Σ0∆ ie

q0∆ = p0

∆(Σ0∆) = p0

i Σ0∆ isin (ximinus12 xi+12)

16 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

The main advantage of this method introduced in [8] is that for an initial datum u0

with a single discontinuity the descent directions are generalized tangent vectors ie theyintroduce Lipschitz continuous variations of u0 at both sides of the discontinuity and adisplacement of the shock position In this way the new datum obtained modifying theold one in the direction of this generalized tangent vector has again a single discontinuityThe method can be applied in a much more general context in which for instance thesolution has various shocks since the method is able both to generate shocks and to destroythem if any of these facts contributes to the decrease of the functional This method isin some sense close to those employed in shape design in elasticity in which topologicalderivatives (that would correspond to controlling the location of the shock in our method)are combined with classical shape deformations (that would correspond to simply shapingthe solution away from the shock in the present setting) [11]

As mentioned above in the context of the tracking problem we are considering thesolutions of the adjoint system do not increase the number of shocks Thus a directapplication of the continuous method without employing this alternating variant couldmake sense Note that the purely discrete method is a limited version of the continuous onesince one simply employs the descent direction indicated by p0 without paying attention tothe value q0 corresponding to the shock sensitivity However the numerical experimentswe have done with the full continuous method do not improve the results obtained by thediscrete one since the definition of the location of the shocks is lost along the iterativeprocess

7 Numerical experiments

In this section we present some numerical experiments which illustrate the results ob-tained in an optimization model problem with each one of the numerical methods describedin the previous section

We have chosen as computational domain the interval (minus4 4) and we have taken asboundary conditions in (61) at each time step t = tn the value of the initial data at theboundary This can be justified if we assume that the initial datum u0 is constant in asufficiently large inner neighborhood of the boundary x = plusmn4 (which depends on the sizeof the Linfin-norm of the data under consideration and the time horizon T ) due to the finitespeed of propagation A similar procedure is employed for the adjoint equation

We underline once more that the solutions obtained with each method may correspondto global minima or local ones since the gradient algorithm does not distinguish them

In the experiments we consider the Burgersrsquo equation ie f(z) = z22

parttu+ partx

(u2

2

)= 0 u(x 0) = u0(x) (71)

The weight function ρ under consideration in the experiments is given by

ρ(x t) =

1 t isin (T2 T )0 otherwise

And the time horizon T = 1

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 17

To compare the efficiency of the different methods we consider a fixed ∆x = 120 λ =∆t∆x = 23 (which satisfies the CFL condition) We then analyze the number of itera-tions that each method needs to attain a prescribed value of the functional

71 Experiment 1 We first consider a piecewise constant target profile ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

07 x isin [minus2 1]0 otherwise

(72)

Note that in this case (72) yields a particular solution of the optimization problemand the minimum value of J vanishes

We solve the optimization problem (13) with the above described different methodsstarting from the following initialization for u0

u0(x) =

05 x le 0minus01 x gt 0

(73)

which also has a discontinuity but located on a different pointIn Figure 2 we plot log(J) with respect to the number of iterations for both the purely

discrete method and the alternating descent one We see that the latter stabilizes in feweriterations

Figure 2 Experiment 1 log(J) versus the number of iterations in thedescent algorithm for the discrete and the alternating descent methods

In Figures 3 and 4 we present the minimizers obtained by the methods above and theassociated solutions Figures 5 and 6

The initial datum u0 obtained by the alternating descent method (Figures 4) is a goodapproximation of (72) The solution given by the discrete approach (Figures 3) presentsadded spurious oscillations Furthermore the discrete method is much slower and does notachieve the same level of accuracy since the functional J∆ does not decrease so much

18 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Figure 3 Experiment 1 u0discrete method iteration k =

999

Figure 4 Experiment 1 u0alternating descent method it-eration k = 38

Figure 5 Experiment 1 So-lution u(x t) discrete methoditeration k = 999

Figure 6 Experiment 1 So-lution u(x t) alternating de-scent method iteration k = 38

72 Experiment 2 The previous experiment indicates that the alternating descent methodperforms significantly better In order to show that this is a systematic fact which arisesindependently of the initialization of the method we consider the target ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

05 x le 050 otherwise

(74)

but this time we compare the performance of both methods starting from different initial-izations

The obtained numerical results are presented in Figure 7We see that regardless the initialization considered the alternating descent method

performs significantly better

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 19

u0 obtained by the discrete

approach

u0 obtained by the alternating

descent method

The value of the functional

Figure 7 Experiment 2 We present a comparison of the results obtainedwith both methods starting out of five different initialization configurationsIn the left column we exhibit the results obtained with the discrete methodIn the second one those achieved by the alternating descent method In thelast one we plot the evolution of the functional with both methods

20 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

We observe that in the five experiments the alternating descent method perfumes betterensuring the descent of the functional in much fewer iterations and yielding smoother lessoscillatory approximation of the minimizer

Note also that the discrete method rather than yielding discontinuous approximationsof the minimizer as the alternating descent method does it produces an initial datum witha Lipschitz front Observe that these are two different configurations that can lead tothe same evolution for the Burgers equation after some time once the front develops thediscontinuity This is in agreement with the fact that the functional to be minimized isonly active in the time-interval T2 le t le T

8 Conclusions and perspectives

In this paper we have adapted and presented the alternating descent method for atracking problem for a 1minus d scalar conservation law the goal being to identify an optimalinitial datum so that the solution gets as close as possible to a given trajectory Wehave exhibited in a number of examples the better performance of the alternating descentmethod with respect to the classical discrete method Thus the paper extends previousworks on other optimization problems such as the inverse design or the optimization of theflux function

The developments in this paper raise a number of interesting problems and questionsthat will deserve further investigation We summarize here some of them

bull The alternating descent method and the discrete one do not seem to yield the sameminimizer This should be further investigated in a more systematic manner inother experiments

The minimizer that the discrete method yields seems to replace the shock of theinitial datum by a Lipschitz function which eventually develops the same dynamicswithin the time interval T2 le t le T in which the functional we have chosen isactive This is an agreement with the behavior of these methods in the context ofthe classical problem of inverse design in which one aims to find the initial datumso that the solution at the final time takes a given value It would be interesting toprove analytically that the two methods may lead to different minimizers in somecircumstancesbull The experiments in this paper concern the case where the target ud is exactly

reachable It would be interesting to explore the performance of both methods inthe case where the target ud is not a solution of the underlying dynamicsbull It would be interesting to compare the performance of the methods presented in the

paper with the direct continuous method without introducing the alternating strat-egy Our preliminary numerical experiments indicate that the continuous methodwithout implementing the alternating strategy does not improve the results thatthe purely discrete strategy yieldsbull In [16] the problem of inverse design has been investigated adapting and extend-

ing the alternating descent method to the multi-dimensional case It would beinteresting to extend the analysis of this paper to the multidimensional case too

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 21

bull It would worth to compare the results in this paper with those that could beachieved by the nudging method as in [2] [3]

AcknowledgementsThe authors would like to thank C Castro F Palacios and A Pozo for stimulating

discussions

References

[1] A Adimurthi S S Ghoshal and V Gowda Optimal controllability for scalar conservation laws withconvex flux 12 Jan 2012 3

[2] D Auroux and J Blum Back and forth nudging algorithm for data assimilation problems ComptesRendus Mathematique 340(12)873 ndash 878 2005 3 20

[3] D Auroux and M Nodet The back and forth nudging algorithm for data assimilation problems theoretical results on transport equations ESAIM Control Optimisation and Calculus of Variations18(2)318ndash342 7 2012 3 20

[4] C Bardos and O Pironneau A formalism for the differentiation of conservation laws C R MathAcad Sci Paris 335(10)839ndash845 2002 8

[5] F Bouchut and F James One-dimensional transport equations with discontinuous coefficients Non-linear Anal 32(7)891ndash933 1998 8

[6] F Bouchut and F James Differentiability with respect to initial data for a scalar conservation lawIn Hyperbolic problems theory numerics applications Vol I (Zurich 1998) volume 129 of InternatSer Numer Math pages 113ndash118 Birkhauser Basel 1999 8

[7] A Bressan and A Marson A variational calculus for discontinuous solutions of systems of conservationlaws Comm Partial Differential Equations 20(9-10)1491ndash1552 1995 8 9

[8] C Castro F Palacios and E Zuazua An alternating descent method for the optimal control of theinviscid burgers equation in the presence of shocks Mathematical Models and Methods in AppliedSciences 18(03)369ndash416 2008 2 3 4 6 9 10 12 16

[9] C Castro F Palacios and E Zuazua Optimal control and vanishing viscosity for the burgers equa-tion In C Constanda and M Perez editors Integral Methods in Science and Engineering Volume2 pages 65ndash90 Birkhauser Boston 2010 2

[10] C Castro and E Zuazua Flux identification for 1-d scalar conservation laws in the presence of shocksMath Comp 80(276)2025ndash2070 2011 2

[11] S Garreau P Guillaume and M Masmoudi The topological asymptotic for pde systems Theelasticity case SIAM J Control Optim 39(6)1756ndash1778 Dec 2000 16

[12] E Godlewski and P Raviart Hyperbolic systems of conservation laws Mathematiques and Applica-tions Ellipses Paris 1991 6

[13] E Godlewski and P A Raviart The linearized stability of solutions of nonlinear hyperbolic systemsof conservation laws A general numerical approach Math Comput Simulation 50(1-4)77ndash95 1999Modelling rsquo98 (Prague) 8

[14] L Gosse and F James Numerical approximations of one-dimensional linear conservation equationswith discontinuous coefficients Mathematics of Computation 69(231)pp 987ndash1015 2000 15

[15] S N Kruzkov First order quasilinear equations with several independent variables Mat Sb (NS)81 (123)228ndash255 1970 2

[16] R Lecaros and E Zuazua Control of 2D scalar conservation laws in the presence of shocks preprint2014 3 6 20

[17] S Ulbrich Adjoint-based derivative computations for the optimal control of discontinuous solutions ofhyperbolic conservation laws Systems Control Lett 48(3-4)313ndash328 2003 Optimization and controlof distributed systems 8

22 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

[18] X Wen and S Jin Convergence of an immersed interface upwind scheme for linear advection equationswith piecewise constant coefficients I L1-error estimates J Comput Math 26(1)1ndash22 2008 15

Rodrigo Lecaros13 and Enrique Zuazua12

1BCAM - Basque Center for Applied MathematicsMazarredo 14 E-48009 Bilbao Basque Country Spain

2Ikerbasque - Basque Foundation for ScienceAlameda Urquijo 36-5 Plaza Bizkaia 48011 Bilbao Basque Country Spain

3CMM - Centro de Modelamiento Matematico Universidad de Chile (UMI CNRS 2807)Avenida Blanco Encalada 2120 Casilla 170-3 Correo 3 Santiago Chile

E-mail address rlecarosdimuchilecl zuazuabcamathorg

URL wwwbcamathorgzuazua

  • 1 Introduction
  • 2 Existence of Minimizers
  • 3 The Discrete Minimization Problem
  • 4 Sensitivity analysis the continuous approach
    • 41 Sensitivity without shocks
    • 42 Sensitivity of the state in the presence of shocks
    • 43 Sensitivity of the cost in the presence of shocks
      • 5 The alternating descent method
      • 6 Numerical approximation of the descent direction
        • 61 The discrete approach
        • 62 The alternating descent method
          • 7 Numerical experiments
            • 71 Experiment 1
            • 72 Experiment 2
              • 8 Conclusions and perspectives
              • References
Page 10: TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE

10 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

where the adjoint state pair (p q) satisfies the system

minus parttpminus f prime(u)partxp = ρ (uminus ud) in Qminus cupQ+ (417)

[p]Σt = 0 t isin (0 T ) (418)

q(t) = p(ϕ(t) t) t isin (0 T ) (419)

minus d

dtq =

(1 + (ϕ)2)12[ρ (uminus ud)2]Σt

2[u]Σt

t isin (0 T ) (420)

p(x T ) = 0 x lt ϕ(T ) cup x gt ϕ(T ) (421)

q(T ) = 0 (422)

Let us briefly comment the result of Proposition 44 before giving its proofSystem (417)-(422) has a unique solution In fact to solve the backward system (417)-

(422) we first define the solution q on the shock Σ from the conditions for q (420) and(422) This determines the value of p along the shock We then propagate this informationtogether with (417) and (421) to both sides of Σ by characteristics (see Figure 1 wherewe illustrate this construction)

p is defined by the method of characteristics

p on the region of influence of the characteristics emanating from the shock

Xminus X+t = 0

t = T

Σ

Σ0

Figure 1 Characteristic lines entering on a shock and how they may beused to build the solution of the adjoint system both away from the shockand on its region of influence

Formula (416) provides an obvious way to compute a first descent direction of J at u0We just take

(δu0 δϕ0) = (minusp(middot 0) q(0)[u]Σ0) (423)

Here the value of δϕ0 must be interpreted as the optimal infinitesimal displacement of thediscontinuity of u0

In [8] when considering the inverse design problem it was observed that the solution pof the corresponding adjoint system at t = 0 was discontinuous with two discontinuities

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 11

one in each side of the original location of the discontinuity at Σ0 This was a reason notto use this descent direction and for introducing the alternating descent method In thepresent setting however the adjoint state p obtained is typically continuous This is due tothe fact that p at both side of the discontinuity is defined by the method of characteristicsand that on the region of influence of the characteristics emanating from the shock thecontinuity is preserved by the fact that on one hand q = q(t) itself is continuous as theprimitive of an integrable function and that the data for p and q at t = T are continuoustoo Despite of this as we shall see the implementation of the alternating descent directionmethod is worth since it significantly improves the results obtained by the purely discreteapproach

We finish this section with the proof of Proposition 44

Proof (of Proposition 44) A straightforward computation shows that J is Gateaux dif-ferentiable in the generalized sense of Definition 43 and that the generalized Gateauxderivative of J in the direction of the generalized tangent vector (δu0 δϕ0) is given by

δJ(u0)[δu0 δϕ0] =

int T

0

intxgtϕ(t)cupxltϕ(t)

ρ(x t)(u(x t)minus ud(x t))δu(x t) dxdt

minusint

Σ

(uminus ud)2

2

]Σt

δϕ(t) dΣ(t) (424)

where the pair (δu δϕ) solves the linearized problem (412)-(415) with initial data (δu0 δϕ0)Let us now introduce the adjoint system (417)-(422) Multiplying the equations of δu

by p and integrating we get

0 =

int T

0

intxgtϕ(t)cupxltϕ(t)

(parttδu+ partx(f prime(u)δu)) p dxdt = minusint T

0

intxgtϕ(t)cupxltϕ(t)

δu (parttp+ f prime(u)partxp) dxdt

+

intxgtϕ(T )cupxltϕ(T )

δu(x T )p(x T )dxminusint

xgtϕ0cupxltϕ0

δu0(x)p(x 0)dx

minusint

Σ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ(t) (425)

where (nx nt) are the Cartesian components of the normal vector to the curve ΣTherefore since p satisfies the adjoint equation (417) (421) from (424) we obtain

δJ(u0)[δu0 δϕ0] =

intxgtϕ0cupxltϕ0

δu0(x)p(x 0)dx

+

intΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ(t)minus

intΣ

(uminus ud)2

2

]Σt

δϕ(t) dΣ(t) (426)

The last two terms in the right hand side of (426) will determine the conditions that pmust satisfy on the shock

12 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Observe that for any functions f g we have

[fg]Σt = f [g]Σt + g[f ]Σt

where g represents the average of g to both sides of the shock Σt ie

g(t) =1

2limε0

(g(ϕ(t) + ε t) + g(ϕ(t)minus ε t)) forallt isin (0 T )

Thus we haveintΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ =

intΣ

[p]Σt

(δunt

Σ + f prime(u)δu middot nxΣ

)dΣ

+

intΣ

p([δu]Σtnt

Σ + [f prime(u)δu]Σt middot nxΣ

)dΣ

(427)

Now we note that the Cartesian components of the normal vector to Σ are given by

nt =minusϕprime(t)radic

1 + (ϕprime(t))2 nx =

1radic1 + (ϕprime(t))2

and dΣ(t) =radic

1 + (ϕprime(t))2dt Using (413) (418) we haveintΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]ΣtnxΣ

)dΣ =

int T

0

p (minus[δu]Σtϕprime(t) + [f prime(u)δu]Σt) dt

=

int T

0

pd

dt([u]Σtδϕ) dt (428)

Therefore replacing (419) (420) (422) and (428) in (426) we obtain (416)This concludes the proof

5 The alternating descent method

Here we explain how the alternating descent method introduced in [8] can be adapted tothe present optimal control problem (13)

First let us introduce some notation We consider two points

Xminus = ϕ(T )minus Tf prime(uminus(ϕ(T ) T )) X+ = ϕ(T )minus Tf prime(u+(x T )) (51)

(p q) the solutions of (417)-(422) and

p0minus = limxXminus

p(x 0) p0+ = limxX+

p(x 0)

Now we introduce the two classes of descent directions we shall use in our descentalgorithm

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 13

First directions With this set of directions we mainly modify the profile of u0 Weset the first directions d1 = (δu0 δϕ0) given by

δu0 =

minusp(x 0) x lt Xminusminusp0minus x isin (Xminus ϕ0)minusp0+ x isin (ϕ0 X+)minusp(x 0) x gt X+

δϕ0 =

0 if

X+intXminus

p(x 0)δu0(x)dx le 0

X+intXminus

p(x 0)δu0(x)dx

[u0]Σ0q(0) if

X+intXminus

p(x 0)δu0(x)dx gt 0 and q(0) 6= 0

(52)

We note that if q(0) = 0 andX+intXminus

p(x 0)δu0(x)dx gt 0 these directions are not

definedSecond directions They are aimed to move the shock without changing the profile

of the solution to both sides Then d2 = (δu0 δϕ0) with

δu0 equiv 0 δϕ0(x) = [u0]Σ0q(0) (53)

We observe that d1 given by (52) satisfies

δJ(u0)[d1] = minusintxisin[XminusX+]

|p(x 0)|2dx

minusp0minusint ϕ0

Xminusp(x 0)dxminus p0+

int X+

ϕ0

p(x 0)dxminus [u0]Σ0q(0)δϕ0 le 0 (54)

Note that in those cases where d1 is not defined as indicated above this descent directionis simply not employed in the descent algorithm

And for d2 given by (53) we have

δJ(u0)[d2] = minus([u0]Σ0q(0))2 le 0 (55)

Thus the two classes of descent directions under consideration have three importantproperties

(1) They are both descent directions(2) They allow to split the design of the profile and the shock location(3) They are true generalized gradients and therefore keep the structure of the data

without increasing its complexity

6 Numerical approximation of the descent direction

We have computed the gradient of the continuous functional J in several cases (u smoothor having shock discontinuities) but in practice one has to work with discrete versions of

14 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

the functional J In this section we discuss two possible ways of searching discrete descentdirections based either on the discrete or the continuous approaches and in the later onthe alternating descent method

The discrete approach consists mainly in applying directly a descent algorithm to thediscrete version J∆ of the functional J by using its gradient The alternating descentmethod by the contrary is a method inspired on the continuous analysis of the previoussection in which the two main classes of descent directions that are first identified andlater discretized

Let us first discuss the discrete approach

61 The discrete approach Let us consider the approximation of the functional J byJ∆ defined (11) and (31) respectively We shall use the Engquist-Osher scheme whichis a 3-point conservative numerical approximation scheme for (12) More explicitly weconsider

un+1i = uni minus

∆t

∆x

(g(uni+1 u

ni )minus g(uni u

niminus1)) i isin Z n = 0 N (61)

where g is the numerical flux defined in (311)The gradient of the discrete functional J∆ requires computing one derivative of J∆ with

respect to each node of the mesh This can be done in a cheaper way using the discreteadjoint state We illustrate it for the Engquist-Osher numerical scheme However asthe discrete functionals J∆ are not necessarily convex the gradient methods could possiblyprovide sequences that do not converge to a global minimizer of J∆ But this drawback anddifficulty appears in most applications of descent methods in optimal design and controlproblems

As we will see in the present context the approximations obtained by discrete gradientmethods are satisfactory although convergence is slow due to unnecessary oscillations thatthe descent method introduces

The gradient of J∆ rigorously speaking requires the linearization of the numericalscheme (61) used to approximate the equation (12) Then the linearization correspondsto

δun+1i = δuni minus

∆t

∆x

(partag(uni+1 u

ni )δuni+1 minus partbg(uni u

niminus1)δuniminus1

)minus∆t

∆x

(partbg(uni+1 u

ni )minus partag(uni u

niminus1))δuni

i isin Z n = 0 N (62)

In view of this the discrete adjoint system of (62) can also be written for the differen-tiable flux functions

pN+1i = 0 i isin Z

pni = pn+1i +

∆t

∆xpartbg(uni+1 u

ni )(pn+1i+1 minus pn+1

i

)+

∆t

∆xpartag(uni u

niminus1)

(pn+1i minus pn+1

iminus1

)+ F n

i

i isin Z n = 0 N (63)

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 15

whereF ni = ∆tρni

(uni minus (ud)ni

) i isin Z n = 0 N (64)

In fact when multiplying the equations in (62) by pn+1i and adding in i isin Z and

n = 0 N the following identity is easily obtained

∆xsumiisinZ

p0i δu

0i = ∆x

Nsumn=0

sumiisinZ

F ni δu

0i (65)

This is the discrete version of formula (46) which allows us to simplify the derivative ofthe discrete cost functional

Thus for any variation δu0∆ the Gateaux derivative of the cost functional defined in

(31) is given by

δJ∆ = ∆x∆tNsum

n=0

sumiisinZ

ρni(uni minus (ud)ni

)δuni (66)

where δunij solves the linearized system (62) If we consider pni the solution of (63) with(64) we obtain that δJ∆ in (66) can be written as

δJ∆ = ∆xsumiisinZ

p0i δu

0i

and this allows to obtain easily the steepest descent direction for J∆ by considering

δu0∆ = minusp0

∆ (67)

Remark 61 We observe that for the Enquis-Osherrsquos flux (311) the system (63) is theupwind method for the continuous adjoint system We do not address here the problemof the convergence of this adjoint scheme towards the solution of the continuous adjointsystem Of course this is an easy matter when u is smooth but it is far from being trivialwhen u has shock discontinuities Whether or not this discrete adjoint system as ∆rarr 0allows reconstructing the complete adjoint system with the inner Dirichlet condition alongthe shock (417)(418)(419)(420)(421) and (422) constitutes an interesting problemfor future research We refer to [14] and [18] for preliminary work on this direction inone-dimension

62 The alternating descent method Now we describe the implementation of thealternating descent method

The main idea is to approximate a minimizer of J alternating between two directions ofdescent First we perturb the initial datum u0 using the direction (52) which principallychanges the profile of u0 Second we move the shock curve without altering the profile ofu0 at both sides of Σ using the direction (53)

More precisely for a given initialization u0∆ and target function ud∆ we compute Σ0

∆ thejump-point of u0

∆u∆ and p0∆ as the solutions of (62) (63) respectively

As numerical approximation of the adjoint state q(0) we take the value of p0∆ over the

point Σ0∆ ie

q0∆ = p0

∆(Σ0∆) = p0

i Σ0∆ isin (ximinus12 xi+12)

16 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

The main advantage of this method introduced in [8] is that for an initial datum u0

with a single discontinuity the descent directions are generalized tangent vectors ie theyintroduce Lipschitz continuous variations of u0 at both sides of the discontinuity and adisplacement of the shock position In this way the new datum obtained modifying theold one in the direction of this generalized tangent vector has again a single discontinuityThe method can be applied in a much more general context in which for instance thesolution has various shocks since the method is able both to generate shocks and to destroythem if any of these facts contributes to the decrease of the functional This method isin some sense close to those employed in shape design in elasticity in which topologicalderivatives (that would correspond to controlling the location of the shock in our method)are combined with classical shape deformations (that would correspond to simply shapingthe solution away from the shock in the present setting) [11]

As mentioned above in the context of the tracking problem we are considering thesolutions of the adjoint system do not increase the number of shocks Thus a directapplication of the continuous method without employing this alternating variant couldmake sense Note that the purely discrete method is a limited version of the continuous onesince one simply employs the descent direction indicated by p0 without paying attention tothe value q0 corresponding to the shock sensitivity However the numerical experimentswe have done with the full continuous method do not improve the results obtained by thediscrete one since the definition of the location of the shocks is lost along the iterativeprocess

7 Numerical experiments

In this section we present some numerical experiments which illustrate the results ob-tained in an optimization model problem with each one of the numerical methods describedin the previous section

We have chosen as computational domain the interval (minus4 4) and we have taken asboundary conditions in (61) at each time step t = tn the value of the initial data at theboundary This can be justified if we assume that the initial datum u0 is constant in asufficiently large inner neighborhood of the boundary x = plusmn4 (which depends on the sizeof the Linfin-norm of the data under consideration and the time horizon T ) due to the finitespeed of propagation A similar procedure is employed for the adjoint equation

We underline once more that the solutions obtained with each method may correspondto global minima or local ones since the gradient algorithm does not distinguish them

In the experiments we consider the Burgersrsquo equation ie f(z) = z22

parttu+ partx

(u2

2

)= 0 u(x 0) = u0(x) (71)

The weight function ρ under consideration in the experiments is given by

ρ(x t) =

1 t isin (T2 T )0 otherwise

And the time horizon T = 1

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 17

To compare the efficiency of the different methods we consider a fixed ∆x = 120 λ =∆t∆x = 23 (which satisfies the CFL condition) We then analyze the number of itera-tions that each method needs to attain a prescribed value of the functional

71 Experiment 1 We first consider a piecewise constant target profile ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

07 x isin [minus2 1]0 otherwise

(72)

Note that in this case (72) yields a particular solution of the optimization problemand the minimum value of J vanishes

We solve the optimization problem (13) with the above described different methodsstarting from the following initialization for u0

u0(x) =

05 x le 0minus01 x gt 0

(73)

which also has a discontinuity but located on a different pointIn Figure 2 we plot log(J) with respect to the number of iterations for both the purely

discrete method and the alternating descent one We see that the latter stabilizes in feweriterations

Figure 2 Experiment 1 log(J) versus the number of iterations in thedescent algorithm for the discrete and the alternating descent methods

In Figures 3 and 4 we present the minimizers obtained by the methods above and theassociated solutions Figures 5 and 6

The initial datum u0 obtained by the alternating descent method (Figures 4) is a goodapproximation of (72) The solution given by the discrete approach (Figures 3) presentsadded spurious oscillations Furthermore the discrete method is much slower and does notachieve the same level of accuracy since the functional J∆ does not decrease so much

18 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Figure 3 Experiment 1 u0discrete method iteration k =

999

Figure 4 Experiment 1 u0alternating descent method it-eration k = 38

Figure 5 Experiment 1 So-lution u(x t) discrete methoditeration k = 999

Figure 6 Experiment 1 So-lution u(x t) alternating de-scent method iteration k = 38

72 Experiment 2 The previous experiment indicates that the alternating descent methodperforms significantly better In order to show that this is a systematic fact which arisesindependently of the initialization of the method we consider the target ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

05 x le 050 otherwise

(74)

but this time we compare the performance of both methods starting from different initial-izations

The obtained numerical results are presented in Figure 7We see that regardless the initialization considered the alternating descent method

performs significantly better

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 19

u0 obtained by the discrete

approach

u0 obtained by the alternating

descent method

The value of the functional

Figure 7 Experiment 2 We present a comparison of the results obtainedwith both methods starting out of five different initialization configurationsIn the left column we exhibit the results obtained with the discrete methodIn the second one those achieved by the alternating descent method In thelast one we plot the evolution of the functional with both methods

20 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

We observe that in the five experiments the alternating descent method perfumes betterensuring the descent of the functional in much fewer iterations and yielding smoother lessoscillatory approximation of the minimizer

Note also that the discrete method rather than yielding discontinuous approximationsof the minimizer as the alternating descent method does it produces an initial datum witha Lipschitz front Observe that these are two different configurations that can lead tothe same evolution for the Burgers equation after some time once the front develops thediscontinuity This is in agreement with the fact that the functional to be minimized isonly active in the time-interval T2 le t le T

8 Conclusions and perspectives

In this paper we have adapted and presented the alternating descent method for atracking problem for a 1minus d scalar conservation law the goal being to identify an optimalinitial datum so that the solution gets as close as possible to a given trajectory Wehave exhibited in a number of examples the better performance of the alternating descentmethod with respect to the classical discrete method Thus the paper extends previousworks on other optimization problems such as the inverse design or the optimization of theflux function

The developments in this paper raise a number of interesting problems and questionsthat will deserve further investigation We summarize here some of them

bull The alternating descent method and the discrete one do not seem to yield the sameminimizer This should be further investigated in a more systematic manner inother experiments

The minimizer that the discrete method yields seems to replace the shock of theinitial datum by a Lipschitz function which eventually develops the same dynamicswithin the time interval T2 le t le T in which the functional we have chosen isactive This is an agreement with the behavior of these methods in the context ofthe classical problem of inverse design in which one aims to find the initial datumso that the solution at the final time takes a given value It would be interesting toprove analytically that the two methods may lead to different minimizers in somecircumstancesbull The experiments in this paper concern the case where the target ud is exactly

reachable It would be interesting to explore the performance of both methods inthe case where the target ud is not a solution of the underlying dynamicsbull It would be interesting to compare the performance of the methods presented in the

paper with the direct continuous method without introducing the alternating strat-egy Our preliminary numerical experiments indicate that the continuous methodwithout implementing the alternating strategy does not improve the results thatthe purely discrete strategy yieldsbull In [16] the problem of inverse design has been investigated adapting and extend-

ing the alternating descent method to the multi-dimensional case It would beinteresting to extend the analysis of this paper to the multidimensional case too

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 21

bull It would worth to compare the results in this paper with those that could beachieved by the nudging method as in [2] [3]

AcknowledgementsThe authors would like to thank C Castro F Palacios and A Pozo for stimulating

discussions

References

[1] A Adimurthi S S Ghoshal and V Gowda Optimal controllability for scalar conservation laws withconvex flux 12 Jan 2012 3

[2] D Auroux and J Blum Back and forth nudging algorithm for data assimilation problems ComptesRendus Mathematique 340(12)873 ndash 878 2005 3 20

[3] D Auroux and M Nodet The back and forth nudging algorithm for data assimilation problems theoretical results on transport equations ESAIM Control Optimisation and Calculus of Variations18(2)318ndash342 7 2012 3 20

[4] C Bardos and O Pironneau A formalism for the differentiation of conservation laws C R MathAcad Sci Paris 335(10)839ndash845 2002 8

[5] F Bouchut and F James One-dimensional transport equations with discontinuous coefficients Non-linear Anal 32(7)891ndash933 1998 8

[6] F Bouchut and F James Differentiability with respect to initial data for a scalar conservation lawIn Hyperbolic problems theory numerics applications Vol I (Zurich 1998) volume 129 of InternatSer Numer Math pages 113ndash118 Birkhauser Basel 1999 8

[7] A Bressan and A Marson A variational calculus for discontinuous solutions of systems of conservationlaws Comm Partial Differential Equations 20(9-10)1491ndash1552 1995 8 9

[8] C Castro F Palacios and E Zuazua An alternating descent method for the optimal control of theinviscid burgers equation in the presence of shocks Mathematical Models and Methods in AppliedSciences 18(03)369ndash416 2008 2 3 4 6 9 10 12 16

[9] C Castro F Palacios and E Zuazua Optimal control and vanishing viscosity for the burgers equa-tion In C Constanda and M Perez editors Integral Methods in Science and Engineering Volume2 pages 65ndash90 Birkhauser Boston 2010 2

[10] C Castro and E Zuazua Flux identification for 1-d scalar conservation laws in the presence of shocksMath Comp 80(276)2025ndash2070 2011 2

[11] S Garreau P Guillaume and M Masmoudi The topological asymptotic for pde systems Theelasticity case SIAM J Control Optim 39(6)1756ndash1778 Dec 2000 16

[12] E Godlewski and P Raviart Hyperbolic systems of conservation laws Mathematiques and Applica-tions Ellipses Paris 1991 6

[13] E Godlewski and P A Raviart The linearized stability of solutions of nonlinear hyperbolic systemsof conservation laws A general numerical approach Math Comput Simulation 50(1-4)77ndash95 1999Modelling rsquo98 (Prague) 8

[14] L Gosse and F James Numerical approximations of one-dimensional linear conservation equationswith discontinuous coefficients Mathematics of Computation 69(231)pp 987ndash1015 2000 15

[15] S N Kruzkov First order quasilinear equations with several independent variables Mat Sb (NS)81 (123)228ndash255 1970 2

[16] R Lecaros and E Zuazua Control of 2D scalar conservation laws in the presence of shocks preprint2014 3 6 20

[17] S Ulbrich Adjoint-based derivative computations for the optimal control of discontinuous solutions ofhyperbolic conservation laws Systems Control Lett 48(3-4)313ndash328 2003 Optimization and controlof distributed systems 8

22 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

[18] X Wen and S Jin Convergence of an immersed interface upwind scheme for linear advection equationswith piecewise constant coefficients I L1-error estimates J Comput Math 26(1)1ndash22 2008 15

Rodrigo Lecaros13 and Enrique Zuazua12

1BCAM - Basque Center for Applied MathematicsMazarredo 14 E-48009 Bilbao Basque Country Spain

2Ikerbasque - Basque Foundation for ScienceAlameda Urquijo 36-5 Plaza Bizkaia 48011 Bilbao Basque Country Spain

3CMM - Centro de Modelamiento Matematico Universidad de Chile (UMI CNRS 2807)Avenida Blanco Encalada 2120 Casilla 170-3 Correo 3 Santiago Chile

E-mail address rlecarosdimuchilecl zuazuabcamathorg

URL wwwbcamathorgzuazua

  • 1 Introduction
  • 2 Existence of Minimizers
  • 3 The Discrete Minimization Problem
  • 4 Sensitivity analysis the continuous approach
    • 41 Sensitivity without shocks
    • 42 Sensitivity of the state in the presence of shocks
    • 43 Sensitivity of the cost in the presence of shocks
      • 5 The alternating descent method
      • 6 Numerical approximation of the descent direction
        • 61 The discrete approach
        • 62 The alternating descent method
          • 7 Numerical experiments
            • 71 Experiment 1
            • 72 Experiment 2
              • 8 Conclusions and perspectives
              • References
Page 11: TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 11

one in each side of the original location of the discontinuity at Σ0 This was a reason notto use this descent direction and for introducing the alternating descent method In thepresent setting however the adjoint state p obtained is typically continuous This is due tothe fact that p at both side of the discontinuity is defined by the method of characteristicsand that on the region of influence of the characteristics emanating from the shock thecontinuity is preserved by the fact that on one hand q = q(t) itself is continuous as theprimitive of an integrable function and that the data for p and q at t = T are continuoustoo Despite of this as we shall see the implementation of the alternating descent directionmethod is worth since it significantly improves the results obtained by the purely discreteapproach

We finish this section with the proof of Proposition 44

Proof (of Proposition 44) A straightforward computation shows that J is Gateaux dif-ferentiable in the generalized sense of Definition 43 and that the generalized Gateauxderivative of J in the direction of the generalized tangent vector (δu0 δϕ0) is given by

δJ(u0)[δu0 δϕ0] =

int T

0

intxgtϕ(t)cupxltϕ(t)

ρ(x t)(u(x t)minus ud(x t))δu(x t) dxdt

minusint

Σ

(uminus ud)2

2

]Σt

δϕ(t) dΣ(t) (424)

where the pair (δu δϕ) solves the linearized problem (412)-(415) with initial data (δu0 δϕ0)Let us now introduce the adjoint system (417)-(422) Multiplying the equations of δu

by p and integrating we get

0 =

int T

0

intxgtϕ(t)cupxltϕ(t)

(parttδu+ partx(f prime(u)δu)) p dxdt = minusint T

0

intxgtϕ(t)cupxltϕ(t)

δu (parttp+ f prime(u)partxp) dxdt

+

intxgtϕ(T )cupxltϕ(T )

δu(x T )p(x T )dxminusint

xgtϕ0cupxltϕ0

δu0(x)p(x 0)dx

minusint

Σ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ(t) (425)

where (nx nt) are the Cartesian components of the normal vector to the curve ΣTherefore since p satisfies the adjoint equation (417) (421) from (424) we obtain

δJ(u0)[δu0 δϕ0] =

intxgtϕ0cupxltϕ0

δu0(x)p(x 0)dx

+

intΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ(t)minus

intΣ

(uminus ud)2

2

]Σt

δϕ(t) dΣ(t) (426)

The last two terms in the right hand side of (426) will determine the conditions that pmust satisfy on the shock

12 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Observe that for any functions f g we have

[fg]Σt = f [g]Σt + g[f ]Σt

where g represents the average of g to both sides of the shock Σt ie

g(t) =1

2limε0

(g(ϕ(t) + ε t) + g(ϕ(t)minus ε t)) forallt isin (0 T )

Thus we haveintΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ =

intΣ

[p]Σt

(δunt

Σ + f prime(u)δu middot nxΣ

)dΣ

+

intΣ

p([δu]Σtnt

Σ + [f prime(u)δu]Σt middot nxΣ

)dΣ

(427)

Now we note that the Cartesian components of the normal vector to Σ are given by

nt =minusϕprime(t)radic

1 + (ϕprime(t))2 nx =

1radic1 + (ϕprime(t))2

and dΣ(t) =radic

1 + (ϕprime(t))2dt Using (413) (418) we haveintΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]ΣtnxΣ

)dΣ =

int T

0

p (minus[δu]Σtϕprime(t) + [f prime(u)δu]Σt) dt

=

int T

0

pd

dt([u]Σtδϕ) dt (428)

Therefore replacing (419) (420) (422) and (428) in (426) we obtain (416)This concludes the proof

5 The alternating descent method

Here we explain how the alternating descent method introduced in [8] can be adapted tothe present optimal control problem (13)

First let us introduce some notation We consider two points

Xminus = ϕ(T )minus Tf prime(uminus(ϕ(T ) T )) X+ = ϕ(T )minus Tf prime(u+(x T )) (51)

(p q) the solutions of (417)-(422) and

p0minus = limxXminus

p(x 0) p0+ = limxX+

p(x 0)

Now we introduce the two classes of descent directions we shall use in our descentalgorithm

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 13

First directions With this set of directions we mainly modify the profile of u0 Weset the first directions d1 = (δu0 δϕ0) given by

δu0 =

minusp(x 0) x lt Xminusminusp0minus x isin (Xminus ϕ0)minusp0+ x isin (ϕ0 X+)minusp(x 0) x gt X+

δϕ0 =

0 if

X+intXminus

p(x 0)δu0(x)dx le 0

X+intXminus

p(x 0)δu0(x)dx

[u0]Σ0q(0) if

X+intXminus

p(x 0)δu0(x)dx gt 0 and q(0) 6= 0

(52)

We note that if q(0) = 0 andX+intXminus

p(x 0)δu0(x)dx gt 0 these directions are not

definedSecond directions They are aimed to move the shock without changing the profile

of the solution to both sides Then d2 = (δu0 δϕ0) with

δu0 equiv 0 δϕ0(x) = [u0]Σ0q(0) (53)

We observe that d1 given by (52) satisfies

δJ(u0)[d1] = minusintxisin[XminusX+]

|p(x 0)|2dx

minusp0minusint ϕ0

Xminusp(x 0)dxminus p0+

int X+

ϕ0

p(x 0)dxminus [u0]Σ0q(0)δϕ0 le 0 (54)

Note that in those cases where d1 is not defined as indicated above this descent directionis simply not employed in the descent algorithm

And for d2 given by (53) we have

δJ(u0)[d2] = minus([u0]Σ0q(0))2 le 0 (55)

Thus the two classes of descent directions under consideration have three importantproperties

(1) They are both descent directions(2) They allow to split the design of the profile and the shock location(3) They are true generalized gradients and therefore keep the structure of the data

without increasing its complexity

6 Numerical approximation of the descent direction

We have computed the gradient of the continuous functional J in several cases (u smoothor having shock discontinuities) but in practice one has to work with discrete versions of

14 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

the functional J In this section we discuss two possible ways of searching discrete descentdirections based either on the discrete or the continuous approaches and in the later onthe alternating descent method

The discrete approach consists mainly in applying directly a descent algorithm to thediscrete version J∆ of the functional J by using its gradient The alternating descentmethod by the contrary is a method inspired on the continuous analysis of the previoussection in which the two main classes of descent directions that are first identified andlater discretized

Let us first discuss the discrete approach

61 The discrete approach Let us consider the approximation of the functional J byJ∆ defined (11) and (31) respectively We shall use the Engquist-Osher scheme whichis a 3-point conservative numerical approximation scheme for (12) More explicitly weconsider

un+1i = uni minus

∆t

∆x

(g(uni+1 u

ni )minus g(uni u

niminus1)) i isin Z n = 0 N (61)

where g is the numerical flux defined in (311)The gradient of the discrete functional J∆ requires computing one derivative of J∆ with

respect to each node of the mesh This can be done in a cheaper way using the discreteadjoint state We illustrate it for the Engquist-Osher numerical scheme However asthe discrete functionals J∆ are not necessarily convex the gradient methods could possiblyprovide sequences that do not converge to a global minimizer of J∆ But this drawback anddifficulty appears in most applications of descent methods in optimal design and controlproblems

As we will see in the present context the approximations obtained by discrete gradientmethods are satisfactory although convergence is slow due to unnecessary oscillations thatthe descent method introduces

The gradient of J∆ rigorously speaking requires the linearization of the numericalscheme (61) used to approximate the equation (12) Then the linearization correspondsto

δun+1i = δuni minus

∆t

∆x

(partag(uni+1 u

ni )δuni+1 minus partbg(uni u

niminus1)δuniminus1

)minus∆t

∆x

(partbg(uni+1 u

ni )minus partag(uni u

niminus1))δuni

i isin Z n = 0 N (62)

In view of this the discrete adjoint system of (62) can also be written for the differen-tiable flux functions

pN+1i = 0 i isin Z

pni = pn+1i +

∆t

∆xpartbg(uni+1 u

ni )(pn+1i+1 minus pn+1

i

)+

∆t

∆xpartag(uni u

niminus1)

(pn+1i minus pn+1

iminus1

)+ F n

i

i isin Z n = 0 N (63)

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 15

whereF ni = ∆tρni

(uni minus (ud)ni

) i isin Z n = 0 N (64)

In fact when multiplying the equations in (62) by pn+1i and adding in i isin Z and

n = 0 N the following identity is easily obtained

∆xsumiisinZ

p0i δu

0i = ∆x

Nsumn=0

sumiisinZ

F ni δu

0i (65)

This is the discrete version of formula (46) which allows us to simplify the derivative ofthe discrete cost functional

Thus for any variation δu0∆ the Gateaux derivative of the cost functional defined in

(31) is given by

δJ∆ = ∆x∆tNsum

n=0

sumiisinZ

ρni(uni minus (ud)ni

)δuni (66)

where δunij solves the linearized system (62) If we consider pni the solution of (63) with(64) we obtain that δJ∆ in (66) can be written as

δJ∆ = ∆xsumiisinZ

p0i δu

0i

and this allows to obtain easily the steepest descent direction for J∆ by considering

δu0∆ = minusp0

∆ (67)

Remark 61 We observe that for the Enquis-Osherrsquos flux (311) the system (63) is theupwind method for the continuous adjoint system We do not address here the problemof the convergence of this adjoint scheme towards the solution of the continuous adjointsystem Of course this is an easy matter when u is smooth but it is far from being trivialwhen u has shock discontinuities Whether or not this discrete adjoint system as ∆rarr 0allows reconstructing the complete adjoint system with the inner Dirichlet condition alongthe shock (417)(418)(419)(420)(421) and (422) constitutes an interesting problemfor future research We refer to [14] and [18] for preliminary work on this direction inone-dimension

62 The alternating descent method Now we describe the implementation of thealternating descent method

The main idea is to approximate a minimizer of J alternating between two directions ofdescent First we perturb the initial datum u0 using the direction (52) which principallychanges the profile of u0 Second we move the shock curve without altering the profile ofu0 at both sides of Σ using the direction (53)

More precisely for a given initialization u0∆ and target function ud∆ we compute Σ0

∆ thejump-point of u0

∆u∆ and p0∆ as the solutions of (62) (63) respectively

As numerical approximation of the adjoint state q(0) we take the value of p0∆ over the

point Σ0∆ ie

q0∆ = p0

∆(Σ0∆) = p0

i Σ0∆ isin (ximinus12 xi+12)

16 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

The main advantage of this method introduced in [8] is that for an initial datum u0

with a single discontinuity the descent directions are generalized tangent vectors ie theyintroduce Lipschitz continuous variations of u0 at both sides of the discontinuity and adisplacement of the shock position In this way the new datum obtained modifying theold one in the direction of this generalized tangent vector has again a single discontinuityThe method can be applied in a much more general context in which for instance thesolution has various shocks since the method is able both to generate shocks and to destroythem if any of these facts contributes to the decrease of the functional This method isin some sense close to those employed in shape design in elasticity in which topologicalderivatives (that would correspond to controlling the location of the shock in our method)are combined with classical shape deformations (that would correspond to simply shapingthe solution away from the shock in the present setting) [11]

As mentioned above in the context of the tracking problem we are considering thesolutions of the adjoint system do not increase the number of shocks Thus a directapplication of the continuous method without employing this alternating variant couldmake sense Note that the purely discrete method is a limited version of the continuous onesince one simply employs the descent direction indicated by p0 without paying attention tothe value q0 corresponding to the shock sensitivity However the numerical experimentswe have done with the full continuous method do not improve the results obtained by thediscrete one since the definition of the location of the shocks is lost along the iterativeprocess

7 Numerical experiments

In this section we present some numerical experiments which illustrate the results ob-tained in an optimization model problem with each one of the numerical methods describedin the previous section

We have chosen as computational domain the interval (minus4 4) and we have taken asboundary conditions in (61) at each time step t = tn the value of the initial data at theboundary This can be justified if we assume that the initial datum u0 is constant in asufficiently large inner neighborhood of the boundary x = plusmn4 (which depends on the sizeof the Linfin-norm of the data under consideration and the time horizon T ) due to the finitespeed of propagation A similar procedure is employed for the adjoint equation

We underline once more that the solutions obtained with each method may correspondto global minima or local ones since the gradient algorithm does not distinguish them

In the experiments we consider the Burgersrsquo equation ie f(z) = z22

parttu+ partx

(u2

2

)= 0 u(x 0) = u0(x) (71)

The weight function ρ under consideration in the experiments is given by

ρ(x t) =

1 t isin (T2 T )0 otherwise

And the time horizon T = 1

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 17

To compare the efficiency of the different methods we consider a fixed ∆x = 120 λ =∆t∆x = 23 (which satisfies the CFL condition) We then analyze the number of itera-tions that each method needs to attain a prescribed value of the functional

71 Experiment 1 We first consider a piecewise constant target profile ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

07 x isin [minus2 1]0 otherwise

(72)

Note that in this case (72) yields a particular solution of the optimization problemand the minimum value of J vanishes

We solve the optimization problem (13) with the above described different methodsstarting from the following initialization for u0

u0(x) =

05 x le 0minus01 x gt 0

(73)

which also has a discontinuity but located on a different pointIn Figure 2 we plot log(J) with respect to the number of iterations for both the purely

discrete method and the alternating descent one We see that the latter stabilizes in feweriterations

Figure 2 Experiment 1 log(J) versus the number of iterations in thedescent algorithm for the discrete and the alternating descent methods

In Figures 3 and 4 we present the minimizers obtained by the methods above and theassociated solutions Figures 5 and 6

The initial datum u0 obtained by the alternating descent method (Figures 4) is a goodapproximation of (72) The solution given by the discrete approach (Figures 3) presentsadded spurious oscillations Furthermore the discrete method is much slower and does notachieve the same level of accuracy since the functional J∆ does not decrease so much

18 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Figure 3 Experiment 1 u0discrete method iteration k =

999

Figure 4 Experiment 1 u0alternating descent method it-eration k = 38

Figure 5 Experiment 1 So-lution u(x t) discrete methoditeration k = 999

Figure 6 Experiment 1 So-lution u(x t) alternating de-scent method iteration k = 38

72 Experiment 2 The previous experiment indicates that the alternating descent methodperforms significantly better In order to show that this is a systematic fact which arisesindependently of the initialization of the method we consider the target ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

05 x le 050 otherwise

(74)

but this time we compare the performance of both methods starting from different initial-izations

The obtained numerical results are presented in Figure 7We see that regardless the initialization considered the alternating descent method

performs significantly better

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 19

u0 obtained by the discrete

approach

u0 obtained by the alternating

descent method

The value of the functional

Figure 7 Experiment 2 We present a comparison of the results obtainedwith both methods starting out of five different initialization configurationsIn the left column we exhibit the results obtained with the discrete methodIn the second one those achieved by the alternating descent method In thelast one we plot the evolution of the functional with both methods

20 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

We observe that in the five experiments the alternating descent method perfumes betterensuring the descent of the functional in much fewer iterations and yielding smoother lessoscillatory approximation of the minimizer

Note also that the discrete method rather than yielding discontinuous approximationsof the minimizer as the alternating descent method does it produces an initial datum witha Lipschitz front Observe that these are two different configurations that can lead tothe same evolution for the Burgers equation after some time once the front develops thediscontinuity This is in agreement with the fact that the functional to be minimized isonly active in the time-interval T2 le t le T

8 Conclusions and perspectives

In this paper we have adapted and presented the alternating descent method for atracking problem for a 1minus d scalar conservation law the goal being to identify an optimalinitial datum so that the solution gets as close as possible to a given trajectory Wehave exhibited in a number of examples the better performance of the alternating descentmethod with respect to the classical discrete method Thus the paper extends previousworks on other optimization problems such as the inverse design or the optimization of theflux function

The developments in this paper raise a number of interesting problems and questionsthat will deserve further investigation We summarize here some of them

bull The alternating descent method and the discrete one do not seem to yield the sameminimizer This should be further investigated in a more systematic manner inother experiments

The minimizer that the discrete method yields seems to replace the shock of theinitial datum by a Lipschitz function which eventually develops the same dynamicswithin the time interval T2 le t le T in which the functional we have chosen isactive This is an agreement with the behavior of these methods in the context ofthe classical problem of inverse design in which one aims to find the initial datumso that the solution at the final time takes a given value It would be interesting toprove analytically that the two methods may lead to different minimizers in somecircumstancesbull The experiments in this paper concern the case where the target ud is exactly

reachable It would be interesting to explore the performance of both methods inthe case where the target ud is not a solution of the underlying dynamicsbull It would be interesting to compare the performance of the methods presented in the

paper with the direct continuous method without introducing the alternating strat-egy Our preliminary numerical experiments indicate that the continuous methodwithout implementing the alternating strategy does not improve the results thatthe purely discrete strategy yieldsbull In [16] the problem of inverse design has been investigated adapting and extend-

ing the alternating descent method to the multi-dimensional case It would beinteresting to extend the analysis of this paper to the multidimensional case too

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 21

bull It would worth to compare the results in this paper with those that could beachieved by the nudging method as in [2] [3]

AcknowledgementsThe authors would like to thank C Castro F Palacios and A Pozo for stimulating

discussions

References

[1] A Adimurthi S S Ghoshal and V Gowda Optimal controllability for scalar conservation laws withconvex flux 12 Jan 2012 3

[2] D Auroux and J Blum Back and forth nudging algorithm for data assimilation problems ComptesRendus Mathematique 340(12)873 ndash 878 2005 3 20

[3] D Auroux and M Nodet The back and forth nudging algorithm for data assimilation problems theoretical results on transport equations ESAIM Control Optimisation and Calculus of Variations18(2)318ndash342 7 2012 3 20

[4] C Bardos and O Pironneau A formalism for the differentiation of conservation laws C R MathAcad Sci Paris 335(10)839ndash845 2002 8

[5] F Bouchut and F James One-dimensional transport equations with discontinuous coefficients Non-linear Anal 32(7)891ndash933 1998 8

[6] F Bouchut and F James Differentiability with respect to initial data for a scalar conservation lawIn Hyperbolic problems theory numerics applications Vol I (Zurich 1998) volume 129 of InternatSer Numer Math pages 113ndash118 Birkhauser Basel 1999 8

[7] A Bressan and A Marson A variational calculus for discontinuous solutions of systems of conservationlaws Comm Partial Differential Equations 20(9-10)1491ndash1552 1995 8 9

[8] C Castro F Palacios and E Zuazua An alternating descent method for the optimal control of theinviscid burgers equation in the presence of shocks Mathematical Models and Methods in AppliedSciences 18(03)369ndash416 2008 2 3 4 6 9 10 12 16

[9] C Castro F Palacios and E Zuazua Optimal control and vanishing viscosity for the burgers equa-tion In C Constanda and M Perez editors Integral Methods in Science and Engineering Volume2 pages 65ndash90 Birkhauser Boston 2010 2

[10] C Castro and E Zuazua Flux identification for 1-d scalar conservation laws in the presence of shocksMath Comp 80(276)2025ndash2070 2011 2

[11] S Garreau P Guillaume and M Masmoudi The topological asymptotic for pde systems Theelasticity case SIAM J Control Optim 39(6)1756ndash1778 Dec 2000 16

[12] E Godlewski and P Raviart Hyperbolic systems of conservation laws Mathematiques and Applica-tions Ellipses Paris 1991 6

[13] E Godlewski and P A Raviart The linearized stability of solutions of nonlinear hyperbolic systemsof conservation laws A general numerical approach Math Comput Simulation 50(1-4)77ndash95 1999Modelling rsquo98 (Prague) 8

[14] L Gosse and F James Numerical approximations of one-dimensional linear conservation equationswith discontinuous coefficients Mathematics of Computation 69(231)pp 987ndash1015 2000 15

[15] S N Kruzkov First order quasilinear equations with several independent variables Mat Sb (NS)81 (123)228ndash255 1970 2

[16] R Lecaros and E Zuazua Control of 2D scalar conservation laws in the presence of shocks preprint2014 3 6 20

[17] S Ulbrich Adjoint-based derivative computations for the optimal control of discontinuous solutions ofhyperbolic conservation laws Systems Control Lett 48(3-4)313ndash328 2003 Optimization and controlof distributed systems 8

22 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

[18] X Wen and S Jin Convergence of an immersed interface upwind scheme for linear advection equationswith piecewise constant coefficients I L1-error estimates J Comput Math 26(1)1ndash22 2008 15

Rodrigo Lecaros13 and Enrique Zuazua12

1BCAM - Basque Center for Applied MathematicsMazarredo 14 E-48009 Bilbao Basque Country Spain

2Ikerbasque - Basque Foundation for ScienceAlameda Urquijo 36-5 Plaza Bizkaia 48011 Bilbao Basque Country Spain

3CMM - Centro de Modelamiento Matematico Universidad de Chile (UMI CNRS 2807)Avenida Blanco Encalada 2120 Casilla 170-3 Correo 3 Santiago Chile

E-mail address rlecarosdimuchilecl zuazuabcamathorg

URL wwwbcamathorgzuazua

  • 1 Introduction
  • 2 Existence of Minimizers
  • 3 The Discrete Minimization Problem
  • 4 Sensitivity analysis the continuous approach
    • 41 Sensitivity without shocks
    • 42 Sensitivity of the state in the presence of shocks
    • 43 Sensitivity of the cost in the presence of shocks
      • 5 The alternating descent method
      • 6 Numerical approximation of the descent direction
        • 61 The discrete approach
        • 62 The alternating descent method
          • 7 Numerical experiments
            • 71 Experiment 1
            • 72 Experiment 2
              • 8 Conclusions and perspectives
              • References
Page 12: TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE

12 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Observe that for any functions f g we have

[fg]Σt = f [g]Σt + g[f ]Σt

where g represents the average of g to both sides of the shock Σt ie

g(t) =1

2limε0

(g(ϕ(t) + ε t) + g(ϕ(t)minus ε t)) forallt isin (0 T )

Thus we haveintΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]Σt middot nxΣ

)dΣ =

intΣ

[p]Σt

(δunt

Σ + f prime(u)δu middot nxΣ

)dΣ

+

intΣ

p([δu]Σtnt

Σ + [f prime(u)δu]Σt middot nxΣ

)dΣ

(427)

Now we note that the Cartesian components of the normal vector to Σ are given by

nt =minusϕprime(t)radic

1 + (ϕprime(t))2 nx =

1radic1 + (ϕprime(t))2

and dΣ(t) =radic

1 + (ϕprime(t))2dt Using (413) (418) we haveintΣ

([δu p]Σtnt

Σ + [f prime(u)δu p]ΣtnxΣ

)dΣ =

int T

0

p (minus[δu]Σtϕprime(t) + [f prime(u)δu]Σt) dt

=

int T

0

pd

dt([u]Σtδϕ) dt (428)

Therefore replacing (419) (420) (422) and (428) in (426) we obtain (416)This concludes the proof

5 The alternating descent method

Here we explain how the alternating descent method introduced in [8] can be adapted tothe present optimal control problem (13)

First let us introduce some notation We consider two points

Xminus = ϕ(T )minus Tf prime(uminus(ϕ(T ) T )) X+ = ϕ(T )minus Tf prime(u+(x T )) (51)

(p q) the solutions of (417)-(422) and

p0minus = limxXminus

p(x 0) p0+ = limxX+

p(x 0)

Now we introduce the two classes of descent directions we shall use in our descentalgorithm

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 13

First directions With this set of directions we mainly modify the profile of u0 Weset the first directions d1 = (δu0 δϕ0) given by

δu0 =

minusp(x 0) x lt Xminusminusp0minus x isin (Xminus ϕ0)minusp0+ x isin (ϕ0 X+)minusp(x 0) x gt X+

δϕ0 =

0 if

X+intXminus

p(x 0)δu0(x)dx le 0

X+intXminus

p(x 0)δu0(x)dx

[u0]Σ0q(0) if

X+intXminus

p(x 0)δu0(x)dx gt 0 and q(0) 6= 0

(52)

We note that if q(0) = 0 andX+intXminus

p(x 0)δu0(x)dx gt 0 these directions are not

definedSecond directions They are aimed to move the shock without changing the profile

of the solution to both sides Then d2 = (δu0 δϕ0) with

δu0 equiv 0 δϕ0(x) = [u0]Σ0q(0) (53)

We observe that d1 given by (52) satisfies

δJ(u0)[d1] = minusintxisin[XminusX+]

|p(x 0)|2dx

minusp0minusint ϕ0

Xminusp(x 0)dxminus p0+

int X+

ϕ0

p(x 0)dxminus [u0]Σ0q(0)δϕ0 le 0 (54)

Note that in those cases where d1 is not defined as indicated above this descent directionis simply not employed in the descent algorithm

And for d2 given by (53) we have

δJ(u0)[d2] = minus([u0]Σ0q(0))2 le 0 (55)

Thus the two classes of descent directions under consideration have three importantproperties

(1) They are both descent directions(2) They allow to split the design of the profile and the shock location(3) They are true generalized gradients and therefore keep the structure of the data

without increasing its complexity

6 Numerical approximation of the descent direction

We have computed the gradient of the continuous functional J in several cases (u smoothor having shock discontinuities) but in practice one has to work with discrete versions of

14 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

the functional J In this section we discuss two possible ways of searching discrete descentdirections based either on the discrete or the continuous approaches and in the later onthe alternating descent method

The discrete approach consists mainly in applying directly a descent algorithm to thediscrete version J∆ of the functional J by using its gradient The alternating descentmethod by the contrary is a method inspired on the continuous analysis of the previoussection in which the two main classes of descent directions that are first identified andlater discretized

Let us first discuss the discrete approach

61 The discrete approach Let us consider the approximation of the functional J byJ∆ defined (11) and (31) respectively We shall use the Engquist-Osher scheme whichis a 3-point conservative numerical approximation scheme for (12) More explicitly weconsider

un+1i = uni minus

∆t

∆x

(g(uni+1 u

ni )minus g(uni u

niminus1)) i isin Z n = 0 N (61)

where g is the numerical flux defined in (311)The gradient of the discrete functional J∆ requires computing one derivative of J∆ with

respect to each node of the mesh This can be done in a cheaper way using the discreteadjoint state We illustrate it for the Engquist-Osher numerical scheme However asthe discrete functionals J∆ are not necessarily convex the gradient methods could possiblyprovide sequences that do not converge to a global minimizer of J∆ But this drawback anddifficulty appears in most applications of descent methods in optimal design and controlproblems

As we will see in the present context the approximations obtained by discrete gradientmethods are satisfactory although convergence is slow due to unnecessary oscillations thatthe descent method introduces

The gradient of J∆ rigorously speaking requires the linearization of the numericalscheme (61) used to approximate the equation (12) Then the linearization correspondsto

δun+1i = δuni minus

∆t

∆x

(partag(uni+1 u

ni )δuni+1 minus partbg(uni u

niminus1)δuniminus1

)minus∆t

∆x

(partbg(uni+1 u

ni )minus partag(uni u

niminus1))δuni

i isin Z n = 0 N (62)

In view of this the discrete adjoint system of (62) can also be written for the differen-tiable flux functions

pN+1i = 0 i isin Z

pni = pn+1i +

∆t

∆xpartbg(uni+1 u

ni )(pn+1i+1 minus pn+1

i

)+

∆t

∆xpartag(uni u

niminus1)

(pn+1i minus pn+1

iminus1

)+ F n

i

i isin Z n = 0 N (63)

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 15

whereF ni = ∆tρni

(uni minus (ud)ni

) i isin Z n = 0 N (64)

In fact when multiplying the equations in (62) by pn+1i and adding in i isin Z and

n = 0 N the following identity is easily obtained

∆xsumiisinZ

p0i δu

0i = ∆x

Nsumn=0

sumiisinZ

F ni δu

0i (65)

This is the discrete version of formula (46) which allows us to simplify the derivative ofthe discrete cost functional

Thus for any variation δu0∆ the Gateaux derivative of the cost functional defined in

(31) is given by

δJ∆ = ∆x∆tNsum

n=0

sumiisinZ

ρni(uni minus (ud)ni

)δuni (66)

where δunij solves the linearized system (62) If we consider pni the solution of (63) with(64) we obtain that δJ∆ in (66) can be written as

δJ∆ = ∆xsumiisinZ

p0i δu

0i

and this allows to obtain easily the steepest descent direction for J∆ by considering

δu0∆ = minusp0

∆ (67)

Remark 61 We observe that for the Enquis-Osherrsquos flux (311) the system (63) is theupwind method for the continuous adjoint system We do not address here the problemof the convergence of this adjoint scheme towards the solution of the continuous adjointsystem Of course this is an easy matter when u is smooth but it is far from being trivialwhen u has shock discontinuities Whether or not this discrete adjoint system as ∆rarr 0allows reconstructing the complete adjoint system with the inner Dirichlet condition alongthe shock (417)(418)(419)(420)(421) and (422) constitutes an interesting problemfor future research We refer to [14] and [18] for preliminary work on this direction inone-dimension

62 The alternating descent method Now we describe the implementation of thealternating descent method

The main idea is to approximate a minimizer of J alternating between two directions ofdescent First we perturb the initial datum u0 using the direction (52) which principallychanges the profile of u0 Second we move the shock curve without altering the profile ofu0 at both sides of Σ using the direction (53)

More precisely for a given initialization u0∆ and target function ud∆ we compute Σ0

∆ thejump-point of u0

∆u∆ and p0∆ as the solutions of (62) (63) respectively

As numerical approximation of the adjoint state q(0) we take the value of p0∆ over the

point Σ0∆ ie

q0∆ = p0

∆(Σ0∆) = p0

i Σ0∆ isin (ximinus12 xi+12)

16 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

The main advantage of this method introduced in [8] is that for an initial datum u0

with a single discontinuity the descent directions are generalized tangent vectors ie theyintroduce Lipschitz continuous variations of u0 at both sides of the discontinuity and adisplacement of the shock position In this way the new datum obtained modifying theold one in the direction of this generalized tangent vector has again a single discontinuityThe method can be applied in a much more general context in which for instance thesolution has various shocks since the method is able both to generate shocks and to destroythem if any of these facts contributes to the decrease of the functional This method isin some sense close to those employed in shape design in elasticity in which topologicalderivatives (that would correspond to controlling the location of the shock in our method)are combined with classical shape deformations (that would correspond to simply shapingthe solution away from the shock in the present setting) [11]

As mentioned above in the context of the tracking problem we are considering thesolutions of the adjoint system do not increase the number of shocks Thus a directapplication of the continuous method without employing this alternating variant couldmake sense Note that the purely discrete method is a limited version of the continuous onesince one simply employs the descent direction indicated by p0 without paying attention tothe value q0 corresponding to the shock sensitivity However the numerical experimentswe have done with the full continuous method do not improve the results obtained by thediscrete one since the definition of the location of the shocks is lost along the iterativeprocess

7 Numerical experiments

In this section we present some numerical experiments which illustrate the results ob-tained in an optimization model problem with each one of the numerical methods describedin the previous section

We have chosen as computational domain the interval (minus4 4) and we have taken asboundary conditions in (61) at each time step t = tn the value of the initial data at theboundary This can be justified if we assume that the initial datum u0 is constant in asufficiently large inner neighborhood of the boundary x = plusmn4 (which depends on the sizeof the Linfin-norm of the data under consideration and the time horizon T ) due to the finitespeed of propagation A similar procedure is employed for the adjoint equation

We underline once more that the solutions obtained with each method may correspondto global minima or local ones since the gradient algorithm does not distinguish them

In the experiments we consider the Burgersrsquo equation ie f(z) = z22

parttu+ partx

(u2

2

)= 0 u(x 0) = u0(x) (71)

The weight function ρ under consideration in the experiments is given by

ρ(x t) =

1 t isin (T2 T )0 otherwise

And the time horizon T = 1

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 17

To compare the efficiency of the different methods we consider a fixed ∆x = 120 λ =∆t∆x = 23 (which satisfies the CFL condition) We then analyze the number of itera-tions that each method needs to attain a prescribed value of the functional

71 Experiment 1 We first consider a piecewise constant target profile ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

07 x isin [minus2 1]0 otherwise

(72)

Note that in this case (72) yields a particular solution of the optimization problemand the minimum value of J vanishes

We solve the optimization problem (13) with the above described different methodsstarting from the following initialization for u0

u0(x) =

05 x le 0minus01 x gt 0

(73)

which also has a discontinuity but located on a different pointIn Figure 2 we plot log(J) with respect to the number of iterations for both the purely

discrete method and the alternating descent one We see that the latter stabilizes in feweriterations

Figure 2 Experiment 1 log(J) versus the number of iterations in thedescent algorithm for the discrete and the alternating descent methods

In Figures 3 and 4 we present the minimizers obtained by the methods above and theassociated solutions Figures 5 and 6

The initial datum u0 obtained by the alternating descent method (Figures 4) is a goodapproximation of (72) The solution given by the discrete approach (Figures 3) presentsadded spurious oscillations Furthermore the discrete method is much slower and does notachieve the same level of accuracy since the functional J∆ does not decrease so much

18 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Figure 3 Experiment 1 u0discrete method iteration k =

999

Figure 4 Experiment 1 u0alternating descent method it-eration k = 38

Figure 5 Experiment 1 So-lution u(x t) discrete methoditeration k = 999

Figure 6 Experiment 1 So-lution u(x t) alternating de-scent method iteration k = 38

72 Experiment 2 The previous experiment indicates that the alternating descent methodperforms significantly better In order to show that this is a systematic fact which arisesindependently of the initialization of the method we consider the target ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

05 x le 050 otherwise

(74)

but this time we compare the performance of both methods starting from different initial-izations

The obtained numerical results are presented in Figure 7We see that regardless the initialization considered the alternating descent method

performs significantly better

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 19

u0 obtained by the discrete

approach

u0 obtained by the alternating

descent method

The value of the functional

Figure 7 Experiment 2 We present a comparison of the results obtainedwith both methods starting out of five different initialization configurationsIn the left column we exhibit the results obtained with the discrete methodIn the second one those achieved by the alternating descent method In thelast one we plot the evolution of the functional with both methods

20 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

We observe that in the five experiments the alternating descent method perfumes betterensuring the descent of the functional in much fewer iterations and yielding smoother lessoscillatory approximation of the minimizer

Note also that the discrete method rather than yielding discontinuous approximationsof the minimizer as the alternating descent method does it produces an initial datum witha Lipschitz front Observe that these are two different configurations that can lead tothe same evolution for the Burgers equation after some time once the front develops thediscontinuity This is in agreement with the fact that the functional to be minimized isonly active in the time-interval T2 le t le T

8 Conclusions and perspectives

In this paper we have adapted and presented the alternating descent method for atracking problem for a 1minus d scalar conservation law the goal being to identify an optimalinitial datum so that the solution gets as close as possible to a given trajectory Wehave exhibited in a number of examples the better performance of the alternating descentmethod with respect to the classical discrete method Thus the paper extends previousworks on other optimization problems such as the inverse design or the optimization of theflux function

The developments in this paper raise a number of interesting problems and questionsthat will deserve further investigation We summarize here some of them

bull The alternating descent method and the discrete one do not seem to yield the sameminimizer This should be further investigated in a more systematic manner inother experiments

The minimizer that the discrete method yields seems to replace the shock of theinitial datum by a Lipschitz function which eventually develops the same dynamicswithin the time interval T2 le t le T in which the functional we have chosen isactive This is an agreement with the behavior of these methods in the context ofthe classical problem of inverse design in which one aims to find the initial datumso that the solution at the final time takes a given value It would be interesting toprove analytically that the two methods may lead to different minimizers in somecircumstancesbull The experiments in this paper concern the case where the target ud is exactly

reachable It would be interesting to explore the performance of both methods inthe case where the target ud is not a solution of the underlying dynamicsbull It would be interesting to compare the performance of the methods presented in the

paper with the direct continuous method without introducing the alternating strat-egy Our preliminary numerical experiments indicate that the continuous methodwithout implementing the alternating strategy does not improve the results thatthe purely discrete strategy yieldsbull In [16] the problem of inverse design has been investigated adapting and extend-

ing the alternating descent method to the multi-dimensional case It would beinteresting to extend the analysis of this paper to the multidimensional case too

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 21

bull It would worth to compare the results in this paper with those that could beachieved by the nudging method as in [2] [3]

AcknowledgementsThe authors would like to thank C Castro F Palacios and A Pozo for stimulating

discussions

References

[1] A Adimurthi S S Ghoshal and V Gowda Optimal controllability for scalar conservation laws withconvex flux 12 Jan 2012 3

[2] D Auroux and J Blum Back and forth nudging algorithm for data assimilation problems ComptesRendus Mathematique 340(12)873 ndash 878 2005 3 20

[3] D Auroux and M Nodet The back and forth nudging algorithm for data assimilation problems theoretical results on transport equations ESAIM Control Optimisation and Calculus of Variations18(2)318ndash342 7 2012 3 20

[4] C Bardos and O Pironneau A formalism for the differentiation of conservation laws C R MathAcad Sci Paris 335(10)839ndash845 2002 8

[5] F Bouchut and F James One-dimensional transport equations with discontinuous coefficients Non-linear Anal 32(7)891ndash933 1998 8

[6] F Bouchut and F James Differentiability with respect to initial data for a scalar conservation lawIn Hyperbolic problems theory numerics applications Vol I (Zurich 1998) volume 129 of InternatSer Numer Math pages 113ndash118 Birkhauser Basel 1999 8

[7] A Bressan and A Marson A variational calculus for discontinuous solutions of systems of conservationlaws Comm Partial Differential Equations 20(9-10)1491ndash1552 1995 8 9

[8] C Castro F Palacios and E Zuazua An alternating descent method for the optimal control of theinviscid burgers equation in the presence of shocks Mathematical Models and Methods in AppliedSciences 18(03)369ndash416 2008 2 3 4 6 9 10 12 16

[9] C Castro F Palacios and E Zuazua Optimal control and vanishing viscosity for the burgers equa-tion In C Constanda and M Perez editors Integral Methods in Science and Engineering Volume2 pages 65ndash90 Birkhauser Boston 2010 2

[10] C Castro and E Zuazua Flux identification for 1-d scalar conservation laws in the presence of shocksMath Comp 80(276)2025ndash2070 2011 2

[11] S Garreau P Guillaume and M Masmoudi The topological asymptotic for pde systems Theelasticity case SIAM J Control Optim 39(6)1756ndash1778 Dec 2000 16

[12] E Godlewski and P Raviart Hyperbolic systems of conservation laws Mathematiques and Applica-tions Ellipses Paris 1991 6

[13] E Godlewski and P A Raviart The linearized stability of solutions of nonlinear hyperbolic systemsof conservation laws A general numerical approach Math Comput Simulation 50(1-4)77ndash95 1999Modelling rsquo98 (Prague) 8

[14] L Gosse and F James Numerical approximations of one-dimensional linear conservation equationswith discontinuous coefficients Mathematics of Computation 69(231)pp 987ndash1015 2000 15

[15] S N Kruzkov First order quasilinear equations with several independent variables Mat Sb (NS)81 (123)228ndash255 1970 2

[16] R Lecaros and E Zuazua Control of 2D scalar conservation laws in the presence of shocks preprint2014 3 6 20

[17] S Ulbrich Adjoint-based derivative computations for the optimal control of discontinuous solutions ofhyperbolic conservation laws Systems Control Lett 48(3-4)313ndash328 2003 Optimization and controlof distributed systems 8

22 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

[18] X Wen and S Jin Convergence of an immersed interface upwind scheme for linear advection equationswith piecewise constant coefficients I L1-error estimates J Comput Math 26(1)1ndash22 2008 15

Rodrigo Lecaros13 and Enrique Zuazua12

1BCAM - Basque Center for Applied MathematicsMazarredo 14 E-48009 Bilbao Basque Country Spain

2Ikerbasque - Basque Foundation for ScienceAlameda Urquijo 36-5 Plaza Bizkaia 48011 Bilbao Basque Country Spain

3CMM - Centro de Modelamiento Matematico Universidad de Chile (UMI CNRS 2807)Avenida Blanco Encalada 2120 Casilla 170-3 Correo 3 Santiago Chile

E-mail address rlecarosdimuchilecl zuazuabcamathorg

URL wwwbcamathorgzuazua

  • 1 Introduction
  • 2 Existence of Minimizers
  • 3 The Discrete Minimization Problem
  • 4 Sensitivity analysis the continuous approach
    • 41 Sensitivity without shocks
    • 42 Sensitivity of the state in the presence of shocks
    • 43 Sensitivity of the cost in the presence of shocks
      • 5 The alternating descent method
      • 6 Numerical approximation of the descent direction
        • 61 The discrete approach
        • 62 The alternating descent method
          • 7 Numerical experiments
            • 71 Experiment 1
            • 72 Experiment 2
              • 8 Conclusions and perspectives
              • References
Page 13: TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 13

First directions With this set of directions we mainly modify the profile of u0 Weset the first directions d1 = (δu0 δϕ0) given by

δu0 =

minusp(x 0) x lt Xminusminusp0minus x isin (Xminus ϕ0)minusp0+ x isin (ϕ0 X+)minusp(x 0) x gt X+

δϕ0 =

0 if

X+intXminus

p(x 0)δu0(x)dx le 0

X+intXminus

p(x 0)δu0(x)dx

[u0]Σ0q(0) if

X+intXminus

p(x 0)δu0(x)dx gt 0 and q(0) 6= 0

(52)

We note that if q(0) = 0 andX+intXminus

p(x 0)δu0(x)dx gt 0 these directions are not

definedSecond directions They are aimed to move the shock without changing the profile

of the solution to both sides Then d2 = (δu0 δϕ0) with

δu0 equiv 0 δϕ0(x) = [u0]Σ0q(0) (53)

We observe that d1 given by (52) satisfies

δJ(u0)[d1] = minusintxisin[XminusX+]

|p(x 0)|2dx

minusp0minusint ϕ0

Xminusp(x 0)dxminus p0+

int X+

ϕ0

p(x 0)dxminus [u0]Σ0q(0)δϕ0 le 0 (54)

Note that in those cases where d1 is not defined as indicated above this descent directionis simply not employed in the descent algorithm

And for d2 given by (53) we have

δJ(u0)[d2] = minus([u0]Σ0q(0))2 le 0 (55)

Thus the two classes of descent directions under consideration have three importantproperties

(1) They are both descent directions(2) They allow to split the design of the profile and the shock location(3) They are true generalized gradients and therefore keep the structure of the data

without increasing its complexity

6 Numerical approximation of the descent direction

We have computed the gradient of the continuous functional J in several cases (u smoothor having shock discontinuities) but in practice one has to work with discrete versions of

14 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

the functional J In this section we discuss two possible ways of searching discrete descentdirections based either on the discrete or the continuous approaches and in the later onthe alternating descent method

The discrete approach consists mainly in applying directly a descent algorithm to thediscrete version J∆ of the functional J by using its gradient The alternating descentmethod by the contrary is a method inspired on the continuous analysis of the previoussection in which the two main classes of descent directions that are first identified andlater discretized

Let us first discuss the discrete approach

61 The discrete approach Let us consider the approximation of the functional J byJ∆ defined (11) and (31) respectively We shall use the Engquist-Osher scheme whichis a 3-point conservative numerical approximation scheme for (12) More explicitly weconsider

un+1i = uni minus

∆t

∆x

(g(uni+1 u

ni )minus g(uni u

niminus1)) i isin Z n = 0 N (61)

where g is the numerical flux defined in (311)The gradient of the discrete functional J∆ requires computing one derivative of J∆ with

respect to each node of the mesh This can be done in a cheaper way using the discreteadjoint state We illustrate it for the Engquist-Osher numerical scheme However asthe discrete functionals J∆ are not necessarily convex the gradient methods could possiblyprovide sequences that do not converge to a global minimizer of J∆ But this drawback anddifficulty appears in most applications of descent methods in optimal design and controlproblems

As we will see in the present context the approximations obtained by discrete gradientmethods are satisfactory although convergence is slow due to unnecessary oscillations thatthe descent method introduces

The gradient of J∆ rigorously speaking requires the linearization of the numericalscheme (61) used to approximate the equation (12) Then the linearization correspondsto

δun+1i = δuni minus

∆t

∆x

(partag(uni+1 u

ni )δuni+1 minus partbg(uni u

niminus1)δuniminus1

)minus∆t

∆x

(partbg(uni+1 u

ni )minus partag(uni u

niminus1))δuni

i isin Z n = 0 N (62)

In view of this the discrete adjoint system of (62) can also be written for the differen-tiable flux functions

pN+1i = 0 i isin Z

pni = pn+1i +

∆t

∆xpartbg(uni+1 u

ni )(pn+1i+1 minus pn+1

i

)+

∆t

∆xpartag(uni u

niminus1)

(pn+1i minus pn+1

iminus1

)+ F n

i

i isin Z n = 0 N (63)

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 15

whereF ni = ∆tρni

(uni minus (ud)ni

) i isin Z n = 0 N (64)

In fact when multiplying the equations in (62) by pn+1i and adding in i isin Z and

n = 0 N the following identity is easily obtained

∆xsumiisinZ

p0i δu

0i = ∆x

Nsumn=0

sumiisinZ

F ni δu

0i (65)

This is the discrete version of formula (46) which allows us to simplify the derivative ofthe discrete cost functional

Thus for any variation δu0∆ the Gateaux derivative of the cost functional defined in

(31) is given by

δJ∆ = ∆x∆tNsum

n=0

sumiisinZ

ρni(uni minus (ud)ni

)δuni (66)

where δunij solves the linearized system (62) If we consider pni the solution of (63) with(64) we obtain that δJ∆ in (66) can be written as

δJ∆ = ∆xsumiisinZ

p0i δu

0i

and this allows to obtain easily the steepest descent direction for J∆ by considering

δu0∆ = minusp0

∆ (67)

Remark 61 We observe that for the Enquis-Osherrsquos flux (311) the system (63) is theupwind method for the continuous adjoint system We do not address here the problemof the convergence of this adjoint scheme towards the solution of the continuous adjointsystem Of course this is an easy matter when u is smooth but it is far from being trivialwhen u has shock discontinuities Whether or not this discrete adjoint system as ∆rarr 0allows reconstructing the complete adjoint system with the inner Dirichlet condition alongthe shock (417)(418)(419)(420)(421) and (422) constitutes an interesting problemfor future research We refer to [14] and [18] for preliminary work on this direction inone-dimension

62 The alternating descent method Now we describe the implementation of thealternating descent method

The main idea is to approximate a minimizer of J alternating between two directions ofdescent First we perturb the initial datum u0 using the direction (52) which principallychanges the profile of u0 Second we move the shock curve without altering the profile ofu0 at both sides of Σ using the direction (53)

More precisely for a given initialization u0∆ and target function ud∆ we compute Σ0

∆ thejump-point of u0

∆u∆ and p0∆ as the solutions of (62) (63) respectively

As numerical approximation of the adjoint state q(0) we take the value of p0∆ over the

point Σ0∆ ie

q0∆ = p0

∆(Σ0∆) = p0

i Σ0∆ isin (ximinus12 xi+12)

16 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

The main advantage of this method introduced in [8] is that for an initial datum u0

with a single discontinuity the descent directions are generalized tangent vectors ie theyintroduce Lipschitz continuous variations of u0 at both sides of the discontinuity and adisplacement of the shock position In this way the new datum obtained modifying theold one in the direction of this generalized tangent vector has again a single discontinuityThe method can be applied in a much more general context in which for instance thesolution has various shocks since the method is able both to generate shocks and to destroythem if any of these facts contributes to the decrease of the functional This method isin some sense close to those employed in shape design in elasticity in which topologicalderivatives (that would correspond to controlling the location of the shock in our method)are combined with classical shape deformations (that would correspond to simply shapingthe solution away from the shock in the present setting) [11]

As mentioned above in the context of the tracking problem we are considering thesolutions of the adjoint system do not increase the number of shocks Thus a directapplication of the continuous method without employing this alternating variant couldmake sense Note that the purely discrete method is a limited version of the continuous onesince one simply employs the descent direction indicated by p0 without paying attention tothe value q0 corresponding to the shock sensitivity However the numerical experimentswe have done with the full continuous method do not improve the results obtained by thediscrete one since the definition of the location of the shocks is lost along the iterativeprocess

7 Numerical experiments

In this section we present some numerical experiments which illustrate the results ob-tained in an optimization model problem with each one of the numerical methods describedin the previous section

We have chosen as computational domain the interval (minus4 4) and we have taken asboundary conditions in (61) at each time step t = tn the value of the initial data at theboundary This can be justified if we assume that the initial datum u0 is constant in asufficiently large inner neighborhood of the boundary x = plusmn4 (which depends on the sizeof the Linfin-norm of the data under consideration and the time horizon T ) due to the finitespeed of propagation A similar procedure is employed for the adjoint equation

We underline once more that the solutions obtained with each method may correspondto global minima or local ones since the gradient algorithm does not distinguish them

In the experiments we consider the Burgersrsquo equation ie f(z) = z22

parttu+ partx

(u2

2

)= 0 u(x 0) = u0(x) (71)

The weight function ρ under consideration in the experiments is given by

ρ(x t) =

1 t isin (T2 T )0 otherwise

And the time horizon T = 1

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 17

To compare the efficiency of the different methods we consider a fixed ∆x = 120 λ =∆t∆x = 23 (which satisfies the CFL condition) We then analyze the number of itera-tions that each method needs to attain a prescribed value of the functional

71 Experiment 1 We first consider a piecewise constant target profile ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

07 x isin [minus2 1]0 otherwise

(72)

Note that in this case (72) yields a particular solution of the optimization problemand the minimum value of J vanishes

We solve the optimization problem (13) with the above described different methodsstarting from the following initialization for u0

u0(x) =

05 x le 0minus01 x gt 0

(73)

which also has a discontinuity but located on a different pointIn Figure 2 we plot log(J) with respect to the number of iterations for both the purely

discrete method and the alternating descent one We see that the latter stabilizes in feweriterations

Figure 2 Experiment 1 log(J) versus the number of iterations in thedescent algorithm for the discrete and the alternating descent methods

In Figures 3 and 4 we present the minimizers obtained by the methods above and theassociated solutions Figures 5 and 6

The initial datum u0 obtained by the alternating descent method (Figures 4) is a goodapproximation of (72) The solution given by the discrete approach (Figures 3) presentsadded spurious oscillations Furthermore the discrete method is much slower and does notachieve the same level of accuracy since the functional J∆ does not decrease so much

18 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Figure 3 Experiment 1 u0discrete method iteration k =

999

Figure 4 Experiment 1 u0alternating descent method it-eration k = 38

Figure 5 Experiment 1 So-lution u(x t) discrete methoditeration k = 999

Figure 6 Experiment 1 So-lution u(x t) alternating de-scent method iteration k = 38

72 Experiment 2 The previous experiment indicates that the alternating descent methodperforms significantly better In order to show that this is a systematic fact which arisesindependently of the initialization of the method we consider the target ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

05 x le 050 otherwise

(74)

but this time we compare the performance of both methods starting from different initial-izations

The obtained numerical results are presented in Figure 7We see that regardless the initialization considered the alternating descent method

performs significantly better

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 19

u0 obtained by the discrete

approach

u0 obtained by the alternating

descent method

The value of the functional

Figure 7 Experiment 2 We present a comparison of the results obtainedwith both methods starting out of five different initialization configurationsIn the left column we exhibit the results obtained with the discrete methodIn the second one those achieved by the alternating descent method In thelast one we plot the evolution of the functional with both methods

20 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

We observe that in the five experiments the alternating descent method perfumes betterensuring the descent of the functional in much fewer iterations and yielding smoother lessoscillatory approximation of the minimizer

Note also that the discrete method rather than yielding discontinuous approximationsof the minimizer as the alternating descent method does it produces an initial datum witha Lipschitz front Observe that these are two different configurations that can lead tothe same evolution for the Burgers equation after some time once the front develops thediscontinuity This is in agreement with the fact that the functional to be minimized isonly active in the time-interval T2 le t le T

8 Conclusions and perspectives

In this paper we have adapted and presented the alternating descent method for atracking problem for a 1minus d scalar conservation law the goal being to identify an optimalinitial datum so that the solution gets as close as possible to a given trajectory Wehave exhibited in a number of examples the better performance of the alternating descentmethod with respect to the classical discrete method Thus the paper extends previousworks on other optimization problems such as the inverse design or the optimization of theflux function

The developments in this paper raise a number of interesting problems and questionsthat will deserve further investigation We summarize here some of them

bull The alternating descent method and the discrete one do not seem to yield the sameminimizer This should be further investigated in a more systematic manner inother experiments

The minimizer that the discrete method yields seems to replace the shock of theinitial datum by a Lipschitz function which eventually develops the same dynamicswithin the time interval T2 le t le T in which the functional we have chosen isactive This is an agreement with the behavior of these methods in the context ofthe classical problem of inverse design in which one aims to find the initial datumso that the solution at the final time takes a given value It would be interesting toprove analytically that the two methods may lead to different minimizers in somecircumstancesbull The experiments in this paper concern the case where the target ud is exactly

reachable It would be interesting to explore the performance of both methods inthe case where the target ud is not a solution of the underlying dynamicsbull It would be interesting to compare the performance of the methods presented in the

paper with the direct continuous method without introducing the alternating strat-egy Our preliminary numerical experiments indicate that the continuous methodwithout implementing the alternating strategy does not improve the results thatthe purely discrete strategy yieldsbull In [16] the problem of inverse design has been investigated adapting and extend-

ing the alternating descent method to the multi-dimensional case It would beinteresting to extend the analysis of this paper to the multidimensional case too

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 21

bull It would worth to compare the results in this paper with those that could beachieved by the nudging method as in [2] [3]

AcknowledgementsThe authors would like to thank C Castro F Palacios and A Pozo for stimulating

discussions

References

[1] A Adimurthi S S Ghoshal and V Gowda Optimal controllability for scalar conservation laws withconvex flux 12 Jan 2012 3

[2] D Auroux and J Blum Back and forth nudging algorithm for data assimilation problems ComptesRendus Mathematique 340(12)873 ndash 878 2005 3 20

[3] D Auroux and M Nodet The back and forth nudging algorithm for data assimilation problems theoretical results on transport equations ESAIM Control Optimisation and Calculus of Variations18(2)318ndash342 7 2012 3 20

[4] C Bardos and O Pironneau A formalism for the differentiation of conservation laws C R MathAcad Sci Paris 335(10)839ndash845 2002 8

[5] F Bouchut and F James One-dimensional transport equations with discontinuous coefficients Non-linear Anal 32(7)891ndash933 1998 8

[6] F Bouchut and F James Differentiability with respect to initial data for a scalar conservation lawIn Hyperbolic problems theory numerics applications Vol I (Zurich 1998) volume 129 of InternatSer Numer Math pages 113ndash118 Birkhauser Basel 1999 8

[7] A Bressan and A Marson A variational calculus for discontinuous solutions of systems of conservationlaws Comm Partial Differential Equations 20(9-10)1491ndash1552 1995 8 9

[8] C Castro F Palacios and E Zuazua An alternating descent method for the optimal control of theinviscid burgers equation in the presence of shocks Mathematical Models and Methods in AppliedSciences 18(03)369ndash416 2008 2 3 4 6 9 10 12 16

[9] C Castro F Palacios and E Zuazua Optimal control and vanishing viscosity for the burgers equa-tion In C Constanda and M Perez editors Integral Methods in Science and Engineering Volume2 pages 65ndash90 Birkhauser Boston 2010 2

[10] C Castro and E Zuazua Flux identification for 1-d scalar conservation laws in the presence of shocksMath Comp 80(276)2025ndash2070 2011 2

[11] S Garreau P Guillaume and M Masmoudi The topological asymptotic for pde systems Theelasticity case SIAM J Control Optim 39(6)1756ndash1778 Dec 2000 16

[12] E Godlewski and P Raviart Hyperbolic systems of conservation laws Mathematiques and Applica-tions Ellipses Paris 1991 6

[13] E Godlewski and P A Raviart The linearized stability of solutions of nonlinear hyperbolic systemsof conservation laws A general numerical approach Math Comput Simulation 50(1-4)77ndash95 1999Modelling rsquo98 (Prague) 8

[14] L Gosse and F James Numerical approximations of one-dimensional linear conservation equationswith discontinuous coefficients Mathematics of Computation 69(231)pp 987ndash1015 2000 15

[15] S N Kruzkov First order quasilinear equations with several independent variables Mat Sb (NS)81 (123)228ndash255 1970 2

[16] R Lecaros and E Zuazua Control of 2D scalar conservation laws in the presence of shocks preprint2014 3 6 20

[17] S Ulbrich Adjoint-based derivative computations for the optimal control of discontinuous solutions ofhyperbolic conservation laws Systems Control Lett 48(3-4)313ndash328 2003 Optimization and controlof distributed systems 8

22 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

[18] X Wen and S Jin Convergence of an immersed interface upwind scheme for linear advection equationswith piecewise constant coefficients I L1-error estimates J Comput Math 26(1)1ndash22 2008 15

Rodrigo Lecaros13 and Enrique Zuazua12

1BCAM - Basque Center for Applied MathematicsMazarredo 14 E-48009 Bilbao Basque Country Spain

2Ikerbasque - Basque Foundation for ScienceAlameda Urquijo 36-5 Plaza Bizkaia 48011 Bilbao Basque Country Spain

3CMM - Centro de Modelamiento Matematico Universidad de Chile (UMI CNRS 2807)Avenida Blanco Encalada 2120 Casilla 170-3 Correo 3 Santiago Chile

E-mail address rlecarosdimuchilecl zuazuabcamathorg

URL wwwbcamathorgzuazua

  • 1 Introduction
  • 2 Existence of Minimizers
  • 3 The Discrete Minimization Problem
  • 4 Sensitivity analysis the continuous approach
    • 41 Sensitivity without shocks
    • 42 Sensitivity of the state in the presence of shocks
    • 43 Sensitivity of the cost in the presence of shocks
      • 5 The alternating descent method
      • 6 Numerical approximation of the descent direction
        • 61 The discrete approach
        • 62 The alternating descent method
          • 7 Numerical experiments
            • 71 Experiment 1
            • 72 Experiment 2
              • 8 Conclusions and perspectives
              • References
Page 14: TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE

14 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

the functional J In this section we discuss two possible ways of searching discrete descentdirections based either on the discrete or the continuous approaches and in the later onthe alternating descent method

The discrete approach consists mainly in applying directly a descent algorithm to thediscrete version J∆ of the functional J by using its gradient The alternating descentmethod by the contrary is a method inspired on the continuous analysis of the previoussection in which the two main classes of descent directions that are first identified andlater discretized

Let us first discuss the discrete approach

61 The discrete approach Let us consider the approximation of the functional J byJ∆ defined (11) and (31) respectively We shall use the Engquist-Osher scheme whichis a 3-point conservative numerical approximation scheme for (12) More explicitly weconsider

un+1i = uni minus

∆t

∆x

(g(uni+1 u

ni )minus g(uni u

niminus1)) i isin Z n = 0 N (61)

where g is the numerical flux defined in (311)The gradient of the discrete functional J∆ requires computing one derivative of J∆ with

respect to each node of the mesh This can be done in a cheaper way using the discreteadjoint state We illustrate it for the Engquist-Osher numerical scheme However asthe discrete functionals J∆ are not necessarily convex the gradient methods could possiblyprovide sequences that do not converge to a global minimizer of J∆ But this drawback anddifficulty appears in most applications of descent methods in optimal design and controlproblems

As we will see in the present context the approximations obtained by discrete gradientmethods are satisfactory although convergence is slow due to unnecessary oscillations thatthe descent method introduces

The gradient of J∆ rigorously speaking requires the linearization of the numericalscheme (61) used to approximate the equation (12) Then the linearization correspondsto

δun+1i = δuni minus

∆t

∆x

(partag(uni+1 u

ni )δuni+1 minus partbg(uni u

niminus1)δuniminus1

)minus∆t

∆x

(partbg(uni+1 u

ni )minus partag(uni u

niminus1))δuni

i isin Z n = 0 N (62)

In view of this the discrete adjoint system of (62) can also be written for the differen-tiable flux functions

pN+1i = 0 i isin Z

pni = pn+1i +

∆t

∆xpartbg(uni+1 u

ni )(pn+1i+1 minus pn+1

i

)+

∆t

∆xpartag(uni u

niminus1)

(pn+1i minus pn+1

iminus1

)+ F n

i

i isin Z n = 0 N (63)

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 15

whereF ni = ∆tρni

(uni minus (ud)ni

) i isin Z n = 0 N (64)

In fact when multiplying the equations in (62) by pn+1i and adding in i isin Z and

n = 0 N the following identity is easily obtained

∆xsumiisinZ

p0i δu

0i = ∆x

Nsumn=0

sumiisinZ

F ni δu

0i (65)

This is the discrete version of formula (46) which allows us to simplify the derivative ofthe discrete cost functional

Thus for any variation δu0∆ the Gateaux derivative of the cost functional defined in

(31) is given by

δJ∆ = ∆x∆tNsum

n=0

sumiisinZ

ρni(uni minus (ud)ni

)δuni (66)

where δunij solves the linearized system (62) If we consider pni the solution of (63) with(64) we obtain that δJ∆ in (66) can be written as

δJ∆ = ∆xsumiisinZ

p0i δu

0i

and this allows to obtain easily the steepest descent direction for J∆ by considering

δu0∆ = minusp0

∆ (67)

Remark 61 We observe that for the Enquis-Osherrsquos flux (311) the system (63) is theupwind method for the continuous adjoint system We do not address here the problemof the convergence of this adjoint scheme towards the solution of the continuous adjointsystem Of course this is an easy matter when u is smooth but it is far from being trivialwhen u has shock discontinuities Whether or not this discrete adjoint system as ∆rarr 0allows reconstructing the complete adjoint system with the inner Dirichlet condition alongthe shock (417)(418)(419)(420)(421) and (422) constitutes an interesting problemfor future research We refer to [14] and [18] for preliminary work on this direction inone-dimension

62 The alternating descent method Now we describe the implementation of thealternating descent method

The main idea is to approximate a minimizer of J alternating between two directions ofdescent First we perturb the initial datum u0 using the direction (52) which principallychanges the profile of u0 Second we move the shock curve without altering the profile ofu0 at both sides of Σ using the direction (53)

More precisely for a given initialization u0∆ and target function ud∆ we compute Σ0

∆ thejump-point of u0

∆u∆ and p0∆ as the solutions of (62) (63) respectively

As numerical approximation of the adjoint state q(0) we take the value of p0∆ over the

point Σ0∆ ie

q0∆ = p0

∆(Σ0∆) = p0

i Σ0∆ isin (ximinus12 xi+12)

16 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

The main advantage of this method introduced in [8] is that for an initial datum u0

with a single discontinuity the descent directions are generalized tangent vectors ie theyintroduce Lipschitz continuous variations of u0 at both sides of the discontinuity and adisplacement of the shock position In this way the new datum obtained modifying theold one in the direction of this generalized tangent vector has again a single discontinuityThe method can be applied in a much more general context in which for instance thesolution has various shocks since the method is able both to generate shocks and to destroythem if any of these facts contributes to the decrease of the functional This method isin some sense close to those employed in shape design in elasticity in which topologicalderivatives (that would correspond to controlling the location of the shock in our method)are combined with classical shape deformations (that would correspond to simply shapingthe solution away from the shock in the present setting) [11]

As mentioned above in the context of the tracking problem we are considering thesolutions of the adjoint system do not increase the number of shocks Thus a directapplication of the continuous method without employing this alternating variant couldmake sense Note that the purely discrete method is a limited version of the continuous onesince one simply employs the descent direction indicated by p0 without paying attention tothe value q0 corresponding to the shock sensitivity However the numerical experimentswe have done with the full continuous method do not improve the results obtained by thediscrete one since the definition of the location of the shocks is lost along the iterativeprocess

7 Numerical experiments

In this section we present some numerical experiments which illustrate the results ob-tained in an optimization model problem with each one of the numerical methods describedin the previous section

We have chosen as computational domain the interval (minus4 4) and we have taken asboundary conditions in (61) at each time step t = tn the value of the initial data at theboundary This can be justified if we assume that the initial datum u0 is constant in asufficiently large inner neighborhood of the boundary x = plusmn4 (which depends on the sizeof the Linfin-norm of the data under consideration and the time horizon T ) due to the finitespeed of propagation A similar procedure is employed for the adjoint equation

We underline once more that the solutions obtained with each method may correspondto global minima or local ones since the gradient algorithm does not distinguish them

In the experiments we consider the Burgersrsquo equation ie f(z) = z22

parttu+ partx

(u2

2

)= 0 u(x 0) = u0(x) (71)

The weight function ρ under consideration in the experiments is given by

ρ(x t) =

1 t isin (T2 T )0 otherwise

And the time horizon T = 1

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 17

To compare the efficiency of the different methods we consider a fixed ∆x = 120 λ =∆t∆x = 23 (which satisfies the CFL condition) We then analyze the number of itera-tions that each method needs to attain a prescribed value of the functional

71 Experiment 1 We first consider a piecewise constant target profile ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

07 x isin [minus2 1]0 otherwise

(72)

Note that in this case (72) yields a particular solution of the optimization problemand the minimum value of J vanishes

We solve the optimization problem (13) with the above described different methodsstarting from the following initialization for u0

u0(x) =

05 x le 0minus01 x gt 0

(73)

which also has a discontinuity but located on a different pointIn Figure 2 we plot log(J) with respect to the number of iterations for both the purely

discrete method and the alternating descent one We see that the latter stabilizes in feweriterations

Figure 2 Experiment 1 log(J) versus the number of iterations in thedescent algorithm for the discrete and the alternating descent methods

In Figures 3 and 4 we present the minimizers obtained by the methods above and theassociated solutions Figures 5 and 6

The initial datum u0 obtained by the alternating descent method (Figures 4) is a goodapproximation of (72) The solution given by the discrete approach (Figures 3) presentsadded spurious oscillations Furthermore the discrete method is much slower and does notachieve the same level of accuracy since the functional J∆ does not decrease so much

18 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Figure 3 Experiment 1 u0discrete method iteration k =

999

Figure 4 Experiment 1 u0alternating descent method it-eration k = 38

Figure 5 Experiment 1 So-lution u(x t) discrete methoditeration k = 999

Figure 6 Experiment 1 So-lution u(x t) alternating de-scent method iteration k = 38

72 Experiment 2 The previous experiment indicates that the alternating descent methodperforms significantly better In order to show that this is a systematic fact which arisesindependently of the initialization of the method we consider the target ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

05 x le 050 otherwise

(74)

but this time we compare the performance of both methods starting from different initial-izations

The obtained numerical results are presented in Figure 7We see that regardless the initialization considered the alternating descent method

performs significantly better

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 19

u0 obtained by the discrete

approach

u0 obtained by the alternating

descent method

The value of the functional

Figure 7 Experiment 2 We present a comparison of the results obtainedwith both methods starting out of five different initialization configurationsIn the left column we exhibit the results obtained with the discrete methodIn the second one those achieved by the alternating descent method In thelast one we plot the evolution of the functional with both methods

20 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

We observe that in the five experiments the alternating descent method perfumes betterensuring the descent of the functional in much fewer iterations and yielding smoother lessoscillatory approximation of the minimizer

Note also that the discrete method rather than yielding discontinuous approximationsof the minimizer as the alternating descent method does it produces an initial datum witha Lipschitz front Observe that these are two different configurations that can lead tothe same evolution for the Burgers equation after some time once the front develops thediscontinuity This is in agreement with the fact that the functional to be minimized isonly active in the time-interval T2 le t le T

8 Conclusions and perspectives

In this paper we have adapted and presented the alternating descent method for atracking problem for a 1minus d scalar conservation law the goal being to identify an optimalinitial datum so that the solution gets as close as possible to a given trajectory Wehave exhibited in a number of examples the better performance of the alternating descentmethod with respect to the classical discrete method Thus the paper extends previousworks on other optimization problems such as the inverse design or the optimization of theflux function

The developments in this paper raise a number of interesting problems and questionsthat will deserve further investigation We summarize here some of them

bull The alternating descent method and the discrete one do not seem to yield the sameminimizer This should be further investigated in a more systematic manner inother experiments

The minimizer that the discrete method yields seems to replace the shock of theinitial datum by a Lipschitz function which eventually develops the same dynamicswithin the time interval T2 le t le T in which the functional we have chosen isactive This is an agreement with the behavior of these methods in the context ofthe classical problem of inverse design in which one aims to find the initial datumso that the solution at the final time takes a given value It would be interesting toprove analytically that the two methods may lead to different minimizers in somecircumstancesbull The experiments in this paper concern the case where the target ud is exactly

reachable It would be interesting to explore the performance of both methods inthe case where the target ud is not a solution of the underlying dynamicsbull It would be interesting to compare the performance of the methods presented in the

paper with the direct continuous method without introducing the alternating strat-egy Our preliminary numerical experiments indicate that the continuous methodwithout implementing the alternating strategy does not improve the results thatthe purely discrete strategy yieldsbull In [16] the problem of inverse design has been investigated adapting and extend-

ing the alternating descent method to the multi-dimensional case It would beinteresting to extend the analysis of this paper to the multidimensional case too

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 21

bull It would worth to compare the results in this paper with those that could beachieved by the nudging method as in [2] [3]

AcknowledgementsThe authors would like to thank C Castro F Palacios and A Pozo for stimulating

discussions

References

[1] A Adimurthi S S Ghoshal and V Gowda Optimal controllability for scalar conservation laws withconvex flux 12 Jan 2012 3

[2] D Auroux and J Blum Back and forth nudging algorithm for data assimilation problems ComptesRendus Mathematique 340(12)873 ndash 878 2005 3 20

[3] D Auroux and M Nodet The back and forth nudging algorithm for data assimilation problems theoretical results on transport equations ESAIM Control Optimisation and Calculus of Variations18(2)318ndash342 7 2012 3 20

[4] C Bardos and O Pironneau A formalism for the differentiation of conservation laws C R MathAcad Sci Paris 335(10)839ndash845 2002 8

[5] F Bouchut and F James One-dimensional transport equations with discontinuous coefficients Non-linear Anal 32(7)891ndash933 1998 8

[6] F Bouchut and F James Differentiability with respect to initial data for a scalar conservation lawIn Hyperbolic problems theory numerics applications Vol I (Zurich 1998) volume 129 of InternatSer Numer Math pages 113ndash118 Birkhauser Basel 1999 8

[7] A Bressan and A Marson A variational calculus for discontinuous solutions of systems of conservationlaws Comm Partial Differential Equations 20(9-10)1491ndash1552 1995 8 9

[8] C Castro F Palacios and E Zuazua An alternating descent method for the optimal control of theinviscid burgers equation in the presence of shocks Mathematical Models and Methods in AppliedSciences 18(03)369ndash416 2008 2 3 4 6 9 10 12 16

[9] C Castro F Palacios and E Zuazua Optimal control and vanishing viscosity for the burgers equa-tion In C Constanda and M Perez editors Integral Methods in Science and Engineering Volume2 pages 65ndash90 Birkhauser Boston 2010 2

[10] C Castro and E Zuazua Flux identification for 1-d scalar conservation laws in the presence of shocksMath Comp 80(276)2025ndash2070 2011 2

[11] S Garreau P Guillaume and M Masmoudi The topological asymptotic for pde systems Theelasticity case SIAM J Control Optim 39(6)1756ndash1778 Dec 2000 16

[12] E Godlewski and P Raviart Hyperbolic systems of conservation laws Mathematiques and Applica-tions Ellipses Paris 1991 6

[13] E Godlewski and P A Raviart The linearized stability of solutions of nonlinear hyperbolic systemsof conservation laws A general numerical approach Math Comput Simulation 50(1-4)77ndash95 1999Modelling rsquo98 (Prague) 8

[14] L Gosse and F James Numerical approximations of one-dimensional linear conservation equationswith discontinuous coefficients Mathematics of Computation 69(231)pp 987ndash1015 2000 15

[15] S N Kruzkov First order quasilinear equations with several independent variables Mat Sb (NS)81 (123)228ndash255 1970 2

[16] R Lecaros and E Zuazua Control of 2D scalar conservation laws in the presence of shocks preprint2014 3 6 20

[17] S Ulbrich Adjoint-based derivative computations for the optimal control of discontinuous solutions ofhyperbolic conservation laws Systems Control Lett 48(3-4)313ndash328 2003 Optimization and controlof distributed systems 8

22 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

[18] X Wen and S Jin Convergence of an immersed interface upwind scheme for linear advection equationswith piecewise constant coefficients I L1-error estimates J Comput Math 26(1)1ndash22 2008 15

Rodrigo Lecaros13 and Enrique Zuazua12

1BCAM - Basque Center for Applied MathematicsMazarredo 14 E-48009 Bilbao Basque Country Spain

2Ikerbasque - Basque Foundation for ScienceAlameda Urquijo 36-5 Plaza Bizkaia 48011 Bilbao Basque Country Spain

3CMM - Centro de Modelamiento Matematico Universidad de Chile (UMI CNRS 2807)Avenida Blanco Encalada 2120 Casilla 170-3 Correo 3 Santiago Chile

E-mail address rlecarosdimuchilecl zuazuabcamathorg

URL wwwbcamathorgzuazua

  • 1 Introduction
  • 2 Existence of Minimizers
  • 3 The Discrete Minimization Problem
  • 4 Sensitivity analysis the continuous approach
    • 41 Sensitivity without shocks
    • 42 Sensitivity of the state in the presence of shocks
    • 43 Sensitivity of the cost in the presence of shocks
      • 5 The alternating descent method
      • 6 Numerical approximation of the descent direction
        • 61 The discrete approach
        • 62 The alternating descent method
          • 7 Numerical experiments
            • 71 Experiment 1
            • 72 Experiment 2
              • 8 Conclusions and perspectives
              • References
Page 15: TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 15

whereF ni = ∆tρni

(uni minus (ud)ni

) i isin Z n = 0 N (64)

In fact when multiplying the equations in (62) by pn+1i and adding in i isin Z and

n = 0 N the following identity is easily obtained

∆xsumiisinZ

p0i δu

0i = ∆x

Nsumn=0

sumiisinZ

F ni δu

0i (65)

This is the discrete version of formula (46) which allows us to simplify the derivative ofthe discrete cost functional

Thus for any variation δu0∆ the Gateaux derivative of the cost functional defined in

(31) is given by

δJ∆ = ∆x∆tNsum

n=0

sumiisinZ

ρni(uni minus (ud)ni

)δuni (66)

where δunij solves the linearized system (62) If we consider pni the solution of (63) with(64) we obtain that δJ∆ in (66) can be written as

δJ∆ = ∆xsumiisinZ

p0i δu

0i

and this allows to obtain easily the steepest descent direction for J∆ by considering

δu0∆ = minusp0

∆ (67)

Remark 61 We observe that for the Enquis-Osherrsquos flux (311) the system (63) is theupwind method for the continuous adjoint system We do not address here the problemof the convergence of this adjoint scheme towards the solution of the continuous adjointsystem Of course this is an easy matter when u is smooth but it is far from being trivialwhen u has shock discontinuities Whether or not this discrete adjoint system as ∆rarr 0allows reconstructing the complete adjoint system with the inner Dirichlet condition alongthe shock (417)(418)(419)(420)(421) and (422) constitutes an interesting problemfor future research We refer to [14] and [18] for preliminary work on this direction inone-dimension

62 The alternating descent method Now we describe the implementation of thealternating descent method

The main idea is to approximate a minimizer of J alternating between two directions ofdescent First we perturb the initial datum u0 using the direction (52) which principallychanges the profile of u0 Second we move the shock curve without altering the profile ofu0 at both sides of Σ using the direction (53)

More precisely for a given initialization u0∆ and target function ud∆ we compute Σ0

∆ thejump-point of u0

∆u∆ and p0∆ as the solutions of (62) (63) respectively

As numerical approximation of the adjoint state q(0) we take the value of p0∆ over the

point Σ0∆ ie

q0∆ = p0

∆(Σ0∆) = p0

i Σ0∆ isin (ximinus12 xi+12)

16 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

The main advantage of this method introduced in [8] is that for an initial datum u0

with a single discontinuity the descent directions are generalized tangent vectors ie theyintroduce Lipschitz continuous variations of u0 at both sides of the discontinuity and adisplacement of the shock position In this way the new datum obtained modifying theold one in the direction of this generalized tangent vector has again a single discontinuityThe method can be applied in a much more general context in which for instance thesolution has various shocks since the method is able both to generate shocks and to destroythem if any of these facts contributes to the decrease of the functional This method isin some sense close to those employed in shape design in elasticity in which topologicalderivatives (that would correspond to controlling the location of the shock in our method)are combined with classical shape deformations (that would correspond to simply shapingthe solution away from the shock in the present setting) [11]

As mentioned above in the context of the tracking problem we are considering thesolutions of the adjoint system do not increase the number of shocks Thus a directapplication of the continuous method without employing this alternating variant couldmake sense Note that the purely discrete method is a limited version of the continuous onesince one simply employs the descent direction indicated by p0 without paying attention tothe value q0 corresponding to the shock sensitivity However the numerical experimentswe have done with the full continuous method do not improve the results obtained by thediscrete one since the definition of the location of the shocks is lost along the iterativeprocess

7 Numerical experiments

In this section we present some numerical experiments which illustrate the results ob-tained in an optimization model problem with each one of the numerical methods describedin the previous section

We have chosen as computational domain the interval (minus4 4) and we have taken asboundary conditions in (61) at each time step t = tn the value of the initial data at theboundary This can be justified if we assume that the initial datum u0 is constant in asufficiently large inner neighborhood of the boundary x = plusmn4 (which depends on the sizeof the Linfin-norm of the data under consideration and the time horizon T ) due to the finitespeed of propagation A similar procedure is employed for the adjoint equation

We underline once more that the solutions obtained with each method may correspondto global minima or local ones since the gradient algorithm does not distinguish them

In the experiments we consider the Burgersrsquo equation ie f(z) = z22

parttu+ partx

(u2

2

)= 0 u(x 0) = u0(x) (71)

The weight function ρ under consideration in the experiments is given by

ρ(x t) =

1 t isin (T2 T )0 otherwise

And the time horizon T = 1

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 17

To compare the efficiency of the different methods we consider a fixed ∆x = 120 λ =∆t∆x = 23 (which satisfies the CFL condition) We then analyze the number of itera-tions that each method needs to attain a prescribed value of the functional

71 Experiment 1 We first consider a piecewise constant target profile ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

07 x isin [minus2 1]0 otherwise

(72)

Note that in this case (72) yields a particular solution of the optimization problemand the minimum value of J vanishes

We solve the optimization problem (13) with the above described different methodsstarting from the following initialization for u0

u0(x) =

05 x le 0minus01 x gt 0

(73)

which also has a discontinuity but located on a different pointIn Figure 2 we plot log(J) with respect to the number of iterations for both the purely

discrete method and the alternating descent one We see that the latter stabilizes in feweriterations

Figure 2 Experiment 1 log(J) versus the number of iterations in thedescent algorithm for the discrete and the alternating descent methods

In Figures 3 and 4 we present the minimizers obtained by the methods above and theassociated solutions Figures 5 and 6

The initial datum u0 obtained by the alternating descent method (Figures 4) is a goodapproximation of (72) The solution given by the discrete approach (Figures 3) presentsadded spurious oscillations Furthermore the discrete method is much slower and does notachieve the same level of accuracy since the functional J∆ does not decrease so much

18 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Figure 3 Experiment 1 u0discrete method iteration k =

999

Figure 4 Experiment 1 u0alternating descent method it-eration k = 38

Figure 5 Experiment 1 So-lution u(x t) discrete methoditeration k = 999

Figure 6 Experiment 1 So-lution u(x t) alternating de-scent method iteration k = 38

72 Experiment 2 The previous experiment indicates that the alternating descent methodperforms significantly better In order to show that this is a systematic fact which arisesindependently of the initialization of the method we consider the target ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

05 x le 050 otherwise

(74)

but this time we compare the performance of both methods starting from different initial-izations

The obtained numerical results are presented in Figure 7We see that regardless the initialization considered the alternating descent method

performs significantly better

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 19

u0 obtained by the discrete

approach

u0 obtained by the alternating

descent method

The value of the functional

Figure 7 Experiment 2 We present a comparison of the results obtainedwith both methods starting out of five different initialization configurationsIn the left column we exhibit the results obtained with the discrete methodIn the second one those achieved by the alternating descent method In thelast one we plot the evolution of the functional with both methods

20 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

We observe that in the five experiments the alternating descent method perfumes betterensuring the descent of the functional in much fewer iterations and yielding smoother lessoscillatory approximation of the minimizer

Note also that the discrete method rather than yielding discontinuous approximationsof the minimizer as the alternating descent method does it produces an initial datum witha Lipschitz front Observe that these are two different configurations that can lead tothe same evolution for the Burgers equation after some time once the front develops thediscontinuity This is in agreement with the fact that the functional to be minimized isonly active in the time-interval T2 le t le T

8 Conclusions and perspectives

In this paper we have adapted and presented the alternating descent method for atracking problem for a 1minus d scalar conservation law the goal being to identify an optimalinitial datum so that the solution gets as close as possible to a given trajectory Wehave exhibited in a number of examples the better performance of the alternating descentmethod with respect to the classical discrete method Thus the paper extends previousworks on other optimization problems such as the inverse design or the optimization of theflux function

The developments in this paper raise a number of interesting problems and questionsthat will deserve further investigation We summarize here some of them

bull The alternating descent method and the discrete one do not seem to yield the sameminimizer This should be further investigated in a more systematic manner inother experiments

The minimizer that the discrete method yields seems to replace the shock of theinitial datum by a Lipschitz function which eventually develops the same dynamicswithin the time interval T2 le t le T in which the functional we have chosen isactive This is an agreement with the behavior of these methods in the context ofthe classical problem of inverse design in which one aims to find the initial datumso that the solution at the final time takes a given value It would be interesting toprove analytically that the two methods may lead to different minimizers in somecircumstancesbull The experiments in this paper concern the case where the target ud is exactly

reachable It would be interesting to explore the performance of both methods inthe case where the target ud is not a solution of the underlying dynamicsbull It would be interesting to compare the performance of the methods presented in the

paper with the direct continuous method without introducing the alternating strat-egy Our preliminary numerical experiments indicate that the continuous methodwithout implementing the alternating strategy does not improve the results thatthe purely discrete strategy yieldsbull In [16] the problem of inverse design has been investigated adapting and extend-

ing the alternating descent method to the multi-dimensional case It would beinteresting to extend the analysis of this paper to the multidimensional case too

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 21

bull It would worth to compare the results in this paper with those that could beachieved by the nudging method as in [2] [3]

AcknowledgementsThe authors would like to thank C Castro F Palacios and A Pozo for stimulating

discussions

References

[1] A Adimurthi S S Ghoshal and V Gowda Optimal controllability for scalar conservation laws withconvex flux 12 Jan 2012 3

[2] D Auroux and J Blum Back and forth nudging algorithm for data assimilation problems ComptesRendus Mathematique 340(12)873 ndash 878 2005 3 20

[3] D Auroux and M Nodet The back and forth nudging algorithm for data assimilation problems theoretical results on transport equations ESAIM Control Optimisation and Calculus of Variations18(2)318ndash342 7 2012 3 20

[4] C Bardos and O Pironneau A formalism for the differentiation of conservation laws C R MathAcad Sci Paris 335(10)839ndash845 2002 8

[5] F Bouchut and F James One-dimensional transport equations with discontinuous coefficients Non-linear Anal 32(7)891ndash933 1998 8

[6] F Bouchut and F James Differentiability with respect to initial data for a scalar conservation lawIn Hyperbolic problems theory numerics applications Vol I (Zurich 1998) volume 129 of InternatSer Numer Math pages 113ndash118 Birkhauser Basel 1999 8

[7] A Bressan and A Marson A variational calculus for discontinuous solutions of systems of conservationlaws Comm Partial Differential Equations 20(9-10)1491ndash1552 1995 8 9

[8] C Castro F Palacios and E Zuazua An alternating descent method for the optimal control of theinviscid burgers equation in the presence of shocks Mathematical Models and Methods in AppliedSciences 18(03)369ndash416 2008 2 3 4 6 9 10 12 16

[9] C Castro F Palacios and E Zuazua Optimal control and vanishing viscosity for the burgers equa-tion In C Constanda and M Perez editors Integral Methods in Science and Engineering Volume2 pages 65ndash90 Birkhauser Boston 2010 2

[10] C Castro and E Zuazua Flux identification for 1-d scalar conservation laws in the presence of shocksMath Comp 80(276)2025ndash2070 2011 2

[11] S Garreau P Guillaume and M Masmoudi The topological asymptotic for pde systems Theelasticity case SIAM J Control Optim 39(6)1756ndash1778 Dec 2000 16

[12] E Godlewski and P Raviart Hyperbolic systems of conservation laws Mathematiques and Applica-tions Ellipses Paris 1991 6

[13] E Godlewski and P A Raviart The linearized stability of solutions of nonlinear hyperbolic systemsof conservation laws A general numerical approach Math Comput Simulation 50(1-4)77ndash95 1999Modelling rsquo98 (Prague) 8

[14] L Gosse and F James Numerical approximations of one-dimensional linear conservation equationswith discontinuous coefficients Mathematics of Computation 69(231)pp 987ndash1015 2000 15

[15] S N Kruzkov First order quasilinear equations with several independent variables Mat Sb (NS)81 (123)228ndash255 1970 2

[16] R Lecaros and E Zuazua Control of 2D scalar conservation laws in the presence of shocks preprint2014 3 6 20

[17] S Ulbrich Adjoint-based derivative computations for the optimal control of discontinuous solutions ofhyperbolic conservation laws Systems Control Lett 48(3-4)313ndash328 2003 Optimization and controlof distributed systems 8

22 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

[18] X Wen and S Jin Convergence of an immersed interface upwind scheme for linear advection equationswith piecewise constant coefficients I L1-error estimates J Comput Math 26(1)1ndash22 2008 15

Rodrigo Lecaros13 and Enrique Zuazua12

1BCAM - Basque Center for Applied MathematicsMazarredo 14 E-48009 Bilbao Basque Country Spain

2Ikerbasque - Basque Foundation for ScienceAlameda Urquijo 36-5 Plaza Bizkaia 48011 Bilbao Basque Country Spain

3CMM - Centro de Modelamiento Matematico Universidad de Chile (UMI CNRS 2807)Avenida Blanco Encalada 2120 Casilla 170-3 Correo 3 Santiago Chile

E-mail address rlecarosdimuchilecl zuazuabcamathorg

URL wwwbcamathorgzuazua

  • 1 Introduction
  • 2 Existence of Minimizers
  • 3 The Discrete Minimization Problem
  • 4 Sensitivity analysis the continuous approach
    • 41 Sensitivity without shocks
    • 42 Sensitivity of the state in the presence of shocks
    • 43 Sensitivity of the cost in the presence of shocks
      • 5 The alternating descent method
      • 6 Numerical approximation of the descent direction
        • 61 The discrete approach
        • 62 The alternating descent method
          • 7 Numerical experiments
            • 71 Experiment 1
            • 72 Experiment 2
              • 8 Conclusions and perspectives
              • References
Page 16: TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE

16 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

The main advantage of this method introduced in [8] is that for an initial datum u0

with a single discontinuity the descent directions are generalized tangent vectors ie theyintroduce Lipschitz continuous variations of u0 at both sides of the discontinuity and adisplacement of the shock position In this way the new datum obtained modifying theold one in the direction of this generalized tangent vector has again a single discontinuityThe method can be applied in a much more general context in which for instance thesolution has various shocks since the method is able both to generate shocks and to destroythem if any of these facts contributes to the decrease of the functional This method isin some sense close to those employed in shape design in elasticity in which topologicalderivatives (that would correspond to controlling the location of the shock in our method)are combined with classical shape deformations (that would correspond to simply shapingthe solution away from the shock in the present setting) [11]

As mentioned above in the context of the tracking problem we are considering thesolutions of the adjoint system do not increase the number of shocks Thus a directapplication of the continuous method without employing this alternating variant couldmake sense Note that the purely discrete method is a limited version of the continuous onesince one simply employs the descent direction indicated by p0 without paying attention tothe value q0 corresponding to the shock sensitivity However the numerical experimentswe have done with the full continuous method do not improve the results obtained by thediscrete one since the definition of the location of the shocks is lost along the iterativeprocess

7 Numerical experiments

In this section we present some numerical experiments which illustrate the results ob-tained in an optimization model problem with each one of the numerical methods describedin the previous section

We have chosen as computational domain the interval (minus4 4) and we have taken asboundary conditions in (61) at each time step t = tn the value of the initial data at theboundary This can be justified if we assume that the initial datum u0 is constant in asufficiently large inner neighborhood of the boundary x = plusmn4 (which depends on the sizeof the Linfin-norm of the data under consideration and the time horizon T ) due to the finitespeed of propagation A similar procedure is employed for the adjoint equation

We underline once more that the solutions obtained with each method may correspondto global minima or local ones since the gradient algorithm does not distinguish them

In the experiments we consider the Burgersrsquo equation ie f(z) = z22

parttu+ partx

(u2

2

)= 0 u(x 0) = u0(x) (71)

The weight function ρ under consideration in the experiments is given by

ρ(x t) =

1 t isin (T2 T )0 otherwise

And the time horizon T = 1

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 17

To compare the efficiency of the different methods we consider a fixed ∆x = 120 λ =∆t∆x = 23 (which satisfies the CFL condition) We then analyze the number of itera-tions that each method needs to attain a prescribed value of the functional

71 Experiment 1 We first consider a piecewise constant target profile ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

07 x isin [minus2 1]0 otherwise

(72)

Note that in this case (72) yields a particular solution of the optimization problemand the minimum value of J vanishes

We solve the optimization problem (13) with the above described different methodsstarting from the following initialization for u0

u0(x) =

05 x le 0minus01 x gt 0

(73)

which also has a discontinuity but located on a different pointIn Figure 2 we plot log(J) with respect to the number of iterations for both the purely

discrete method and the alternating descent one We see that the latter stabilizes in feweriterations

Figure 2 Experiment 1 log(J) versus the number of iterations in thedescent algorithm for the discrete and the alternating descent methods

In Figures 3 and 4 we present the minimizers obtained by the methods above and theassociated solutions Figures 5 and 6

The initial datum u0 obtained by the alternating descent method (Figures 4) is a goodapproximation of (72) The solution given by the discrete approach (Figures 3) presentsadded spurious oscillations Furthermore the discrete method is much slower and does notachieve the same level of accuracy since the functional J∆ does not decrease so much

18 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Figure 3 Experiment 1 u0discrete method iteration k =

999

Figure 4 Experiment 1 u0alternating descent method it-eration k = 38

Figure 5 Experiment 1 So-lution u(x t) discrete methoditeration k = 999

Figure 6 Experiment 1 So-lution u(x t) alternating de-scent method iteration k = 38

72 Experiment 2 The previous experiment indicates that the alternating descent methodperforms significantly better In order to show that this is a systematic fact which arisesindependently of the initialization of the method we consider the target ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

05 x le 050 otherwise

(74)

but this time we compare the performance of both methods starting from different initial-izations

The obtained numerical results are presented in Figure 7We see that regardless the initialization considered the alternating descent method

performs significantly better

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 19

u0 obtained by the discrete

approach

u0 obtained by the alternating

descent method

The value of the functional

Figure 7 Experiment 2 We present a comparison of the results obtainedwith both methods starting out of five different initialization configurationsIn the left column we exhibit the results obtained with the discrete methodIn the second one those achieved by the alternating descent method In thelast one we plot the evolution of the functional with both methods

20 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

We observe that in the five experiments the alternating descent method perfumes betterensuring the descent of the functional in much fewer iterations and yielding smoother lessoscillatory approximation of the minimizer

Note also that the discrete method rather than yielding discontinuous approximationsof the minimizer as the alternating descent method does it produces an initial datum witha Lipschitz front Observe that these are two different configurations that can lead tothe same evolution for the Burgers equation after some time once the front develops thediscontinuity This is in agreement with the fact that the functional to be minimized isonly active in the time-interval T2 le t le T

8 Conclusions and perspectives

In this paper we have adapted and presented the alternating descent method for atracking problem for a 1minus d scalar conservation law the goal being to identify an optimalinitial datum so that the solution gets as close as possible to a given trajectory Wehave exhibited in a number of examples the better performance of the alternating descentmethod with respect to the classical discrete method Thus the paper extends previousworks on other optimization problems such as the inverse design or the optimization of theflux function

The developments in this paper raise a number of interesting problems and questionsthat will deserve further investigation We summarize here some of them

bull The alternating descent method and the discrete one do not seem to yield the sameminimizer This should be further investigated in a more systematic manner inother experiments

The minimizer that the discrete method yields seems to replace the shock of theinitial datum by a Lipschitz function which eventually develops the same dynamicswithin the time interval T2 le t le T in which the functional we have chosen isactive This is an agreement with the behavior of these methods in the context ofthe classical problem of inverse design in which one aims to find the initial datumso that the solution at the final time takes a given value It would be interesting toprove analytically that the two methods may lead to different minimizers in somecircumstancesbull The experiments in this paper concern the case where the target ud is exactly

reachable It would be interesting to explore the performance of both methods inthe case where the target ud is not a solution of the underlying dynamicsbull It would be interesting to compare the performance of the methods presented in the

paper with the direct continuous method without introducing the alternating strat-egy Our preliminary numerical experiments indicate that the continuous methodwithout implementing the alternating strategy does not improve the results thatthe purely discrete strategy yieldsbull In [16] the problem of inverse design has been investigated adapting and extend-

ing the alternating descent method to the multi-dimensional case It would beinteresting to extend the analysis of this paper to the multidimensional case too

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 21

bull It would worth to compare the results in this paper with those that could beachieved by the nudging method as in [2] [3]

AcknowledgementsThe authors would like to thank C Castro F Palacios and A Pozo for stimulating

discussions

References

[1] A Adimurthi S S Ghoshal and V Gowda Optimal controllability for scalar conservation laws withconvex flux 12 Jan 2012 3

[2] D Auroux and J Blum Back and forth nudging algorithm for data assimilation problems ComptesRendus Mathematique 340(12)873 ndash 878 2005 3 20

[3] D Auroux and M Nodet The back and forth nudging algorithm for data assimilation problems theoretical results on transport equations ESAIM Control Optimisation and Calculus of Variations18(2)318ndash342 7 2012 3 20

[4] C Bardos and O Pironneau A formalism for the differentiation of conservation laws C R MathAcad Sci Paris 335(10)839ndash845 2002 8

[5] F Bouchut and F James One-dimensional transport equations with discontinuous coefficients Non-linear Anal 32(7)891ndash933 1998 8

[6] F Bouchut and F James Differentiability with respect to initial data for a scalar conservation lawIn Hyperbolic problems theory numerics applications Vol I (Zurich 1998) volume 129 of InternatSer Numer Math pages 113ndash118 Birkhauser Basel 1999 8

[7] A Bressan and A Marson A variational calculus for discontinuous solutions of systems of conservationlaws Comm Partial Differential Equations 20(9-10)1491ndash1552 1995 8 9

[8] C Castro F Palacios and E Zuazua An alternating descent method for the optimal control of theinviscid burgers equation in the presence of shocks Mathematical Models and Methods in AppliedSciences 18(03)369ndash416 2008 2 3 4 6 9 10 12 16

[9] C Castro F Palacios and E Zuazua Optimal control and vanishing viscosity for the burgers equa-tion In C Constanda and M Perez editors Integral Methods in Science and Engineering Volume2 pages 65ndash90 Birkhauser Boston 2010 2

[10] C Castro and E Zuazua Flux identification for 1-d scalar conservation laws in the presence of shocksMath Comp 80(276)2025ndash2070 2011 2

[11] S Garreau P Guillaume and M Masmoudi The topological asymptotic for pde systems Theelasticity case SIAM J Control Optim 39(6)1756ndash1778 Dec 2000 16

[12] E Godlewski and P Raviart Hyperbolic systems of conservation laws Mathematiques and Applica-tions Ellipses Paris 1991 6

[13] E Godlewski and P A Raviart The linearized stability of solutions of nonlinear hyperbolic systemsof conservation laws A general numerical approach Math Comput Simulation 50(1-4)77ndash95 1999Modelling rsquo98 (Prague) 8

[14] L Gosse and F James Numerical approximations of one-dimensional linear conservation equationswith discontinuous coefficients Mathematics of Computation 69(231)pp 987ndash1015 2000 15

[15] S N Kruzkov First order quasilinear equations with several independent variables Mat Sb (NS)81 (123)228ndash255 1970 2

[16] R Lecaros and E Zuazua Control of 2D scalar conservation laws in the presence of shocks preprint2014 3 6 20

[17] S Ulbrich Adjoint-based derivative computations for the optimal control of discontinuous solutions ofhyperbolic conservation laws Systems Control Lett 48(3-4)313ndash328 2003 Optimization and controlof distributed systems 8

22 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

[18] X Wen and S Jin Convergence of an immersed interface upwind scheme for linear advection equationswith piecewise constant coefficients I L1-error estimates J Comput Math 26(1)1ndash22 2008 15

Rodrigo Lecaros13 and Enrique Zuazua12

1BCAM - Basque Center for Applied MathematicsMazarredo 14 E-48009 Bilbao Basque Country Spain

2Ikerbasque - Basque Foundation for ScienceAlameda Urquijo 36-5 Plaza Bizkaia 48011 Bilbao Basque Country Spain

3CMM - Centro de Modelamiento Matematico Universidad de Chile (UMI CNRS 2807)Avenida Blanco Encalada 2120 Casilla 170-3 Correo 3 Santiago Chile

E-mail address rlecarosdimuchilecl zuazuabcamathorg

URL wwwbcamathorgzuazua

  • 1 Introduction
  • 2 Existence of Minimizers
  • 3 The Discrete Minimization Problem
  • 4 Sensitivity analysis the continuous approach
    • 41 Sensitivity without shocks
    • 42 Sensitivity of the state in the presence of shocks
    • 43 Sensitivity of the cost in the presence of shocks
      • 5 The alternating descent method
      • 6 Numerical approximation of the descent direction
        • 61 The discrete approach
        • 62 The alternating descent method
          • 7 Numerical experiments
            • 71 Experiment 1
            • 72 Experiment 2
              • 8 Conclusions and perspectives
              • References
Page 17: TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 17

To compare the efficiency of the different methods we consider a fixed ∆x = 120 λ =∆t∆x = 23 (which satisfies the CFL condition) We then analyze the number of itera-tions that each method needs to attain a prescribed value of the functional

71 Experiment 1 We first consider a piecewise constant target profile ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

07 x isin [minus2 1]0 otherwise

(72)

Note that in this case (72) yields a particular solution of the optimization problemand the minimum value of J vanishes

We solve the optimization problem (13) with the above described different methodsstarting from the following initialization for u0

u0(x) =

05 x le 0minus01 x gt 0

(73)

which also has a discontinuity but located on a different pointIn Figure 2 we plot log(J) with respect to the number of iterations for both the purely

discrete method and the alternating descent one We see that the latter stabilizes in feweriterations

Figure 2 Experiment 1 log(J) versus the number of iterations in thedescent algorithm for the discrete and the alternating descent methods

In Figures 3 and 4 we present the minimizers obtained by the methods above and theassociated solutions Figures 5 and 6

The initial datum u0 obtained by the alternating descent method (Figures 4) is a goodapproximation of (72) The solution given by the discrete approach (Figures 3) presentsadded spurious oscillations Furthermore the discrete method is much slower and does notachieve the same level of accuracy since the functional J∆ does not decrease so much

18 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Figure 3 Experiment 1 u0discrete method iteration k =

999

Figure 4 Experiment 1 u0alternating descent method it-eration k = 38

Figure 5 Experiment 1 So-lution u(x t) discrete methoditeration k = 999

Figure 6 Experiment 1 So-lution u(x t) alternating de-scent method iteration k = 38

72 Experiment 2 The previous experiment indicates that the alternating descent methodperforms significantly better In order to show that this is a systematic fact which arisesindependently of the initialization of the method we consider the target ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

05 x le 050 otherwise

(74)

but this time we compare the performance of both methods starting from different initial-izations

The obtained numerical results are presented in Figure 7We see that regardless the initialization considered the alternating descent method

performs significantly better

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 19

u0 obtained by the discrete

approach

u0 obtained by the alternating

descent method

The value of the functional

Figure 7 Experiment 2 We present a comparison of the results obtainedwith both methods starting out of five different initialization configurationsIn the left column we exhibit the results obtained with the discrete methodIn the second one those achieved by the alternating descent method In thelast one we plot the evolution of the functional with both methods

20 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

We observe that in the five experiments the alternating descent method perfumes betterensuring the descent of the functional in much fewer iterations and yielding smoother lessoscillatory approximation of the minimizer

Note also that the discrete method rather than yielding discontinuous approximationsof the minimizer as the alternating descent method does it produces an initial datum witha Lipschitz front Observe that these are two different configurations that can lead tothe same evolution for the Burgers equation after some time once the front develops thediscontinuity This is in agreement with the fact that the functional to be minimized isonly active in the time-interval T2 le t le T

8 Conclusions and perspectives

In this paper we have adapted and presented the alternating descent method for atracking problem for a 1minus d scalar conservation law the goal being to identify an optimalinitial datum so that the solution gets as close as possible to a given trajectory Wehave exhibited in a number of examples the better performance of the alternating descentmethod with respect to the classical discrete method Thus the paper extends previousworks on other optimization problems such as the inverse design or the optimization of theflux function

The developments in this paper raise a number of interesting problems and questionsthat will deserve further investigation We summarize here some of them

bull The alternating descent method and the discrete one do not seem to yield the sameminimizer This should be further investigated in a more systematic manner inother experiments

The minimizer that the discrete method yields seems to replace the shock of theinitial datum by a Lipschitz function which eventually develops the same dynamicswithin the time interval T2 le t le T in which the functional we have chosen isactive This is an agreement with the behavior of these methods in the context ofthe classical problem of inverse design in which one aims to find the initial datumso that the solution at the final time takes a given value It would be interesting toprove analytically that the two methods may lead to different minimizers in somecircumstancesbull The experiments in this paper concern the case where the target ud is exactly

reachable It would be interesting to explore the performance of both methods inthe case where the target ud is not a solution of the underlying dynamicsbull It would be interesting to compare the performance of the methods presented in the

paper with the direct continuous method without introducing the alternating strat-egy Our preliminary numerical experiments indicate that the continuous methodwithout implementing the alternating strategy does not improve the results thatthe purely discrete strategy yieldsbull In [16] the problem of inverse design has been investigated adapting and extend-

ing the alternating descent method to the multi-dimensional case It would beinteresting to extend the analysis of this paper to the multidimensional case too

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 21

bull It would worth to compare the results in this paper with those that could beachieved by the nudging method as in [2] [3]

AcknowledgementsThe authors would like to thank C Castro F Palacios and A Pozo for stimulating

discussions

References

[1] A Adimurthi S S Ghoshal and V Gowda Optimal controllability for scalar conservation laws withconvex flux 12 Jan 2012 3

[2] D Auroux and J Blum Back and forth nudging algorithm for data assimilation problems ComptesRendus Mathematique 340(12)873 ndash 878 2005 3 20

[3] D Auroux and M Nodet The back and forth nudging algorithm for data assimilation problems theoretical results on transport equations ESAIM Control Optimisation and Calculus of Variations18(2)318ndash342 7 2012 3 20

[4] C Bardos and O Pironneau A formalism for the differentiation of conservation laws C R MathAcad Sci Paris 335(10)839ndash845 2002 8

[5] F Bouchut and F James One-dimensional transport equations with discontinuous coefficients Non-linear Anal 32(7)891ndash933 1998 8

[6] F Bouchut and F James Differentiability with respect to initial data for a scalar conservation lawIn Hyperbolic problems theory numerics applications Vol I (Zurich 1998) volume 129 of InternatSer Numer Math pages 113ndash118 Birkhauser Basel 1999 8

[7] A Bressan and A Marson A variational calculus for discontinuous solutions of systems of conservationlaws Comm Partial Differential Equations 20(9-10)1491ndash1552 1995 8 9

[8] C Castro F Palacios and E Zuazua An alternating descent method for the optimal control of theinviscid burgers equation in the presence of shocks Mathematical Models and Methods in AppliedSciences 18(03)369ndash416 2008 2 3 4 6 9 10 12 16

[9] C Castro F Palacios and E Zuazua Optimal control and vanishing viscosity for the burgers equa-tion In C Constanda and M Perez editors Integral Methods in Science and Engineering Volume2 pages 65ndash90 Birkhauser Boston 2010 2

[10] C Castro and E Zuazua Flux identification for 1-d scalar conservation laws in the presence of shocksMath Comp 80(276)2025ndash2070 2011 2

[11] S Garreau P Guillaume and M Masmoudi The topological asymptotic for pde systems Theelasticity case SIAM J Control Optim 39(6)1756ndash1778 Dec 2000 16

[12] E Godlewski and P Raviart Hyperbolic systems of conservation laws Mathematiques and Applica-tions Ellipses Paris 1991 6

[13] E Godlewski and P A Raviart The linearized stability of solutions of nonlinear hyperbolic systemsof conservation laws A general numerical approach Math Comput Simulation 50(1-4)77ndash95 1999Modelling rsquo98 (Prague) 8

[14] L Gosse and F James Numerical approximations of one-dimensional linear conservation equationswith discontinuous coefficients Mathematics of Computation 69(231)pp 987ndash1015 2000 15

[15] S N Kruzkov First order quasilinear equations with several independent variables Mat Sb (NS)81 (123)228ndash255 1970 2

[16] R Lecaros and E Zuazua Control of 2D scalar conservation laws in the presence of shocks preprint2014 3 6 20

[17] S Ulbrich Adjoint-based derivative computations for the optimal control of discontinuous solutions ofhyperbolic conservation laws Systems Control Lett 48(3-4)313ndash328 2003 Optimization and controlof distributed systems 8

22 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

[18] X Wen and S Jin Convergence of an immersed interface upwind scheme for linear advection equationswith piecewise constant coefficients I L1-error estimates J Comput Math 26(1)1ndash22 2008 15

Rodrigo Lecaros13 and Enrique Zuazua12

1BCAM - Basque Center for Applied MathematicsMazarredo 14 E-48009 Bilbao Basque Country Spain

2Ikerbasque - Basque Foundation for ScienceAlameda Urquijo 36-5 Plaza Bizkaia 48011 Bilbao Basque Country Spain

3CMM - Centro de Modelamiento Matematico Universidad de Chile (UMI CNRS 2807)Avenida Blanco Encalada 2120 Casilla 170-3 Correo 3 Santiago Chile

E-mail address rlecarosdimuchilecl zuazuabcamathorg

URL wwwbcamathorgzuazua

  • 1 Introduction
  • 2 Existence of Minimizers
  • 3 The Discrete Minimization Problem
  • 4 Sensitivity analysis the continuous approach
    • 41 Sensitivity without shocks
    • 42 Sensitivity of the state in the presence of shocks
    • 43 Sensitivity of the cost in the presence of shocks
      • 5 The alternating descent method
      • 6 Numerical approximation of the descent direction
        • 61 The discrete approach
        • 62 The alternating descent method
          • 7 Numerical experiments
            • 71 Experiment 1
            • 72 Experiment 2
              • 8 Conclusions and perspectives
              • References
Page 18: TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE

18 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

Figure 3 Experiment 1 u0discrete method iteration k =

999

Figure 4 Experiment 1 u0alternating descent method it-eration k = 38

Figure 5 Experiment 1 So-lution u(x t) discrete methoditeration k = 999

Figure 6 Experiment 1 So-lution u(x t) alternating de-scent method iteration k = 38

72 Experiment 2 The previous experiment indicates that the alternating descent methodperforms significantly better In order to show that this is a systematic fact which arisesindependently of the initialization of the method we consider the target ud given by thesolution of (71) with the initial condition (ud)0 given by

(ud)0(x) =

05 x le 050 otherwise

(74)

but this time we compare the performance of both methods starting from different initial-izations

The obtained numerical results are presented in Figure 7We see that regardless the initialization considered the alternating descent method

performs significantly better

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 19

u0 obtained by the discrete

approach

u0 obtained by the alternating

descent method

The value of the functional

Figure 7 Experiment 2 We present a comparison of the results obtainedwith both methods starting out of five different initialization configurationsIn the left column we exhibit the results obtained with the discrete methodIn the second one those achieved by the alternating descent method In thelast one we plot the evolution of the functional with both methods

20 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

We observe that in the five experiments the alternating descent method perfumes betterensuring the descent of the functional in much fewer iterations and yielding smoother lessoscillatory approximation of the minimizer

Note also that the discrete method rather than yielding discontinuous approximationsof the minimizer as the alternating descent method does it produces an initial datum witha Lipschitz front Observe that these are two different configurations that can lead tothe same evolution for the Burgers equation after some time once the front develops thediscontinuity This is in agreement with the fact that the functional to be minimized isonly active in the time-interval T2 le t le T

8 Conclusions and perspectives

In this paper we have adapted and presented the alternating descent method for atracking problem for a 1minus d scalar conservation law the goal being to identify an optimalinitial datum so that the solution gets as close as possible to a given trajectory Wehave exhibited in a number of examples the better performance of the alternating descentmethod with respect to the classical discrete method Thus the paper extends previousworks on other optimization problems such as the inverse design or the optimization of theflux function

The developments in this paper raise a number of interesting problems and questionsthat will deserve further investigation We summarize here some of them

bull The alternating descent method and the discrete one do not seem to yield the sameminimizer This should be further investigated in a more systematic manner inother experiments

The minimizer that the discrete method yields seems to replace the shock of theinitial datum by a Lipschitz function which eventually develops the same dynamicswithin the time interval T2 le t le T in which the functional we have chosen isactive This is an agreement with the behavior of these methods in the context ofthe classical problem of inverse design in which one aims to find the initial datumso that the solution at the final time takes a given value It would be interesting toprove analytically that the two methods may lead to different minimizers in somecircumstancesbull The experiments in this paper concern the case where the target ud is exactly

reachable It would be interesting to explore the performance of both methods inthe case where the target ud is not a solution of the underlying dynamicsbull It would be interesting to compare the performance of the methods presented in the

paper with the direct continuous method without introducing the alternating strat-egy Our preliminary numerical experiments indicate that the continuous methodwithout implementing the alternating strategy does not improve the results thatthe purely discrete strategy yieldsbull In [16] the problem of inverse design has been investigated adapting and extend-

ing the alternating descent method to the multi-dimensional case It would beinteresting to extend the analysis of this paper to the multidimensional case too

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 21

bull It would worth to compare the results in this paper with those that could beachieved by the nudging method as in [2] [3]

AcknowledgementsThe authors would like to thank C Castro F Palacios and A Pozo for stimulating

discussions

References

[1] A Adimurthi S S Ghoshal and V Gowda Optimal controllability for scalar conservation laws withconvex flux 12 Jan 2012 3

[2] D Auroux and J Blum Back and forth nudging algorithm for data assimilation problems ComptesRendus Mathematique 340(12)873 ndash 878 2005 3 20

[3] D Auroux and M Nodet The back and forth nudging algorithm for data assimilation problems theoretical results on transport equations ESAIM Control Optimisation and Calculus of Variations18(2)318ndash342 7 2012 3 20

[4] C Bardos and O Pironneau A formalism for the differentiation of conservation laws C R MathAcad Sci Paris 335(10)839ndash845 2002 8

[5] F Bouchut and F James One-dimensional transport equations with discontinuous coefficients Non-linear Anal 32(7)891ndash933 1998 8

[6] F Bouchut and F James Differentiability with respect to initial data for a scalar conservation lawIn Hyperbolic problems theory numerics applications Vol I (Zurich 1998) volume 129 of InternatSer Numer Math pages 113ndash118 Birkhauser Basel 1999 8

[7] A Bressan and A Marson A variational calculus for discontinuous solutions of systems of conservationlaws Comm Partial Differential Equations 20(9-10)1491ndash1552 1995 8 9

[8] C Castro F Palacios and E Zuazua An alternating descent method for the optimal control of theinviscid burgers equation in the presence of shocks Mathematical Models and Methods in AppliedSciences 18(03)369ndash416 2008 2 3 4 6 9 10 12 16

[9] C Castro F Palacios and E Zuazua Optimal control and vanishing viscosity for the burgers equa-tion In C Constanda and M Perez editors Integral Methods in Science and Engineering Volume2 pages 65ndash90 Birkhauser Boston 2010 2

[10] C Castro and E Zuazua Flux identification for 1-d scalar conservation laws in the presence of shocksMath Comp 80(276)2025ndash2070 2011 2

[11] S Garreau P Guillaume and M Masmoudi The topological asymptotic for pde systems Theelasticity case SIAM J Control Optim 39(6)1756ndash1778 Dec 2000 16

[12] E Godlewski and P Raviart Hyperbolic systems of conservation laws Mathematiques and Applica-tions Ellipses Paris 1991 6

[13] E Godlewski and P A Raviart The linearized stability of solutions of nonlinear hyperbolic systemsof conservation laws A general numerical approach Math Comput Simulation 50(1-4)77ndash95 1999Modelling rsquo98 (Prague) 8

[14] L Gosse and F James Numerical approximations of one-dimensional linear conservation equationswith discontinuous coefficients Mathematics of Computation 69(231)pp 987ndash1015 2000 15

[15] S N Kruzkov First order quasilinear equations with several independent variables Mat Sb (NS)81 (123)228ndash255 1970 2

[16] R Lecaros and E Zuazua Control of 2D scalar conservation laws in the presence of shocks preprint2014 3 6 20

[17] S Ulbrich Adjoint-based derivative computations for the optimal control of discontinuous solutions ofhyperbolic conservation laws Systems Control Lett 48(3-4)313ndash328 2003 Optimization and controlof distributed systems 8

22 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

[18] X Wen and S Jin Convergence of an immersed interface upwind scheme for linear advection equationswith piecewise constant coefficients I L1-error estimates J Comput Math 26(1)1ndash22 2008 15

Rodrigo Lecaros13 and Enrique Zuazua12

1BCAM - Basque Center for Applied MathematicsMazarredo 14 E-48009 Bilbao Basque Country Spain

2Ikerbasque - Basque Foundation for ScienceAlameda Urquijo 36-5 Plaza Bizkaia 48011 Bilbao Basque Country Spain

3CMM - Centro de Modelamiento Matematico Universidad de Chile (UMI CNRS 2807)Avenida Blanco Encalada 2120 Casilla 170-3 Correo 3 Santiago Chile

E-mail address rlecarosdimuchilecl zuazuabcamathorg

URL wwwbcamathorgzuazua

  • 1 Introduction
  • 2 Existence of Minimizers
  • 3 The Discrete Minimization Problem
  • 4 Sensitivity analysis the continuous approach
    • 41 Sensitivity without shocks
    • 42 Sensitivity of the state in the presence of shocks
    • 43 Sensitivity of the cost in the presence of shocks
      • 5 The alternating descent method
      • 6 Numerical approximation of the descent direction
        • 61 The discrete approach
        • 62 The alternating descent method
          • 7 Numerical experiments
            • 71 Experiment 1
            • 72 Experiment 2
              • 8 Conclusions and perspectives
              • References
Page 19: TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 19

u0 obtained by the discrete

approach

u0 obtained by the alternating

descent method

The value of the functional

Figure 7 Experiment 2 We present a comparison of the results obtainedwith both methods starting out of five different initialization configurationsIn the left column we exhibit the results obtained with the discrete methodIn the second one those achieved by the alternating descent method In thelast one we plot the evolution of the functional with both methods

20 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

We observe that in the five experiments the alternating descent method perfumes betterensuring the descent of the functional in much fewer iterations and yielding smoother lessoscillatory approximation of the minimizer

Note also that the discrete method rather than yielding discontinuous approximationsof the minimizer as the alternating descent method does it produces an initial datum witha Lipschitz front Observe that these are two different configurations that can lead tothe same evolution for the Burgers equation after some time once the front develops thediscontinuity This is in agreement with the fact that the functional to be minimized isonly active in the time-interval T2 le t le T

8 Conclusions and perspectives

In this paper we have adapted and presented the alternating descent method for atracking problem for a 1minus d scalar conservation law the goal being to identify an optimalinitial datum so that the solution gets as close as possible to a given trajectory Wehave exhibited in a number of examples the better performance of the alternating descentmethod with respect to the classical discrete method Thus the paper extends previousworks on other optimization problems such as the inverse design or the optimization of theflux function

The developments in this paper raise a number of interesting problems and questionsthat will deserve further investigation We summarize here some of them

bull The alternating descent method and the discrete one do not seem to yield the sameminimizer This should be further investigated in a more systematic manner inother experiments

The minimizer that the discrete method yields seems to replace the shock of theinitial datum by a Lipschitz function which eventually develops the same dynamicswithin the time interval T2 le t le T in which the functional we have chosen isactive This is an agreement with the behavior of these methods in the context ofthe classical problem of inverse design in which one aims to find the initial datumso that the solution at the final time takes a given value It would be interesting toprove analytically that the two methods may lead to different minimizers in somecircumstancesbull The experiments in this paper concern the case where the target ud is exactly

reachable It would be interesting to explore the performance of both methods inthe case where the target ud is not a solution of the underlying dynamicsbull It would be interesting to compare the performance of the methods presented in the

paper with the direct continuous method without introducing the alternating strat-egy Our preliminary numerical experiments indicate that the continuous methodwithout implementing the alternating strategy does not improve the results thatthe purely discrete strategy yieldsbull In [16] the problem of inverse design has been investigated adapting and extend-

ing the alternating descent method to the multi-dimensional case It would beinteresting to extend the analysis of this paper to the multidimensional case too

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 21

bull It would worth to compare the results in this paper with those that could beachieved by the nudging method as in [2] [3]

AcknowledgementsThe authors would like to thank C Castro F Palacios and A Pozo for stimulating

discussions

References

[1] A Adimurthi S S Ghoshal and V Gowda Optimal controllability for scalar conservation laws withconvex flux 12 Jan 2012 3

[2] D Auroux and J Blum Back and forth nudging algorithm for data assimilation problems ComptesRendus Mathematique 340(12)873 ndash 878 2005 3 20

[3] D Auroux and M Nodet The back and forth nudging algorithm for data assimilation problems theoretical results on transport equations ESAIM Control Optimisation and Calculus of Variations18(2)318ndash342 7 2012 3 20

[4] C Bardos and O Pironneau A formalism for the differentiation of conservation laws C R MathAcad Sci Paris 335(10)839ndash845 2002 8

[5] F Bouchut and F James One-dimensional transport equations with discontinuous coefficients Non-linear Anal 32(7)891ndash933 1998 8

[6] F Bouchut and F James Differentiability with respect to initial data for a scalar conservation lawIn Hyperbolic problems theory numerics applications Vol I (Zurich 1998) volume 129 of InternatSer Numer Math pages 113ndash118 Birkhauser Basel 1999 8

[7] A Bressan and A Marson A variational calculus for discontinuous solutions of systems of conservationlaws Comm Partial Differential Equations 20(9-10)1491ndash1552 1995 8 9

[8] C Castro F Palacios and E Zuazua An alternating descent method for the optimal control of theinviscid burgers equation in the presence of shocks Mathematical Models and Methods in AppliedSciences 18(03)369ndash416 2008 2 3 4 6 9 10 12 16

[9] C Castro F Palacios and E Zuazua Optimal control and vanishing viscosity for the burgers equa-tion In C Constanda and M Perez editors Integral Methods in Science and Engineering Volume2 pages 65ndash90 Birkhauser Boston 2010 2

[10] C Castro and E Zuazua Flux identification for 1-d scalar conservation laws in the presence of shocksMath Comp 80(276)2025ndash2070 2011 2

[11] S Garreau P Guillaume and M Masmoudi The topological asymptotic for pde systems Theelasticity case SIAM J Control Optim 39(6)1756ndash1778 Dec 2000 16

[12] E Godlewski and P Raviart Hyperbolic systems of conservation laws Mathematiques and Applica-tions Ellipses Paris 1991 6

[13] E Godlewski and P A Raviart The linearized stability of solutions of nonlinear hyperbolic systemsof conservation laws A general numerical approach Math Comput Simulation 50(1-4)77ndash95 1999Modelling rsquo98 (Prague) 8

[14] L Gosse and F James Numerical approximations of one-dimensional linear conservation equationswith discontinuous coefficients Mathematics of Computation 69(231)pp 987ndash1015 2000 15

[15] S N Kruzkov First order quasilinear equations with several independent variables Mat Sb (NS)81 (123)228ndash255 1970 2

[16] R Lecaros and E Zuazua Control of 2D scalar conservation laws in the presence of shocks preprint2014 3 6 20

[17] S Ulbrich Adjoint-based derivative computations for the optimal control of discontinuous solutions ofhyperbolic conservation laws Systems Control Lett 48(3-4)313ndash328 2003 Optimization and controlof distributed systems 8

22 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

[18] X Wen and S Jin Convergence of an immersed interface upwind scheme for linear advection equationswith piecewise constant coefficients I L1-error estimates J Comput Math 26(1)1ndash22 2008 15

Rodrigo Lecaros13 and Enrique Zuazua12

1BCAM - Basque Center for Applied MathematicsMazarredo 14 E-48009 Bilbao Basque Country Spain

2Ikerbasque - Basque Foundation for ScienceAlameda Urquijo 36-5 Plaza Bizkaia 48011 Bilbao Basque Country Spain

3CMM - Centro de Modelamiento Matematico Universidad de Chile (UMI CNRS 2807)Avenida Blanco Encalada 2120 Casilla 170-3 Correo 3 Santiago Chile

E-mail address rlecarosdimuchilecl zuazuabcamathorg

URL wwwbcamathorgzuazua

  • 1 Introduction
  • 2 Existence of Minimizers
  • 3 The Discrete Minimization Problem
  • 4 Sensitivity analysis the continuous approach
    • 41 Sensitivity without shocks
    • 42 Sensitivity of the state in the presence of shocks
    • 43 Sensitivity of the cost in the presence of shocks
      • 5 The alternating descent method
      • 6 Numerical approximation of the descent direction
        • 61 The discrete approach
        • 62 The alternating descent method
          • 7 Numerical experiments
            • 71 Experiment 1
            • 72 Experiment 2
              • 8 Conclusions and perspectives
              • References
Page 20: TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE

20 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

We observe that in the five experiments the alternating descent method perfumes betterensuring the descent of the functional in much fewer iterations and yielding smoother lessoscillatory approximation of the minimizer

Note also that the discrete method rather than yielding discontinuous approximationsof the minimizer as the alternating descent method does it produces an initial datum witha Lipschitz front Observe that these are two different configurations that can lead tothe same evolution for the Burgers equation after some time once the front develops thediscontinuity This is in agreement with the fact that the functional to be minimized isonly active in the time-interval T2 le t le T

8 Conclusions and perspectives

In this paper we have adapted and presented the alternating descent method for atracking problem for a 1minus d scalar conservation law the goal being to identify an optimalinitial datum so that the solution gets as close as possible to a given trajectory Wehave exhibited in a number of examples the better performance of the alternating descentmethod with respect to the classical discrete method Thus the paper extends previousworks on other optimization problems such as the inverse design or the optimization of theflux function

The developments in this paper raise a number of interesting problems and questionsthat will deserve further investigation We summarize here some of them

bull The alternating descent method and the discrete one do not seem to yield the sameminimizer This should be further investigated in a more systematic manner inother experiments

The minimizer that the discrete method yields seems to replace the shock of theinitial datum by a Lipschitz function which eventually develops the same dynamicswithin the time interval T2 le t le T in which the functional we have chosen isactive This is an agreement with the behavior of these methods in the context ofthe classical problem of inverse design in which one aims to find the initial datumso that the solution at the final time takes a given value It would be interesting toprove analytically that the two methods may lead to different minimizers in somecircumstancesbull The experiments in this paper concern the case where the target ud is exactly

reachable It would be interesting to explore the performance of both methods inthe case where the target ud is not a solution of the underlying dynamicsbull It would be interesting to compare the performance of the methods presented in the

paper with the direct continuous method without introducing the alternating strat-egy Our preliminary numerical experiments indicate that the continuous methodwithout implementing the alternating strategy does not improve the results thatthe purely discrete strategy yieldsbull In [16] the problem of inverse design has been investigated adapting and extend-

ing the alternating descent method to the multi-dimensional case It would beinteresting to extend the analysis of this paper to the multidimensional case too

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 21

bull It would worth to compare the results in this paper with those that could beachieved by the nudging method as in [2] [3]

AcknowledgementsThe authors would like to thank C Castro F Palacios and A Pozo for stimulating

discussions

References

[1] A Adimurthi S S Ghoshal and V Gowda Optimal controllability for scalar conservation laws withconvex flux 12 Jan 2012 3

[2] D Auroux and J Blum Back and forth nudging algorithm for data assimilation problems ComptesRendus Mathematique 340(12)873 ndash 878 2005 3 20

[3] D Auroux and M Nodet The back and forth nudging algorithm for data assimilation problems theoretical results on transport equations ESAIM Control Optimisation and Calculus of Variations18(2)318ndash342 7 2012 3 20

[4] C Bardos and O Pironneau A formalism for the differentiation of conservation laws C R MathAcad Sci Paris 335(10)839ndash845 2002 8

[5] F Bouchut and F James One-dimensional transport equations with discontinuous coefficients Non-linear Anal 32(7)891ndash933 1998 8

[6] F Bouchut and F James Differentiability with respect to initial data for a scalar conservation lawIn Hyperbolic problems theory numerics applications Vol I (Zurich 1998) volume 129 of InternatSer Numer Math pages 113ndash118 Birkhauser Basel 1999 8

[7] A Bressan and A Marson A variational calculus for discontinuous solutions of systems of conservationlaws Comm Partial Differential Equations 20(9-10)1491ndash1552 1995 8 9

[8] C Castro F Palacios and E Zuazua An alternating descent method for the optimal control of theinviscid burgers equation in the presence of shocks Mathematical Models and Methods in AppliedSciences 18(03)369ndash416 2008 2 3 4 6 9 10 12 16

[9] C Castro F Palacios and E Zuazua Optimal control and vanishing viscosity for the burgers equa-tion In C Constanda and M Perez editors Integral Methods in Science and Engineering Volume2 pages 65ndash90 Birkhauser Boston 2010 2

[10] C Castro and E Zuazua Flux identification for 1-d scalar conservation laws in the presence of shocksMath Comp 80(276)2025ndash2070 2011 2

[11] S Garreau P Guillaume and M Masmoudi The topological asymptotic for pde systems Theelasticity case SIAM J Control Optim 39(6)1756ndash1778 Dec 2000 16

[12] E Godlewski and P Raviart Hyperbolic systems of conservation laws Mathematiques and Applica-tions Ellipses Paris 1991 6

[13] E Godlewski and P A Raviart The linearized stability of solutions of nonlinear hyperbolic systemsof conservation laws A general numerical approach Math Comput Simulation 50(1-4)77ndash95 1999Modelling rsquo98 (Prague) 8

[14] L Gosse and F James Numerical approximations of one-dimensional linear conservation equationswith discontinuous coefficients Mathematics of Computation 69(231)pp 987ndash1015 2000 15

[15] S N Kruzkov First order quasilinear equations with several independent variables Mat Sb (NS)81 (123)228ndash255 1970 2

[16] R Lecaros and E Zuazua Control of 2D scalar conservation laws in the presence of shocks preprint2014 3 6 20

[17] S Ulbrich Adjoint-based derivative computations for the optimal control of discontinuous solutions ofhyperbolic conservation laws Systems Control Lett 48(3-4)313ndash328 2003 Optimization and controlof distributed systems 8

22 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

[18] X Wen and S Jin Convergence of an immersed interface upwind scheme for linear advection equationswith piecewise constant coefficients I L1-error estimates J Comput Math 26(1)1ndash22 2008 15

Rodrigo Lecaros13 and Enrique Zuazua12

1BCAM - Basque Center for Applied MathematicsMazarredo 14 E-48009 Bilbao Basque Country Spain

2Ikerbasque - Basque Foundation for ScienceAlameda Urquijo 36-5 Plaza Bizkaia 48011 Bilbao Basque Country Spain

3CMM - Centro de Modelamiento Matematico Universidad de Chile (UMI CNRS 2807)Avenida Blanco Encalada 2120 Casilla 170-3 Correo 3 Santiago Chile

E-mail address rlecarosdimuchilecl zuazuabcamathorg

URL wwwbcamathorgzuazua

  • 1 Introduction
  • 2 Existence of Minimizers
  • 3 The Discrete Minimization Problem
  • 4 Sensitivity analysis the continuous approach
    • 41 Sensitivity without shocks
    • 42 Sensitivity of the state in the presence of shocks
    • 43 Sensitivity of the cost in the presence of shocks
      • 5 The alternating descent method
      • 6 Numerical approximation of the descent direction
        • 61 The discrete approach
        • 62 The alternating descent method
          • 7 Numerical experiments
            • 71 Experiment 1
            • 72 Experiment 2
              • 8 Conclusions and perspectives
              • References
Page 21: TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE

TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE PRESENCE OF SHOCKS 21

bull It would worth to compare the results in this paper with those that could beachieved by the nudging method as in [2] [3]

AcknowledgementsThe authors would like to thank C Castro F Palacios and A Pozo for stimulating

discussions

References

[1] A Adimurthi S S Ghoshal and V Gowda Optimal controllability for scalar conservation laws withconvex flux 12 Jan 2012 3

[2] D Auroux and J Blum Back and forth nudging algorithm for data assimilation problems ComptesRendus Mathematique 340(12)873 ndash 878 2005 3 20

[3] D Auroux and M Nodet The back and forth nudging algorithm for data assimilation problems theoretical results on transport equations ESAIM Control Optimisation and Calculus of Variations18(2)318ndash342 7 2012 3 20

[4] C Bardos and O Pironneau A formalism for the differentiation of conservation laws C R MathAcad Sci Paris 335(10)839ndash845 2002 8

[5] F Bouchut and F James One-dimensional transport equations with discontinuous coefficients Non-linear Anal 32(7)891ndash933 1998 8

[6] F Bouchut and F James Differentiability with respect to initial data for a scalar conservation lawIn Hyperbolic problems theory numerics applications Vol I (Zurich 1998) volume 129 of InternatSer Numer Math pages 113ndash118 Birkhauser Basel 1999 8

[7] A Bressan and A Marson A variational calculus for discontinuous solutions of systems of conservationlaws Comm Partial Differential Equations 20(9-10)1491ndash1552 1995 8 9

[8] C Castro F Palacios and E Zuazua An alternating descent method for the optimal control of theinviscid burgers equation in the presence of shocks Mathematical Models and Methods in AppliedSciences 18(03)369ndash416 2008 2 3 4 6 9 10 12 16

[9] C Castro F Palacios and E Zuazua Optimal control and vanishing viscosity for the burgers equa-tion In C Constanda and M Perez editors Integral Methods in Science and Engineering Volume2 pages 65ndash90 Birkhauser Boston 2010 2

[10] C Castro and E Zuazua Flux identification for 1-d scalar conservation laws in the presence of shocksMath Comp 80(276)2025ndash2070 2011 2

[11] S Garreau P Guillaume and M Masmoudi The topological asymptotic for pde systems Theelasticity case SIAM J Control Optim 39(6)1756ndash1778 Dec 2000 16

[12] E Godlewski and P Raviart Hyperbolic systems of conservation laws Mathematiques and Applica-tions Ellipses Paris 1991 6

[13] E Godlewski and P A Raviart The linearized stability of solutions of nonlinear hyperbolic systemsof conservation laws A general numerical approach Math Comput Simulation 50(1-4)77ndash95 1999Modelling rsquo98 (Prague) 8

[14] L Gosse and F James Numerical approximations of one-dimensional linear conservation equationswith discontinuous coefficients Mathematics of Computation 69(231)pp 987ndash1015 2000 15

[15] S N Kruzkov First order quasilinear equations with several independent variables Mat Sb (NS)81 (123)228ndash255 1970 2

[16] R Lecaros and E Zuazua Control of 2D scalar conservation laws in the presence of shocks preprint2014 3 6 20

[17] S Ulbrich Adjoint-based derivative computations for the optimal control of discontinuous solutions ofhyperbolic conservation laws Systems Control Lett 48(3-4)313ndash328 2003 Optimization and controlof distributed systems 8

22 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

[18] X Wen and S Jin Convergence of an immersed interface upwind scheme for linear advection equationswith piecewise constant coefficients I L1-error estimates J Comput Math 26(1)1ndash22 2008 15

Rodrigo Lecaros13 and Enrique Zuazua12

1BCAM - Basque Center for Applied MathematicsMazarredo 14 E-48009 Bilbao Basque Country Spain

2Ikerbasque - Basque Foundation for ScienceAlameda Urquijo 36-5 Plaza Bizkaia 48011 Bilbao Basque Country Spain

3CMM - Centro de Modelamiento Matematico Universidad de Chile (UMI CNRS 2807)Avenida Blanco Encalada 2120 Casilla 170-3 Correo 3 Santiago Chile

E-mail address rlecarosdimuchilecl zuazuabcamathorg

URL wwwbcamathorgzuazua

  • 1 Introduction
  • 2 Existence of Minimizers
  • 3 The Discrete Minimization Problem
  • 4 Sensitivity analysis the continuous approach
    • 41 Sensitivity without shocks
    • 42 Sensitivity of the state in the presence of shocks
    • 43 Sensitivity of the cost in the presence of shocks
      • 5 The alternating descent method
      • 6 Numerical approximation of the descent direction
        • 61 The discrete approach
        • 62 The alternating descent method
          • 7 Numerical experiments
            • 71 Experiment 1
            • 72 Experiment 2
              • 8 Conclusions and perspectives
              • References
Page 22: TRACKING CONTROL OF 1D SCALAR CONSERVATION LAWS IN THE

22 RODRIGO LECAROS13 AND ENRIQUE ZUAZUA12

[18] X Wen and S Jin Convergence of an immersed interface upwind scheme for linear advection equationswith piecewise constant coefficients I L1-error estimates J Comput Math 26(1)1ndash22 2008 15

Rodrigo Lecaros13 and Enrique Zuazua12

1BCAM - Basque Center for Applied MathematicsMazarredo 14 E-48009 Bilbao Basque Country Spain

2Ikerbasque - Basque Foundation for ScienceAlameda Urquijo 36-5 Plaza Bizkaia 48011 Bilbao Basque Country Spain

3CMM - Centro de Modelamiento Matematico Universidad de Chile (UMI CNRS 2807)Avenida Blanco Encalada 2120 Casilla 170-3 Correo 3 Santiago Chile

E-mail address rlecarosdimuchilecl zuazuabcamathorg

URL wwwbcamathorgzuazua

  • 1 Introduction
  • 2 Existence of Minimizers
  • 3 The Discrete Minimization Problem
  • 4 Sensitivity analysis the continuous approach
    • 41 Sensitivity without shocks
    • 42 Sensitivity of the state in the presence of shocks
    • 43 Sensitivity of the cost in the presence of shocks
      • 5 The alternating descent method
      • 6 Numerical approximation of the descent direction
        • 61 The discrete approach
        • 62 The alternating descent method
          • 7 Numerical experiments
            • 71 Experiment 1
            • 72 Experiment 2
              • 8 Conclusions and perspectives
              • References