Lecture 15 Nonlinear Problems Newton’s Method. Syllabus Lecture 01Describing Inverse Problems...

Lecture 15

Nonlinear Problems

Newton’s Method

SyllabusLecture 01 Describing Inverse ProblemsLecture 02 Probability and Measurement Error, Part 1Lecture 03 Probability and Measurement Error, Part 2 Lecture 04 The L2 Norm and Simple Least SquaresLecture 05 A Priori Information and Weighted Least SquaredLecture 06 Resolution and Generalized InversesLecture 07 Backus-Gilbert Inverse and the Trade Off of Resolution and VarianceLecture 08 The Principle of Maximum LikelihoodLecture 09 Inexact TheoriesLecture 10 Nonuniqueness and Localized AveragesLecture 11 Vector Spaces and Singular Value DecompositionLecture 12 Equality and Inequality ConstraintsLecture 13 L1 , L∞ Norm Problems and Linear ProgrammingLecture 14 Nonlinear Problems: Grid and Monte Carlo Searches Lecture 15 Nonlinear Problems: Newton’s Method Lecture 16 Nonlinear Problems: Simulated Annealing and Bootstrap Confidence Intervals Lecture 17 Factor AnalysisLecture 18 Varimax Factors, Empircal Orthogonal FunctionsLecture 19 Backus-Gilbert Theory for Continuous Problems; Radon’s ProblemLecture 20 Linear Operators and Their AdjointsLecture 21 Fréchet DerivativesLecture 22 Exemplary Inverse Problems, incl. Filter DesignLecture 23 Exemplary Inverse Problems, incl. Earthquake LocationLecture 24 Exemplary Inverse Problems, incl. Vibrational Problems

Purpose of the Lecture

Introduce Newton’s Method

Generalize it to an Implicit Theory

Introduce the Gradient Method

Part 1

Newton’s Method

grid searchMonte Carlo Method

are completely undirected

alternativetake directions from the

local propertiesof the error function E(m)

Newton’s Method

start with a guess m(p)near m(p) , approximate E(m) as a parabola and

find its minimum

set new guess to this value and iterate

0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 50

mnest mn+1estmGM mE(m)

Taylor Series Approximation for E(m)expand E around a point m(p)

differentiate and set result to zero to find minimum

relate b and B to g(m)

linearizeddata kernel

formula for approximate solution

relate b and B to g(m)

very reminiscentof least squares

what do you do if you can’t analytically differentiate g(m) ?

use finite differences to numerically differentiate

g(m)or

first derivative

vectorΔm [0, ..., 0, 1, 0, ..., 0]T

need to evaluate E(m) M+1 times

second derivative

need to evaluate E(m) about ½M2 times

what can go wrong?

convergence to a local minimum

0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 50

mnest mn+1est mGM mE(m)

localminimum

global minimum

analytically differentiatesample inverse problem

di(xi) = sin(ω0m1xi) + m1m2

0 0.5 1 1.5 2

0 5 10 15 200

iteration

0 5 10 15 201

iteration

0 5 10 15 201

iteration

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 10

often, the convergence is very rapid

sometimes the solution converges to a local minimum

sometimes it even diverges

mg = [1, 1]’;G = zeros(N,M);for k = [1:Niter]

dg = sin( w0*mg(1)*x) + mg(1)*mg(2); dd = dobs-dg; Eg=dd'*dd; G = zeros(N,2); G(:,1) = w0*x.*cos(w0*mg(1)*x) + mg(2); G(:,2) = mg(2)*ones(N,1); % least squares solution dm = (G'*G)\(G'*dd); % update mg = mg+dm;

Part 2

Newton’s Method for anImplicit Theory

Implicit Theoryf(d,m)=0with Gaussian

prediction errorand

a priori information about m

to simplify algebragroup d, m into a vector x

0 1 2 3 4 5

parameter, x2

<x2>para

meter, x 1

represent data and a priori model parameters as a Gaussian p(x)

f(x)=0 defines a surface in the space of x

maximize p(x) on this surface

maximum likelihood point is xest

0 1 2 3 4 5

P(s)x2x2est

f(x)=0

(B)(A)

can get local maxima if f(x) isvery non-linear

0 1 2 3 4 5

P(s)x2x2est

p(x1)x1 ML

f(x)=0

(B)(A)

mathematical statement of the problem

its solution (using Lagrange Multipliers)

with Fij = ∂fi/∂xj

reminiscent ofminimum length solution

oops!x appears in 3 places

solutioniterate !

new value for xis x(p+1)

old value for xis x(p)

special case of an explicit theoryf(x) = d-g(m)

equivalent to solving

using simple least squares

special case of an explicit theory f(x) = d-g(m)

weighted least squares generalized inversewith a linearized data kernel

special case of an explicit theory f(x) = d-g(m)Newton’s Method, but making E+L small

not just E small

Part 3

The Gradient Method

What if you can computeE(m) and ∂E/∂mp

but you can’t compute∂g/∂mp or ∂2E/∂mp∂ mq

mnest mGM

you know the directiontowards the minimumbut not how far away it is

unit vector pointing towards the minimum

so improved solution would be

if we knew how big to make α

Armijo’s ruleprovides an acceptance criterion for α

with c≈10-4

simple strategystart with a largish α

divide it by 2 whenever it fails Armijo’s Rule

(B)(C)

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 10

0 0.5 1 1.5 2

0 50 100 150 2000

0 50 100 150 2001

iteration

Error,

% error and its gradient at the trial solutionmgo=[1,1]';ygo = sin( w0*mgo(1)*x) + mgo(1)*mgo(2);Ego = (ygo-y)'*(ygo-y);dydmo = zeros(N,2);dydmo(:,1) = w0*x.*cos(w0*mgo(1)*x) + mgo(2);dydmo(:,2) = mgo(2)*ones(N,1);dEdmo = 2*dydmo'*(ygo-y);

alpha = 0.05;c1 = 0.0001;tau = 0.5;

Niter=500;for k = [1:Niter] v = -dEdmo / sqrt(dEdmo'*dEdmo);

% backstep for kk=[1:10] mg = mgo+alpha*v; yg = sin(w0*mg(1)*x)+mg(1)*mg(2); Eg = (yg-y)'*(yg-y); dydm = zeros(N,2); dydm(:,1) = w0*x.*cos(w0*mg(1)*x)+ mg(2);

dydm(:,2) = mg(2)*ones(N,1); dEdm = 2*dydm'*(yg-y); if( (Eg<=(Ego + c1*alpha*v'*dEdmo)) ) break; end alpha = tau*alpha; end

% change in solution Dmg = sqrt( (mg-mgo)'*(mg-mgo) ); % update mgo=mg; ygo = yg; Ego = Eg; dydmo = dydm; dEdmo = dEdm; if( Dmg < 1.0e-6 ) break; endend

often, the convergence is reasonably rapid

exception

when the minimum is in along a long shallow valley

Lecture 15 Nonlinear Problems Newton’s Method. Syllabus Lecture 01Describing Inverse Problems...

Documents

Transcript of Lecture 15 Nonlinear Problems Newton’s Method. Syllabus Lecture 01Describing Inverse Problems...

Lecture 1 - Stanford University · Lecture 1 Introduction Lecture 2 Linear Systems Lecture 3 Basic Feedback Lecture 4 PID Lecture 5 Digital Control Lecture 6 Outer Loop Lecture 7

Lecture 8 The Principle of Maximum Likelihood. Syllabus Lecture 01Describing Inverse Problems Lecture 02Probability and Measurement Error, Part 1 Lecture.

Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1

Lecture Slides: Lecture Evaluation

Lecture 11- Pelvis Lecture

Lecture 3 1 - PEEEBpeeeb.dk/book1/chapter3/lecture3.pdf · Lecture 3 1 Lecture 3 2 Lecture 3 3 Lecture 3 4 Lecture 3 5 Lecture 3 6 Lecture 3 7 Lecture 3 8 Lecture 3 9 Lecture 3 10

Lecture 8 1 - PEEEBpeeeb.dk/book1/chapter8/lecture8.pdf · Lecture 8 9 . Lecture 8 10 . Lecture 8 11 . Lecture 8 12 . Lecture 8 13 . Lecture 8 14 . Lecture 8 15 . Lecture 8 16 . Lecture

Lecture 12 - Stoichiometry Lecture 12 - Introduction Lecture 12

Lecture 4Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture

Summary Notes - University of Cambridgeachc2/phase/summary_notes.pdf · Lecture 2 Lecture 3 Lecture 4 Lecture 5 Lecture 6 Lecture 7 Lecture 8 Lecture 9 Lecture 10 Lecture 11 Lecture

Lecture Slides-Lecture 1

Cold atoms Lecture 7. 21 st February, 2007. 2 Preliminary plan/reality in the fall term Lecture 1 … Lecture 2 … Lecture 3 … Lecture 4 … Lecture 5 … Lecture.

Lecture 7Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture

Lecture 3Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 v

Lecture 12Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1

Lecture 9 Inexact Theories. Syllabus Lecture 01Describing Inverse Problems Lecture 02Probability and Measurement Error, Part 1 Lecture 03Probability and.

Lecture 8Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 Lecture 1 vv

Lecture 1 Describing Inverse Problems. Syllabus Lecture 01Describing Inverse Problems Lecture 02Probability and Measurement Error, Part 1 Lecture 03Probability.

LECTURE 23 LECTURE OUTLINE - dspace.mit.edudspace.mit.edu/.../contents/lecture-notes/MIT6_235S10_lec23.pdf · LECTURE 23 LECTURE OUTLINE • Interior point methods • Constrained

Internal diseases - ed.umed.wroc.pl · Lecture 2 Lecture 3 Lecture 4 Lecture 1 Lecture 2 Lecture 3 Lecture 4 benefits from lowering both of them. The role of renin angiotensin system