Data Assimilation

Tristan Quaife, Philip Lewis

What is Data Assimilation?

A working definition:

The set techniques the combine data with some underlying process model to provide optimal estimates of the true state and/or parameters of that model.

What is Data Assimilation?

It is not just model inversion.

But could be seen as a process constraint on inversion (e.g. a temporal constraint)

e.g. Use EO data to constrain estimates of terrestrial C fluxes

Terrestrial EO data: no direct constraint on C fluxes

Combine with model

Data Assimilation is Bayesian

• Bayes’ theorem:

P(A|B) =P(B|A) P(A)

What does DA aim to do?

Use all available information about The underlying model The observations The observation operator

Including estimates of uncertainty and the current state of the system

To provide a best estimate of the true state of the system with quantified uncertainty

Kalman Filter DA: MODIS LAI product

Data assimilation into DALEC ecological model

Lower-level product DA

Ensure consistency between model and observations

Assimilate low-level products (surface reflectance)

Uncertainty better quantified

Need to build observation operator relating model state (e.g. LAI) to reflectance

Example of Oregon (MODIS DA)

Quaife et al. 2008, RSE

Modelled vs. observed reflectance

Red NIRNote BRF shape in red: can’t simulate with 1-D canopy (GORT here)

NEP results

No assimilation

Assimilating MODIS

(red/NIR)

DALEC model calibrated from flux measurements at tower site 1

Integrated flux predictions

Flux (gC.m-2)

Assimilated data

3yr totalStandardDeviation

No assimilation 240.2 212.2

MODIS B1 & B2 373.0 151.3

Williams et al. (2005)

406.0 27.8

No assimilation 1646.4 834.5

MODIS B1 & B2 2620.3 96.8

2170.3 18.1

Flux (gC.m-2)

Assimilated data

TotalStandardDeviation

Assimilation exc. snow

373.0 151.3

Assimilation inc. snow

404.8 129.6

406.0 27.8

Assimilation exc. snow

2620.3 96.8

Assimilation inc. snow

2525.6 42.7

2170.3 18.1

Mean NEP for 2000-2002

gC/m2/year

4.5 km

Flux Tower

Spatial average = 50.9

Std. dev. = 9.7

(gC/m2/year)

NEP – Site2 (intermediate) parameters, with/without DA

Model running at Site 2, Oregon

Site 1 model EO-calibrated at site 2NEP observations from Site 2

Shows ability to spatialise

Data assimilation

Low-level DA can be effective

‘easier’ data uncertainties

Can be applied to multiple observation types

Requires Observation operator(s)

RT models

Requires other uncertainties

Ecosystem Model

Driver (climate)

Observation operator

Specific issues in land EOLDAS

No spatial transfer of information Require full spatial coverage Atmosphere dealt with by an instantaneous retrieval

(i.e. no transport model) All state vector members influence observations

We are not interested in other variables!

Sequential Smoothers Variational

Nominal classification of DA

Kalman Filter Variants - EKF

Ensemble Kalman Filter Variants – Unscented EnKF

Particle filters Lots of different types true MCMC technique

Sequential methods

• Propagation step:

x = Mx-

P = MP-MT + Q• Analysis step:

x* = x + K( y – Hx )

K = PHT( HPHT+R )-1

The Kalman filter

State vectorModel

Covariancematrix

Stochastic forcingKalman

gainObservation

vectorObservation covariance

matrix

Observation operator

The Kalman Filter

• Linear process model

• Linear observation operator

• Assumes normally distributed errors

x = m(x-)

P = M'P-M'T + Q• Analysis step:

x* = x + K( y – h(x) )

K = PH'T( H'PH'T+R )-1

The Extended Kalman filter

Jacobian matrix

The Extended Kalman Filter

• Linear process model

• Linear observation operator

• Problem with divergence

X = m(X-) + Q

no explicit error propagation• Analysis step:

X*= X + K( D – HX )

K = PHT( HPHT+R )-1

The Ensemble Kalman filter

State vector ensemble

Perturbed observations

The Ensemble Kalman Filter

• P estimated from X

• Non linear observations using augmentation:

Xa = h(X) X

No assimilation

Assimilating MODIS surface

reflectance bands 1 and 2

• Avoids use of Jacobian matrices

– Some degree of relaxation of this assumption

• Augmentation assumes local linearisation

Particle Filters

X = m(X-) + Q

• Analysis step:

e.g. Metropolis-Hastings

Particle Filters

No available observations

Particle Filters

• Fully Bayesian

– No underlying assumptions about distributions

• Theoretically the most appealing choice of sequential technique, but…

• Our analysis show little difference with EnKF

• Potentially requires larger ensemble

– But comparing 1:1 is faster than EnKF

Sequential techniques

• General considerations:

– Designed for real time systems

– Only consider historical observations

– Only assimilates observations in single time step

– Can lead to artificial high frequency components

• Extension of sequential techniques

• All observations effect every time step

• Analogous to weighting on observations

– [ smoothing-convolution / regularisation ]

• Difficult to apply in rapid change areas

Smoothers

Smoothers - regularisation

x = (HTR-1H + λ2BTB)-1HTR-1y

B is the required constraint. It imposes:

Bf = 0and the scalar λ is a weighting on the constraint.

Constraint matrix

Lagrange multiplier

Regularisation

Quaife and Lewis (2010) Temporal constraints on linear

BRDF model parameters. IEEE TGRS, in press.

Regularisation

• Lots of literature on the selection of λ

– Cross validation etc

• Permits insight into the form of Q

Variational techniques

• Expressed as a cost function

• Uses numerical minimisation

• Gradient descent requires differential

• Traditionally used for initial conditions

– But parameters may also be adjusted

J(x) = ( x-x- ) P-1 ( x-x- )T +

( y-h(x) ) R-1 ( y-h(x) )T

Background

Observations

• No temporal propagation of state vector

– OK for zero order approximations

– Unable to deal with phenology

J(x) = ( x-x- ) P-1 ( x-x- )T +

( y-h(xi) ) R-1 ( y-h(xi) )TΣi

Time varying state vector

Variational techniques

• Parameters constant over time window

• Non smooth transitions

• Assumes normal error distribution

• Size of time window?

• For zero-order case 3DVAR = 4DVAR

– 4DVAR for use with phenology model

• Absence of Q - propagation of P?

Building an EOLDAS

• Lewis et al. (RSE submitted)

• Sentinel-2

EOLDAS

Assimilation

Assume model

Uncertainty known

EOLDAS

Base level noise

Cross validation

EOLDAS

Cross validation

Double noise

EOLDAS

Double noise

Conclusions - technique

• DA is optimal way to combine observations and model

• Range of options available for DA

• Sequential

• Smoothers

• Variational

• Require understanding of relative uncertainties of model and observations

• Require way of linking observations and model state

• Observation operator (e.g. RT)

References

• P. Lewis et al. (2010 submitted) RSE, An EOLDAS

• T. Quaife, P. Lewis, M. DE Kauwe, M. Williams, B. Law, M. Disney, P. Bowyer (2008), Assimilating Canopy Reflectance data into an Ecosystem Model with an Ensemble Kalman Filter, Remote Sensing of Environment, 112(4),1347-1364.

• T. Quaife and P. Lewis (2010) Temporal constraints on linear BRF model parameters IEEE Transactions on Geoscience and Remote Sensing doi: 10.1109/TGRS.2009.2038901

• http://www.ecmwf.int/newsevents/training/rcourse_notes/DATA_ASSIMILATION/ASSIM_CONCEPTS/Assim_concepts11.html

• http://www.cs.unc.edu/~welch/kalman/

Data Assimilation

Documents

Transcript of Data Assimilation

Radiation Belt Data Assimilation: Overview and …gem.epss.ucla.edu/mediawiki/tutorial/2011/2011_Koller...Data Assimilation Basics Data assimilation is combining data with model using

Data Assimilation vs Data Mining

Data Assimilation

Surface data assimilation

Hydrologic Data Assimilation - Dr. Paul R. Houser · hydrologic data assimilation. Data Assimilation combines observations into a dynamical model, using the model’s equations to

Ensemble-Based Data Assimilation · at data assimilation time (cont’d) • Knowing statistics of Qcrucial • Mitchell and Houtekamer: estimate parameters of Q from data assimilation

Data Assimilation Theory CTCD Data Assimilation Workshop Nov 2005

Hydrologic Data Assimilation

Challenges and practical applications of data assimilation in numerical weather prediction Data Assimilation Education Forum Part I: Overview of Data Assimilation.

Data Assimilation Systems

Coupled Data Assimilation

What is Data Assimilation ? Data Assimilation: Data assimilation seeks to characterize the true state of an environmental system by combining information.

Geogg124: Data assimilation

About data assimilation

Radar Data Assimilation

Lecture 1 on data assimilation: Elementary principles of ...cerea.enpc.fr/HomePages/bocquet/docs/ihp2019/lect1.pdf · Data assimilation: principles Introduction Data assimilation:

Data Assimilation - European Space Agency · 2013. 10. 23. · Data Assimilation Alan O’Neill Data Assimilation Research Centre University of Reading. Contents • Motivation •

Data assimilation concepts and methods March 1999 - · PDF file16.1 Theorem: use of Hessian ... ocean circulation models, ... Data assimilation concepts and methods, = Data assimilation

PERSPECTIVES FROM THE GLOBAL OCEAN DATA ASSIMILATION ... · Keywords : Ocean, data assimilation, observations, prediction. 1. Introduction The concept of a Global Ocean Data Assimilation

Introduction to Data Assimilation NCEO Data-assimilation training days 11-12 July 2011