Spatio-chromatic image content descriptors and their analysis using Extreme Value theory
An Introduction to Extreme Value Theory
Transcript of An Introduction to Extreme Value Theory
An Introduction to Extreme Value Theory
Petra Friederichs
Meteorological InstituteUniversity of Bonn
COPS Summer School, July/August, 2007
Applications of EVT
Finance• distribution of income has so called fat tails• value-at-risk: maximal daily lost• re-assurance
Hydrology• protection against flood• Q100: maximal flow that is expected once every 100 years
Meteorology• extreme winds • risk assessment (e.g. ICE, power plants)• heavy precipitation events• heat waves, hurricanes, droughts• extremes in a changing climate
In classical statistics: focus on AVERAGE behavior of stochastic process
central limit theorem
In extreme value theory:focus on extreme and rare events
Fisher-Tippett theorem
What is an extreme?
from Ulrike Schneider
Application
Extreme Value Theory
Block Maximum
for
follows a Generalized Extreme Value (GEV) distribution
Peak over Threshold (POT)
very large threshold u
follow a Generalized Pareto Distribution (GPD)
Poisson-Point GPD Process
combines POT with Poisson point process
M n=max {X 1 , , X n}
M n
n∞
{X i−u∣X iu}
Extreme Value Theory
Block Maximum
Example: station precipitation (DWD) at Dresden
1948 – 2004: 57 seasonal (Nov-March and May-Sept) (maxima over approximately 150 days)
M n=max {X 1 , , X n}
DresdenSummer
Winter
GEV – Fisher-Tippett Theorem
G y=exp −[1y−]−1 /
The distribution of
converges to ( )
which is called the Generalized Extreme Value (GEV) distribution.
It has three parameters
location parameter
scale parameter
shape parameter
M n=max {X 1 , , X n}
n∞
G y=exp −exp −y−
≠0
=0
GEV – Types of Distributions
GEV has 3 types depending on shape parameter
Gumbel
Fréchet
Weibull
=0
G x =exp−exp[−x ]
x=y−
=1 /0
G x =exp−[1x]−
=−1 /0
G x =exp−[1−x]
GEV – Types of Distributions
GEV has 3 types depending on shape parameter
Gumbel
exponential tail
Fréchet
so called „fat tail“
Weibull
upper finite endpoint
=0
x=y−
=1 /0
=−1 /0
GEV – Types of Distribution
Conditions to the sample from which the maxima are drawn
must be independently identically distributed (i.i.d.)
Let be the distribution of
is in the domain of attraction of a Gumbel type GEV iff
for all
Exponential decay in the tail of
{X 1 , , X n}
X i
X iF x
F x
limx∞
1−F xtb x 1−F x
=e−t
t0F x
F(x)p=0.9p=0.99
x0.9 x0.99
GEV – Types of Distribution
Conditions to the sample from which the maxima are drawn
must be independently identically distributed (i.i.d.)
Let be the distribution of
is in the domain of attraction of a Frechet type GEV iff
for all
polynomial decay in the tail of
{X 1 , , X n}
X i
X iF x
F x
limx∞
1−F x 1−F x
=−1/
0F x
GEV – Types of Distribution
Conditions to the sample from which the maxima are drawn
must be independently identically distributed (i.i.d.)
Let be the distribution of
is in the domain of attraction of a Weibull type GEV iff
there exists with
and
for all
has a finite upper end point
{X 1 , , X n}
X i
X iF x
F x
limx∞1−F F−
1 x1−F F−
1x−1
=1/
0F x
F F =1F
F
GEV – return level
zm= − log −log 1−1/m
Of interest often is return level
value expected every m observation (block maxima)
calculated using invers distribution function (quantile function)
can be estimated empirically as the
zm
Prob yzm=1−G yzm=1m
zm= G−11−1m = inf y ∣F y
1m
zm=G−11−
1m = −
1−−log 1−1/m−
DresdenSummer
Winter
Maximum daily precipitation for Nov-March (Winter) and May-Sept (Summer)
Block Maxima
Block Maxima
Block Maxima
Peak over Threshold
Peak over Threshold (POT)
very large threshold u
follow a Generalized Pareto Distribution (GPD)
{X i−u∣X iu}
Daily precipitation for Nov-March (green) and May-Sept (red) Dresden
Peak over Threshold
Peak over Threshold (POT)
The distribution of
exceedances over large threshold u
are asymptotically distributed following a
Generalized Pareto Distribution (GPD)
two parameters
scale parameter
shape parameter
advantage: more efficient use of data
disadvantage: how to choose threshold not evident
Y i :=X i−u∣X iu
H y∣X iu=1−1yu−1/
POT – Types of Distribution
GDP has same 3 types as GEV depending on shape parameter
Gumbel
exponential tail
Pareto (Fréchet)
polynomial tail behavior
Weibull
has upper end point
=0
H y =1−exp−yu
0
1−H y ~c y−1/
0
F=u∣∣
Peak over Threshold
POT: threshold u = 15mm
Poisson Point – GPD Process
A= t 2−t1 [1y−u u
]−1/
Poisson point – GPD process with intensity
on
A= t1, t2 × y ,∞
Estimation and Uncertainty
General concept of estimating parameters from a sample
Maximum Likelihood (ML) Method
Assume are draw from a GEV (GPD,...) with unknown parameters
and PDF
{ yi} , i=1, , n
yi~F y∣ , ,
{ yi}
f y∣ , ,=F ' y∣ , ,
Maximum Likelihood Method
L , ,=∏i=1
nf yi∣ , ,
The likelihood L of the sample is then
It is easier to minimize the negative logarithm of the likelihood
in general there is no analytical solution for the minimum with respect to the parameters
Minimize using numerical algorithms.
The estimates maximize the likelihood of the data.
l , ,=−∑i=1
nlog f yi∣ , ,
, ,
Profile log Likelihood
To Take Home
There exists a well elaborated statistical theory for extreme values.
It applies to (almost) all (univariate) extremal problems.
EVT: extremes from a very large domain of stochastic processes follow one of the three types: Gumbel, Frechet/Pareto, or Weibull
Only those three types characterize the behavior of extremes!
Note: Data need to be in the asymptotic limit of a EVD!
References
Coles, S (2001): An Introduction to Statistical Modeling of Extreme Values. Springer Series in Statistics. Springer Verlag London. 208p
Beirlant, J; Y. Goegebeur; J. Segers; J. Teugels (2005): Statistics of Extremes. Theory and Applications. John Wiley & Sons Ltd. 490p
Embrechts, Küppelberg, Mikosch (1997): Modelling Extremal Events for Insurance and Finance. Springer Verlag Heidelberg.648p
Gumbel, E.J. (1958): Statistics of Extremes. (Dover Publication, New York 2004)
R Development Core Team (2003): R: A language and environment for statistical computing, available at http://www.R-project.org
The evd and ismev Packages by Alec Stephenson and Stuard Coles