Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a...

54
Robust Statistics for Signal Processing Abdelhak M Zoubir Signal Processing Group Technische Universit¨ at Darmstadt email: [email protected] URL: www.spg.tu-darmstadt.de The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 1 SPG

Transcript of Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a...

Page 1: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Robust Statistics for Signal Processing

Abdelhak M Zoubir

S i g n a l P r o c e s s i n g G r o u p

Signal Processing Group

Technische Universitat Darmstadt

email: [email protected]

URL: www.spg.tu-darmstadt.de

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 1 SPG

Page 2: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Motivation for Robust Statistics

Common assumptions such as Gaussianity, linearity and independence are

only approximations to reality.

Robust statistics is a body of knowledge, partly formalised into

’theories of robustness’ relating to deviations from idealised

assumptions in statistics [Hampel et al. (1986)].

Robust statistics may be separated into two distinct but related areas

Robust Estimation - A robustification of classical estimation theory (point

estimation) Robust Testing - A robustification of the classical theory of statistical

hypothesis testing (interval estimation)

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 2 SPG

Page 3: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Motivation for Robust Statistics (Cont’d)

Often engineering systems, such as in communication, are based on a

parametric model where the observations are assumed to be Gaussian

System performance is dependent on the accuracy of this model.

Classical parametric statistics are optimal under exact parametric models. Performance becomes more uncertain the further we are from the assumed

model. Incorrect model leads to a performance decrease to an uncertain level.

Systems designed using parametric models are very sensitive to deviations

from the assumed model [Hampel et al. (1986), Huber (1981)].

Solution: Robust estimation of parametric models

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 3 SPG

Page 4: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Motivation for Robust Statistics (Cont’d)

In general, the aims of robust statistics are to:

describe (or fit a model to) the majority of the data

identify (and deal with) outliers or influential points

How far away from Gaussianity are we in reality? Experience shows [Hampelet al. 1986] that

high quality data sets may contain up to 1% outliers,

low quality data sets may contain more than 10% outliers,

1 − 10% outliers is common.

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 4 SPG

Page 5: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Example: Electricity consumption data

0 500 1000 15003

3.5

4

4.5

5

5.5

6x 10

4

Time (half−hours)

Ele

ctric

al C

onsu

mpt

ion

(MW

)

Non−working days Normal week

Half-hourly daily French electricity

consumption on April 27th to June 1st ,

2007

0 50 100 150 200 250 300 350 400−3

−2

−1

0

1

2

3x 10

4

Ele

ctric

al C

onsu

mpt

ion

Time (days)

One-week differenced French load

seasonal time series at 10:00, 2009

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 5 SPG

Page 6: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Motivation for Robust Statistics (Cont’d)

Nonparametric Robust Parametric

Description Model specified in

terms of general

properties

Parametric mo-

del allowing for

deviations

Model completely

specified by several

parametersIdeal Performance Mediocre/Satisfactory Good Very Good/Excellent

Range of Validity Large Medium Small

Nonparametric Robust Parametric

Robust is the most appropriate approach for real-life applications

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 6 SPG

Page 7: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Robust Statistics for Signal Processing

Robust Estimation

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 7 SPG

Page 8: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Overview

Measures of Robustness: Quantitative and Qualitative robustness

Location Estimation

Linear Regression Models

Correlated Data

Signal Processing Applications

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 8 SPG

Page 9: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Quantitative Robustness

Define the ’neighborhood’ using e.g. the ε contaminated mixture model (or

’gross error model’)

F = F |F = (1 − ε)F0 + εH,

where F0 is the nominal distribution and H is the contaminating distribution.

Consider

1. maximum bias

b(ε) = supF∈F

|Θ(F ) − Θ(F0)|

2. maximum variance

v(ε) = supF∈F

AV (F , Θ)

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 9 SPG

Page 10: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Quantitative Robustness (Cont’d)

Then the asymptotic breakdown point ε∗ of an estimator at F0 is

ε∗(F0, Θ) = supε|b(ε) <∞.

Loosely speaking, it gives the limiting fraction of gross errors (outliers) the

estimator can cope with (for details, see [Huber (1981) Section 1.4], [Hampel

(1986) Section 2.2]). In many cases ε

∗ does not depend on F0 and is often the same for all the

usual choices for F .

Maximum bias curve plots the maximum bias (b(ε)) of an estimator with

respect to the fraction of contamination ε

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 10 SPG

Page 11: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Quantitative Robutsness: An Example

0 0.1 0.2 0.3 0.4 0.50

0.5

1

1.5

2

2.5

3

fraction of outliers (ε)

max

imum

bia

s M

B(ε

)

median

BP BP

25 % trimmed mean

Maximum bias curves at F0 = Φ. Beyond the BP (ε∗), the maximum bias is

infiniteThe Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 11 SPG

Page 12: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Qualitative Robustness

The Influence function, introduced as influence curve [Hampel (1968,1974)],

describes the effect (on an estimator Θ) of adding an observation of value x

to a large sample; asymptotically, it is defined by

IF (x , F , Θ) = lim∆→0

Θ((1 − ∆)F − ∆δ(x)) − Θ(F )

where δ(x) denotes the point mass 1 at x .

Roughly speaking, it is the first derivative of a statistic at F , where x plays

the role of the contamination position.

It measures the normalized asymptotic bias caused by an infinitesimal

contamination at point x in the observations [Hampel (1986)].

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 12 SPG

Page 13: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Qualitative Robustness: An Example

−3 −2 −1 0 1 2 3−3

−2

−1

0

1

2

3

√π2

−√π2

IF(z

,Φ,Θ

)

z

mean

median

M-estimator

Influence functions of three estimators for the standard normal distribution Φ

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 13 SPG

Page 14: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Trade-off Robustness vs. Efficiency

Robustness: resistance of the estimator towards contamination quantified by:

Breakdown Point ε∗, Maximum Bias Curve b(ε), and Influence Function

IF (x , F , Θ).

Efficiency: the asymptotic behavior and variance of the estimator under the

nominal model (clean data) quantified by AV (F , Θ) or IF (x , F , Θ).

AV (F , Θ) =

∫IF (x , F , Θ)2dF (x)

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 14 SPG

Page 15: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Location Estimation

Consider the model

Xt = µ+ εt

under a distribution F such that X ∼ F (x − µ). We wish to estimate µ, given i.i.d

Xt , t = 1, ... , n. The Maximum likelihood estimate is

µML = arg maxµ

n∑

t=1

log f (xt − µ)

⇒ ∑ni=1 ψ(xi − µML) = 0 where ψ = f ′/f

F standard Gaussian: µ is the sample mean

F double Exponential: µ is the sample median

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 15 SPG

Page 16: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Example of the Effect of Outliers

unbounded bias

unbounded bias

sample mean

sample median

sample median

sample median

true value

BP=0

BP=0.5

Effect of outliers on the bias of the sample mean and sample median

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 16 SPG

Page 17: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Example: Theoretical Robustness and Efficiency

of the Sample Median

IF(x,Φ,Med) bounded ⇒ sample median robust. Its BP ε∗= 50 %

0 500 1000 1500 20000.55

0.6

0.65

0.7

0.75

0.8

Monte Carlo efficiencyTheoretical efficiency

sample size (n) of a Gaussian sample

Effi

cien

cyofth

em

edia

n

Median minimizes the maximum bias [Huber (1981)]. When F0 = Φ,

efficiency of the Median is 2/π = 0.64 ⇒ suggests M-estimatorsThe Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 17 SPG

Page 18: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

M-Estimator

The MLE is asymptotically optimal (unbiased, consistent, achieves CRB) only

if model is correct. If model is incorrect, performance uncertain → possibly

not robust.

Huber’s approach generalises the MLE of a parameter θ of interest for

independent observations by replacing fXt(xt |θ) by an arbitrary function

ρ(xt , θ).

arg maxθ

N∑

t=1

log fXt(xt |θ) → arg max

θ

N∑

t=1

ρ(xt , θ)

This estimator is called M-estimator (ML-type estimator).

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 18 SPG

Page 19: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

M-Estimator (Cont’d)

Let ψ be the derivative of ρ, then

arg maxθ

N∑

t=1

log fXt(xt |θ) → arg max

θ

N∑

t=1

ρ(xt , θ)

leads to

N∑

t=1

d log fXt(xt |θ)

dθ= 0 →

N∑

t=1

ψ(xt , θ) = 0.

ψ is called the score function since it ’scores’ each observation xt .

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 19 SPG

Page 20: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Robust M-estimators

The class of M-estimators contains in particular

The sample mean The sample median All maximum likelihood estimators

Huber asked the following question

How does one make an M-estimator robust?

Implies determining fX , or equivalently ψ, so that the M-estimator is robust

An M-estimator is qualitatively robust if and only if ψ is bounded and

continuous

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 20 SPG

Page 21: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Robust M-estimators

−3 −2 −1 0 1 2 30

1

2

3

4

5

6

7

8

9

10

x

ρ(x)

ρ(x)

MeanHuber type M−estimatorMedian

(a) ρ-function

−3 −2 −1 0 1 2 3−3

−2

−1

0

1

2

3

x

ψ(x

)

ψ(x)

MeanHuber type M−estimatorMedian

(b) score function ψ

Huber M-estimator is qualitatively robust (ψ or IF bounded and continuous).

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 21 SPG

Page 22: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Linear Regression

Assume the model

Y = Xβ + e

where X and e are independently distributed random variables. Let Y(t) = yt and

x′t be the tth row of the matrix X.

Least-Squares Estimator (LSE)

β = argminβ

n∑

t=1

rt(β)2

rt(β) = yt − x′tβ, being the tth residual. The LSE solves for

n∑

t=1

rt(β)xt = 0

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 22 SPG

Page 23: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Effect of Outliers

The regression line is tilted by the outliers

x-outlier

y-outlier

bias

xt

yt

LSE

LSE

without outliers

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 23 SPG

Page 24: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

M-Estimation for Regression

M-Estimator:

β = argminβ

n∑

t=1

ρ

(rt(β)

σr

)⇒

n∑

t=1

ψ

(rt(β)

σr

)xt = 0

A leverage point (‖xt‖ large) will dominate the equation. Here, σr is a robust

scale of the residuals rt .

Solution:

Redescending function ψ: MM estimator

Down-weight leverage points: GM estimator

Other robust estimators are: Residual Autocovariance (RA-), Least Median

Squares (LMS), Least Trimmed Squares (LTS), S-, τ -Estimators.

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 24 SPG

Page 25: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Leverage Points in Regression

LSE and M-Estimator are not qualitatively robust

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 25 SPG

Page 26: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Robust Methods for Correlated Data

Difficulty introduced by the correlation

Several types of outliers: Additive, Innovative, patchy, isolated, ...

Different definitions of robustness measures

Estimators need to be adapted to deal with correlation

⇓Only a few methods exist

2 Definitions of influence function in the correlated data case ([Kunsch

(1981)] and [Martin (1981)])

Breakdown Point (ε∗) still not clearly defined [Genton (2010)]

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 26 SPG

Page 27: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Some Robust Estimators for ARMA Models

CML: Cleaned Maximum Likelihood after ’3-σ’ rejection (practical

engineering method) ⇒ it neglects the correlation, bad performance.

Generalized-M (GM): used in Power Systems [Maronna et al. (2006)], not

robust for ARMA; for AR(p), BP (ε∗) decreases with increasing p.

Residual Autocovariance (RA) and truncated RA estimators (TRA) [Bustos

and Yohai (1986)]. RA is not robust and TRA lacks efficiency for ARMA.

Filtered-M: M-estimator combined with robust filter [Maronna (2006)].

Robust but lacks efficiency.

Filtered-τ [Maronna (2006)], Ratios of Medians (RME), Medians of Ratios

(MRE) and Filtered Hellinger based estimator [Chakhchoukh (2010)].

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 27 SPG

Page 28: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Importance of Robust Filtering

For an AR(p), a residual rt is evaluated by regressing yt on the p variables

yt−1, ... , yt−p. An observation yt is used in computing p + 1 residuals: rt , ... , rt+p

⇒ BP (ε∗) decreases significantly.

For an ARMA model, one observation will affect all the residuals ⇒ BP ε∗ = 0.

Problem: Propagation of outliers in the data used in the estimation

Solution: use robust residuals computed by a robust filter cleaner [Masreliez

(2010)]:

rt = yt − φ1yt−1|t−1 − ... − φp yt−p|t−1

Cleaning the data with a robust filter improves efficiency

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 28 SPG

Page 29: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Robust Filtering of an AR(1)

For an AR(1): Xt = φXt−1 + εt , we use the following robust filtering algorithm:

estimate robustly φ, e.g.: φ = Cr (1)/Cr (0) and run the recursions of the

filter-cleaner; X1|1 = X1, P1|1 = MADN(Xt),

Prediction :

X2|1 = φX1|1;

ε2 = Y2 − φX1|1

P2|1 = φ2P1|1 + σε2

Correction :

X2|2 = X2|1 + 1√P2|1

P2|1ψ

(bε2√P2|1

)

P2|2 = P2|1 − 1P2|1

P22|1w

(bε2√P2|1

)

Go to the next step of the recursion.

Apply the ML to the filtered series Xt|t or:

Test to remove the outliers, e.g.: If ε2 > 3√

P2|1 then X2 is outlying

Apply ML estimator that handles missing data.

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 29 SPG

Page 30: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Applications

Wireless Communications: Robust Geolocation, eg: [Hammes (2009)]

Array Processing: Robust Direction of Arrival Estimation, eg: [Tsakalides

(1995)]

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 30 SPG

Page 31: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Wireless Communications: Robust geolocation

Geolocation refers to identifying the position of a mobile terminal using a

network of sensors. Applications for Geolocation arise e.g. in emergency call services, yellow page

services and intelligent transport systems [Caffery (1999)]. We consider wireless positioning of a stationary terminal based on TOA

estimates. At least three sensors/BSs are needed to solve ambiguities

line-of-sight (LOS) non-line-of-sight (NLOS)

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 31 SPG

Page 32: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Problem Statement

Time of arrival (TOA) estimates are multiplied by the speed of light to obtain

the measured distances

rm =√

(xm − x)2 + (y − ym)2︸ ︷︷ ︸=hm

+vm, m = 1, ... , M ,

where xm, ym are the known coordinates of the BS and x , y describe the

unknown location of the MT. The i.i.d. random variables vm have pdf

fV (v) = (1 − ε)N (v ; 0,σG ) + εH,

describing sensor noise and errors due to NLOS propagation (H = fG ∗ fη)

where fη may be any pdf with positive mean such that EH > 0.

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 32 SPG

Page 33: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Linearization

Squaring the nonlinear equation yields

r2m = h2

m + 2hmvm + v2m︸ ︷︷ ︸

=vm(hm)

m = 1, ... , M

For M BSs we have

r = Sθ + v,

where θ = [x y R2]T with R2 = x2 + y2.

Since fV (v) is non-Gaussian and contains outliers due to NLOS, least-squares

estimation suffers from a performance loss.

⇓Robust methods

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 33 SPG

Page 34: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Iterative Robust Algorithm

1. Initialisation: Set i = 0. Obtain an initial estimate of θ, θ0.

2. Determine residuals: v = r − Sθi.

3. Estimate λ, perform transformation KDE.

4. Estimate score function: ϕ = −f ′V (v)

fV (v).

5. Update: θi+1

= θi+ µ(STS)−1STϕ(v) or (STΩS)−1STΩr, Ω = diag(ω), ω = |ϕ(v)/v|

6. Check for convergence: If‖θ i+1−θ i‖

‖θ i+1‖< ξ stop, otherwise set i → i + 1 and go to step 2.

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 34 SPG

Page 35: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Simulation Settings

Consider 10 BSs each of them collecting 10 measurements.

We compare least-squares (’LS’) with Huber’s M-estimator (’Hc ’) where the

clipping point c = 0.6σV , where σV is estimated using the median absolute

deviation.

Semi-parametric estimators using Newton-Raphson algorithm labeled as

’SPMR’ and the one based on weighted least-squares is ’SPWLS’.

We average over MC = 10, 000 Monte-Carlo runs and σG = 150m.

Performance measure is the mean error distance, i.e.,

MED =1

MC

MC∑

i=1

√(x − xi )2 + (y − yi)2

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 35 SPG

Page 36: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Simulation results

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 120

40

60

80

100

120

140

160

LSH

c

SPMRSPWLS

ε

Mea

ner

ror

dista

nce

inm

MED vs. degree of NLOS contamination. fη is an exponential distribution with

ση = 409m

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 36 SPG

Page 37: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Array Processing: Robust Direction of Arrival

Estimation

Direction of Arrival (DOA) estimates are needed in array processing

Smart Antennas Space-Time Adaptive Processing Radar

Classical methods for DOA estimation based on sample covariance matrix arenot robust

Beamformer Capon’s minimum variance ML techniques Subspace methods MUSIC, ESPIRIT

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 37 SPG

Page 38: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Array Processing: Robust Direction of Arrival

Estimation

Robust DOA methods exist based on

M-Estimation Symmetric-Alpha-Stable (SaS) distributions and Fractional Lower Order

Moments (FLOMs) Gaussian mixture distributions and Space Alternating Generalised Expectation

Maximisation (SAGE) Nonparametric statistics using the spatial sign function

Former three robust DOA estimators require knowledge of noise

parameters/setting of thresholds/choice of weighting functions

Last nonparametric estimator is simple and requires no prior knowledge or

settings to be chosen

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 38 SPG

Page 39: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Array Processing: Robust Direction of Arrival

Estimation

Problem: Estimate the direction-of-arrivals of the sources using the observations

of a sensor array in an impulsive noise environment

Sensor Array

Source 1

Source 2

Source 3

θ1 θ2

θ3

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 39 SPG

Page 40: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Array Signal Model

Model

yn = Asn + xn, n = 1, ... , N

yn: p-dim snapshot from p array elements

A = (a(θ1), ... , a(θq)): p × q-dim array steering matrix

a(θ): p-dim array steering vector

θ1, ... , θq: directions to the q sources

sn: q-dim element source signal

xn: p-dim spherically symmetric noiseAssumptions

snapshots yn, n = 1, ... , N, are i.i.d. source signal sn and noise xm are independent for all n,m = 1, ... , N q < p to avoid identifiability problems A is of full rank q

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 40 SPG

Page 41: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Conventional DOA estimation: MUSIC

The spatial covariance matrix has the following structure

R = A RsAH + σ2I (1)

From the eigendecomposition of R

R = UΣV (2)

where U = [u1,u2, ... ,uM ] and Σ = diag[λ1,λ2, ... ,λM ].

Construct the signal and the noise subspace, Us and Un, respectively Search for the peaks in the MUSIC pseudo-Spectrum P(θ)

P(θ) =1

‖UHn a(θ)‖2

(3)

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 41 SPG

Page 42: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Conventional DOA Estimation

The estimated covariance matrix is given by

R =1

N

N∑

t=1

xxH

⇓Robust Estimation:

Normalize each of the snapshots, called spatial sign function

Trim the corrupted observations, requires hypothesis testing

Robust estimation of spatial covariance matrix, e.g. FLOM, maximum

likelihood estimation. Some possible methods for robust covariance

estimation MCD, MVE, MM-, ...

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 42 SPG

Page 43: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Spatial Sign Function & Robust Covariance

Spatial Sign Function

Spatial sign function (SSF) of a p-variate complex vector x

u(x) =

(

x||x||

x 6= 0

0 x = 0

u is a unit length direction vector Generalises the sign function sgn(x) for 1-D to p-D

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 43 SPG

Page 44: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Spatial Sign Function & Robust Covariance

Sample spatial sign covariance matrix (SCM) of

R1 = E [u(x)uH(x)]

R1 =1

N

N∑

n=1

u(xn)uH(xn)

also known as quadrant correlation Sample spatial tau covariance matrix (TCM) of

R2 = E [u(x − y)uH(x − y)]

R2 =1

N(N − 1)

N∑

n=1

N∑

m=1

u(xn − xm)uH(xn − xm)

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 44 SPG

Page 45: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Simulation results

Setup

Two linear FM signals impinging

on an array of m = 8 sensors in

ULA geometry

DOAs are [−3o 2o]

Total number of snapshots

N = 128

ǫ-contaminated mixture, ǫ = 0.2

and κ = 20

−20 −15 −10 −5 0 5 1010

−2

10−1

100

101

102

SNR (dB)

RM

SE

in d

eg

No Impulsive RemedyRobust TrimingRobust VisuriNon−linear Based MBDMedian Based MBDKnown IF

RMSE DOA Estimation

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 45 SPG

Page 46: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Conclusions

Robust methods are useful tools for estimation in many real world

applications.

Robust statistics for independently and identically distributed data are

well-established.

There exists a need for robust techniques for correlated data: the more

interesting case for a signal processing practitioner.

Optimality has its advantage, but Robustness is the engineer’s interest.

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 46 SPG

Page 47: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

Robust Statistics for Signal Processing

References

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 47 SPG

Page 48: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

References

D.F. Andrews, P.J. Bickel, F.R. Hampel, P.J. Huber, W.H. Rogers and J.W. Tuckey.

Robust Estimates of Location: Survey and Advances.

In Princeton University Press, Princeton, New York, 1972.

F. Hampel, E. Ronchetti, P. Rousseeuw, and W. Stahel.

Robust Statistics, The Approach Based on Influence Functions.

John Wiley, 1986.

P. Huber.

Robust Statistics 1st edition.

Wiley, 1981.

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 48 SPG

Page 49: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

References

F. Hampel, E. Ronchetti, P. Rousseeuw, and W. Stahel.

Robust Statistics, The Approach Based on Influence Functions.

John Wiley, 1986.

P. Huber and E.M. Ronchetti.

Robust Statistics, 2nd edition.

Wiley, 2009.

R.A. Maronna, R.D. Martin and V.J. Yohai.

Robust Statistics Theory and Methods.

John Wiley, 2006.

P. Rousseeuw and A. Leroy.

Robust Regression and Outlier Detection.

John Wiley, 1987.

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 49 SPG

Page 50: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

References

R. Staudte and S. Sheather.

Robust Estimation and Testing.

John Wiley, 1990.

R. Wilcox.

Introduction to Robust Estimation and Hypothesis Testing.

Academic Press, 1997.

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 50 SPG

Page 51: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

References

O.H. Bustos and V.J. Yohai

Robust estimates for ARMA models.

J. Amer. Statist. Assoc., 81:155-168, 1986.

A.J. Fox

Outliers in Time Series.

J. R. Stat. Soc. Ser. B, 34:350-363, 1972.

M.G. Genton and A. Lucas

Comprehensive definitions of breakdown-points for independent and dependent

observations.

J. R. Stat. Soc. Ser. B, 65:81-94, 2003.

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 51 SPG

Page 52: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

References

R.D. Martin and V.J. Yohai

Influence functionals for time series.

Ann. Statist., 14:781-818, 1986.

N. Muler, D-Pena and V.J. Yohai

Robust Estimation for ARMA models.

Ann. Statist., 37(2):816-840, 2009.

H. Kunsch,

Infinitesimal robustness for autoregressive processes.

Ann. Statist., 12:843–863, 1984.

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 52 SPG

Page 53: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

References

C. J. Masreliez, and R. D. Martin.

Robust Bayesian estimation for the linear model and robustifying the Kalman filter.

IEEE Transactions on Automatic Control, 22(3):361–371, 1977.

Y. Chakhchoukh.

Contribution to the robust estimation of SARIMA models. Application to short term load

forecasting of electricity consumption

In Ph.D. Thesis, University of Paris XI, Paris, 2010.

Yin et. al.

Weighted median filters: A tutorial.

IEEE Transactions on Circuits and Systems-II: Analog and Digital Signal Processing,

43(3):157–192, March 1996.

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 53 SPG

Page 54: Robust Statistics for Signal Processing DL robust statistics 2011.pdf · Robust statistics is a body of knowledge, partly formalised into ’theories of robustness’ relating to

References

F. Gustafsson and F. Gunnarsson.

Mobile positioning using wireless networks: possibilities and fundamental limitations based

on available wireless network measurements.

IEEE Signal Processing Magazine, 22(4):41–53, 2005.

U. Hammes.

Robust positioning algorithms for wireless networks

In Ph.D. Thesis, TU Darmstadt, Darmstadt, 2009.

P. Tsakalides.

Subspace-based direction finding in alpha-stable noise environments.

In IEEE International Conference on Acoustics, Speech and Signal Processing, pages

2092–5, Detroit, USA, 1995.

The Chinese University of Hong Kong, 21 September 2011 | SPG TUD | A.M. Zoubir | 54 SPG