Sample of slides for Sandsynlighedsregning · 02443–lecture10 2 DTU...

Stochastic SimulationThe Bootstrap methodBo Friis Nielsen

Institute of Mathematical Modelling

Technical University of Denmark

2800 Kgs. Lyngby – Denmark

Email: [email protected]

02443 – lecture 10 2DTU

The Bootstrap methodThe Bootstrap method

http://www.dtu.dk



• A technique for estimating the variance (etc) of an estimator.

http://www.dtu.dk




• Based on sampling from the empirical distribution.

http://www.dtu.dk




• Based on sampling from the empirical distribution.

• Non-parametric technique

http://www.dtu.dk


Recall the simple situationRecall the simple situation

http://www.dtu.dk



• We have n observations

http://www.dtu.dk



• We have n observations xi, i = 1, . . . , n.

http://www.dtu.dk




• If we want to estimate the mean value of the underlying

distribution,

http://www.dtu.dk





distribution, we (typically) just use the estimator

http://www.dtu.dk





distribution, we (typically) just use the estimator x̄ =∑

xi/n.

http://www.dtu.dk






xi/n.

• This estimator has the variance

http://www.dtu.dk






xi/n.

• This estimator has the variance 1

nVar(X).

http://www.dtu.dk






xi/n.

• This estimator has the variance 1

nVar(X). To estimate this, we

(typically) just use the sample variance.

http://www.dtu.dk


A not-so-simple-situationA not-so-simple-situation

http://www.dtu.dk



• Assume we want to estimate the median, rather than the mean.

http://www.dtu.dk




• (This makes much sense w.r.t. robustness)

http://www.dtu.dk





• The natural estimator for the median is the sample median.

http://www.dtu.dk





• The natural estimator for the median is the sample median.

• But what is the variance of the estimator?

http://www.dtu.dk


The variance of the sample medianThe variance of the sample median

If we had access to the “true” underlying distribution,

http://www.dtu.dk



If we had access to the “true” underlying distribution, we could

http://www.dtu.dk




1. Simulate a number of data sets like the one we had.

http://www.dtu.dk





2. For each simulated data set, compute the median.

http://www.dtu.dk






3. Finally report the variance among these medians.

http://www.dtu.dk







We don’t have the true distribution.

http://www.dtu.dk







We don’t have the true distribution. But we have the empirical

distribution!

http://www.dtu.dk


Empirical distributionEmpirical distribution

http://www.dtu.dk


Empirical distributionEmpirical distribution20 N(0, 1) variates (sorted):

http://www.dtu.dk


Empirical distributionEmpirical distribution20 N(0, 1) variates (sorted): -2.20, -1.68, -1.43, -0.77, -0.76, -0.12, 0.30,

0.39, 0.41, 0.44, 0.44, 0.71, 0.85, 0.87, 1.15, 1.37, 1.41, 1.81, 2.65,

3.69

http://www.dtu.dk



http://www.dtu.dk



Xi iid random variables with F (x) = P(X ≤ x)

http://www.dtu.dk




Each leads to a (simple) random function Fe,i(x) = 1{Xi≤x}

leading to Fe(x) =1

n

∑n

i=1Fe,i(x) =

1

n

∑n

i=11{Xi≤x}

http://www.dtu.dk





leading to Fe(x) =1

n

∑n

i=1Fe,i(x) =

1

n

∑n

i=11{Xi≤x}

E (Fe(x))

http://www.dtu.dk





leading to Fe(x) =1

n

∑n

i=1Fe,i(x) =

1

n

∑n

i=11{Xi≤x}

E (Fe(x)) = E(

1

n

∑n

i=11{Xi≤x}

)

http://www.dtu.dk





leading to Fe(x) =1

n

∑n

i=1Fe,i(x) =

1

n

∑n

i=11{Xi≤x}

E (Fe(x)) = E(

1

n

∑n

i=11{Xi≤x}

)

= 1

n

∑n

i=1E(

1{Xi≤x}

)

http://www.dtu.dk





leading to Fe(x) =1

n

∑n

i=1Fe,i(x) =

1

n

∑n

i=11{Xi≤x}

E (Fe(x)) = E(

1

n

∑n

i=11{Xi≤x}

)

= 1

n

∑n

i=1E(

1{Xi≤x}

)

= F (x)

Once we have sample

http://www.dtu.dk





leading to Fe(x) =1

n

∑n

i=1Fe,i(x) =

1

n

∑n

i=11{Xi≤x}

E (Fe(x)) = E(

1

n

∑n

i=11{Xi≤x}

)

= 1

n

∑n

i=1E(

1{Xi≤x}

)

= F (x)

Once we have sample xi,

http://www.dtu.dk





leading to Fe(x) =1

n

∑n

i=1Fe,i(x) =

1

n

∑n

i=11{Xi≤x}

E (Fe(x)) = E(

1

n

∑n

i=11{Xi≤x}

)

= 1

n

∑n

i=1E(

1{Xi≤x}

)

= F (x)

Once we have sample xi, i = 1, 2, . . . , n

http://www.dtu.dk





leading to Fe(x) =1

n

∑n

i=1Fe,i(x) =

1

n

∑n

i=11{Xi≤x}

E (Fe(x)) = E(

1

n

∑n

i=11{Xi≤x}

)

= 1

n

∑n

i=1E(

1{Xi≤x}

)

= F (x)

Once we have sample xi, i = 1, 2, . . . , n we have a realised version

http://www.dtu.dk





leading to Fe(x) =1

n

∑n

i=1Fe,i(x) =

1

n

∑n

i=11{Xi≤x}

E (Fe(x)) = E(

1

n

∑n

i=11{Xi≤x}

)

= 1

n

∑n

i=1E(

1{Xi≤x}

)

= F (x)


of the empirical distribution function

Fe(x)

http://www.dtu.dk





leading to Fe(x) =1

n

∑n

i=1Fe,i(x) =

1

n

∑n

i=11{Xi≤x}

E (Fe(x)) = E(

1

n

∑n

i=11{Xi≤x}

)

= 1

n

∑n

i=1E(

1{Xi≤x}

)

= F (x)



Fe(x) =1

n

n∑

i=1

http://www.dtu.dk





leading to Fe(x) =1

n

∑n

i=1Fe,i(x) =

1

n

∑n

i=11{Xi≤x}

E (Fe(x)) = E(

1

n

∑n

i=11{Xi≤x}

)

= 1

n

∑n

i=1E(

1{Xi≤x}

)

= F (x)



Fe(x) =1

n

n∑

i=1

Fe,i(x)

http://www.dtu.dk





leading to Fe(x) =1

n

∑n

i=1Fe,i(x) =

1

n

∑n

i=11{Xi≤x}

E (Fe(x)) = E(

1

n

∑n

i=11{Xi≤x}

)

= 1

n

∑n

i=1E(

1{Xi≤x}

)

= F (x)



Fe(x) =1

n

n∑

i=1

Fe,i(x) =1

n

n∑

i=1

http://www.dtu.dk





leading to Fe(x) =1

n

∑n

i=1Fe,i(x) =

1

n

∑n

i=11{Xi≤x}

E (Fe(x)) = E(

1

n

∑n

i=11{Xi≤x}

)

= 1

n

∑n

i=1E(

1{Xi≤x}

)

= F (x)



Fe(x) =1

n

n∑

i=1

Fe,i(x) =1

n

n∑

i=1

δ{xi≤x}

http://www.dtu.dk





leading to Fe(x) =1

n

∑n

i=1Fe,i(x) =

1

n

∑n

i=11{Xi≤x}

E (Fe(x)) = E(

1

n

∑n

i=11{Xi≤x}

)

= 1

n

∑n

i=1E(

1{Xi≤x}

)

= F (x)



Fe(x) =1

n

n∑

i=1

Fe,i(x) =1

n

n∑

i=1

δ{xi≤x}

where δ

http://www.dtu.dk





leading to Fe(x) =1

n

∑n

i=1Fe,i(x) =

1

n

∑n

i=11{Xi≤x}

E (Fe(x)) = E(

1

n

∑n

i=11{Xi≤x}

)

= 1

n

∑n

i=1E(

1{Xi≤x}

)

= F (x)



Fe(x) =1

n

n∑

i=1

Fe,i(x) =1

n

n∑

i=1

δ{xi≤x}

where δ is Kroneckers delta-function

http://www.dtu.dk

The Bootstrap Algorithm for the variance of a

parameter estimator


parameter estimator

http://www.dtu.dk


parameter estimator


parameter estimator• Given a data set with n observations.

http://www.dtu.dk


parameter estimator



• Simulate r

http://www.dtu.dk


parameter estimator



• Simulate r

• (e.g., r = 100)

http://www.dtu.dk


parameter estimator



• Simulate r

• (e.g., r = 100)

• data sets,

http://www.dtu.dk


parameter estimator



• Simulate r

• (e.g., r = 100)

• data sets,

• each with n “observations”

http://www.dtu.dk


parameter estimator



• Simulate r

• (e.g., r = 100)

• data sets,


• sampled form the empirical distribution Fe.

http://www.dtu.dk


parameter estimator



• Simulate r

• (e.g., r = 100)

• data sets,



• (To simulate such one data set,

http://www.dtu.dk


parameter estimator



• Simulate r

• (e.g., r = 100)

• data sets,



• (To simulate such one data set, simply take n samples from

http://www.dtu.dk


parameter estimator



• Simulate r

• (e.g., r = 100)

• data sets,



• (To simulate such one data set, simply take n samples from the

original data set

http://www.dtu.dk


parameter estimator



• Simulate r

• (e.g., r = 100)

• data sets,




original data set with replacement)

http://www.dtu.dk


parameter estimator



• Simulate r

• (e.g., r = 100)

• data sets,





• For each simulated data set,

http://www.dtu.dk


parameter estimator



• Simulate r

• (e.g., r = 100)

• data sets,





• For each simulated data set, estimate the parameter of interest

http://www.dtu.dk


parameter estimator



• Simulate r

• (e.g., r = 100)

• data sets,






(e.g., the median).

http://www.dtu.dk


parameter estimator



• Simulate r

• (e.g., r = 100)

• data sets,






(e.g., the median). This is a bootstrap replicate of the estimate.

http://www.dtu.dk


parameter estimator



• Simulate r

• (e.g., r = 100)

• data sets,






(e.g., the median). This is a bootstrap replicate of the estimate.

• Finally report the variance among the bootstrap replicates.

http://www.dtu.dk


Advantages of the Bootstrap methodAdvantages of the Bootstrap method

http://www.dtu.dk



• Does not require the distribution in parametric form.

http://www.dtu.dk




• Easily implemented.

http://www.dtu.dk





• Applies also to estimators which cannot easily be analysed.

http://www.dtu.dk





• Applies also to estimators which cannot easily be analysed.

• Generalizes e.g. to confidence intervals.

http://www.dtu.dk

Exercise 8Exercise 8

1. Exercise 13 in Chapter 8 of Ross (P.152).

2. Exercise 15 in Chapter 8 of Ross (P.152).

3. Write a subroutine that takes as input a “data” vector of

observed values, and which outputs the median as well as the

bootstrap estimate of the variance of the median, based on

r = 100 bootstrap replicates. Simulate N = 200 Pareto

distributed random variates with β = 1 and k = 1.05.

(a) Compute the mean and the median (of the sample)

(b) Make the bootstrap estimate of the variance of the sample

mean.

(c) Make the bootstrap estimate of the variance of the sample

median.

(d) Compare the precision of the estimated median with the

precision of the estimated mean.

http://www.dtu.dk

Sample of slides for Sandsynlighedsregning · 02443–lecture10 2 DTU...

Documents

Transcript of Sample of slides for Sandsynlighedsregning · 02443–lecture10 2 DTU...