18.440: Lecture 10 .1in Variance and standard...

18.440: Lecture 10

Variance and standard deviation

Scott Sheffield

18.440 Lecture 10

Outline

Defining variance

Examples

Properties

Decomposition trick

18.440 Lecture 10

Outline

Defining variance

Examples

Properties

Decomposition trick

18.440 Lecture 10

Recall definitions for expectation

I Recall: a random variable X is a function from the state spaceto the real numbers.

I Can interpret X as a quantity whose value depends on theoutcome of an experiment.

I Say X is a discrete random variable if (with probability one)it takes one of a countable set of values.

I For each a in this countable set, write p(a) := P{X = a}.Call p the probability mass function.

I The expectation of X , written E [X ], is defined by

E [X ] =∑

x :p(x)>0

xp(x).

I Also,E [g(X )] =

∑x :p(x)>0

g(x)p(x).

18.440 Lecture 10

E [X ] =∑

x :p(x)>0

xp(x).

I Also,E [g(X )] =

∑x :p(x)>0

g(x)p(x).

18.440 Lecture 10

E [X ] =∑

x :p(x)>0

xp(x).

I Also,E [g(X )] =

∑x :p(x)>0

g(x)p(x).

18.440 Lecture 10

E [X ] =∑

x :p(x)>0

xp(x).

I Also,E [g(X )] =

∑x :p(x)>0

g(x)p(x).

18.440 Lecture 10

E [X ] =∑

x :p(x)>0

xp(x).

I Also,E [g(X )] =

∑x :p(x)>0

g(x)p(x).

18.440 Lecture 10

E [X ] =∑

x :p(x)>0

xp(x).

I Also,E [g(X )] =

∑x :p(x)>0

g(x)p(x).

18.440 Lecture 10

Defining variance

I Let X be a random variable with mean µ.

I The variance of X , denoted Var(X ), is defined byVar(X ) = E [(X − µ)2].

I Taking g(x) = (x − µ)2, and recalling thatE [g(X )] =

∑x :p(x)>0 g(x)p(x), we find that

Var[X ] =∑

x :p(x)>0

(x − µ)2p(x).

I Variance is one way to measure the amount a random variable“varies” from its mean over successive trials.

18.440 Lecture 10

Defining variance

Var[X ] =∑

x :p(x)>0

(x − µ)2p(x).

18.440 Lecture 10

Defining variance

Var[X ] =∑

x :p(x)>0

(x − µ)2p(x).

18.440 Lecture 10

Defining variance

Var[X ] =∑

x :p(x)>0

(x − µ)2p(x).

18.440 Lecture 10

Very important alternate formula

I We introduced above the formula Var(X ) = E [(X − µ)2].

I This can be written Var[X ] = E [X 2 − 2Xµ+ µ2].

I By additivity of expectation, this is the same asE [X 2]− 2µE [X ] + µ2 = E [X 2]− µ2.

I This gives us our very important alternate formula:Var[X ] = E [X 2]− (E [X ])2.

I Original formula gives intuitive idea of what variance is(expected square of difference from mean). But we will oftenuse this alternate formula when we have to actually computethe variance.

18.440 Lecture 10

Outline

Defining variance

Examples

Properties

Decomposition trick

18.440 Lecture 10

Outline

Defining variance

Examples

Properties

Decomposition trick

18.440 Lecture 10

Variance examples

I If X is number on a standard die roll, what is Var[X ]?

I Var[X ] = E [X 2]− E [X ]2 =1612 + 1

622 + 1632 + 1

642 + 1652 + 1

662 − (7/2)2 = 916 −

494 = 35

I Let Y be number of heads in two fair coin tosses. What isVar[Y ]?

I Recall P{Y = 0} = 1/4 and P{Y = 1} = 1/2 andP{Y = 2} = 1/4.

I Then Var[Y ] = E [Y 2]− E [Y ]2 = 1402 + 1

212 + 1422 − 12 = 1

18.440 Lecture 10

Variance examples

I Var[X ] = E [X 2]− E [X ]2 =1612 + 1

622 + 1632 + 1

642 + 1652 + 1

662 − (7/2)2 = 916 −

494 = 35

I Then Var[Y ] = E [Y 2]− E [Y ]2 = 1402 + 1

212 + 1422 − 12 = 1

18.440 Lecture 10

Variance examples

I Var[X ] = E [X 2]− E [X ]2 =1612 + 1

622 + 1632 + 1

642 + 1652 + 1

662 − (7/2)2 = 916 −

494 = 35

I Then Var[Y ] = E [Y 2]− E [Y ]2 = 1402 + 1

212 + 1422 − 12 = 1

18.440 Lecture 10

Variance examples

I Var[X ] = E [X 2]− E [X ]2 =1612 + 1

622 + 1632 + 1

642 + 1652 + 1

662 − (7/2)2 = 916 −

494 = 35

I Then Var[Y ] = E [Y 2]− E [Y ]2 = 1402 + 1

212 + 1422 − 12 = 1

18.440 Lecture 10

Variance examples

I Var[X ] = E [X 2]− E [X ]2 =1612 + 1

622 + 1632 + 1

642 + 1652 + 1

662 − (7/2)2 = 916 −

494 = 35

I Then Var[Y ] = E [Y 2]− E [Y ]2 = 1402 + 1

212 + 1422 − 12 = 1

18.440 Lecture 10

More variance examples

I You buy a lottery ticket that gives you a one in a millionchance to win a million dollars.

I Let X be the amount you win. What’s the expectation of X?

I How about the variance?

I Variance is more sensitive than expectation to rare “outlier”events.

I At a particular party, there are four five-foot-tall people, fivesix-foot-tall people, and one seven-foot-tall person. You pickone of these people uniformly at random. What is theexpected height of the person you pick?

I Variance?

18.440 Lecture 10

I Variance?

18.440 Lecture 10

I Variance?

18.440 Lecture 10

I Variance?

18.440 Lecture 10

I Variance?

18.440 Lecture 10

I Variance?

18.440 Lecture 10

Outline

Defining variance

Examples

Properties

Decomposition trick

18.440 Lecture 10

Outline

Defining variance

Examples

Properties

Decomposition trick

18.440 Lecture 10

Identity

I If Y = X + b, where b is constant, then does it follow thatVar[Y ] = Var[X ]?

I Yes.

I We showed earlier that E [aX ] = aE [X ]. We claim thatVar[aX ] = a2Var[X ].

I Proof: Var[aX ] = E [a2X 2]− E [aX ]2 = a2E [X 2]− a2E [X ]2 =a2Var[X ].

18.440 Lecture 10

Identity

I Yes.

18.440 Lecture 10

Identity

I Yes.

18.440 Lecture 10

Identity

I Yes.

18.440 Lecture 10

Standard deviation

I Write SD[X ] =√Var[X ].

I Satisfies identity SD[aX ] = aSD[X ].

I Uses the same units as X itself.

I If we switch from feet to inches in our “height of randomlychosen person” example, then X , E [X ], and SD[X ] each getmultiplied by 12, but Var[X ] gets multiplied by 144.

18.440 Lecture 10

Standard deviation

18.440 Lecture 10

Standard deviation

18.440 Lecture 10

Standard deviation

18.440 Lecture 10

Outline

Defining variance

Examples

Properties

Decomposition trick

18.440 Lecture 10

Outline

Defining variance

Examples

Properties

Decomposition trick

18.440 Lecture 10

Number of aces

I Choose five cards from a standard deck of 52 cards. Let A bethe number of aces you see.

I Compute E [A] and Var[A].

I How many five card hands total?

I Answer:(525

I How many such hands have k aces?

I Answer:(4k

)( 485−k

I So P{A = k} =(4k)( 48

5−k)(525 )

I So E [A] =∑4

k=0 kP{A = k},I and Var[A] =

∑4k=0 k

2P{A = k} − E [A]2.

18.440 Lecture 10

Number of aces

I Answer:(525

I Answer:(4k

)( 485−k

I So P{A = k} =(4k)( 48

5−k)(525 )

I So E [A] =∑4

∑4k=0 k

2P{A = k} − E [A]2.

18.440 Lecture 10

Number of aces

I Answer:(525

I Answer:(4k

)( 485−k

I So P{A = k} =(4k)( 48

5−k)(525 )

I So E [A] =∑4

∑4k=0 k

2P{A = k} − E [A]2.

18.440 Lecture 10

Number of aces

I Answer:(525

I Answer:(4k

)( 485−k

I So P{A = k} =(4k)( 48

5−k)(525 )

I So E [A] =∑4

∑4k=0 k

2P{A = k} − E [A]2.

18.440 Lecture 10

Number of aces

I Answer:(525

I Answer:(4k

)( 485−k

I So P{A = k} =(4k)( 48

5−k)(525 )

I So E [A] =∑4

∑4k=0 k

2P{A = k} − E [A]2.

18.440 Lecture 10

Number of aces

I Answer:(525

I Answer:(4k

)( 485−k

I So P{A = k} =(4k)( 48

5−k)(525 )

I So E [A] =∑4

∑4k=0 k

2P{A = k} − E [A]2.

18.440 Lecture 10

Number of aces

I Answer:(525

I Answer:(4k

)( 485−k

I So P{A = k} =(4k)( 48

5−k)(525 )

I So E [A] =∑4

∑4k=0 k

2P{A = k} − E [A]2.

18.440 Lecture 10

Number of aces

I Answer:(525

I Answer:(4k

)( 485−k

I So P{A = k} =(4k)( 48

5−k)(525 )

I So E [A] =∑4

k=0 kP{A = k},

I and Var[A] =∑4

k=0 k2P{A = k} − E [A]2.

18.440 Lecture 10

Number of aces

I Answer:(525

I Answer:(4k

)( 485−k

I So P{A = k} =(4k)( 48

5−k)(525 )

I So E [A] =∑4

∑4k=0 k

2P{A = k} − E [A]2.

18.440 Lecture 10

Number of aces revisited

I Choose five cards in order, and let Ai be 1 if the ith cardchosen is an ace and zero otherwise.

I Then A =∑5

i=1 Ai . And E [A] =∑5

i=1 E [Ai ] = 5/13.

I Now A2 = (A1 + A2 + . . .+ A5)2 can be expanded into 25terms: A2 =

∑5i=1

∑5j=1 AiAj .

I So E [A2] =∑5

∑5j=1 E [AiAj ].

I Five terms of form E [AiAj ] with i = j five with i 6= j . Firstfive contribute 1/13 each. How about other twenty?

I E [AiAj ] = (1/13)(3/51) = (1/13)(1/17). SoE [A2] = 5

13 + 2013×17 = 105

13×17 .

I Var[A] = E [A2]− E [A]2 = 10513×17 −

2513×13 .

18.440 Lecture 10

I Then A =∑5

i=1 Ai . And E [A] =∑5

i=1 E [Ai ] = 5/13.

∑5i=1

∑5j=1 AiAj .

I So E [A2] =∑5

∑5j=1 E [AiAj ].

I E [AiAj ] = (1/13)(3/51) = (1/13)(1/17). SoE [A2] = 5

13 + 2013×17 = 105

13×17 .

I Var[A] = E [A2]− E [A]2 = 10513×17 −

2513×13 .

18.440 Lecture 10

I Then A =∑5

i=1 Ai . And E [A] =∑5

i=1 E [Ai ] = 5/13.

∑5i=1

∑5j=1 AiAj .

I So E [A2] =∑5

∑5j=1 E [AiAj ].

I E [AiAj ] = (1/13)(3/51) = (1/13)(1/17). SoE [A2] = 5

13 + 2013×17 = 105

13×17 .

I Var[A] = E [A2]− E [A]2 = 10513×17 −

2513×13 .

18.440 Lecture 10

I Then A =∑5

i=1 Ai . And E [A] =∑5

i=1 E [Ai ] = 5/13.

∑5i=1

∑5j=1 AiAj .

I So E [A2] =∑5

∑5j=1 E [AiAj ].

I E [AiAj ] = (1/13)(3/51) = (1/13)(1/17). SoE [A2] = 5

13 + 2013×17 = 105

13×17 .

I Var[A] = E [A2]− E [A]2 = 10513×17 −

2513×13 .

18.440 Lecture 10

I Then A =∑5

i=1 Ai . And E [A] =∑5

i=1 E [Ai ] = 5/13.

∑5i=1

∑5j=1 AiAj .

I So E [A2] =∑5

∑5j=1 E [AiAj ].

I E [AiAj ] = (1/13)(3/51) = (1/13)(1/17). SoE [A2] = 5

13 + 2013×17 = 105

13×17 .

I Var[A] = E [A2]− E [A]2 = 10513×17 −

2513×13 .

18.440 Lecture 10

I Then A =∑5

i=1 Ai . And E [A] =∑5

i=1 E [Ai ] = 5/13.

∑5i=1

∑5j=1 AiAj .

I So E [A2] =∑5

∑5j=1 E [AiAj ].

I E [AiAj ] = (1/13)(3/51) = (1/13)(1/17). SoE [A2] = 5

13 + 2013×17 = 105

13×17 .

I Var[A] = E [A2]− E [A]2 = 10513×17 −

2513×13 .

18.440 Lecture 10

I Then A =∑5

i=1 Ai . And E [A] =∑5

i=1 E [Ai ] = 5/13.

∑5i=1

∑5j=1 AiAj .

I So E [A2] =∑5

∑5j=1 E [AiAj ].

I E [AiAj ] = (1/13)(3/51) = (1/13)(1/17). SoE [A2] = 5

13 + 2013×17 = 105

13×17 .

I Var[A] = E [A2]− E [A]2 = 10513×17 −

2513×13 .

18.440 Lecture 10

I Then A =∑5

i=1 Ai . And E [A] =∑5

i=1 E [Ai ] = 5/13.

∑5i=1

∑5j=1 AiAj .

I So E [A2] =∑5

∑5j=1 E [AiAj ].

I E [AiAj ] = (1/13)(3/51) = (1/13)(1/17). SoE [A2] = 5

13 + 2013×17 = 105

13×17 .

I Var[A] = E [A2]− E [A]2 = 10513×17 −

2513×13 .

18.440 Lecture 10

Hat problem variance

I In the n-hat shuffle problem, let X be the number of peoplewho get their own hat. What is Var[X ]?

I We showed earlier that E [X ] = 1. So Var[X ] = E [X 2]− 1.

I But how do we compute E [X 2]?

I Decomposition trick: write variable as sum of simple variables.

I Let Xi be one if ith person gets own hat and zero otherwise.Then X = X1 + X2 + . . .+ Xn =

∑ni=1 Xi .

I We want to compute E [(X1 + X2 + . . .+ Xn)2].

I Expand this out and using linearity of expectation:

E [n∑

n∑j=1

Xj ] =n∑

n∑j=1

E [XiXj ] = n·1n

+n(n−1)1

n(n − 1)= 2.

I So Var[X ] = E [X 2]− (E [X ])2 = 2− 1 = 1.

18.440 Lecture 10

∑ni=1 Xi .

E [n∑

n∑j=1

Xj ] =n∑

n∑j=1

E [XiXj ] = n·1n

+n(n−1)1

n(n − 1)= 2.

I So Var[X ] = E [X 2]− (E [X ])2 = 2− 1 = 1.

18.440 Lecture 10

∑ni=1 Xi .

E [n∑

n∑j=1

Xj ] =n∑

n∑j=1

E [XiXj ] = n·1n

+n(n−1)1

n(n − 1)= 2.

I So Var[X ] = E [X 2]− (E [X ])2 = 2− 1 = 1.

18.440 Lecture 10

∑ni=1 Xi .

E [n∑

n∑j=1

Xj ] =n∑

n∑j=1

E [XiXj ] = n·1n

+n(n−1)1

n(n − 1)= 2.

I So Var[X ] = E [X 2]− (E [X ])2 = 2− 1 = 1.

18.440 Lecture 10

∑ni=1 Xi .

E [n∑

n∑j=1

Xj ] =n∑

n∑j=1

E [XiXj ] = n·1n

+n(n−1)1

n(n − 1)= 2.

I So Var[X ] = E [X 2]− (E [X ])2 = 2− 1 = 1.

18.440 Lecture 10

∑ni=1 Xi .

E [n∑

n∑j=1

Xj ] =n∑

n∑j=1

E [XiXj ] = n·1n

+n(n−1)1

n(n − 1)= 2.

I So Var[X ] = E [X 2]− (E [X ])2 = 2− 1 = 1.

18.440 Lecture 10

∑ni=1 Xi .

E [n∑

n∑j=1

Xj ] =n∑

n∑j=1

E [XiXj ] = n·1n

+n(n−1)1

n(n − 1)= 2.

I So Var[X ] = E [X 2]− (E [X ])2 = 2− 1 = 1.

18.440 Lecture 10

∑ni=1 Xi .

E [n∑

n∑j=1

Xj ] =n∑

n∑j=1

E [XiXj ] = n·1n

+n(n−1)1

n(n − 1)= 2.

I So Var[X ] = E [X 2]− (E [X ])2 = 2− 1 = 1.

18.440 Lecture 10

18.440: Lecture 10 .1in Variance and standard...

Documents

Transcript of 18.440: Lecture 10 .1in Variance and standard...

Lecture 9: Kernel (Variance Component) Tests and Omnibus ...

1 Lecture 2 ANALYSIS OF VARIANCE: AN INTRODUCTION.

M.A.N. 18.440 TGX XLX INTARDER

18.440: Lecture 20 .1in Exponential random variablesmath.mit.edu/~sheffield/440/Lecture20.pdfOutline Exponential random variables Minimum of independent exponentials Memoryless property

Lecture 8: Supervised Burden Tests and Variance Component ...faculty.washington.edu/tathornt/SISG2015/lectures/...Lecture 8: Supervised Burden Tests and Variance Component Test for

Lecture 10: Discount Factor, Beta, Mean-variance Frontier

18.440: Lecture 15 .1in Continuous random variablesmath.mit.edu/~sheffield/440/Lecture15.pdf · Outline Continuous random variables Expectation and variance of continuous random variables

Previous Lecture: Phylogenetics. Analysis of Variance This Lecture Judy Zhong Ph.D.

18.440: Lecture 6 .1in Conditional probabilitymath.mit.edu/~sheffield/440/Lecture6.pdf · 18.440 Lecture 6. Conditional probability I Suppose I have a sample space S with n equally

Risk Analysis & Modelling Lecture 2: The Mean-Variance Model.

Expected Utility, Mean-Variance and Risk Aversion Lecture VII.

Lecture 6: Distance-based multivariate analysis of variance

18.440: Lecture 9 of discrete random variables · 18.440: Lecture 9 Expectations of discrete random variables Scott Sheﬃeld. MIT. 18.440. Lecture 9 1. Outline. Deﬁning expectation.

Lecture Slides-mean Variance

STAT 430/510 Probability Lecture 8: Variance, Bernoulli ...stat.wharton.upenn.edu/~pengyuan/stat430/notes/Lecturenote8.pdf · Lecture 8: Variance, Bernoulli Distribution, Binomial

18.440: Lecture 34 .1in Entropy - MIT Mathematicsmath.mit.edu/~sheffield/440/Lecture34.pdf · 18.440 Lecture 34. What is entropy? I Entropy is an important notion in thermodynamics,

Lecture Lecture 07: 07: MeanMean--Variance Analysis …markus/teaching/Fin501/07Lecture.pdf · Lecture Lecture 07: 07: MeanMean--Variance Analysis &Variance Analysis & Capital Asset

Lecture 1: Course logistics, introduction, bias-variance ...

LECTURE 6: Variance; Cond-tioning on an event; Multiple ......LECTURE 6: Variance; Cond-tioning on an event; Multiple random variables • Variance and its properties - Variance of

18.440: Lecture 12 .1in Poisson random variablesmath.mit.edu/~sheffield/440/Lecture12.pdf · 18.440 Lecture 12. Poisson random variables: motivating questions I How many raindrops