Introduction to Probability and Statistics

30
Introduction to Introduction to Probability Probability and Statistics and Statistics Chapter 6 The Normal Probability Distribution

description

Introduction to Probability and Statistics. Chapter 6 The Normal Probability Distribution. Continuous Random Variables. Continuous random variables can assume infinitely many values corresponding to points on a line interval. Examples: Heights, Weights Lifetime of a particular product - PowerPoint PPT Presentation

Transcript of Introduction to Probability and Statistics

Page 1: Introduction to Probability  and Statistics

Introduction to Probability Introduction to Probability and Statisticsand Statistics

Chapter 6

The Normal Probability Distribution

Page 2: Introduction to Probability  and Statistics

Continuous Random VariablesContinuous Random Variables• Continuous random variables can assume

infinitely many values corresponding to points on a line interval.

• Examples:Examples:

– Heights, Weights

– Lifetime of a particular product

– Experimental laboratory error

Page 3: Introduction to Probability  and Statistics

Continuous Random VariablesContinuous Random Variables• A smooth curvesmooth curve describes the probability

distribution of a continuous random variable.

• The depth or density of the probability, which varies with x, may be described by a mathematical formula f (x ), called the probability distributionprobability distribution or probability density functionprobability density function for the random variable x.

Page 4: Introduction to Probability  and Statistics

Properties of ContinuousProperties of ContinuousRandom Variable Random Variable xx

• The area under the curve is equal to 1.1.• P(a < x < b) = area under the curvearea under the curve

between a and b.

• There is no probability attached to any single value of x. That is, P(x = a) = 0.

Page 5: Introduction to Probability  and Statistics

Properties of ContinuousProperties of ContinuousRandom Variable Random Variable xx

• Total probability is 1• P( x = a) = 0• P( x a) = P( x < a)

(not true when x is discrete)• P( a < x < b) is the area between a and b

under the density curve• P( x < a) is the area to the left of a• P( x > a) is the area to the right of a

Page 6: Introduction to Probability  and Statistics

Continuous Probability Continuous Probability DistributionsDistributions

• There are many different types of continuous random variablesa. Uniform;b. Exponential;c. Normal.

• We try to pick a model that– Fits the data well– Allows us to make the best possible

inferences using the data.

Page 7: Introduction to Probability  and Statistics

Normal DistributionNormal Distribution

deviation. standard is mean; s

7183.21416.3

for 2

1)(

2

2

1

i

e

xexfx

deviation. standard is mean; s

7183.21416.3

for 2

1)(

2

2

1

i

e

xexfx

• The formula that generates the normal probability density is:

• Standard Normal: 0, = 1.2

2

2

1)(

x

exf

2

2

2

1)(

x

exf

Page 8: Introduction to Probability  and Statistics

Normal DistributionNormal Distribution

• The shape and location of the normal curve changes as the mean and standard deviation change.

• Mean locates the center of the curve;

• Standard deviation determines the shape:

1. Large values of standard deviation reduce height and increase spread.

2. Small values increase height and reduce spread.

Page 9: Introduction to Probability  and Statistics

The Standard Normal The Standard Normal DistributionDistribution

• To find P(a < x < b), we need to find the area under the appropriate normal curve.

• To simplify the tabulation of these areas, we standardize standardize each value of x by expressing it as a z-score, the number of standard deviations it lies from the mean .

x

z

x

z

Page 10: Introduction to Probability  and Statistics

Standard Standard Normal (z) Normal (z)

DistributionDistribution

• z has Mean = 0; Standard deviation = 1

• Symmetric about z = 0

• Total area under curve is 1;

• Area to the right of 0 is 0.5;

• Area to the left of 0 is 0.5. P ( z < 0 ) = .5

P ( z > 0 ) = .5

Total probability

Page 11: Introduction to Probability  and Statistics

P(z < 1.36) ?P(z < 1.36) ?

Using Table 3Using Table 3Use Table 3 to calculate the probability:

P ( z < 1.36) = .9131P ( z < 1.36) = .9131

Area to the left of 1.36

Area to the left of 1.36

=.9131

Page 12: Introduction to Probability  and Statistics

P( z > 1.36) ?P( z > 1.36) ?

Using Table 3Using Table 3Use Table 3 to find the probability:

P ( z > 1.36) = 1- P ( z 1.36) = 1 - .9131 = .0869

P ( z > 1.36) = 1- P ( z 1.36) = 1 - .9131 = .0869

Area to the right of 1.36

Page 13: Introduction to Probability  and Statistics

P(-1.20 < z <1.36)?P(-1.20 < z <1.36)?

Using Table 3Using Table 3Use Table 3 to calculate the probability:(Area between)

P(-1.20 < z <1.36)= P ( z < 1.36) - P ( z <-1.2) = .9131 - .1151 = .7980

P(-1.20 < z <1.36)= P ( z < 1.36) - P ( z <-1.2) = .9131 - .1151 = .7980

Area between -1.2 and 1.36

Page 14: Introduction to Probability  and Statistics

Check Empirical RuleCheck Empirical Rule• within 3 standard deviations

P(-3 < z <3)= .9987 - .0013=.9974

P(-3 < z <3)= .9987 - .0013=.9974

Remember the Empirical Rule: Approximately 99.7% of the measurements lie within 3 standard deviations of the mean.

P(-1 < z <1) = .8413 - .1587 = .6826

P(-1 < z <1) = .8413 - .1587 = .6826

P(-2 < z < 2)= .9772 - .0228 = .9544

P(-2 < z < 2)= .9772 - .0228 = .9544

Page 15: Introduction to Probability  and Statistics

1. 1.645 is halfway of 1.64 and 1.65

2. Look for areas of 1.64 and 1.65 in Table 3.

1. 1.645 is halfway of 1.64 and 1.65

2. Look for areas of 1.64 and 1.65 in Table 3.

z value with more z value with more than two decimalsthan two decimals

3. Since the value 1.645 is halfway between 1.64 and 1.65, we average areas .9495 and .9505.

4. P( z 1.645) = (.9495+.9505)/2 = .9500

P( z < 1.643) = P( z < 1.64)= .9495

P( z < 1.643) = P( z < 1.64)= .9495

P( z < 1.6474) = P( z < 1.65)= .9505

P( z < 1.6474) = P( z < 1.65)= .9505 P( z < 1.645) ?P( z < 1.645) ?

Page 16: Introduction to Probability  and Statistics

Extreme z valuesExtreme z values

Using Table 3, calculate P(z<-5) = ?P(z>4) = ?

• P(z<-5) = 0• P(z>4) = 0

Page 17: Introduction to Probability  and Statistics

General Normal & General Normal & Standard NormalStandard Normal

• x is normal with mean and standard deviation .

• Question: P(a < x < b) ? • i.e. area under the normal curve from a to b.• To simplify the tabulation of these areas, we

standardize standardize each value of x by expressing it as a z-score, the number of standard deviations it lies from the mean .

x

z

x

z z is standard normal

Page 18: Introduction to Probability  and Statistics

ExampleExample• x is normal with mean 0

and standard deviation 2.

2

6.0

xz

2

6.0

xz

z is standard normal

• x is normal with mean 10.2 and standard deviation 5.

5

2.10

xz

5

2.10

xz

z is standard normal

Page 19: Introduction to Probability  and Statistics

Probabilities for General Normal Probabilities for General Normal Random VariableRandom Variable

To find an area for a normal random variable x with mean and standard deviation standardize or rescale the interval in terms of z. Find the appropriate area using Table 3.

To find an area for a normal random variable x with mean and standard deviation standardize or rescale the interval in terms of z. Find the appropriate area using Table 3.

Example: Example: x has a normal distribution with = 5 and = 2. Find P(x > 7).

1587.8413.1

)1(1)1(

)2

57

2

5()7(

zPzP

xPxP

1 z

Page 20: Introduction to Probability  and Statistics

ExampleExample

The weights of packages of ground beef are normally distributed with mean 1 pound and standard deviation .10. What is the probability that a randomly selected package weighs between 0.80 and 0.85 pounds?

)85.80(. xP

)1.

185.

1.

1

1.

18.(

xP

0440.0228.0668. )5.12( zP

Page 21: Introduction to Probability  and Statistics

Area under General Area under General Normal CurveNormal Curve

Studies show that gasoline use for compact cars sold in U.S. is normally distributed, with a mean of 25.5 mpg and a standard deviation of 4.5 mpg.What is the percentage of compacts get 30 mpg or more?

)30( xP

)5.4

5.2530

5.4

5.25(

xP

1587.

)1()1( zPzP using Table 3

15.87% of compacts get 30 mpg or more

Page 22: Introduction to Probability  and Statistics

To find an area to the left of a z-value, find the area directly from the table.

e.g. P( z < 1.36)To find an area to the right of a z-value, find the area in Table 3 and subtract from 1. or find the area with respect to the negative of the z-value;

e.g. P( z >1.36) = 1-P( z <1.36), P( z>1.36) = P( z< -1.36) To find the area between two values of z, find the two areas in Table 3, and subtract.

e.g. P( -1.20 < z < 1.36 )

To find an area to the left of a z-value, find the area directly from the table.

e.g. P( z < 1.36)To find an area to the right of a z-value, find the area in Table 3 and subtract from 1. or find the area with respect to the negative of the z-value;

e.g. P( z >1.36) = 1-P( z <1.36), P( z>1.36) = P( z< -1.36) To find the area between two values of z, find the two areas in Table 3, and subtract.

e.g. P( -1.20 < z < 1.36 )

Using Table 3Using Table 3

To find an area for a normal random variable x with mean and standard deviation standardize or rescale the interval in terms of z. Find the appropriate area using Table 3.

To find an area for a normal random variable x with mean and standard deviation standardize or rescale the interval in terms of z. Find the appropriate area using Table 3.

x

z

x

z

Page 23: Introduction to Probability  and Statistics

1. Look for the four digit area closest to .2500 in Table 3.

2. What row and column does this value correspond to?

1. Look for the four digit area closest to .2500 in Table 3.

2. What row and column does this value correspond to?

Working BackwardsWorking Backwards

Find the value of z that has area .25 to its left.

4. What percentile does this value represent?

4. What percentile does this value represent? 25th percentile,

or 1st quartile (Q1)

3. z = -.67

Page 24: Introduction to Probability  and Statistics

1. The area to its left will be 1 - .05 = .95

2. Look for the four digit area closest to .9500 in Table 3.

1. The area to its left will be 1 - .05 = .95

2. Look for the four digit area closest to .9500 in Table 3.

Working BackwardsWorking Backwards

Find the value of z that has area .05 to its right.

3. Since the value .9500 is halfway between .9495 and .9505, we choose z halfway between 1.64 and 1.65.

4. z = 1.645

Page 25: Introduction to Probability  and Statistics

Example 1Example 1Find the value of z, say z0 , such that .01 of the area is to its right. (tail area of .01)

01.)( 0 zzP 01.)( 0 zzP

33.20 z

99.)( 0 zzPusing Table 3

Page 26: Introduction to Probability  and Statistics

Example 2Example 2Find the value of z, say z0 , such that .95 of the area is within z0 standard deviations of the mean.

95.)( 00 zzzP 95.)( 00 zzzP

296.10 z

975.)( 0 zzPusing Table 3

Page 27: Introduction to Probability  and Statistics

The weights of packages of ground beef are normally distributed with mean 1 pound and standard deviation .10. What is the weight of a package such that only 1% of all packages exceed this weight?

01.)( 0 xxP

01.)( 0 xxP

99th percentile

99.)( 0 xxP

33.21.

10 x

233.11)1(.33.20 x

using Table 3

Example 3Example 3

99.)1.

1( 0

xzP

Page 28: Introduction to Probability  and Statistics

ExerciseExercise

Page 29: Introduction to Probability  and Statistics

Key ConceptsKey ConceptsI. Continuous Probability DistributionsI. Continuous Probability Distributions

1. Continuous random variables

2. Probability distributions or probability density functions

a. Curves are smooth.

b. The area under the curve between a and b represents

the probability that x falls between a and b.

c. P (x a) 0 for continuous random variables.

II. The Normal Probability DistributionII. The Normal Probability Distribution

1. Symmetric about its mean .

2. Shape determined by its standard deviation .

Page 30: Introduction to Probability  and Statistics

Key ConceptsKey ConceptsIII. The Standard Normal DistributionIII. The Standard Normal Distribution

1. The normal random variable z has mean 0 and standard deviation 1.2. Any normal random variable x can be transformed to a standard normal random variable using

3. Convert necessary values of x to z.4. Use Table 3 in Appendix I to compute standard normal probabilities.5. Several important z-values have tail areas as follows:

Tail Area: .005 .01 .025 .05 .10

z-Value: 2.58 2.33 1.96 1.645 1.28

x

z

x

z