Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

71

Transcript of Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Page 1: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.
Page 2: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

• Your 4 th homework is assigned. It is due on 12th of Feb, 11:59 pm.

Page 3: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

a) Adult not working during summer vacation.

b)

•The experiment consist of 10 identical trials. A trial for this experiment is an individual.

•There are only two possible outcomes: work or do not work

•The probability remains same for each individual (trial)

•Individuals are independent

c) 0.35

d) 0.2522

e) 0.2616

Page 4: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

By using TI-84:

d) P(X = 3) = = P(X ≤ 3) – P(X ≤ 2)

binomcdf(10,.35,3) - binomcdf(10,.35,2)

e) P(X ≤ 2) binomcdf(10,.35,2)

Page 5: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Thinking Challenge

• The communications monitoring company Postini has reported that 91% of e-mail messages are spam. Suppose your inbox contains 25 messages.

• What are the mean and standard deviation of the number of real messages you should expect to find in your inbox?

• What is the probability that you will find only 1 or 2 real messages?

Page 6: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Thinking Challenge

• What are the mean and standard deviation of the number of real messages you should expect to find in your inbox? (2.25, 1.43)

• What is the probability that you will find only 1 or 2 real messages?

(P(X=1)+P(X=2)= 0.5117)

Page 7: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Content

1. Two Types of Random Variables

2. Probability Distributions for Discrete Random Variables

3. The Binomial Distribution

4. Hypergeometric Distributions

5. Probability Distributions for Continuous Random Variables

6. The Normal Distribution

7. Uniform Distribution

Page 8: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

4.4

Hypergeometric Distribution

Page 9: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Characteristics of a Hypergeometric

Random Variable

1. The experiment consists of randomly drawing n elements without replacement from a set of N elements, r of which are S’s (for success) and (N – r) of which are F’s (for failure).

2. The hypergeometric random variable x is the number of S’s in the draw of n elements.

Page 10: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Hypergeometric Probability Distribution Function

where . . .

[x = Maximum [0, n – (N – r), …, Minimum (r, n)]

p x

r

x

N r

n x

N

n

µ nr

N 2

r N r n N n N 2 N 1

Page 11: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Hypergeometric Probability Distribution Function

N = Total number of elements

r = Number of S’s in the N elements

n = Number of elements drawn

x = Number of S’s drawn in the n elements

Page 12: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

a. Hypergeometric

b. Binomial

Page 13: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Thinking Challenge

A carton of 12 eggs has 4 rotten eggs and 8 good eggs. Three eggs are chosen at random from the carton to make a three-egg omelet.

Let X = the number of rotten eggs chosen. What is the probability that the sample will consist of one rotten egg and two good eggs, that is, what is P(X = 1)?

(a) 81/220 (b) 192/220 (c) 112/220 (d) 56/220

Page 14: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Thinking Challenge

• 5 cards are picked from a deck of 52 cards without replacement. What is the probability that 2 of the selected cards will be Ace?

a) b) c) d)

• • • •

Page 15: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

4.5

Probability Distributions for Continuous Random Variables

Page 16: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Continuous Probability Density Function

The graphical form of the probability distribution for a continuous random variable x is a smooth curve that might appear as below:

Page 17: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Continuous Probability Density Function

This curve, a function of x, is denoted by the symbol f(x) and is variously called a probability density function (pdf), a frequency function, or a probability distribution.•The areas under a probability distribution correspond to probabilities for x. The area A beneath the curve between two points a and b is the probability that x assumes a value between a and b.

Page 18: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Continuous Probability Density Function

P(a<x<b)=P(a≤x≤b) sinceP(x=a)=P(x=b)=0

Page 19: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

4.6

The Normal Distribution

Page 20: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Importance of Normal Distribution

1. Describes many random processes or continuous phenomena

2. Can be used to approximate discrete probability distributions

• Example: binomial

3. Basis for classical statistical inference

Page 21: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Normal Distribution

1. ‘Bell-shaped’ & symmetrical

2. Mean, median, mode are equal •x

•f•(•x•)

Mean Median Mode

Page 22: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Probability Density Function

whereµ = Mean of the normal random variable x = Standard deviationπ = 3.1415 . . .e = 2.71828 . . . P(x < a) is obtained from a table of normal

probabilities

f (x) 1

2e

1

2

x

2

Page 23: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Effect of Varying Parameters ( & )

Page 24: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Normal Distribution Probability

c dx

f(x)

•Probability is area under curve!

P(c x d) f (x)c

d

dx?

Page 25: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Standard Normal Distribution •The standard normal distribution is a normal distribution with µ = 0 and = 1. A random variable with a standard normal distribution, denoted by the symbol z, is called a standard normal random variable.

Page 26: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

The Standard Normal Table:P(0 < z < 1.96)

z = 0

= 1

1.96

Z .04 .05

1.8 .4671 .4678 .4686

.4738 .4744

2.0 .4793 .4798 .4803

2.1 .4838 .4842 .4846

.06

1.9 .4750

Standard Normal Probability Table (Portion)

Probabilities

.4750

Page 27: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

The Standard Normal Table:P(–1.26 z 1.26)

z = 0

= 1

–1.26

Standard Normal Distribution

.3962

1.26

.3962 P(–1.26 ≤ z ≤ 1.26)

= .3962 + .3962

= .7924

Page 28: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

The Standard Normal Table:P(z > 1.26)

z

Standard Normal Distribution

1.26

P(z > 1.26)

= .5000 – .3962

= .1038.3962

.500

= 1

= 0

.500

Page 29: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

The Standard Normal Table:P(–2.78 z –2.00)

–2.78 z–2.00

.4973

.4772

Standard Normal Distribution

P(–2.78 ≤ z ≤ –2.00)

= .4973 – .4772

= .0201

= 1

= 0

Page 30: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

The Standard Normal Table:P(z > –2.13)

z–2.13

Standard Normal Distribution

P(z > –2.13)

= .4834 + .5000

= .9834

.5000.4834

= 1

= 0

Page 31: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

•The rest of the quizzes will be held in the lecture on Wednesdays.

•The updated dates for quizzes are as below.

•I will take attendance and will give extra credit to the students who have attended at least 8 of the lectures till the end of the semester.

Page 32: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Non-standard Normal Distribution

x

f(x)

Normal distributions differ by mean & standard deviation.

Each distribution would require its own table.

That’s an infinite number of tables!

Page 33: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Property of Normal Distribution

•If x is a normal random variable with mean μ and standard deviation , then the random variable z, defined by the formula

has a standard normal distribution. The value z describes the number of standard deviations between x and µ.

z x µ

Page 34: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Standardize theNormal Distribution

Normal Distribution

x

One table!

= 1

z

Standard Normal Distribution

z x

= 0

Page 35: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Finding a Probability Corresponding to a Normal

Random Variable1. Sketch normal distribution, indicate mean, and shade

the area corresponding to the probability you want.

2. Convert the boundaries of the shaded area from x values to standard normal random variable z

z x µ

Show the z values under corresponding x values.

3. Use Table in Appendix D to find the areas corresponding to the z values. Use symmetry when necessary.

Page 36: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

• Of course, the TI does it all: normalcdf(a,b,μ,σ) returns the probability P(a < x < b) with x distributed N(μ, σ).

Page 37: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Non-standard Normal μ = 5, σ = 10: P(5 < x < 6.2)

z

= 1

.12

Standard Normal Distribution

.0478

Normal Distribution

x = 5

= 10

6.2 = 0

z

x

6.2 5

10.12

Page 38: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Non-standard Normal μ = 5, σ = 10: P(3.8 x 5)

z = 0-.12

Standardized Normal Distribution

Normal Distribution

x = 53.8

.0478

z

x

3.8 5

10 .12

= 10 = 1

Page 39: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Non-standard Normal μ = 5, σ = 10: P(2.9 x 7.1)

0-.21 z.21

Standard Normal Distribution

52.9 7.1 x

Normal Distribution

.1664

.0832.0832

z

x

2.9 5

10 .21

z

x

7.1 5

10.21

= 10 = 1

Page 40: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Non-standard Normal μ = 5, σ = 10: P(x 8)

x = 5

= 10

8

Normal Distribution

= 1

z = 0 .30

Standard Normal Distribution

.3821.5000

.1179

z

x

8 5

10.30

Page 41: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Non-standard Normal μ = 5, σ = 10: P(7.1 X 8)

= 1

.30 z.21

Standard Normal Distribution

= 10

87.1 x

Normal Distribution

.1179 .0347.0832

z

x

7.1 5

10.21

z

x

8 5

10.30

= 5 = 0

Page 42: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Normal Distribution Thinking Challenge

You work in Quality Control for GE. Light bulb life has a normal distribution with = 2000 hours and = 200 hours. What’s the probability that a bulb will last

A. between 2000 and 2400 hours?

B. less than 1470 hours?

Page 43: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Solution* P(2000 x 2400)

Standard Normal Distribution

z = 0 2.0

Normal Distribution

x= 2000

= 200

2400

= 1

.4772

z

x

2400 2000

2002.0

Page 44: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Solution* P(x 1470)

z= 0

= 1

–2.65

Standard Normal Distribution

x = 2000

= 200

1470

Normal Distribution

.0040 .4960

.5000

z

x

1470 2000

200 2.65

Page 45: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Finding z-Values for Known Probabilities

What is z, given P(z) = .1217?

z= 0

= 1

•?

.1217

Standard Normal Probability Table (Portion)

Z .00 0.2

0.0 •.0000 .0040 .0080

0.1 .0398 .0438 .0478

0.2 .0793 .0832 .0871

.1179 .1255

.01

0.3 .1217

.31

Page 46: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Finding x Values for Known Probabilities

Normal Distribution

•x= 5

= 10

•?

.1217

Standard Normal Distribution

z= 0

= 1

.31

.1217

8.1

x z 5 .31 10

Page 47: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Using a TI: invNorm(p, μ, σ) returns the 100pth percentile of N(μ, σ).

Page 48: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Normal Distribution Thinking Challenge

At one university, the students are given z-scores at the end of each semester rather than the traditional GPAs. The mean and the standard deviation of all students’ cumulative GPAs, on which the z-scores are based, are 2.7 and .5, respectively.

a) Translate each of the following z-scores to the corresponding GPA: z=2.0, z=-1, z=0.5, z=-2.5.

b) Students with z-scores below -1.6 are put on probation. What is the corresponding probationary GPA?

c) The president of the university wishes to graduate the top 20% of the students with cum laude honors and the top 2.5% with summa cum laude honors. Under the assumption that the distribution is exactly normal, by using the Table Z in the appendix, determine the limits be set in terms of original GPAs.

• 

Page 49: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Solution

a) 3.7, 2.2, 2.95, 1.45

b) 1.9

c)P(z > 0.84)=0.20 P(z > 1.96)=0.025

So,

for cum laude: 0.84=(x-2.7)/0.5x= 3.12;

for summa cum laude: 1.96=(x-2.7)/0.5

x= 3.68

Page 50: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

4.7

Descriptive Methods forAssessing Normality

Page 51: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Determining Whether the Data Are from an Approximately

Normal Distribution1. Construct either a histogram or stem-and-leaf

display for the data and note the shape of the graph. If the data are approximately normal, the shape of the histogram or stem-and-leaf display will be similar to the normal curve.

Page 52: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Determining Whether the Data Are from an Approximately

Normal Distribution

2. Compute the intervals and determine the percentage of measurements falling in each. If the data are approximately normal, the percentages will be approximately equal to 68%, 95%, and 100%, respectively; from the Empirical Rule (68%, 95%, 99.7%).

x s, x 2s, and x 3s,

Page 53: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Determining Whether the Data Are from an Approximately

Normal Distribution

3. Find the interquartile range, IQR, and standard deviation, s, for the sample, then calculate the ratio IQR/s. If the data are approximately normal, then IQR/s ≈ 1.3.

3 1Q QIQR

s s

Page 54: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Determining Whether the Data Are from an Approximately

Normal Distribution

4. Examine a normal probability plot for the data. If the data are approximately normal, the points will fall (approximately) on a straight line.

Observed value

Exp

ecte

d z

–sco

re

Page 55: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Normal Probability Plot

A normal probability plot for a data set is a scatterplot with the ranked data values on one axis and their corresponding expected z-scores from a standard normal distribution on the other axis. [Note: Computation of the expected standard normal z-scores are beyond the scope of this lecture. Therefore, we will rely on available statistical software packages to generate a normal probability plot.]

Page 56: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Using Stat Cranch for Normal probability plot

Page 57: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Using Stat Cranch for Normal probability plot

Page 58: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.
Page 59: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

4.8

Other Continuous Distributions:

Uniform

Page 60: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Uniform Distribution•Continuous random variables that appear to have equally likely outcomes over their range of possible values possess a uniform probability distribution.•Suppose the random variable x can assume values only in an interval c ≤ x ≤ d. Then the uniform frequency function has a rectangular shape.

Page 61: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Probability Distribution for a Uniform Random Variable x

Mean:

f (x)

1

d cc x dProbability density function:

d c

12

c d

2Standard Deviation:

P a x b b a d c , c a b d

Page 62: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Uniform Distribution Example

You’re production manager of a soft drink bottling company. You believe that when a machine is set to dispense 12 oz., it really dispenses between 11.5 and 12.5 oz. inclusive. Suppose the amount dispensed has a uniform distribution. What is the probability that less than 11.8 oz. is dispensed?

SODA

Page 63: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Uniform Distribution Solution

P(11.5 x 11.8) = (Base)/(Height)

= (11.8 – 11.5)/(1) = .30

11.5 12.5

f(x)

•x11.8

1 1

12.5 11.51

1.01

d c

1.0

Page 64: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

• Consider a powder specimen that has exactly 10 anthrax spores. Suppose that the number of anthrax spores in the sample detected by the new method follows an approximate uniform distribution between 0 and 10.

a. Find the probability that 8 or fewer anthrax spores are detected in the powder specimen.

b. Find the probability that between 2 and 5 anthrax spores are detected in the powder specimen.

c. Find the expected number of anthrax spores.

d. Find the standard deviation of anthrax spores.

Review Q1

Page 65: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Review Q2

• The output from a statistical computer program indicates that the mean and standard deviation of a data set consisting of 200 measurements are $1500 and $300, respectively. Suppose the frequency distribution of the data set has normal distribution. What of percentage of data will be between $750 and $1200?

Page 66: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Review Q3• Almost all companies utilize some type of year-end performance review

for their employees. Human Resources(HR) at the university of Texas health Science Center provides guidelines for supervisors rating their subordinates. For example raters are advised to examine their ratings for a tendency to be too lenient or too harsh. According to HR, “if you have this tendency, consider using a normal distribution-10% of employees (rated) exemplary, 20% distinguished, 40% competent, 20% marginal, and 10% unacceptable”.

• Suppose you are rating an employee’s performance on a scale of 1(lowest) to 100(highest). Also assume the ratings follow normal distribution with a mean of 50 and a standard deviation of 15.

a) What is the lowest rating you should give to an “exemplary” employee if you follow the Univ. of Texas HR guidelines?

b) What is the lowest rating you should give to an “competent” employee if you follow the Univ. of Texas HR guidelines?

Page 67: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Review Q4

• Suppose that 5 out of 13 liver transplants done at a hospital will fail within a year. Consider a random sample of 4 of these 13 patients. What is the probability that 1 of these patients will result in failed transplants?

Page 68: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Key IdeasProperties of Probability Distributions

Discrete Distributions

1. p(x) ≥ 0

2.

Continuous Distributions

1. P(x = a) = 0

2. P(a < x < b) = area under curve between a and b

p x 1all x

Page 69: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Key Ideas

Methods for Assessing Normality

1. Histogram

Page 70: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Key Ideas

Methods for Assessing Normality

2. Stem-and-leaf display

1 7

2 3389

3 245677

4 19

5 2

Page 71: Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.

Key Ideas

Methods for Assessing Normality

3. (IQR)/S ≈ 1.3

4. Normal probability plot