1 ES9 Chapter 5 ~ Probability Distributions (Discrete Variables) Express Checkout Number of Items...

38
1 ES9 Chapter 5 ~ Probability Distributions (Discrete Variables) 0 0.05 0.1 0.15 0.2 0.25 0.3 0 1 2 3 4 5 6 7 8 Px () x Express Checkout Number of Items Purchased

Transcript of 1 ES9 Chapter 5 ~ Probability Distributions (Discrete Variables) Express Checkout Number of Items...

1

ES9 Chapter 5 ~ Probability Distributions

(Discrete Variables)

0

0.05

0.1

0.15

0.2

0.25

0.3

0 1 2 3 4 5 6 7 8

P x( )

x

Express Checkout

Number of Items Purchased

2

ES9

Chapter Goals

• Combine the ideas of frequency distributions and probability to form probability distributions

• Investigate discrete probability distributions and study measures of central tendency and dispersion

• Study the binomial random variable

3

ES9

5.1 ~ Random Variables

• Bridge between experimental outcomes and statistical analysis

• Each outcome in an experiment is assigned to a number

• This suggests the idea of a function

4

ES9

Random Variable

Random Variable: A variable that assumes a unique numerical value for each of the outcomes in the sample space of a probability experiment

Notes: Used to denote the outcomes of a probability experiment Each outcome in a probability experiment is assigned to a unique

value Illustration: S

Outcomes

0 1 2 1 2

Random Variable

5

ES9

Examples of Random Variable

1. Let the number of computers sold per day by a local merchant be a random variable. Integer values ranging from zero to about 50 are possible values.

2. Let the number of pages in a mystery novel at a bookstore be a random variable. The smallest number of pages is 125 while the largest number of pages is 547.

3. Let the time it takes an employee to get to work be a random variable. Possible values are 15 minutes to over 2 hours.

4. Let the volume of water used by a household during a month be a random variable. Amounts range up to several thousand gallons.

5. Let the number of defective components in a shipment of 1000 be a random variable. Values range from 0 to 1000.

6

ES9

Discrete & Random Variables

Discrete Random Variable: A quantitative random variable that can assume a countable number of values

• Intuitively, a discrete random variable can assume values corresponding to isolated points along a line interval. That is, there is a gap between any two values.

Note: Usually associated with counting

Continuous Random Variable: A quantitative random variable that can assume an uncountable number of values

• Intuitively, a continuous random variable can assume any value along a line interval, including every possible value between any two values

Note: Usually associated with a measurement

7

ES9

Example

Example: Determine whether the following random variables are discrete or continuous

1. The barometric pressure at 12:00 PM

2. The length of time it takes to complete a statistics exam

3. The number of items in the shopping cart of the person in front of you at the checkout line

4. The weight of a home grown zucchini

5. The number of tickets issued by the PA State Police during a 24 hour period

6. The number of cans of soda pop dispensed by a machine placed in the Mathematics building on campus

7. The number of cavities the dentist discovers during your next visit

8

ES9 5.2 ~ Probability Distributions

of a Discrete Random Variable

• Need a complete description of a discrete random variable

• This includes all the values the random variable may assume and all of the associated probabilities

• This information may be presented in a variety of ways

9

ES9

Probability Distribution & Function

Probability Distribution: A distribution of the probabilities associated with each of the values of a random variable. The probability distribution is a theoretical distribution; it is used to represent populations.

Notes: The probability distribution tells you everything you need to know about the random

variable. The probability distribution may be presented in the form of a table, chart, function, etc.

Probability Function: A rule that assigns probabilities to the values of the random variable

10

ES9

x 0 1 2 3 4

P (x ) 2/15 4/15 5/15 3/15 1/15

Example

Example: The number of people staying in a randomly selected room at a local hotel is a random variable ranging in value from 0 to 4. The probability distribution is

known and is given in various forms below:

P x( )the random variable equals 2 P( ) 25

15

Notes: This chart implies the only values x takes on are 0, 1, 2, 3, and 4

11

ES9

0 1 2 3 40.0

0.1

0.2

0.3

0.4

x

P x( ) Hotel Room Probability Distribution

A Line Representation

12

ES9

0 1 2 3 40.0

0.1

0.2

0.3

0.4

x

P x( ) Hotel Room Probability Distribution

Histogram

A histogram may be used to present a probability distribution:

13

ES9

Notes

The histogram of a probability distribution uses the physical area of each bar to represent its assigned probability

In the Hotel Room probability distribution: the width of each bar is 1, so the height of each bar is equal to the assigned probability, which is the area of each bar.

The idea of area representing probability is important in the study of continuous random variables

14

ES9

Reminder!

Every probability function must satisfy the two basic properties of probability:

1)(0 xP

1. The probability assigned to each value of the random variable must be between 0 and 1, inclusive:

x

xP all

1)(

2. The sum of the probabilities assigned to all the values of the random variable must equal 1:

15

ES9 5.3 ~ Mean and Variance of a

Discrete Probability Distribution

• Describe the center and spread of a population

• s, s2 : population parameters

• Population parameters are usually unknown values (we would like to estimate)

16

ES9

Important Notes

2. s2 and s are the variance and standard deviation of the sample

x3. , s2, and s are called sample statistics

4. (lowercase Greek letter “mu”) is the mean of the population

5. 2 (“sigma squared”) is the variance of the population

6. (lowercase Greek letter “sigma”) is the standard deviation of the population

7. , 2, and are called population parameters. (A parameter is a constant. , 2, and are typically unknown values.)

x1. is the mean of the sample

17

ES9

[ ( )]xP x

Mean of a Discrete Random Variable

• The mean, , of a discrete random variable x is found by multiplying each possible value of x by its own probability and then adding all the products together:

Notes: The mean is the average value of the random variable, what happens on average

The mean is not necessarily a value of the random variable

18

ES9

2 2

2 2

2 2

[( ) ( )]

[ ( )] [ ( )]

[ ( )]

x P x

x P x xP x

x P x

2

Discrete Random Variables

Variance of a Discrete Random Variable: Variance, 2, of a discrete random variable x is found by multiplying each possible value of the squared deviation from the mean, (x )2, by its own probability and then adding all the products together:

Standard Deviation of a Discrete Random Variable: The positive square root of the variance:

19

ES9

x P x( ) xP x( ) x2 x P x2 ( )0 0.30 0.00 0 0.001 0.25 0.25 1 0.252 0.20 0.40 4 0.803 0.15 0.45 9 1.354 0.05 0.20 16 0.805 0.05 0.25 25 1.25

Totals 1.00 1.55 4.45

P x( )(check)

[ ( )]xP x [ ( )]x P x2

Example

Example: The number of standby passengers who get seats on a daily commuter flight from Boston to New York is a

random variable, x, with probability distribution given below (in an extensions table). Find the mean,

variance, and standard deviation:

20

ES9

[ ( )] .xP x 155

Solution

• Using the formulas for mean, variance, and standard deviation:

2 2 2

24 45 155 4 45 2 4025 2 0475

[ ( )] [ ( )]

. ( . ) . . .

x P x xP x

2 2 0475 143. .

Note: 1.55 is not a value of the random variable (in this case). It is only what happens on average.

21

ES9

P xx

x( ) 815

for 3, 4, 5, 6, 7

P( )48 415

415

P( )58 515

315

P( )68 615

215

P( )78 715

115

Example

Example: The probability distribution for a random variable x is given by the probability function:

Find the mean, variance, and standard deviation

P( )38 315

515

Solutions: Find the probability associated with each value by using the probability function:

22

ES9

3 5/15 15/15 9 45/154 4/15 16/15 16 64/155 3/15 15/15 25 75/156 2/15 12/15 36 72/157 1/15 7/15 49 49/15

Totals 15/15 65/15 305/15

x P x( ) xP x( ) x2 x P x2 ( )

[ ( )]xP x [ ( )]x P x2

[ ( )] .xP x6515

4 33

2 2 22

30515

6515

156

[ ( )] [ ( )] .x P x xP x

2 156 1 25. .

Finding the Population Parameters

• Use an extensions table to find the population parameters:

23

ES9

5.4 ~ The Binomial Probability Distribution

• One of the most important discrete distributions

• Based on a series of repeated trials whose outcomes can be classified in one of two categories: success or failure

• Distribution based on a binomial probability experiment

24

ES9

Binomial Probability Experiment

Binomial Probability Experiment: An experiment that is made up of repeated trials that possess the following properties:

1. There are n repeated independent trials

2. Each trial has two possible outcomes (success, failure)

3. P(success) = p, P(failure) = q, and p + q = 1

4. The binomial random variable x is the count of the number of successful trials that occur; x may take on any integer value from zero to n

25

ES9

Notes Properties 1 and 2 are the two basic properties of any binomial experiment

Property 3 concerns the algebraic notation for each trial

Property 4 concerns the algebraic notation for the complete experiment

Both x and p must be associated with “success” Independent trials mean that the result of one trial does not affect the probability of success of any other

trial in the experiment. The probability of “success” remains constant throughout the entire experiment.

26

ES9

ExampleExample: It is known that 40% of all graduating seniors on the campus of a

very large university have taken a statistics class. Five seniors are selected at random and asked if they have taken a statistics class. This approximates a binomial experiment:

1. A trial is asking one student, repeated 5 times. The trials are independent since the probability of taking a statistics class for any one student is not affected by the results from any other student.

2. Two outcomes on each trial: taken a statistics class (success), not taken a statistics class (failure)

3. p = P(taken a statistics class) = 0.40q = P(not taken a statistics class) = 0.60

4. x = number of students who have taken a statistics class

27

ES9

Binomial Probability Function

• For a binomial experiment, let p represent the probability of a “success” and q represent the probability of a “failure” on a single trial; then P(x), the probability that there will be exactly x successes on n trials is:

Notes: The number of ways that exactly x successes can occur in n trials:

The probability of exactly x successes: px

The probability that failure will occur on the remaining(n - x) trials: qn - x

n

x

P xn

xp q x nx n x( ) ( )( ),

for 0, 1, 2, ... , or

28

ES9

The number of ways that exactly x successes can occur in a set of n trials is represented by the symbol:

Notes

n

x

n n n n! ( )( ) ( )( )( )

!

1 2 3 2 1

6 6 5 4 3 2 1 720

n! is an abbreviation for n factorial:

1. Must always be a positive integer

2. Called the binomial coefficient

3. Found by using the formula:

n

xn

x n x

!

!( )!

29

ES9

Example

Example: According to a recent study, 65% of all homes in a certain county have high levels of radon gas leaking into their basements. Four homes are selected at

random and tested for radon. The random variable x is the number of homes with high levels of radon (out of the four).

Properties:

1. There are 4 repeated trials: n = 4. The trials are independent.

2. Each test for radon is a trial, and each test has two outcomes: radon or no radon

3. p = P(radon) = 0.65, q = P(no radon) = 0.35

p + q = 1

4. x is the number of homes with high levels of radon, possible values:0, 1, 2, 3, 4

30

ES9

P xx

xx x( ) (0. ) (0. ) ,

465 35 4 for 0, 1, 2, 3, 4

P ( ) (0. ) (0. ) ( )( )( . ) .04

065 35 1 1 0 0150 0 01500 4

P ( ) (0. ) (0. ) ( )( . )( . ) .14

165 35 4 0 65 0 0429 0 11151 3

P ( ) (0. ) (0. ) ( )( . )( . ) .24

265 35 6 0 4225 0 1225 0 31052 2

P ( ) (0. ) (0. ) ( )( . )( . ) .34

365 35 4 0 2746 0 35 0 38453 1

P ( ) (0. ) (0. ) ( )( . )( ) .44

465 35 1 0 1785 1 0 17854 0

Binomial Probability Function

31

ES9

Example Example: In a certain automobile dealership, 70% of all customers purchase an

extended warranty with their new car. For 15 customers selected at random:

1) Find the probability that exactly 12 will purchase an extended warranty

2) Find the probability at most 13 will purchase an extended warranty

P xx

xx x( ) (0. ) (0. ) ,

157 3 15 for 0, 1, 2, ... ,15

Solutions:• Let x be the number of customers who purchase an extended warranty. x is a binomial random variable.

• The probability function associated with this experiment:

32

ES9

Solutions Continued

96480.03520.1

]00470.0305[0.1

)3(0.)7(0.15

15)3(0.)7(0.

14

151

)15()14(1

)13()1()0()13(

015114

...

PP

PPPxP

2) Probability at most 13 purchase an extended warranty:

P( ) (0. ) (0. ) 0.1215

127 3 170012 3

1) Probability exactly 12 purchase an extended warranty:

33

ES9

Notes The value of many binomial probabilities (n < 15 and common values

of p) are found in Table 2, Appendix B

Minitab has special commands for computing binomial probabilities or cumulative probabilities:

PDF 12;Binomial 15 .7.Probability Density FunctionBinomial with n = 15 and p = 0.70x P( X = x)

12.00 0.1700

Many graphing calculators also have built-in functions for computing binomial probabilities and cumulative probabilities

34

ES9 5.5 ~ Mean & Standard Deviation

of the Binomial Distribution

• Population parameters of the binomial distribution help to describe the distribution

• Mean and standard deviation indicate where the distribution is centered and the spread of the distribution

35

ES9

np

npq

Mean & Standard Deviation

• The mean and standard deviation of a theoretical binomial distribution can be found by using the following two formulas:

Notes:

Mean is intuitive: number of trials multiplied by the probability of a success

The variance of a binomial probability distribution is:

2 2 npq npq

36

ES9

Solutions:

1) n = 18, p = 0.75, q = 1 - 0.75 = 0.25

np ( )(0. ) .18 75 13 5

npq ( )(0. )(0. ) . .18 75 25 3 375 18371

Example

Example: Find the mean and standard deviation of the binomial distribution when n = 18 and p = 0.75

P xx

xx x( ) (0. ) (0. )

1875 25 18 for 0, 1, 2, ... , 18

2) The probability function is:

37

ES9

Table of Values & Probabilities x P(x)

0.00 0.0000 1.00 0.0000 2.00 0.0000 3.00 0.0000 4.00 0.0000 5.00 0.0000 6.00 0.0002 7.00 0.0010 8.00 0.0042 9.00 0.0139 10.00 0.0376 11.00 0.0820 12.00 0.1436 13.00 0.1988 14.00 0.2130 15.00 0.1704 16.00 0.0958 17.00 0.0338 18.00 0.0056

38

ES9

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 180.00

0.02

0.04

0.06

0.08

0.10

0.12

0.14

0.16

0.18

0.20

0.22

x

P x( )

Histogram