Week 5 Sep 29 – Oct 3

52
Week 5 Sep 29 – Oct 3 Four Mini-Lectures QMM 510 Fall 2014

description

Week 5 Sep 29 – Oct 3. Four Mini-Lectures QMM 510 Fall 2014. Chapter Contents 7.1 Describing a Continuous Distribution 7.2 Uniform Continuous Distribution 7.3 Normal Distribution 7.4 Standard Normal Distribution 7.5 Normal Approximations 7.6 Exponential Distribution - PowerPoint PPT Presentation

Transcript of Week 5 Sep 29 – Oct 3

Page 1: Week  5  Sep  29  – Oct  3

Week 5 Sep 29 – Oct 3

Four Mini-Lectures QMM 510Fall 2014

Page 2: Week  5  Sep  29  – Oct  3

7-2

Continuous Probability Distributions ML 5.1

Chapter Contents7.1 Describing a Continuous Distribution

7.2 Uniform Continuous Distribution

7.3 Normal Distribution

7.4 Standard Normal Distribution

7.5 Normal Approximations

7.6 Exponential Distribution

7.7 Triangular Distribution (Optional)

Ch

apter 7

So many topics, so little time …

Page 3: Week  5  Sep  29  – Oct  3

7-3

• Discrete Variable – each value of X has its own probability P(X).

• Continuous Variable – events are intervals and probabilities are areas under continuous curves. A single point has no probability.

Events as Intervals

Ch

apter 7

Continuous Distributions

Page 4: Week  5  Sep  29  – Oct  3

7-4

Continuous PDF:• Denoted f(x)• Must be nonnegative• Total area under

curve = 1• Mean, variance, and

shape depend onthe PDF parameters

• PDF reveals the shape of the distribution

PDF – Probability Density Function

Ch

apter 7

Continuous Distributions

Page 5: Week  5  Sep  29  – Oct  3

7-5

Continuous CDF:• Denoted F(x)• Shows P(X ≤ x), the

cumulative proportion below x.

• Shows the area to the left of any given point on the PDF.

• There are Excel functions for either the PDF or CDF.

CDF – Cumulative Distribution Function

Ch

apter 7

Continuous Distributions

Page 6: Week  5  Sep  29  – Oct  3

7-6

• Continuous probability functions:

• Unlike discrete distributions, the probability at any singlepoint is 0.

• The entire area under any PDF, by definition, is 1.

• Mean is the balancepoint of the distribution.

Probabilities as Areas

Ch

apter 7

Continuous Distributions

Page 7: Week  5  Sep  29  – Oct  3

7-7

Expected Value and Variance

Ch

apter 7

The mean and variance of a continuous random variable are analogous to E(X) and Var(X ) for a discrete random variable. Here the integral sign replaces the summation sign. Calculus is required to compute the integrals.

Continuous Distributions

Page 8: Week  5  Sep  29  – Oct  3

7-8

Characteristics of the Normal Distribution

• Normal or Gaussian (or bell-shaped) distribution was named for German mathematician Karl Gauss (1777 – 1855).

• Defined by two parameters, µ and .• Denoted N(µ, ).• Domain is – < X < + (continuous scale).• Almost all (99.7%) of the area under the normal curve is included in the range µ

– 3 < X < µ + 3.• Symmetric and unimodal about the mean.

Ch

apter 7

Normal Distribution

Page 9: Week  5  Sep  29  – Oct  3

7-9

Characteristics of the Normal Distribution

Ch

apter 7

Normal Distribution

Page 10: Week  5  Sep  29  – Oct  3

7-10

• Normal PDF f(x) reaches a maximum at µ and has points of inflection at µ ±

Bell-shaped curve

Note: All normal distributions have the same shape but differ in the axis scales.

Ch

apter 7 Characteristics of the Normal Distribution

Normal Distribution

• Excel function for PDF (height of the function at x) is =NORM.DIST(x, µ, , 0)

0 for PDF, 1 for CDF

Page 11: Week  5  Sep  29  – Oct  3

7-11

• Normal CDF has a “lazy-S” shape

Ch

apter 7 Characteristics of the Normal Distribution

Normal Distribution

• Excel function for CDF (area to left of x) is =NORM.DIST(x, µ, , 1)

0 for PDF, 1 for CDF

Page 12: Week  5  Sep  29  – Oct  3

7-12

Characteristics of the Standard Normal Distribution

Ch

apter 7

Standard Normal Distribution

• Since for every value of µ and , there is a different normal distribution, we transform a normal random variable to a standard normal distribution with µ = 0 and = 1 using the formula z = (x - µ)/.

Page 13: Week  5  Sep  29  – Oct  3

7-13

Characteristics of the Standard Normal• Standard normal PDF f(x) reaches a maximum at z = 0 and has points

of inflection at +1.

• Shape is unaffected by the transformation. It is still a bell-shaped curve.

• Standard normal tables or Excel functions can be used to find the desired probabilities.

Figure 7.11

Ch

apter 7

Standard Normal Distribution

Excel function for CDF (area to left of z) is =NORM.DIST(z, 1)

Page 14: Week  5  Sep  29  – Oct  3

7-14

Characteristics of the Standard Normal• Standard normal CDF

Ch

apter 7

• A common scale from 3 to +3 is used.

• Entire area under the curve is unity.

• The probability of an event P(z1 < Z < z2) is a definite integral of f(z).

• However, standard normal tables or Excel functions can be used to find the desired probabilities.

Standard Normal Distribution

Page 15: Week  5  Sep  29  – Oct  3

7-15

Normal Areas from Appendix C-1• Appendix C-1 allows you to find the area under the curve from 0 to

z.• For example, find P(0 < Z < 1.96):

Ch

apter 7

Standard Normal Distribution

Page 16: Week  5  Sep  29  – Oct  3

7-16

Normal Areas from Appendix C-1• Now find P(1.96 < Z < 1.96).• Due to symmetry, P(1.96 < Z) is the same as P(Z < 1.96).

• So, P(1.96 < Z < 1.96) = .4750 + .4750 = .9500, or 95% of the area under the curve.

Ch

apter 7

Standard Normal Distribution

Page 17: Week  5  Sep  29  – Oct  3

7-17

Basis for the Empirical Rule

• Approximately 68% of the area under the curve is between + 1 • Approximately 95% of the area under the curve is between + 2 • Approximately 99.7% of the area under the curve is between + 3

Ch

apter 7

Standard Normal Distribution

Page 18: Week  5  Sep  29  – Oct  3

7-18

Normal Areas from Appendix C-2• Appendix C-2 allows you to find the area under the curve from the left of z

(similar to Excel).

• This table is the CDF (not the PDF). For example,

P(Z < 1.96)P(Z < 1.96) P(1.96 < Z < 1.96)

Ch

apter 7

Standard Normal Distribution

=NORM.S.DIST(1.96,1) =NORM.S.DIST(-1.96,1) =NORM.S.DIST(1.96,1)- NORM.S.DIST(-1.96,1)

Page 19: Week  5  Sep  29  – Oct  3

7-19

Normal Areas from Appendices C-1 and C-2• Appendices C-1 and C-2 yield identical results.• Use whichever table is easiest.

Finding z for a Given Area• Appendices C-1 and C-2 can be used to

find the z-value corresponding to a given probability.

• For example, what z-value defines the top 1% of a normal distribution?

• This implies that 49% of the area lies between 0 and z, which gives z = 2.33 by looking for an area of 0.4900 in Appendix C-1.

Ch

apter 7

Standard Normal Distribution

Page 20: Week  5  Sep  29  – Oct  3

7-20

Finding Areas Using Standardized Variables

• John’s score is 1.57 standard deviations above the mean.

Ch

apter 7

• P(X < 86) = P(Z < 1.57) = .9418 (from Appendix C-2)

• John is approximately in the 94th percentile.

• John took an economics exam and scored 86 points. The class mean was 75 with a standard deviation of 7. What percentile is John in? That is, what is P(X < 86) where X represents the exam scores?

Standard Normal Distribution

Appendix C-2: Cumulative Standard Normal Distribution (continued)

This table shows the normal area less than z .

z 0.00 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09

0.0 0.5000 0.5040 0.5080 0.5120 0.5160 0.5199 0.5239 0.5279 0.5319 0.53590.1 0.5398 0.5438 0.5478 0.5517 0.5557 0.5596 0.5636 0.5675 0.5714 0.57530.2 0.5793 0.5832 0.5871 0.5910 0.5948 0.5987 0.6026 0.6064 0.6103 0.61410.3 0.6179 0.6217 0.6255 0.6293 0.6331 0.6368 0.6406 0.6443 0.6480 0.65170.4 0.6554 0.6591 0.6628 0.6664 0.6700 0.6736 0.6772 0.6808 0.6844 0.68790.5 0.6915 0.6950 0.6985 0.7019 0.7054 0.7088 0.7123 0.7157 0.7190 0.72240.6 0.7257 0.7291 0.7324 0.7357 0.7389 0.7422 0.7454 0.7486 0.7517 0.75490.7 0.7580 0.7611 0.7642 0.7673 0.7704 0.7734 0.7764 0.7794 0.7823 0.78520.8 0.7881 0.7910 0.7939 0.7967 0.7995 0.8023 0.8051 0.8078 0.8106 0.81330.9 0.8159 0.8186 0.8212 0.8238 0.8264 0.8289 0.8315 0.8340 0.8365 0.83891.0 0.8413 0.8438 0.8461 0.8485 0.8508 0.8531 0.8554 0.8577 0.8599 0.86211.1 0.8643 0.8665 0.8686 0.8708 0.8729 0.8749 0.8770 0.8790 0.8810 0.88301.2 0.8849 0.8869 0.8888 0.8907 0.8925 0.8944 0.8962 0.8980 0.8997 0.90151.3 0.9032 0.9049 0.9066 0.9082 0.9099 0.9115 0.9131 0.9147 0.9162 0.91771.4 0.9192 0.9207 0.9222 0.9236 0.9251 0.9265 0.9279 0.9292 0.9306 0.93191.5 0.9332 0.9345 0.9357 0.9370 0.9382 0.9394 0.9406 0.9418 0.9429 0.94411.6 0.9452 0.9463 0.9474 0.9484 0.9495 0.9505 0.9515 0.9525 0.9535 0.95451.7 0.9554 0.9564 0.9573 0.9582 0.9591 0.9599 0.9608 0.9616 0.9625 0.96331.8 0.9641 0.9649 0.9656 0.9664 0.9671 0.9678 0.9686 0.9693 0.9699 0.97061.9 0.9713 0.9719 0.9726 0.9732 0.9738 0.9744 0.9750 0.9756 0.9761 0.97672.0 0.9772 0.9778 0.9783 0.9788 0.9793 0.9798 0.9803 0.9808 0.9812 0.9817

Page 21: Week  5  Sep  29  – Oct  3

7-21

• Finding Areas by Using Standardized VariablesYou can use Excel, Minitab, TI83/84, etc. to compute these probabilities directly. The Excel functions are shown:

Ch

apter 7

Standard Normal Distribution

Without standardizing:=NORM.DIST(x, µ, , 1)=NORM.DIST(86, 75, 7, 1)=.9420

With standardizing:=NORM.S.DIST(z, 1)=NORM.S.DIST(1.57, 1)=.9418

Slight difference is due to rounding z to 1.57

Page 22: Week  5  Sep  29  – Oct  3

7-22

Inverse Normal• How can we find the various normal percentiles (5th, 10th, 25th, 75th, 90th, 95th, etc.) known as the inverse normal? That is, how can we find X for a given area?

Ch

apter 7

Solving for x in z = (x − μ)/ gives x = μ + zσ

Inverse Normal ML 5.2

• We simply turn the standardizing transformation around:

Page 23: Week  5  Sep  29  – Oct  3

7-23

Inverse Normal: Excel

Ch

apter 7

Inverse Normal Distribution

Finding x: Finding z:

Page 24: Week  5  Sep  29  – Oct  3

7-24

Inverse Normal: Example

Ch

apter 7

• John’s economics professor decides that any student who scores below the 10th percentile must retake the exam.

• The exam scores are normal with μ = 75 and σ = 7.

• What is the score that would require a student to retake the exam?

• We need to find the value of x that satisfies P(X < x) = .10.

• The z-score for with the 10th percentile is z = −1.28.

Inverse Normal Distribution

Page 25: Week  5  Sep  29  – Oct  3

7-25

Inverse Normal

Ch

apter 7

The logical steps to solve the problem are:• Use Appendix C to find z = −1.28 to satisfy P(Z < −1.28) = .10.• Substitute z = −1.28 into z = (x − μ)/σ to get −1.28 = (x − 75)/7• Solve for x to get x = 75 − (1.28)(7) = 66.03 (or 66 after rounding)• Students who score below 66 points on the economics exam will be

required to retake the exam.

Inverse Normal Distribution

or use Excel to solve in one step:=NORM.INV(0.1,75,7) = 66.03

or use Excel to obtain z:=NORM.S.INV(0.1) = 1.282

Page 26: Week  5  Sep  29  – Oct  3

7-26

Normal Approximation to the Binomial• Binomial probabilities are difficult to calculate when n is large.• Use a normal approximation to the binomial distribution.• As n becomes large, the binomial bars become smaller and the PDF approaches a

continuous distribution.

Ch

apter 7

Normal Approximations

Page 27: Week  5  Sep  29  – Oct  3

7-27

Normal Approximation to the Binomial

• Rule of thumb: when n ≥ 10 and n(1 ) ≥ 10, then it is appropriate to use the normal approximation to the binomial distribution.

• Set the mean and standard deviation for the binomial distribution equal to the normal µ and , respectively.

Ch

apter 7

Normal Approximations

Page 28: Week  5  Sep  29  – Oct  3

7-28

Example: Coin Flips

• Yes, because:

n = 32 x .50 = 16 (at least 10 “successes”) n(1 ) = 32 x (1 .50) = 16 (at least 10 “failures”)

Ch

apter 7

• When translating a discrete scale into a continuous scale, care must be taken about individual points.

• For example, find the probability of more than 17 heads in 32 flips of a fair coin. This can be written as P(X 18).

• However, “more than 17” actually falls between 17 and 18 on a discrete scale.

Normal Approximations

• If we flip a coin n = 32 times and = .50, are the requirements for a normal approximation to the binomial distribution met?

Page 29: Week  5  Sep  29  – Oct  3

7-29

Example: Coin Flips• Since the cutoff point for “more than 17” is halfway between 17 and 18, we

add 0.5 to the lower limit and find P(X > 17.5).• This addition to X is called the Continuity Correction.• At this point, the problem can be completed as any normal distribution

problem.

Ch

apter 7

Normal Approximations

Page 30: Week  5  Sep  29  – Oct  3

7-30

Ch

apter 7

Example: Coin Flips

P(X > 17) = P(X ≥ 18) P(X ≥ 17.5) = P(Z > 0.53) = 0.2981

Normal Approximations

Page 31: Week  5  Sep  29  – Oct  3

7-31

Normal Approximation to the Poisson• The normal approximation to the Poisson distribution works best

when is large (e.g., when exceeds the values in Appendix B).• Set the normal µ and equal to the mean and standard deviation

for the Poisson distribution.

Ch

apter 7

Example: Utility Bills• On Wednesday between 10 a.m. and noon customer billing inquiries

arrive at a mean rate of 42 inquiries per hour at Consumers Energy. What is the probability of receiving more than 50 calls in an hour?

• = 42, which is too big to use the Poisson table.• Use the normal approximation with = 42 and = 6.48074.

Normal Approximations

Page 32: Week  5  Sep  29  – Oct  3

7-32

Example: Utility Bills• To find P(X > 50) calls, use the continuity-corrected cutoff point halfway

between 50 and 51 (i.e., X = 50.5).• At this point, the problem can be completed as any normal distribution

problem.

Ch

apter 7

Normal Approximations

Page 33: Week  5  Sep  29  – Oct  3

7-33

Bottom Line:

Ch

apter 7

Normal Approximations

• With Excel, we do not need these approximations for calculations.

• They are still useful when Excel is not available.

• They are taught to show the logical connection between discrete and continuous distributions.

Page 34: Week  5  Sep  29  – Oct  3

7-34

Characteristics of the Exponential Distribution• If events per unit of time follow a Poisson distribution (e.g., customer arrivals),

the waiting time until the next event (e.g., customer arrival) follows the exponential distribution.

• The time until the next event is a continuous variable.

Ch

apter 7

Exponential Distribution ML 5.3

Note: We seek tail probabilities such as P(X x) or P(X ≤ x).

Page 35: Week  5  Sep  29  – Oct  3

7-35

Characteristics of the Exponential Distribution

Probability of waiting more than x

Probability of waiting less than or equal to x

Ch

apter 7

Exponential Distribution

Note: A point has no area so P(X ≤ x) is the same as P( X < x) and similarly P(X > x) is the same as P( X x).

Page 36: Week  5  Sep  29  – Oct  3

7-36

Example: Customer Waiting Time

• Between 2 p.m. and 4 p.m. on Wednesday, patient insurance inquiries arrive at Blue Choice insurance at a mean rate of 2.2 calls per minute.

• What is the probability of waiting more than 30 seconds (i.e., 0.50 minutes) for the next call?

• Set = 2.2 events/min and x = 0.50 min

• P(X > 0.50) = e–x = e–(2.2)(0.5) = .3329 or a 33.29% chance of waiting more than 30 seconds for the next call.

Ch

apter 7

Exponential Distribution

Page 37: Week  5  Sep  29  – Oct  3

7-37

Example: Customer Waiting Time

P(X > 0.50) = e–x = e–(2.2)(0.5) = .3329 P(X ≤ 0.50) = 1-.3329 = .6671

Ch

apter 7

Exponential Distribution

Given λ = 2.2 inquiries per minute, what is the probability of waiting more than 30 seconds (i.e., 0.50 minutes) for the next call?

Page 38: Week  5  Sep  29  – Oct  3

7-38

• If the mean arrival rate is 2.2 calls per minute, what is the 90th percentile for waiting time (the top 10% of waiting time)?

• Find the x-value that defines the upper 10%.

Ch

apter 7

Inverse Exponential Distribution Inverse Exponential

Page 39: Week  5  Sep  29  – Oct  3

7-39

Inverse Exponential

Ch

apter 7

Inverse Exponential Distribution

If the mean arrival rate is 2.2 calls per minute, what is the 90th percentile for waiting time (the top 10% of waiting time)? Find the x-value that defines the upper 10%.

Page 40: Week  5  Sep  29  – Oct  3

7-40

Ch

apter 7 Mean Time Between Events

Exponential Distribution

Page 41: Week  5  Sep  29  – Oct  3

7-41

Ch

apter 7 Bottom Line:

Exponential Distribution

You may encounter the exponential model in any situation that involves customer arrivals, waiting lines, and queueing (e.g., retail business, call center, concert, theme park, bank, grocery store, airline check-in, traffic planning).

Such applications are not rare in our crowded world.

Study simulation (Chapter 18) to learn more about how such situations can be modeled to plan facility capacity, predict waiting times, and study system throughput.

Page 42: Week  5  Sep  29  – Oct  3

7-42

Characteristics of the Triangular Distribution

Ch

apter 7

Other Continuous Distributions ML 5.4

Page 43: Week  5  Sep  29  – Oct  3

7-43

Characteristics of the Triangular Distribution

Ch

apter 7

• The triangular distribution is a way of thinking about variation that corresponds rather well to what-if analysis in business.

• It is not surprising that business analysts are attracted to the triangular model.

• Its finite range and simple form are more understandable than a normal distribution.

Other Continuous Distributions

Page 44: Week  5  Sep  29  – Oct  3

7-44

Characteristics of the Triangular Distribution

Ch

apter 7

• It is more versatile than a normal because it can be skewed in either direction.

• Yet it has some of the nice properties of a normal, such as a distinct mode.

• The triangular model is especially handy for what-if analysis when the business case depends on predicting a stochastic variable (e.g., the price of a raw material, an interest rate, a sales volume).

• If the analyst can anticipate the range (a to c) and most likely value (b), it will be possible to calculate probabilities of various outcomes.

• Many times, distributions will be skewed, so a normal wouldn’t be much help.

Other Continuous Distributions

Page 45: Week  5  Sep  29  – Oct  3

7-45

Triangular Distribution: Example T(15, 20, 30)

Ch

apter 7

Other Continuous Distributions

Page 46: Week  5  Sep  29  – Oct  3

7-46

Triangular Distribution: Example T(15, 20, 30)

Ch

apter 7

Other Continuous Distributions

Page 47: Week  5  Sep  29  – Oct  3

7-47

Characteristics of the Uniform Distribution

If X is a random variable that is uniformly distributed between a and b, its PDF has constant height.

• Denoted U(a, b)• Area =

base x height =(b a) x 1/(b a) = 1

Ch

apter 7

Uniform Continuous Distribution

Page 48: Week  5  Sep  29  – Oct  3

7-48

Characteristics of the Uniform Distribution

Ch

apter 7

Uniform Continuous Distribution

Page 49: Week  5  Sep  29  – Oct  3

7-49

Example: Anesthesia Effectiveness

• An oral surgeon injects a painkiller prior to extracting a tooth. Given the varying characteristics of patients, the dentist views the time for anesthesia effectiveness as a uniform random variable that takes between 15 minutes and 30 minutes.

• X is U(15, 30)

• a = 15, b = 30, find the mean and standard deviation.

Ch

apter 7

• Find the probability that the effectiveness of the anaesthetic takes between 20 and 25 minutes.

Uniform Continuous Distribution

Page 50: Week  5  Sep  29  – Oct  3

7-50

P(20 < X < 25) = (25 – 20)/(30 – 15) = 5/15 = 0.3333 = 33.33%

Ch

apter 7 Example: Anesthesia Effectiveness

Uniform Continuous Distribution

Page 51: Week  5  Sep  29  – Oct  3

7-51

• Can be a conservative “what-if” baseline model.

• Excel’s =RAND() function follows this model:

μ = (a + b)/2 = (0 + 1)/2 = .5000

σ = [(b - a)2/12]1/2 = [(1 - 0)2/12]1/2 = [1/12]1/2 = .2887

Ch

apter 7Uses of Uniform Distribution

Uniform Continuous Distribution

Try it yourself! Calculate a bunch of =RAND() values in Excel, and look at the mean and standard deviation. They should be close to the above predictions (if sample is large).

Mean = 0.494637 St Dev = 0.2718940.84328 0.68397 0.69134 0.09071 0.72185 0.73706 0.786450.33170 0.56953 0.04807 0.65731 0.34992 0.79984 0.971430.45351 0.70129 0.15553 0.36416 0.33627 0.71570 0.806460.53490 0.96473 0.62752 0.78566 0.82808 0.34901 0.142200.46443 0.98558 0.25002 0.05013 0.61517 0.09537 0.500000.43802 0.37406 0.08978 0.29142 0.47772 0.25935 0.365040.00549 0.32222 0.63328 0.28581 0.27208 0.81790 0.59686

Page 52: Week  5  Sep  29  – Oct  3

7-52

Comparison of Models

Ch

apter 7

• The normal distribution is the used most often.• The exponential is useful in modeling waiting lines (queues).• The triangular distribution is a way of thinking about variation that

corresponds well to what-if analysis in business.• The uniform distribution is a useful baseline model or for random

sampling (randomizing a list).

Continuous Distributions