MATH& 146 Lesson 23 - Amazon S3 · MATH& 146 Lesson 23 Section 3.2 Difference of Two Proportions 1....

21
MATH& 146 Lesson 23 Section 3.2 Difference of Two Proportions 1

Transcript of MATH& 146 Lesson 23 - Amazon S3 · MATH& 146 Lesson 23 Section 3.2 Difference of Two Proportions 1....

Page 1: MATH& 146 Lesson 23 - Amazon S3 · MATH& 146 Lesson 23 Section 3.2 Difference of Two Proportions 1. Difference of Proportions ... whether the difference of two proportions is significantly

MATH& 146

Lesson 23

Section 3.2

Difference of Two Proportions

1

Page 2: MATH& 146 Lesson 23 - Amazon S3 · MATH& 146 Lesson 23 Section 3.2 Difference of Two Proportions 1. Difference of Proportions ... whether the difference of two proportions is significantly

Difference of Proportions

Consider the following questions:

• Are college graduates more likely than non-

college graduates to support a particular

candidate?

• Is there a difference between the heart attack

rate of those who took an experimental drug and

the rate of those who received the placebo?

2

Page 3: MATH& 146 Lesson 23 - Amazon S3 · MATH& 146 Lesson 23 Section 3.2 Difference of Two Proportions 1. Difference of Proportions ... whether the difference of two proportions is significantly

The Hypotheses

When the conditions are met, we are ready to test

whether the difference of two proportions is

significantly different from zero. We test the

hypotheses

H0: p1 – p2 = 0

HA: p1 – p2 ≠ 0 (or >, or <)

The differences are almost always compared to

zero, but they can be compared to any number.

3

Page 4: MATH& 146 Lesson 23 - Amazon S3 · MATH& 146 Lesson 23 Section 3.2 Difference of Two Proportions 1. Difference of Proportions ... whether the difference of two proportions is significantly

Point Estimate

To answer questions like these, we would like to

make conclusions about the difference in two

population proportions (p1 – p2) using the normal

model.

To start with, we first identify a reasonable point

estimate of p1 – p2 based on the sample. You may

have already guessed its form:

4

1 2ˆ ˆp p Point estimate

for p1 – p2.

Page 5: MATH& 146 Lesson 23 - Amazon S3 · MATH& 146 Lesson 23 Section 3.2 Difference of Two Proportions 1. Difference of Proportions ... whether the difference of two proportions is significantly

Conditions

To use this test, we need to verify that the point

estimate follows the normal model by checking

certain conditions:

• Each proportion separately follows a normal

model (all sample observations are independent

and there are at least 10 successes and 10

failures).

• The two samples are independent of each other.

5

Page 6: MATH& 146 Lesson 23 - Amazon S3 · MATH& 146 Lesson 23 Section 3.2 Difference of Two Proportions 1. Difference of Proportions ... whether the difference of two proportions is significantly

Standard Error

Finally, we compute the estimate's standard error

and apply our inferential framework.

where p1 and p2 represent the population

proportions, and n1 and n2 represent the sample

sizes.

6

1 2 1 2

1 1 2 22 2

ˆ ˆ ˆ ˆ

1 2

1 1p p p p

p p p pSE SE SE

n n

Page 7: MATH& 146 Lesson 23 - Amazon S3 · MATH& 146 Lesson 23 Section 3.2 Difference of Two Proportions 1. Difference of Proportions ... whether the difference of two proportions is significantly

Example 1

Compute the standard error if

a)

b)

7

1 1 2 20.55, 50, 0.6, 75p n p n

1 1 2 20.95, 300, 0.80, 120p n p n

Page 8: MATH& 146 Lesson 23 - Amazon S3 · MATH& 146 Lesson 23 Section 3.2 Difference of Two Proportions 1. Difference of Proportions ... whether the difference of two proportions is significantly

Question Wording

The way a question is phrased can influence a

person's response. For example, Pew Research

Center conducted a survey with the following question:

As you may know, by 2014 nearly all Americans will

be required to have health insurance. [People who

do not buy insurance will pay a penalty] while

[People who cannot afford it will receive financial

help from the government]. Do you approve or

disapprove of this policy?

8

Page 9: MATH& 146 Lesson 23 - Amazon S3 · MATH& 146 Lesson 23 Section 3.2 Difference of Two Proportions 1. Difference of Proportions ... whether the difference of two proportions is significantly

Question Wording

For each randomly sampled respondent, the

statements in brackets were randomized: either

they were kept in the order given above, or the two

statements were reversed. The table below shows

the results of this experiment.

9

Sample

size (ni)

Approve

law (%)

Disapprove

law (%)

Other

"people who cannot afford it will

receive financial help from the

government" is given second.

771 47 49 3

"people who do not buy it will

pay a penalty" is given second.732 34 63 3

Page 10: MATH& 146 Lesson 23 - Amazon S3 · MATH& 146 Lesson 23 Section 3.2 Difference of Two Proportions 1. Difference of Proportions ... whether the difference of two proportions is significantly

Example 2

Create and interpret a 90% confidence interval of

the difference in approval.

10

Sample

size (ni)

Approve

law (%)

Disapprove

law (%)

Other

"people who cannot afford it will

receive financial help from the

government" is given second.

771 47 49 3

"people who do not buy it will

pay a penalty" is given second.732 34 63 3

Page 11: MATH& 146 Lesson 23 - Amazon S3 · MATH& 146 Lesson 23 Section 3.2 Difference of Two Proportions 1. Difference of Proportions ... whether the difference of two proportions is significantly

Wheel Gears

A remote control car company is considering a

new manufacturer for wheel gears. The new

manufacturer would be more expensive but their

higher quality gears are more reliable, resulting in

happier customers and fewer warranty claims.

However, management must be convinced that the

more expensive gears are worth the conversion

before they approve the switch.

11

Page 12: MATH& 146 Lesson 23 - Amazon S3 · MATH& 146 Lesson 23 Section 3.2 Difference of Two Proportions 1. Difference of Proportions ... whether the difference of two proportions is significantly

Example 3

If there is strong evidence of a more than 3%

improvement in the percent of gears that pass

inspection, management says they will switch

suppliers, otherwise they will maintain the current

supplier.

Set up appropriate one-sided hypotheses for the

test.

12

Page 13: MATH& 146 Lesson 23 - Amazon S3 · MATH& 146 Lesson 23 Section 3.2 Difference of Two Proportions 1. Difference of Proportions ... whether the difference of two proportions is significantly

Example 4

The quality control engineer collects a sample of

gears, examining 1000 gears from each company

and finds that 899 gears pass inspection from the

current supplier and 958 pass inspection from the

prospective supplier.

Using these data, evaluate the hypothesis setup of

the last example using a significance level of 10%.

13

Page 14: MATH& 146 Lesson 23 - Amazon S3 · MATH& 146 Lesson 23 Section 3.2 Difference of Two Proportions 1. Difference of Proportions ... whether the difference of two proportions is significantly

Cancer Risk in Dogs

Let us consider a new example to examine a

special estimate of standard error when

H0: p1 = p2.

We investigate whether there is an increased risk

of cancer in dogs that are exposed to the herbicide

2,4-dichlorophenoxyacetic acid (2,4-D).

14

Page 15: MATH& 146 Lesson 23 - Amazon S3 · MATH& 146 Lesson 23 Section 3.2 Difference of Two Proportions 1. Difference of Proportions ... whether the difference of two proportions is significantly

Cancer Risk in Dogs

A study in 1994 examined 491 dogs that had

developed cancer and 945 dogs as a control

group. Of these two groups, researchers identified

which dogs had been exposed to 2,4-D in their

owner's yard. The results are shown below.

15

cancer no cancer total

2,4-D 191 304 495

no 2,4-D 300 641 941

total 491 945 1436

Page 16: MATH& 146 Lesson 23 - Amazon S3 · MATH& 146 Lesson 23 Section 3.2 Difference of Two Proportions 1. Difference of Proportions ... whether the difference of two proportions is significantly

Example 5

a) Is this study an experiment or an observational

study?

b) Are the conditions met to use the normal model

and make inference on the results?

16

cancer no cancer total

2,4-D 191 304 495

no 2,4-D 300 641 941

total 491 945 1436

Page 17: MATH& 146 Lesson 23 - Amazon S3 · MATH& 146 Lesson 23 Section 3.2 Difference of Two Proportions 1. Difference of Proportions ... whether the difference of two proportions is significantly

Example 6

Set up hypotheses to test whether 2,4-D and the

occurrence of cancer in dogs are related. Use a

two-sided test and compare across the cancer and

no cancer groups.

(Note: Don't compute the p-value yet.)

17

cancer no cancer total

2,4-D 191 304 495

no 2,4-D 300 641 941

total 491 945 1436

Page 18: MATH& 146 Lesson 23 - Amazon S3 · MATH& 146 Lesson 23 Section 3.2 Difference of Two Proportions 1. Difference of Proportions ... whether the difference of two proportions is significantly

Pooled Estimate of a

Proportion

In a hypothesis test, the distribution of the test

statistic is always examined as though the null

hypothesis is true, i.e. in this case, pc = pn. The

standard error formula should reflect this equality

in the null hypothesis. We will use p to represent

the common rate of dogs that are exposed to

2,4-D in the two groups:

18

1 1

c n

p p p pSE

n n

Page 19: MATH& 146 Lesson 23 - Amazon S3 · MATH& 146 Lesson 23 Section 3.2 Difference of Two Proportions 1. Difference of Proportions ... whether the difference of two proportions is significantly

Pooled Estimate of a

Proportion

We don't know the exposure rate, p, but we can

obtain a good estimate of it by pooling the results

of both samples:

This is called the pooled estimate of the sample

proportion, and we use it to compare the standard

error when the null hypothesis is that p1 = p2.

19

# of "successes" 191 304ˆ 0.345

# of cases 191 300 304 641p

Page 20: MATH& 146 Lesson 23 - Amazon S3 · MATH& 146 Lesson 23 Section 3.2 Difference of Two Proportions 1. Difference of Proportions ... whether the difference of two proportions is significantly

Pooled Estimate of a

Proportion

When the null hypothesis suggests the proportions

are equal, we used the pooled proportion estimate

( ) to verify the success-failure condition and also

to estimate the standard error:

20

1 2

ˆ ˆ ˆ ˆ1 1p p p pSE

n n

Page 21: MATH& 146 Lesson 23 - Amazon S3 · MATH& 146 Lesson 23 Section 3.2 Difference of Two Proportions 1. Difference of Proportions ... whether the difference of two proportions is significantly

Example 7

Using verify the

estimate for the standard error is SE = 0.026.

Next compute the hypothesis test using a

significance level of 0.05. Be certain to draw a

picture, compute the p-value, and state your

conclusion in both statistical language and plain

language.

21

1 2ˆ 0.345, 491, and 945,p n n