OAD30763 Statistics in Business and Economics

OAD30763Statistics in Business and Economics

Week 5Dr. Jenne Meyer

Article Reviews

Article review

Hypothesis Testing

5-Step Hypothesis Testing Procedure

Step 1: Set up the null and alternative hypotheses.Step 2: Pick the level of significance (value of )

and find the rejection region.Step 3: Calculate the test statistics.Step 4: Decide whether or not to reject the null

hypothesis.Step 5: Interpret the statistical decision in terms of

the stated problem.

Two-tail testH0: = 750HA: 750

Lower-tail test Upper-tail testH0: 700 H0: 800HA: < 700 HA: >800

Hypothesis Testing

Review

One-tailed Two-tailed

/2 /2

The Rejection Region is the range of values of the test statistics that will lead you to reject the null hypothesis.

Review

For a large sample:

n

XZ

For a small sample:

ns

Xt

**At least 30 units

Apply hypothesis testing to different populations and samples in business research situations

Test of single population, small sample size

Test of a single proportion

Test of two populations, large sample size

Test of two populations, small sample size

Test for difference in two population proportions

The 5-Step Hypothesis Testing Procedure is the same for all

these processes.

Hypothesis Testing

Test for Single Population, Small n

For a small sample: Small sample and unknown σ Calculations are identical to those for

z Becomes identical to z for n > 30 Uses degrees of freedom: df = n - 1

ns

Xt

Review t-table Common Values of a and df and the Corresponding t-Values

Upper Tail Area

Degrees of Freedom (df) 0.25 0.10 0.05 0.01 0.005

20 1.185 1.725 2.086 2.845 3.15321 1.183 1.721 2.080 2.831 3.13522 1.182 1.717 2.074 2.819 3.11923 1.180 1.714 2.069 2.807 3.10424 1.179 1.711 2.064 2.797 3.09125 1.178 1.708 2.060 2.787 3.07826 1.177 1.706 2.056 2.779 3.067

1000000 1.150 1.645 1.960 2.576 2.807


ExampleA State Highway Patrol periodically samples vehicle

speeds at various locations on a particular roadway. The sample of vehicle speeds is used to test the hypothesis

H0: m < 65mph

The locations where H0 is rejected (average speed exceeds 65mph) are the best for radar traps.At Location X, a sample of 16 vehicles shows a mean speed of 68.2 mph with a standard deviation of 3.8 mph. Use an level of significance=.05 to test the hypothesis.


Example, cont.

a=.05

d.f.=16-1=15, ta = 1.753

n = 16

= 68.2 mph

s = 3.8 mph

x

37.316/8.3

652.68

/0

ns

xt

Rejection Region

t 1.753

Since 3.37 > 1.753, we reject H0.

Conclusion: We are 95% confident that the mean speed of vehicles at Location X is greater than 65 mph. Location X is a good candidate for a radar trap.

H0: m < 65mphHA: m > 65mph


Use if concerned with a proportion of the population, p, that have a particular characteristic

Can be used with nominal data Use the same 5-Step Hypothesis Testing Procedures Test Statistic calculated

Z = p -

(1- ) / n√

Test for Single Proportion

Test for Single Proportion Example For a Christmas and New Year’s

week, the National Safety Council estimated that 500 people would be killed and 25,000 injured on the nation’s roads. The NSC claimed that 50% of the accidents would be caused by drunk driving.

A sample of 120 accidents showed that 67 were caused by drunk driving. Use these data to test the NSC’s claim with a = 0.05.

Example, cont.H0: p = .5

HA: p .5

(67/120) - .5

= 1.278

.5(1 - .5)

120

a=.05

Rejection Region

Rejection Region

p = 67/120

n = 120

Z = Since –1.96 < 1.278 < 1.96, we do not reject H0.

Conclusion: There is insufficient evidence to suggest that the population proportion of accidents caused by drunk driving is different from 50%

Test for Single Proportion

Example, cont.

Rejection Region

Z 1.645

250

)1036.1(1036.

300

)1036.1(1036.

0880.1167.

-+

-

-=

Since 1.099 < 1.645, we do not reject H0.

Conclusion: There is insufficient evidence to suggest that there is an difference between the proportion of unmarried workers missing more than 5 days of work than the proportion of married ones

= 1.099

21

21

)1()1(

n

pp

n

pp

ppz

cccc

Test of Two Population Proportions

Often we are interested in comparing two different, independent populations

Sample 1

Population 1Population 2

Sample 2

Figure 13.1 Two Populations and Two Samples

Test of Two Populations

When comparing two different, independent populations the Null Hypothesis takes on the form:

H0: s- p = 0 H0: s = p

H0: s ≤ p

Test of Two Populations

When comparing two different, independent populations the with large n, the test statistic looks like

2

22

1

21

21 )(

nn

XXz

If population std. dev. are unknown, use s1 and s2 instead of σ’s

Test of Two Populations, large n

Example

A study was conducted to compare the mean years of service for those retiring in 1979 with those retiring last year at Acme Manufacturing Co. At the .01 significance level can we conclude that the workers retiring last year gave more service based on the following sample data? Note: Let pop #1= “last year”

Population #1 Population #2

"Last Year" 1979

Sample Mean 30.4 25.6

Sample Standard Deviation 3.6 2.9

Sample Size 45 40


Example, cont.

H0: LY < 1979

HA: LY > 1979

Rejection Region

Z 2.326

a=.01


Example, cont.

Population #1 Population #2

"Last Year" 1979

Sample Mean 30.4 25.6

Sample Standard Deviation 3.6 2.9

Sample Size 45 40

Z 2.326

z

30 4 25 6

3 645

2 940

6 802 2

. .

. ..

Since 6.80 > 2.326, we reject H0.

Conclusion: There is sufficient evidence at the 99% confidence level to suggest that the mean years of service of those retiring last year is greater than the mean years of service of those retiring in 1979.


When comparing two different, independent populations with unknown variances that are assumed equal) with small n, the test statistic looks like

sn s n s

n np2 1 1

22 2

2

1 2

1 1

2

( ) ( )

(Pooled Sample Variance)

df = n1 + n2 – 2t( X X

sn np

=-

+æ

èç

ö

ø÷

1 2

2

1 2

1 1

- D)

Test of Two Populations, small n

Example

To determine whether there is a difference in the time involved in using two versions of software, the new version of the software is compared to the original. Samples are taken from two independent groups using the software (data below). At the .01 significance level, is there a difference in the mean amount of time required to use two versions of software?

Version 1 Version 2

6 58 96 89 7

10 76


Example, cont.Version 1 Version 2

6 58 96 89 7

10 76

S12 = variance 1 S2

2 = variance 2

3.2 2.0

X1 bar = mean 1 X2 bar = mean 2

7.8 7.0

H0 : 1- 2 = 0

HA : 1- 2 = 0/

a =.01 Because we have a two tailed test, there is a/2 = .005 in each tail

df = n1 + n2 – 2 = 5 + 6 – 2 = 9

From t-table, critical cutoffs for two-tail, alpha/2=.005, df=9 is 3.25


Example, cont.

53.2265

)0.2)(5()2.3)(4(

2

))(1())(1(

21

222

2112

nn

snsnsp

83.963.

80.

)6/15/1(53.2

0.78.7 ==+

-=

)/1/1(21

22

1+

-=

nnS

XXt

p

Since .83 < 3.25, we do not reject H0.

Conclusion: There is insufficient evidence to suggest that there is a difference between the mean time to use the two versions of software


When comparing two different population proportions, the Null Hypothesis takes on the form:

The test statistic looks like:

H0: p1- p2 = 0 H0: p1 = p2

21

21

)1()1(

n

pp

n

pp

ppz

cccc

where pTotal number of successes

Total number in samples

X X

n nc

1 2

1 2

= (the weighted mean of the two sample proportions)



Are unmarried workers more likely to be absent from work than married workers? A sample of 250 married workers showed 22 missed more than 5 days last year, while a sample of 300 unmarried workers showed 35 missed more than five days. Use a .05 significance level. Note: let pop #1= unmarried workers.

Example, cont.

H0 : pu = pm

HA : pu > pm

a = .05

Rejection Region

Z 1.645

pu = Unmarried Workers = X1/n1 = 35/300 = .1167

pm = Married Workers = X2/n2 = 22/250 = .0880

1036.250300

2235=

+

+=cp


Assignments

Chapter 9: problems 10, 15, 17, 18, 20, 40

Chapter 10: problems 12, 14 (two sample), 32, 33 (proportions)

OAD30763 Statistics in Business and Economics

Documents

Transcript of OAD30763 Statistics in Business and Economics