Chapter 10: Inferences based on two samplesvucdinh.github.io/Files/lecture24.pdf · 2020. 12....
Transcript of Chapter 10: Inferences based on two samplesvucdinh.github.io/Files/lecture24.pdf · 2020. 12....
![Page 1: Chapter 10: Inferences based on two samplesvucdinh.github.io/Files/lecture24.pdf · 2020. 12. 15. · Overview 10.1Di erence between two population means 10.2The two-sample t test](https://reader036.fdocuments.us/reader036/viewer/2022071408/60ffd76b6df8ea0fca1783e4/html5/thumbnails/1.jpg)
Chapter 10: Inferences based on two samples
MATH 450
November 28th, 2017
MATH 450 Chapter 10: Inferences based on two samples
![Page 2: Chapter 10: Inferences based on two samplesvucdinh.github.io/Files/lecture24.pdf · 2020. 12. 15. · Overview 10.1Di erence between two population means 10.2The two-sample t test](https://reader036.fdocuments.us/reader036/viewer/2022071408/60ffd76b6df8ea0fca1783e4/html5/thumbnails/2.jpg)
Overview
Week 1 · · · · · ·• Chapter 1: Descriptive statistics
Week 2 · · · · · ·• Chapter 6: Statistics and SamplingDistributions
Week 4 · · · · · ·• Chapter 7: Point Estimation
Week 7 · · · · · ·• Chapter 8: Confidence Intervals
Week 10 · · · · · ·• Chapter 9: Tests of Hypotheses
Week 12 · · · · · ·• Chapter 10: Two-sample inference
MATH 450 Chapter 10: Inferences based on two samples
![Page 3: Chapter 10: Inferences based on two samplesvucdinh.github.io/Files/lecture24.pdf · 2020. 12. 15. · Overview 10.1Di erence between two population means 10.2The two-sample t test](https://reader036.fdocuments.us/reader036/viewer/2022071408/60ffd76b6df8ea0fca1783e4/html5/thumbnails/3.jpg)
Overview
10.1 Difference between two population means
10.2 The two-sample t test and confidence interval
10.3 Analysis of paired data
MATH 450 Chapter 10: Inferences based on two samples
![Page 4: Chapter 10: Inferences based on two samplesvucdinh.github.io/Files/lecture24.pdf · 2020. 12. 15. · Overview 10.1Di erence between two population means 10.2The two-sample t test](https://reader036.fdocuments.us/reader036/viewer/2022071408/60ffd76b6df8ea0fca1783e4/html5/thumbnails/4.jpg)
Two-sample inference
Example
Let µ1 and µ2 denote true average decrease in cholesterol for twodrugs. From two independent samples X1,X2, . . . ,Xm andY1,Y2, . . . ,Yn, we want to test:
H0 : µ1 = µ2
Ha : µ1 6= µ2
MATH 450 Chapter 10: Inferences based on two samples
![Page 5: Chapter 10: Inferences based on two samplesvucdinh.github.io/Files/lecture24.pdf · 2020. 12. 15. · Overview 10.1Di erence between two population means 10.2The two-sample t test](https://reader036.fdocuments.us/reader036/viewer/2022071408/60ffd76b6df8ea0fca1783e4/html5/thumbnails/5.jpg)
Settings
Independent samples1 X1,X2, . . . ,Xm is a random sample from a population with
mean µ1 and variance σ21 .
2 Y1,Y2, . . . ,Yn is a random sample from a population withmean µ2 and variance σ2
2 .3 The X and Y samples are independent of each other.
Today: paired samples1 There is only one set of n individuals or experimental objects2 Two observations are made on each individual or object
MATH 450 Chapter 10: Inferences based on two samples
![Page 6: Chapter 10: Inferences based on two samplesvucdinh.github.io/Files/lecture24.pdf · 2020. 12. 15. · Overview 10.1Di erence between two population means 10.2The two-sample t test](https://reader036.fdocuments.us/reader036/viewer/2022071408/60ffd76b6df8ea0fca1783e4/html5/thumbnails/6.jpg)
Independent samples
1 X1,X2, . . . ,Xm is a random sample from a population withmean µ1 and variance σ2
1.
2 Y1,Y2, . . . ,Yn is a random sample from a population withmean µ2 and variance σ2
2.
3 The X and Y samples are independent of each other.
MATH 450 Chapter 10: Inferences based on two samples
![Page 7: Chapter 10: Inferences based on two samplesvucdinh.github.io/Files/lecture24.pdf · 2020. 12. 15. · Overview 10.1Di erence between two population means 10.2The two-sample t test](https://reader036.fdocuments.us/reader036/viewer/2022071408/60ffd76b6df8ea0fca1783e4/html5/thumbnails/7.jpg)
Testing the difference between two population means
Setting: independent normal random samples X1,X2, . . . ,Xm
and Y1,Y2, . . . ,Yn with known values of σ1 and σ2. Constant∆0.
Null hypothesis:H0 : µ1 − µ2 = ∆0
Alternative hypothesis:
(a) Ha : µ1 − µ2 > ∆0
(b) Ha : µ1 − µ2 < ∆0
(c) Ha : µ1 − µ2 6= ∆0
When ∆ = 0, the test (c) becomes
H0 : µ1 = µ2
Ha : µ1 6= µ2
MATH 450 Chapter 10: Inferences based on two samples
![Page 8: Chapter 10: Inferences based on two samplesvucdinh.github.io/Files/lecture24.pdf · 2020. 12. 15. · Overview 10.1Di erence between two population means 10.2The two-sample t test](https://reader036.fdocuments.us/reader036/viewer/2022071408/60ffd76b6df8ea0fca1783e4/html5/thumbnails/8.jpg)
z-test
Proposition
MATH 450 Chapter 10: Inferences based on two samples
![Page 9: Chapter 10: Inferences based on two samplesvucdinh.github.io/Files/lecture24.pdf · 2020. 12. 15. · Overview 10.1Di erence between two population means 10.2The two-sample t test](https://reader036.fdocuments.us/reader036/viewer/2022071408/60ffd76b6df8ea0fca1783e4/html5/thumbnails/9.jpg)
2-sample t test: degree of freedom
MATH 450 Chapter 10: Inferences based on two samples
![Page 10: Chapter 10: Inferences based on two samplesvucdinh.github.io/Files/lecture24.pdf · 2020. 12. 15. · Overview 10.1Di erence between two population means 10.2The two-sample t test](https://reader036.fdocuments.us/reader036/viewer/2022071408/60ffd76b6df8ea0fca1783e4/html5/thumbnails/10.jpg)
2-sample t procedures
MATH 450 Chapter 10: Inferences based on two samples
![Page 11: Chapter 10: Inferences based on two samplesvucdinh.github.io/Files/lecture24.pdf · 2020. 12. 15. · Overview 10.1Di erence between two population means 10.2The two-sample t test](https://reader036.fdocuments.us/reader036/viewer/2022071408/60ffd76b6df8ea0fca1783e4/html5/thumbnails/11.jpg)
Paired samples
1 There is only one set of n individuals or experimental objects
2 Two observations are made on each individual or object
MATH 450 Chapter 10: Inferences based on two samples
![Page 12: Chapter 10: Inferences based on two samplesvucdinh.github.io/Files/lecture24.pdf · 2020. 12. 15. · Overview 10.1Di erence between two population means 10.2The two-sample t test](https://reader036.fdocuments.us/reader036/viewer/2022071408/60ffd76b6df8ea0fca1783e4/html5/thumbnails/12.jpg)
Settings
Assumption
1 The data consists of n independently selected pairs(X1,Y1), (X2,Y2), . . . , (Xn,Yn) with E (Xi ) = µ1 andE (Yi ) = µ2.
2 Let
D1 = X1 − Y1, D2 = X2 − Y2, . . . , Dn = Xn − Yn,
so the Di ’s are the differences within pairs.
3 The Di ’s are assumed to be normally distributed with meanvalue µD and variance σ2
D .
MATH 450 Chapter 10: Inferences based on two samples
![Page 13: Chapter 10: Inferences based on two samplesvucdinh.github.io/Files/lecture24.pdf · 2020. 12. 15. · Overview 10.1Di erence between two population means 10.2The two-sample t test](https://reader036.fdocuments.us/reader036/viewer/2022071408/60ffd76b6df8ea0fca1783e4/html5/thumbnails/13.jpg)
From the last lecture (independent samples)
Problem
Assume that
X1,X2, . . . ,Xm is a random sample from a population withmean µ1 and variance σ2
1.
Y1,Y2, . . . ,Yn is a random sample from a population withmean µ2 and variance σ2
2.
The X and Y samples are independent of each other.
Compute (in terms of µ1, µ2, σ1, σ2,m, n)
(a) E [X − Y ]
(b) Var [X − Y ] and σX−Y
MATH 450 Chapter 10: Inferences based on two samples
![Page 14: Chapter 10: Inferences based on two samplesvucdinh.github.io/Files/lecture24.pdf · 2020. 12. 15. · Overview 10.1Di erence between two population means 10.2The two-sample t test](https://reader036.fdocuments.us/reader036/viewer/2022071408/60ffd76b6df8ea0fca1783e4/html5/thumbnails/14.jpg)
Notes
In the independent case, we construct the statistics by lookingat the distribution of
X − Y
which have
E [X−Y ] = µ1−µ2, Var [X−Y ] = Var(X )+Var(Y ) =σ2
1
m+σ2
2
n
With paired data, the X and Y observations within each pairare not independent, so X and Y are not independent of eachother → the computation of the variance is in valid → couldnot use the old formulas
MATH 450 Chapter 10: Inferences based on two samples
![Page 15: Chapter 10: Inferences based on two samplesvucdinh.github.io/Files/lecture24.pdf · 2020. 12. 15. · Overview 10.1Di erence between two population means 10.2The two-sample t test](https://reader036.fdocuments.us/reader036/viewer/2022071408/60ffd76b6df8ea0fca1783e4/html5/thumbnails/15.jpg)
The paired t-test
Because different pairs are independent, the Di ’s areindependent of each other
We also have
E [D] = E [X − Y ] = E [X ]− E [Y ] = µ1 − µ2 = µD
Testing about µ1 − µ2 is just the same as testing about µDIdea: to test hypotheses about µ1 − µ2 when data is paired:
1 form the differences D1,D2, . . . ,Dn
2 carry out a one-sample t-test (based on n − 1 df) on thedifferences.
MATH 450 Chapter 10: Inferences based on two samples
![Page 16: Chapter 10: Inferences based on two samplesvucdinh.github.io/Files/lecture24.pdf · 2020. 12. 15. · Overview 10.1Di erence between two population means 10.2The two-sample t test](https://reader036.fdocuments.us/reader036/viewer/2022071408/60ffd76b6df8ea0fca1783e4/html5/thumbnails/16.jpg)
The paired t-test
MATH 450 Chapter 10: Inferences based on two samples
![Page 17: Chapter 10: Inferences based on two samplesvucdinh.github.io/Files/lecture24.pdf · 2020. 12. 15. · Overview 10.1Di erence between two population means 10.2The two-sample t test](https://reader036.fdocuments.us/reader036/viewer/2022071408/60ffd76b6df8ea0fca1783e4/html5/thumbnails/17.jpg)
Confidence intervals
A t confidence interval for for µD = µ1 − µ2 can beconstructed based on the fact that
T =D − µDSD/√n
follows the t distribution with degree of freedom n − 1.
The paired t CI for µD is
d ± tα/2,n−1sD√n
A one-sided confidence bound results from retaining therelevant sign and replacing tα/2,n−1 by tα,n−1.
MATH 450 Chapter 10: Inferences based on two samples
![Page 18: Chapter 10: Inferences based on two samplesvucdinh.github.io/Files/lecture24.pdf · 2020. 12. 15. · Overview 10.1Di erence between two population means 10.2The two-sample t test](https://reader036.fdocuments.us/reader036/viewer/2022071408/60ffd76b6df8ea0fca1783e4/html5/thumbnails/18.jpg)
Example
Example
Consider two scenarios:
A. Insulin rate is measured on 30 patients before and after amedical treatment.
B. Insulin rate is measured on 30 patients receiving a placeboand 30 other patients receiving a medical treatment.
What type of test should be used in each cased: paired orunpaired?
MATH 450 Chapter 10: Inferences based on two samples
![Page 19: Chapter 10: Inferences based on two samplesvucdinh.github.io/Files/lecture24.pdf · 2020. 12. 15. · Overview 10.1Di erence between two population means 10.2The two-sample t test](https://reader036.fdocuments.us/reader036/viewer/2022071408/60ffd76b6df8ea0fca1783e4/html5/thumbnails/19.jpg)
Example
Example
Suppose we have a new synthetic material for making soles forshoes. We’d like to compare the new material with leather – usingsome energetic kids who are willing to wear test shoes and returnthem after a time for our study. Consider two scenarios:
A. Giving 50 kids synthetic sole shoes and 50 kids leather shoesand then collect them back, comparing the average wear ineach group
B. Give each of a random sample of 50 kids one shoe made bythe new synthetic materials and one shoe made with leather
What type of test should be used in each cased: paired orunpaired?
MATH 450 Chapter 10: Inferences based on two samples
![Page 20: Chapter 10: Inferences based on two samplesvucdinh.github.io/Files/lecture24.pdf · 2020. 12. 15. · Overview 10.1Di erence between two population means 10.2The two-sample t test](https://reader036.fdocuments.us/reader036/viewer/2022071408/60ffd76b6df8ea0fca1783e4/html5/thumbnails/20.jpg)
Example
Consider an experiment in which each of 13 workers was providedwith both a conventional shovel and a shovel whose blade wasperforated with small holes. The following data on stable energyexpenditure is provided:
Calculate a confidence interval at the 95% confidence level for thetrue average difference between energy expenditure for theconventional shovel and the perforated shovel (assuming that thedifferences follow normal distribution).
MATH 450 Chapter 10: Inferences based on two samples
![Page 21: Chapter 10: Inferences based on two samplesvucdinh.github.io/Files/lecture24.pdf · 2020. 12. 15. · Overview 10.1Di erence between two population means 10.2The two-sample t test](https://reader036.fdocuments.us/reader036/viewer/2022071408/60ffd76b6df8ea0fca1783e4/html5/thumbnails/21.jpg)
Example
Consider an experiment in which each of 13 workers was providedwith both a conventional shovel and a shovel whose blade wasperforated with small holes. The following data on stable energyexpenditure is provided:
Carry out a test of hypotheses at significance level .05 to see if trueaverage energy expenditure using the conventional shovel exceedsthat using the perforated shovel; include a P-value in your analysis.
MATH 450 Chapter 10: Inferences based on two samples
![Page 22: Chapter 10: Inferences based on two samplesvucdinh.github.io/Files/lecture24.pdf · 2020. 12. 15. · Overview 10.1Di erence between two population means 10.2The two-sample t test](https://reader036.fdocuments.us/reader036/viewer/2022071408/60ffd76b6df8ea0fca1783e4/html5/thumbnails/22.jpg)
t-table
MATH 450 Chapter 10: Inferences based on two samples