Randomization and Bootstrap Methods in the …Randomization and Bootstrap Methods in the...

21
Randomization and Bootstrap Methods in the Introductory Statistics Course Kari Lock Morgan Robin Lock Duke University St. Lawrence University [email protected] [email protected] Panel a 2013 Joint Mathematics Meetings San Diego, CA

Transcript of Randomization and Bootstrap Methods in the …Randomization and Bootstrap Methods in the...

Page 1: Randomization and Bootstrap Methods in the …Randomization and Bootstrap Methods in the Introductory Statistics Course Kari Lock Morgan Robin Lock Duke University St. Lawrence University

RandomizationandBootstrapMethodsintheIntroductory

StatisticsCourse

KariLockMorgan RobinLockDukeUniversity St.LawrenceUniversity

[email protected] [email protected]

Panela2013JointMathematicsMeetingsSanDiego,CA

Page 2: Randomization and Bootstrap Methods in the …Randomization and Bootstrap Methods in the Introductory Statistics Course Kari Lock Morgan Robin Lock Duke University St. Lawrence University

HowmighttheIntroStatcurriculumchangeto

accommodate/takeadvantageofbootstrap/randomization

methods?

Page 3: Randomization and Bootstrap Methods in the …Randomization and Bootstrap Methods in the Introductory Statistics Course Kari Lock Morgan Robin Lock Duke University St. Lawrence University

IntroStat– TraditionalTopics• DescriptiveStatistics– oneandtwosamples• Normaldistributions• Dataproduction(samples/experiments)

• Samplingdistributions(mean/proportion)

• Confidenceintervals(means/proportions)

• Hypothesistests(means/proportions)

• ANOVAforseveralmeans,Inferenceforregression,Chi-squaretests

Page 4: Randomization and Bootstrap Methods in the …Randomization and Bootstrap Methods in the Introductory Statistics Course Kari Lock Morgan Robin Lock Duke University St. Lawrence University

IntroStat– RevisetheTopics• DescriptiveStatistics– oneandtwosamples• Normaldistributions• Dataproduction(samples/experiments)

• Samplingdistributions(mean/proportion)

• Confidenceintervals(means/proportions)

• Hypothesistests(means/proportions)

• ANOVAforseveralmeans,Inferenceforregression,Chi-squaretests

• Dataproduction(samples/experiments)• Bootstrapconfidenceintervals• Randomization-basedhypothesistests• Normaldistributions

• Bootstrapconfidenceintervals• Randomization-basedhypothesistests

• DescriptiveStatistics– oneandtwosamples

Page 5: Randomization and Bootstrap Methods in the …Randomization and Bootstrap Methods in the Introductory Statistics Course Kari Lock Morgan Robin Lock Duke University St. Lawrence University

WhystartwithBootstrapCI’s?•Minimalprerequisites:

Populationparametervs.samplestatisticRandomsamplingDotplot (orhistogram)Standarddeviationand/orpercentiles

• SamemethodofrandomizationinmostcasesSamplewithreplacementfromoriginalsample

• NaturalprogressionSampleestimate==>Howaccurateistheestimate?

• Intervalsaremoreuseful?Agooddebateforanothersession…

Page 6: Randomization and Bootstrap Methods in the …Randomization and Bootstrap Methods in the Introductory Statistics Course Kari Lock Morgan Robin Lock Duke University St. Lawrence University

Example:MustangPrices

Data:Sampleof25MustangslistedonAutotrader.com

Findaconfidence intervalfortheslope ofaregression linetopredictpricesofusedMustangsbasedontheirmileage.

Page 7: Randomization and Bootstrap Methods in the …Randomization and Bootstrap Methods in the Introductory Statistics Course Kari Lock Morgan Robin Lock Duke University St. Lawrence University

“Bootstrap”SamplesKeyidea:• Samplewithreplacementfromtheoriginalsampleusingthesamen.

• Computethesamplestatisticforeachbootstrapsample.

• Collectlotsofsuchbootstrapstatistics

Imaginethe“population”ismany,manycopiesoftheoriginalsample.

Page 8: Randomization and Bootstrap Methods in the …Randomization and Bootstrap Methods in the Introductory Statistics Course Kari Lock Morgan Robin Lock Duke University St. Lawrence University

Distributionof3000BootstrapSlopes

Page 9: Randomization and Bootstrap Methods in the …Randomization and Bootstrap Methods in the Introductory Statistics Course Kari Lock Morgan Robin Lock Duke University St. Lawrence University

UsingtheBootstrapDistributiontoGetaConfidenceInterval– Version#1

Thestandarddeviationofthebootstrapstatisticsestimatesthestandarderrorofthesamplestatistic.

Quickintervalestimate:

𝑂𝑟𝑖𝑔𝑖𝑛𝑎𝑙𝑆𝑡𝑎𝑡𝑖𝑠𝑡𝑖𝑐 ± 2 / 𝑆𝐸ForthemeanMustangslopetime:

)162.0,278.0(058.022.0029.0222.0 −−=−±−=⋅±−

Page 10: Randomization and Bootstrap Methods in the …Randomization and Bootstrap Methods in the Introductory Statistics Course Kari Lock Morgan Robin Lock Duke University St. Lawrence University

UsingtheBootstrapDistributiontoGetaConfidenceInterval– Version#2

Keep95%inmiddle

Chop2.5%ineachtail

Chop2.5%ineachtail

95%CIforslope(-0.279,-0.163)

Page 11: Randomization and Bootstrap Methods in the …Randomization and Bootstrap Methods in the Introductory Statistics Course Kari Lock Morgan Robin Lock Duke University St. Lawrence University

3.SimulationTechnology?

Fall2010:FathomFall2011:Fathom&Applets

Tactilesimulationsfirst?Bootstrap– No(withreplacementistough)Testforanexperiment– Yes(1or2)

Page 12: Randomization and Bootstrap Methods in the …Randomization and Bootstrap Methods in the Introductory Statistics Course Kari Lock Morgan Robin Lock Duke University St. Lawrence University

DesirableTechnologyFeatures?

ThreeDistributions

OnetoManySamples

Page 13: Randomization and Bootstrap Methods in the …Randomization and Bootstrap Methods in the Introductory Statistics Course Kari Lock Morgan Robin Lock Duke University St. Lawrence University

DesirableTechnologyFeatures

Page 14: Randomization and Bootstrap Methods in the …Randomization and Bootstrap Methods in the Introductory Statistics Course Kari Lock Morgan Robin Lock Duke University St. Lawrence University

4.OneCrankorTwo?

ConfidenceIntervals– Bootstrap– onecrank

SignificanceTests– Two(ormore)cranks

Rulesforselectingrandomizationsamplesforatest.Beconsistentwith:1. thenullhypothesis2. thesampledata3. thewaydatawerecollected

Page 15: Randomization and Bootstrap Methods in the …Randomization and Bootstrap Methods in the Introductory Statistics Course Kari Lock Morgan Robin Lock Duke University St. Lawrence University

RandomizationTestforSlope

Page 16: Randomization and Bootstrap Methods in the …Randomization and Bootstrap Methods in the Introductory Statistics Course Kari Lock Morgan Robin Lock Duke University St. Lawrence University

5.Testfora2x2Table

Firstexample:ArandomizedexperimentTeststatistic:CountinonecellRandomize:TreatmentgroupsMargins:FixbothLaterexamplesvary,e.g.usedifferenceinproportionsorrandomizeasindependentsampleswithcommonp.

Page 17: Randomization and Bootstrap Methods in the …Randomization and Bootstrap Methods in the Introductory Statistics Course Kari Lock Morgan Robin Lock Duke University St. Lawrence University

6.Whatabout“traditional”methods?

AFTERstudentshaveseenlotsofbootstrapandrandomizationdistributions(andhopefullybeguntounderstandthelogicofinference)…

• Introducethenormaldistribution(andlatert)

• Introduce“shortcuts”forestimatingSEforproportions,means,differences,…

Page 18: Randomization and Bootstrap Methods in the …Randomization and Bootstrap Methods in the Introductory Statistics Course Kari Lock Morgan Robin Lock Duke University St. Lawrence University

BacktoMustangPricesThe regression equation isPrice = 30.5 - 0.219 Miles

Predictor Coef SE Coef T PConstant 30.495 2.441 12.49 0.000Miles -0.21880 0.03130 -6.99 0.000

S = 6.42211 R-Sq = 68.0% R-Sq(adj) = 66.6%

Page 19: Randomization and Bootstrap Methods in the …Randomization and Bootstrap Methods in the Introductory Statistics Course Kari Lock Morgan Robin Lock Duke University St. Lawrence University

7.Assessment?

Newlearninggoals• Understandhowtogeneratebootstrap

samplesanddistribution.• Understandhowtocreaterandomization

samplesanddistribution.• Beabletouseabootstrap/randomization

distributiontofindaninterval/p-value.

Page 20: Randomization and Bootstrap Methods in the …Randomization and Bootstrap Methods in the Introductory Statistics Course Kari Lock Morgan Robin Lock Duke University St. Lawrence University

8.Howdiditgo?• Studentsenjoyedandwereengagedwiththenewapproach• Instructorenjoyedandwasengagedwiththenewapproach.• Betterunderstandingofp-valuereflecting“ifH0 istrue”.• Betterinterpretationsofintervals.• Challenge:Few“experienced”studentstoserveasresources.

Page 21: Randomization and Bootstrap Methods in the …Randomization and Bootstrap Methods in the Introductory Statistics Course Kari Lock Morgan Robin Lock Duke University St. Lawrence University

Goingforward

Continuewithrandomizationapproach?

ABSOLUTELY(3sectionsinFall2011)