9709_y06_sw_23

download 9709_y06_sw_23

of 6

Transcript of 9709_y06_sw_23

  • 8/12/2019 9709_y06_sw_23

    1/6

    Section 2 of Unit 07 (Statist ics 2)

    Sampling, estimates and hypothesis tests

    Recommended Prior KnowledgeStudents must have studied S1 and Section 1 of Statistics 2 before starting this Section.

    Context Sections 1 and 2 must be studied in order, since the ideas in Section 1 are required in Section 2.

    OutlineThis Section introduces the ideas behind sampling and estimation. The idea of a sample mean and how sample means are normally distributed areconsidered. The Section continues with the central limit theorem and moves on to unbiased estimates and confidence intervals. The Section concludes withan introduction to hypothesis testing and the terminology associated with it. The study looks at hypothesis testing with sample means and with a singleobservation taken from the Poisson or binomial distribution. It concludes with a study of Type 1 and Type 2 errors.

    Topic Learning Outcomes Suggested Teaching activi ties Resources On-LineResources

    4 Sampling and estimation

    Understand the distinctionbetween a sample and apopulation, and appreciate thenecessity for randomness inchoosing samples.

    Discuss the distinction between a sample and a population.Discuss, in general terms, the ideas behind sampling andthe reasons for taking samples. Talk about the idea ofrandom samples and, in particular, discuss the idea ofusing random numbers, as produced on a calculator, or inrandom number tables. Discuss the differences between

    large and small samples - of the impracticality of taking alarge sample of measurements when items such as a car ora plane need to be tested to destruction. Discuss how alarge sample, if practical, will lead to greater accuracy inforecasting statistics of the parent population, particularlywith regard to election results.

    www.mathsrevision.netA-Level SectionStatisticsSampling 1Sampling 2

    RandomSamples

    www.Xtrem

    ePapers.com

    http://www.mathsrevision.net/http://www.mathsrevision.net/http://www.mathsrevision.net/http://www.mathsrevision.net/
  • 8/12/2019 9709_y06_sw_23

    2/6

    Explain in simple terms why agiven sampling method may beunsatisfactory (knowledge ofparticular sampling methods, suchas quota or stratified sampling, isnot required, but candidatesshould have an elementaryunderstanding of the use ofrandom numbers in producingrandom samples).

    Recognise that a sample meancan be regarded as a randomvariable, and use the fact that

    =)E(X and thatn

    X

    2)(Var = .

    Use the fact that X has a normal

    distribution ifX

    has a normaldistribution.

    Use the central limit theoremwhere appropriate.

    Students are not required to have knowledge of specific

    sampling methods, such as quota, or stratified, but generaldiscussions of these and other methods is worthwhile iftime allows. Similarly students should realise why particularsampling methods (such as using a telephone directory or adata-base of car owners) can lead to bias.

    Students should recognise that the mean of a sample

    (usually referred to as X ) is itself a random variable and

    that =)E(X andn

    X

    2)(Var = . Students should be

    able to prove these results by referring back to the work onlinear combinations of random variables. By

    takingn

    XXXX n

    +++

    ...as 21 , the results should follow.

    www.mathsrevision.netA-Level SectionStatisticsCentral LimitThoerem

    http://www.mathsrevision.net/http://www.mathsrevision.net/http://www.mathsrevision.net/http://www.mathsrevision.net/
  • 8/12/2019 9709_y06_sw_23

    3/6

    Calculate unbiased estimates ofthe population mean and variancefrom a sample, using either raw orsummarised data (only a simpleunderstanding of the termunbiased is required).

    Discuss with the students the advantage of taking the mean

    of a sample of readings and using the sample mean as amore accurate statistic in hypothesis testing. This willprepare the ground for future work on hypothesis testing.If time allows, students can take groups of 9 single-digitrandom numbers (generated on a calculator or by randomnumber tables) and evaluate the mean of the group of 9numbers. If a large number of these sample means isavailable, students can represent the results in a frequencytable and hence plot a frequency curve (if sample means

    are recorded to the nearest integer) or a histogram (ifsample means are recorded to 3 significant figures). Thedistribution can now be recognised as being approximatelynormal. This is a good introduction from which to

    test =)E(X and thatn

    X

    2)(Var = . It can also lead to

    the students recognition that the sample meansthemselves are normally distributed and lead to an

    introduction to the application of the central limit theorem.This is a good opportunity to test different distributions(discrete probability distribution, binomial distribution,Poisson distribution, normal distribution, continuousdistributions) with the central limit theorem.

    Introduce the idea of an unbiased estimate (a difficultconcept to fully understand) and calculate unbiasedestimates for the population mean and variance from asample of raw data. In particular students should be awareof the need to use (n1) rather than nin the formula for thepopulation variance.

    www.mathsrevision.netA-Level SectionStatisticsEstimators

    www.mathsrevision

    .netA-Level SectionStatisticsConfidenceIntervals

    http://www.mathsrevision.net/http://www.mathsrevision.net/http://www.mathsrevision.net/http://www.mathsrevision.net/http://www.mathsrevision.net/http://www.mathsrevision.net/http://www.mathsrevision.net/http://www.mathsrevision.net/
  • 8/12/2019 9709_y06_sw_23

    4/6

    http://www.mathsrevision.net/http://www.mathsrevision.net/
  • 8/12/2019 9709_y06_sw_23

    5/6

    Formulate hypotheses and carry

    out a hypothesis test concerningthe population mean in caseswhere the population is normallydistributed with known variance orwhere a large sample is used.

    Understand the terms Type 1 errorand Type 2 error in relation tohypothesis tests.

    Calculate the probabilities ofmaking Type 1 and Type 2 errorsin specific situations involving testsbased on a normal distribution ordirect evaluation of binomial orPoisson probabilities.

    The students should recognise the following terms:-

    H0 The machine is functioning as previously with L= 200.H1 The machine is producing bulbs for which the meanvalue ofL< 200.Test statistic: L= 170.Significance level of the test: 5%.Type of Test: One-tail.Rejection (or critical) region: Values of Lthat lie in thebottom 5% of the normal distribution.Calculation: The limit for Lis in fact given by z = 1.64,

    from which the corresponding value of L= 167.2 isobtained. The value of L=170 lies within the acceptableregion. The result is not significant.Therefore H0is accepted and H1is rejected.

    It is worth repeating the above exercise for similarquestions using examples that involve the binomial andPoisson distributions, using either direct evaluation ofprobabilities or a normal approximation as appropriate.

    It is good practice for the students to list the various termsfor each example as in the solution to the question above.

    Repeat the exercise with a population mean in cases wherethe population is normally distributed with known varianceor where a large sample is used.

    Discuss with students the meaning of Type 1 and Type 2

    errors. Students should realise that a Type 1 error occurswhen the null Hypothesis has been rejected when it is infact true. The probability of a Type 1 error is therefore thesame as the probability of a reading lying in the criticalregion i.e. the significance level of the test.

    mentioned in the

    teaching activity.

  • 8/12/2019 9709_y06_sw_23

    6/6

    A Type 2 error is made when the null hypothesis is

    accepted, when in fact an alternative hypothesis is true.The probability of making a Type 2 error is dependent uponthe alternative hypothesis. For example, in the test above,if subsequently the information was given that the lifetime oflight bulbs (L) is a normal distribution with mean 150 hours,then a Type 2 error will be made whenever a value of Lgreater than 167.2 is obtained for N(150,400). Again,students need considerable practice in using Type 1 andType 2 errors.

    Have ready

    workedexamplesillustrating theuse of bothType 1 andType 2 errors.