Central Tendency and Variability The two most essential features of a distribution.
Central Tendency & Variability as 12.4.3
-
Upload
wmathematics -
Category
Documents
-
view
220 -
download
0
Transcript of Central Tendency & Variability as 12.4.3
-
8/4/2019 Central Tendency & Variability as 12.4.3
1/18
Central TendencyCentral Tendency
& Variability& Variability
A S 12.4.3
-
8/4/2019 Central Tendency & Variability as 12.4.3
2/18
Did you hear about thestatistician who put her
head in the oven and herfeet in the refrigerator?
She said, "On average, Ifeel just fine."
-
8/4/2019 Central Tendency & Variability as 12.4.3
3/18
Which average?Which average?
All three averages are useful for summarizing e.g. thedistribution of household incomes.
In 1998, the income common to thegreatest
number of households (mode) was R25000.
Half the households (median) earnedlessthan R38 885.
The mean income was R50 600.Reporting only one measure of centraltendency might be misleading and
perhaps reflect a bias.
-
8/4/2019 Central Tendency & Variability as 12.4.3
4/18
The table shows the heights of 50randomly chosen Grade 12 school
girls.
height (cm)midpoint (x) frequency(f) (f)(x)
150-
-
8/4/2019 Central Tendency & Variability as 12.4.3
5/18
height (cm) midpoint frequency(f)
150-
-
8/4/2019 Central Tendency & Variability as 12.4.3
6/18
It is a single summary figure that
describes the spread of data within adistribution. Range difference between thesmallest
and largest observations. Percentiles where p% of the valuesfalls
below a certain value. Interquartile Range (IQR) -Range ofthe
middle half ofmedian
scores. -
Measures of VariabilityMeasures of Variability
-
8/4/2019 Central Tendency & Variability as 12.4.3
7/18
The results of a survey of thetravelling
time (in minutes) of 200 workersare as
follows.Time(min)
Freq(f)
Cumfreq
0
-
8/4/2019 Central Tendency & Variability as 12.4.3
8/18
0 10 20 30 40 50 60
Time (min)
200
180
160
140120
100
80
60
40
20
0
Cu
m
freq
Use the graphto estimate:
the median Q2= 27
the interquartile rang
Q3=37; Q1= 16
IQR = 37 - 16=21
the no. of workers
with
no. = 200 192
= 8 workers
(100)
Q2 Q3Q1
(150)
(50)
-
8/4/2019 Central Tendency & Variability as 12.4.3
9/18
Unathi sells the following
number ofcomputers in 12 months:
Five-numberFive-number
summarysummaryThe summary consists of the lowest datavalue, first quartile (Q1), median (Q2), third
quartile (Q3) and highest data value.
34, 47, 1, 15, 57, 24, 20, 11, 19, 50, 28, 37
Arrange the data in ascending order.
1, 11, 15, 19, 20, 24, 28, 34, 37, 47, 50, 57
-
8/4/2019 Central Tendency & Variability as 12.4.3
10/18
1, 11, 15, 19, 20, 24, 28, 34, 37, 47, 50, 5
median
Median = Q2 =(24+28) /2 = 26
Q1 Q3
Q1 =(15+19) /2 = 17 Q3 =
(37+47) /2 = 42
5-number summary = 1; 17; 26; 42; 57
Give a five-number summary of the sales.
Minimum = 1 Maximum = 57
-
8/4/2019 Central Tendency & Variability as 12.4.3
11/18
Barry also sells computers during a 12month period.
Below is a 5-number summary for each person.
Unathi Barry
min 1 6
Q1
17 15
Q2 26 32
Q3 42 46
max 57 62
Which personwould you mostlikely want toappoint for yourcompany?
Barrys highest and lowest sales arehigher than Unathis corresponding sales,and Barrys median sales figure is also
higher than Unathis.
Barry
Explain.
-
8/4/2019 Central Tendency & Variability as 12.4.3
12/18
PercentilesPercentiles
A percentile is a score below which acertain percentage of values fall. Thereare 100 percentiles in a sample.
Oscars height is at the 90th
percentile andhis weight is at the 60th percentile for hisage.Describe Oscar's physical build in general
terms.
e.g. If your test score is in the 95thpercentile, it means that if 1000students took the test, at least 950students did worse than you and at
most 49 students did better than you.He is taller than 90% of thepeople but only weighs morethan 60% of the people -
possibly tall and thin.
It does not mean thatyou received 95% forthe test.
-
8/4/2019 Central Tendency & Variability as 12.4.3
13/18
242, 228, 217, 209, 253, 239, 266, 242,251, 240, 223, 219, 246, 260, 258, 225,234, 230, 249, 245, 254, 243, 235, 231,257.
In 2004 the snow depth at Tiffendell wasmeasured (in mm) for 25 days and
recorded.
Complete the frequency table,
cumulative frequency table and
Depth(mm)
200-210
210-220
220-230
230-240
240-250
250-260
260-270
freq
cum freq
cum %
1 2 53 7 5 2
1 3 6 1811 25234 2412 4
472
92 100
-
8/4/2019 Central Tendency & Variability as 12.4.3
14/18
Plot the graph of snow depthagainst the cumulative frequencyand cumulative % using two
different vertical axes.
200 210 220 230 240 250 260 270
25
20
15
10
5
0
100
80
60
40
20
0
Depth (mm)
Cum
freq
Cum
%
percentiles
Estimatethe 80th
percentile.252mm
-
8/4/2019 Central Tendency & Variability as 12.4.3
15/18
200 210 220 230 240 250 260 270
25
20
15
10
5
0
100
80
60
40
20
0
Depth (mm)
Cum
fre
q
Cum
%
For howmanydays wasthe depthat least250mm?
25-18= 7 days
A year later (2005)the depth is shown bythe broken line.Explain which year
had possibly better
2004 - depth greateron more days. E.g.44% below 240mmcompared to 80%
below 240mm in
-
8/4/2019 Central Tendency & Variability as 12.4.3
16/18
Standard deviation is useful when
comparing the spread of two or more datasets that have approximately the samemean.This technique is best used withsymmetric distributions with no outliers.
The smaller the standard deviation thenarrower the spread of measurementsaround the mean, as it has possibly fewhigh or low values.
Standard DeviationStandard Deviation
e.g. If the mean of a data set is 5 and theS.D. is 2, then on average the data lies
between 3 and 7.
-
8/4/2019 Central Tendency & Variability as 12.4.3
17/18
Super Crisps come in 25g bags. Thereare
two machines (A & B) producing thechips.A quality control engineer weighs a
sample
of 10 bags from each machine.
Calculate the mean of eachmachine.
A 25,3 25,6 24,8 25,7 25,5 25 24,9 25,7 25,5 25,6
B 25,3 25,3 25,4 24,9 25,3 25,3 25,4 25,4 25,4 25,3
For A: mean =
253,6
/10= 25,36g
For B: mean = 253 /10
= 25,3g
-
8/4/2019 Central Tendency & Variability as 12.4.3
18/18
Super Crisps will be taken to court if it is
found their bags are less than 25g.Which machine gives the best chance ofavoiding this fate?
Below is the variance and standarddeviation for each machine.
Explain.
Mean Variance S.D.
A 25,36g 1,044 1,02
B 25,3g 0,2 0,45
Machine B
A: Mass of chips are on average from24,34g to 26,38g. B: Mass of chips arefrom 24,85g to 25,75g. B has a narrower
spread than A. Its smallest value is very