Chapter 3 Numerical Descriptive Measures (3.1, 3.2, 3.3 ......
Transcript of Chapter 3 Numerical Descriptive Measures (3.1, 3.2, 3.3 ......
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*.
@9A3%&:4?*B35:%&#)&C3*D3459%35
!"#$%&'()
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*,
3.1 Central TendencyMost variables show a distinct tendency to group around a central value. When people talk about an “average value” or the “middle value” or the “most frequent value,” they are talking informally about the mean, median, and mode—three measures of central tendency.
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*=
! The central tendency is the extent to which all the data values group around a or central value.
! The variation is the amount of dispersion or scattering of values
! The shape is the pattern of the distribution of values from the lowest value to the highest value.
>9AA4%$*B3E&6&)&"65
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*F
Measures of Central TendencyThe Mean
" G(3*4%&)(A3)&:*A346*H"E)36*I95)*:4??38*)(3*JA346KL*&5*)(3*A"5)*:"AA"6*A3459%3*"E*:36)%4?*)36836:$
" M"%*4*54A#?3*"E*5&N3*6O
>4A#?3*5&N3
!"""
!
"" !#$
!
$%% +++==
!= !
PQ53%C38*C4?935
G(3*&)( C4?932%"6"96:38*RSQ4%
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*/
" G(3*A"5)*:"AA"6*A3459%3*"E*:36)%4?*)36836:$" D346*T*59A*"E*C4?935*8&C&838*Q$*)(3*69AQ3%*"E*C4?935" UEE3:)38*Q$*3R)%3A3*C4?935*H"9)?&3%5L
**((*+((*)((*,((*-((*.((*/((*0((*1(+2
3(5(*)
**((*+((*)((*,((*-((*.((*/((*0((*1(+2
3(5(*,
31565
55141312111
==++++ 41
570
52041312111
==++++
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*V
D3459%35*"E*!36)%4?*G36836:$OG(3*D38&46
" ;6*46*"%83%38*4%%4$0*)(3*A38&46*&5*)(3*JA&88?3K*69AQ3%*H/-W*4Q"C30*/-W*Q3?"XL
" Y355*5365&)&C3*)(46*)(3*A346*)"*3R)%3A3*C4?935
3&67#4(5(*) 3&67#4(5(*)
**((*+((*)((*,((*-((*.((*/((*0((*1(+2 **((*+((*)((*,((*-((*.((*/((*0((*1(+2
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*[
Y":4)&6'*)(3*D38&46" The location of the median when the values are in numerical
order (smallest to largest):
" If the number of values is odd, the median is the middle number
" If the number of values is even, the median is the average of the two middle numbers
Note that is not the value of the median, only the
position of the median in the ranked data
!"#"$%!&%&!#'&()*$+(#($),-)*$+(#($).&!(") +
=
!"# +
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*\
D3459%35*"E*!36)%4?*G36836:$OG(3*D"83
" Value that occurs most often" Not affected by extreme values" Used for either numerical or categorical (nominal) data" There may may be no mode" There may be several modes
2(((*(((+((()(((,(((-(((.(((/(((0(((1(((*2(((**(((*+(((*)(((*,
386&(5(1
2(((*(((+((()(((,(((-(((.
98(386&
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*1
Measures of Central Tendency:Review Example
House Prices:
$2,000,000$ 500,000$ 300,000$ 100,000$ 100,000
Sum $ 3,000,000
! Mean: ($3,000,000/5) = $600,000
! Median: middle value of ranked data
= $300,000! Mode: most frequent value
= $100,000
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*.-
D3459%3*"E*!36)%4?*G36836:$*M"%*G(3*]4)3*PE*!(46'3*PE*U*Z4%&4Q?3*PC3%*G&A3OG(3*^3"A3)%&:*D346*_*G(3*^3"A3)%&:*]4)3*"E*]3)9%6
Geometric mean! Used to measure the rate of change of a variable
over time
! Geometric mean rate of return! Measures the status of an investment over time
! Where Ri is the rate of return in time period i
n/1n21G )XXX(X !!!= !
!"#$!%"$!%"$!&%$ '(!')!* !+""+"+= !
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*..
Example
An investment of $100,000 declined to $50,000 at the end of year one and rebounded to $100,000 at end of year two:
The overall two-year return is zero, since it started and ended at the same level.
!!!"#!!$%!!!"&!$%!!!"#!!$% '(# ===
/-W*83:%3453***************.--W*&6:%3453
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*.,
Use the 1-year returns to compute the arithmetic mean and the geometric mean:
%2525.2
)1()5.(====
++!!==X
Arithmetic mean rate of return:
Geometric mean rate of return:
%0111)]2()50[(.1))]1(1())5.(1[(
1)]1()1()1[(
2/12/1
2/1
/121
==!!==!!""==
!!++""!!++==
!!++""""++""++== nnG RRRR !
Misleading result
More
representative
result
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*.=
Measures of Central Tendency:Summary
Central Tendency
Arithmetic Mean
Median Mode Geometric Mean
!
""
!
##!
== 1
!"#!$#% &'''(' !!!= !
D&88?3*C4?93*&6*)(3*"%83%38*4%%4$
D"5)*E%3`936)?$*"Q53%C38*C4?93
]4)3*"E*:(46'3*"E4*C4%&4Q?3*"C3%*)&A3
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*.F
:#;&(<&4%&'=(67>>&'&4%(?#'7#%784
3.2 Variation and ShapeMeasures of Variation
" D3459%35*"E*C4%&4)&"6*'&C3*&6E"%A4)&"6*"6*)(3*@$'("% ?#'7#A7B7%C "%*67@$&'@784 "E*)(3*84)4*C4?935<
D#'7#%784
:%#46#'6(E&?7#%784
!8&>>7<7&4%(8>(D#'7#%784
F#4G& D#'7#4<&
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*./
The Range
! Simplest measure of variation! Difference between the largest and the smallest values:
Range = Xlargest – Xsmallest
-***.***,***=***F***/***V***[***\***1***.-***..***.,****.=***.F***
F#4G&(5(*)(H *(5(*+
Example:
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*.V
Why The Range Can Be Misleading
! Does not account for how the data are distributed
! Sensitive to outliers
/(((((0(((((1(((((*2((((**((((*+Range = 12 - 7 = 5
/(((((0(((((1((((*2(((((**((((*+Range = 12 - 7 = 5
1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,5
1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,120
Range = 5 - 1 = 4
Range = 120 - 1 = 119
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*.[
" Average (approximately) of squared deviations of values from the mean" >4A#?3 C4%&46:3O
The Sample Variance
!"#
$%&%'
#
!(
)(
)!=
"=
Where = arithmetic mean
n = sample size
Xi = ith value of the variable X
!
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*.\
The Sample Standard Deviation
" Most commonly used measure of variation" Shows variation about the mean
" Is the square root of the variance" Has the same units as the original data
" >4A#?3 5)4684%8*83C&4)&"6O
!"#
$%&%'
#
!(
)(!
=
"=
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*.1
Steps for Computing Standard Deviation1. Compute the difference between each value and the
mean.2. Square each difference.
3. Add the squared differences.4. Divide this total by n-1 to get the sample variance.5. Take the square root of the sample variance to get the
sample standard deviation.
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*,-
Calculation Example
:#;$B&(E#%#((IJ7K(L(((((*2(((((*+(((((*,(((((*-((((*/((((*0((((*0((((+,
6*T*\************D346*T*a*T*.V
!"#$%&'(#$
()(*+,-!(*+,(!(*+,(-(*+,($
(.+/,-!+/,(!+/,(-+/,($0
----
----
==
!
!++!+!+!=
!
!++!+!+!=
!
!
U*A3459%3*"E*)(3*J4C3%4'3K*5:4))3%*4%"968*)(3*A346
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*,.
Comparing Standard Deviations
Mean = 15.5S = 3.338**((((*+((((*)((((*,((((*-((((*.((((*/((((*0((((*1((((+2(((+*
**((((*+((((*)((((*,((((*-((((*.((((*/((((*0((((*1((((+2(((+*
Data B
Data A
Mean = 15.5S = 0.926
**((((*+((((*)((((*,((((*-((((*.((((*/((((*0((((*1((((+2(((+*
Mean = 15.5S = 4.567
Data C
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*,,
Smaller standard deviation
Larger standard deviation
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*,=
Summary Characteristics
! The more the data are spread out, the greater the range, variance, and standard deviation.
! The more the data are concentrated, the smaller the range, variance, and standard deviation.
! If the values are all the same (no variation), all these measures will be zero.
! None of these measures are ever negative.
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*,F
The Coefficient of Variation" D3459%35*%3?4)&C3*C4%&4)&"6" U?X4$5*&6*#3%:36)4'3*HWL" >("X5*C4%&4)&"6*%3?4)&C3*)"*A346" !46*Q3*9538*)"*:"A#4%3*)(3*C4%&4Q&?&)$*"E*)X"*"%*A"%3*53)5*"E*84)4*A3459%38*&6*8&EE3%36)*96&)5*
!""#$%&' !""#
$%%&
'=
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*,/
D3459%35*"E*Z4%&4)&"6O!"A#4%&6'*!"3EE&:&36)5*"E*Z4%&4)&"6
" >)":b*UO" UC3%4'3*#%&:3*?45)*$34%*T*c/-" >)4684%8*83C&4)&"6*T*c/
" >)":b*dO" UC3%4'3*#%&:3*?45)*$34%*T*c.--" >)4684%8*83C&4)&"6*T*c/
d")(*5)":b5*(4C3*)(3*54A3*5)4684%8*83C&4)&"60*Q9)*5)":b*d*&5*?355*C4%&4Q?3*%3?4)&C3*)"*&)5*#%&:3
!"#!""#$%"$%!""#
&'()* =!=!""#
$%%&
'=
!"#$$"%#$$%!#$$"
&'()* =!=!""#
$%%&
'=
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*,V
Locating Extreme Outliers: Z-Score
where X represents the data valueX is the sample mean
S is the sample standard deviation
SXXZ !
= B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*,[
Locating Extreme Outliers: Z-Score
! Suppose the mean math SAT score is 490, with a standard deviation of 100.
! Compute the Z-score for a test score of 620.
3.1100130
100490620
==!
=!
=SXXZ
A score of 620 is 1.3 standard deviations above the mean and would not be considered an outlier.
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*,\
Shape of a Distribution
" B35:%&Q35*("X*84)4*4%3*8&5)%&Q9)38" GX"*953E9?*5(4#3*%3?4)38*5)4)&5)&:5*4%3O
" >b3X6355" D3459%35*)(3*3R)36)*)"*X(&:(*84)4*C4?935*4%3*6")*5$AA3)%&:4?
" e9%)"5&5" e9%)"5&5*4EE3:)5*)(3*#34b386355*"E*)(3*:9%C3*"E*)(3*8&5)%&Q9)&"6f)(4)*&50*("X*5(4%#?$*)(3*:9%C3*%&535*4##%"4:(&6'*)(3*:36)3%*"E*)(3*8&5)%&Q9)&"6
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*,1
Shape of a Distribution (Skewness)
" D3459%35*)(3*3R)36)*)"*X(&:(*84)4*&5*6")*5$AA3)%&:4?
3(5 3&67#43(M 3&67#4 3&67#4(M(3F7G"%H:N&O&6P&>%H:N&O&6 :C;;&%'7<
:N&O4&@@:%#%7@%7< g*- - h-*
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*=-
>(4#3*"E*4*B&5)%&Q9)&"6**SS e9%)"5&5*A3459%35*("X*5(4%#?$*)(3*:9%C3*%&535*4##%"4:(&6'*)(3*:36)3%*"E*)(3*8&5)%&Q9)&"6L
:"#'$&'(Q&#NR"#4(S&BBH:"#$&6ITU'%8@7@(V(2K
WB#%%&'(R"#4S&BBH:"#$&6ITU'%8@7@(M(2K
S&BBH:"#$&6ITU'%8@7@(5(2K
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*=.
3.3 Exploring Numerical DataSections 3.1 and 3.2 discuss measures of central tendency, variation, and shape. You can also visualize the distribution of the values for a numerical variable by computing the quartiles and five-number summary and constructing aboxplot.Quartile
" Quartiles split the ranked data into 4 segments with an equal number of values per segment
" The first quartile, Q1, is the value for which 25% of the observations are smaller and 75% are larger
" Q2 is the same as the median (50% of the observations are smaller and 50% are larger)
" Only 25% of the observations are greater than the third quartile
i. i, i=
+-X +-X +-X +-X
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*=,
i94%)&?3*D3459%35OY":4)&6'*i94%)&?35
M&68*4*`94%)&?3*Q$*83)3%A&6&6'*)(3*C4?93*&6*)(3*4##%"#%&4)3*#"5&)&"6*&6*)(3*%46b38*84)40*X(3%3
M&%5)*̀ 94%)&?3*#"5&)&"6O Y* 5(I4Z*K[,((((%46b38*C4?93
>3:"68*`94%)&?3*#"5&)&"6O Y+ 5(I4Z*K[+ %46b38*C4?93
G(&%8*̀ 94%)&?3*#"5&)&"6O Y) 5()I4Z*K[,((%46b38*C4?93
X(3%3 4 &5*)(3*69AQ3%*"E*"Q53%C38*C4?935
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*==
i94%)&?3*D3459%35O!4?:9?4)&"6*]9?35
" j(36*:4?:9?4)&6'*)(3*%46b38*#"5&)&"6*953*)(3*E"??"X&6'*%9?35" ;E*)(3*%359?)*&5*4*X("?3*69AQ3%*)(36*&)*&5*)(3*%46b38*#"5&)&"6*)"*953
" ;E*)(3*%359?)*&5*4*E%4:)&"64?*(4?E*H3<'<*,</0*[</0*\</0*3):<L*)(36*4C3%4'3*)(3*)X"*:"%%35#"68&6'*84)4*C4?935<
" ;E*)(3*%359?)*&5*6")*4*X("?3*69AQ3%*"%*4*E%4:)&"64?*(4?E*)(36*%"968*)(3*%359?)*)"*)(3*634%35)*&6)3'3%*)"*E&68*)(3*%46b38*#"5&)&"6<
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*=F
H6*T*1Li. &5*&6*)(3 H1k.LlF*T*,</*#"5&)&"6*"E*)(3*%46b38*84)45"*953*)(3*C4?93*(4?E*X4$*Q3)X336*)(3*,68 468*=%8 C4?9350
5"****Y* 5(*+\-
i94%)&?3*D3459%35OY":4)&6'*i94%)&?35
:#;$B&(E#%#(74(]'6&'&6(^''#CL((**(((*+(((*)(((*.(((*.(((*/(((*0(((+*(((++
i. 468*i= 4%3*A3459%35*"E*6"6S:36)%4?*?":4)&"6i, T*A38&460*&5*4*A3459%3*"E*:36)%4?*)36836:$
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*=/
H6*T*1Li. &5*&6*)(3 H1k.LlF*T*,</*#"5&)&"6*"E*)(3*%46b38*84)40
5"****Y* 5(I*+Z*)K[+(5(*+\-
i, &5*&6*)(3 H1k.Ll,*T*/)( #"5&)&"6*"E*)(3*%46b38*84)405"****Y+ 5(;&67#4(5(*.
i= &5*&6*)(3 =H1k.LlF*T*[</*#"5&)&"6*"E*)(3*%46b38*84)405"****Y) 5(I*0Z+*K[+(5(*1\-
i94%)&?3*D3459%35!4?:9?4)&6'*G(3*i94%)&?35O**7R4A#?3
:#;$B&(E#%#(74(]'6&'&6(^''#CL((**(((*+(((*)(((*.(((*.(((*/(((*0(((+*(((++
i. 468*i= 4%3*A3459%35*"E*6"6S:36)%4?*?":4)&"6i, T*A38&460*&5*4*A3459%3*"E*:36)%4?*)36836:$
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*=V
i94%)&?3*D3459%35OG(3*;6)3%`94%)&?3*]46'3*H;i]L
" G(3*;i]*&5*i= m i. 468*A3459%35*)(3*5#%348*&6*)(3*A&88?3*/-W*"E*)(3*84)4
" G(3*;i]*&5*4?5"*:4??38*)(3*A&85#%348*Q3:4953*&)*:"C3%5*)(3*A&88?3*/-W*"E*)(3*84)4
" G(3*;i]*&5*4*A3459%3*"E*C4%&4Q&?&)$*)(4)*&5*6")*&6E?936:38*Q$*"9)?&3%5*"%*3R)%3A3*C4?935
" D3459%35*?&b3*i.0*i=0*468*;i]*)(4)*4%3*6")*&6E?936:38*Q$*"9)?&3%5*4%3*:4??38*%35&5)46)*A3459%35
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*=[
Calculating The Interquartile Range
D38&46Hi,L
aA4R&A9Aa
A&6&A9A i. i=
7R4A#?3O
,/W*****************,/W***************,/W**********,/W
*+((((((((((((((((((((()2(((((((((((((((((,-(((((((((((-/((((((((((((/2
;6)3%`94%)&?3*%46'3*T*/[*m =-*T*,[
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*=\
G(3*M&C3*@9AQ3%*>9AA4%$
The five numbers that help describe the center, spread and shape of data are:
! Xsmallest
! First Quartile (Q1)! Median (Q2)! Third Quartile (Q3)! Xlargest
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*=1
]3?4)&"65(*4A"6'*)(3*E&C3S69AQ3%*59AA4%$*468*8&5)%&Q9)&"6*5(4#3
P&>%H:N&O&6 :C;;&%'7< F7G"%H:N&O&6D38&46*m a5A4??35)
h
a?4%'35) m D38&46
D38&46*m a5A4??35)
n
a?4%'35) m D38&46
D38&46*m a5A4??35)
g
a?4%'35) m D38&46
i. m a5A4??35)
h
a?4%'35) m i=
i. m a5A4??35)
n
a?4%'35) m i=
i. m a5A4??35)
g
a?4%'35) m i=D38&46*m i.
h
i= m D38&46
D38&46*m i.
n
i= m D38&46
D38&46*m i.
g
i= m D38&46
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*F-
M&C3*@9AQ3%*>9AA4%$*468G(3*d"R#?")
" G(3*d"R#?")O*U*^%4#(&:4?*8&5#?4$*"E*)(3*84)4*Q4538*"6*)(3*E&C3S69AQ3%*59AA4%$O
7R4A#?3O
a5A4??35) SS i. SS D38&46***SS i= SS a?4%'35)
,/W*"E*84)4*****************,/W*************,/W********,/W*"E*84)4"E*84)4**********"E*84)4
a5A4??35) i. D38&46******i= a?4%'35)
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*F.
M&C3*@9AQ3%*>9AA4%$O>(4#3*"E*d"R#?")5
" ;E*84)4*4%3*5$AA3)%&:*4%"968*)(3*A38&46*)(36*)(3*Q"R*468*:36)%4?*?&63*4%3*:36)3%38*Q3)X336*)(3*368#"&6)5
" U*d"R#?")*:46*Q3*5("X6*&6*3&)(3%*4*C3%)&:4?*"%*("%&N"6)4?*"%&36)4)&"6
a5A4??35) i. D38&46******i= a?4%'35)
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*F,
B&5)%&Q9)&"6*>(4#3*468*G(3*d"R#?")
]&'()S>b3X38Y3E)S>b3X38 >$AA3)%&:
Y* Y+ Y) Y* Y+ Y) Y* Y+ Y)
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*F=
d"R#?")*7R4A#?3
" d3?"X*&5*4*d"R#?")*E"%*)(3*E"??"X&6'*84)4O
-****,*****,******,*****=*****=*****F*****/*****/****1****,[
" G(3*84)4*4%3*%&'()*5b3X380*45*)(3*#?")*83#&:)5
!!!!"!!!#!$!!%!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!#&2(((+(()(((-((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((+/
J@;#BB&@% Y* Y+ [(3&67#4(((((((((((((Y)( JB#'G&@%
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*FF
3.4 Numerical Descriptive Measures for a Population
! Descriptive statistics discussed previously described a sample, not the population.
! Summary measures describing a population, called parameters, are denoted with Greek letters.
! Important population parameters are the population mean, variance, and standard deviation.
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*F/
@9A3%&:4?*B35:%&#)&C3*D3459%35*E"%*4*2"#9?4)&"6O**G(3*A346*o
" G(3*#"#9?4)&"6*A346 &5*)(3*59A*"E*)(3*C4?935*&6*)(3*#"#9?4)&"6*8&C&838*Q$*)(3*#"#9?4)&"6*5&N30*@
!"""
!
"!#$
!
$%% +++==µ
!= !
p T*#"#9?4)&"6*A346
@*T*#"#9?4)&"6*5&N3
a& T*&)( C4?93*"E*)(3*C4%&4Q?3*a
j(3%3
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*FV
" UC3%4'3*"E*5`94%38*83C&4)&"65*"E*C4?935*E%"A*)(3*A346
" 2"#9?4)&"6 C4%&46:3O
@9A3%&:4?*B35:%&#)&C3*D3459%35*M"%*U*2"#9?4)&"6O**G(3*Z4%&46:3*q,
!
!""##
!
$%
&%
&!=
"=
j(3%3 p T*#"#9?4)&"6*A346
@*T*#"#9?4)&"6*5&N3
a& T*&)( C4?93*"E*)(3*C4%&4Q?3*a
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*F[
@9A3%&:4?*B35:%&#)&C3*D3459%35*M"%*U*2"#9?4)&"6O**G(3*>)4684%8*B3C&4)&"6*q
" D"5)*:"AA"6?$*9538*A3459%3*"E*C4%&4)&"6" >("X5*C4%&4)&"6*4Q"9)*)(3*A346" ;5*)(3*5`94%3*%"")*"E*)(3*#"#9?4)&"6*C4%&46:3" r45*)(3*54A3*96&)5*45*)(3*"%&'&64?*84)4
" 2"#9?4)&"6 5)4684%8*83C&4)&"6O
!
!""##
!
$%
&%!
=
"=
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*F\
>4A#?3*5)4)&5)&:5*C3%595*#"#9?4)&"6*#4%4A3)3%5
Measure Population Parameter
Sample Statistic
Mean
Variance
Standard Deviation
X
2S
S
µ
2!
!
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*F1
" G(3*3A#&%&:4?*%9?3*4##%"R&A4)35*)(3*C4%&4)&"6*"E*84)4*&6*4*Q3??S5(4#38*8&5)%&Q9)&"6
" U##%"R&A4)3?$*V\W "E*)(3*84)4*&6*4*Q3??*5(4#38*8&5)%&Q9)&"6*&5*X&)(&6*.*5)4684%8*83C&4)&"6*"E*)(3*A346*"%*
G(3*7A#&%&:4?*]9?3
!"!" ±
!
.0X
!!"±
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*/-
" U##%"R&A4)3?$*1/W*"E*)(3*84)4*&6*4*Q3??S5(4#38*8&5)%&Q9)&"6*?&35*X&)(&6*)X"*5)4684%8*83C&4)&"65*"E*)(3*A3460*"%*o*s ,q
" U##%"R&A4)3?$*11<[W*"E*)(3*84)4*&6*4*Q3??S5(4#38*8&5)%&Q9)&"6*?&35*X&)(&6*)(%33*5)4684%8*83C&4)&"65*"E*)(3*A3460*"%*o*s =q
G(3*7A#&%&:4?*]9?3
!!" ±
11\/X1-X
!!" ±
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*/.
t5&6'*)(3*7A#&%&:4?*]9?3
! Suppose that the variable Math SAT scores is bell-shaped with a mean of 500 and a standard deviation of 90. Then,
! 68% of all test takers scored between 410 and 590 (500 ± 90).
! 95% of all test takers scored between 320 and 680 (500 ± 180).
! 99.7% of all test takers scored between 230 and 770 (500 ± 270).
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*/,
" ]3'4%8?355*"E*("X*)(3*84)4*4%3*8&5)%&Q9)380*4)*?345)*H.*S .lb,L*R*.--W "E*)(3*C4?935*X&??*E4??*X&)(&6*b 5)4684%8*83C&4)&"65*"E*)(3*A346*HE"%*b*h*.L
" 7R4A#?35O
H.*S .l,,L*R*.--W*T*[/W*u<<<<<<<<<<<<<<*bT,**Hp s ,qLH.*S .l=,L*R*.--W*T*\\<\1W*uuu<<*bT=**Hp s =qL
!(3Q$5(3C*]9?3
j&)(&6U)*?345)
B!PZU
!"#$%&'()*+*,-./0*,-.,0*,--1*234%5"6*789:4)&"60*;6:< !(4#)3%*=0*>?&83*/=
G(46b*$"9