PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and...
-
date post
20-Dec-2015 -
Category
Documents
-
view
219 -
download
3
Transcript of PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and...
![Page 1: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/1.jpg)
PPA 415 – Research Methods in Public Administration
Lecture 5 – Normal Curve, Sampling, and Estimation
![Page 2: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/2.jpg)
Normal Curve
The normal curve is central to the theory that underlies inferential statistics.
The normal curve is a theoretical model. A frequency polygon that is perfectly symmetrical and
smooth. Bell shaped, unimodal, with infinite tails. Crucial point distances along the horizontal axis,
when measured in standard deviations, always measure the same proportion under the curve.
![Page 3: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/3.jpg)
Normal Curve
![Page 4: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/4.jpg)
Normal Curve
![Page 5: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/5.jpg)
Computing Z-Scores
To find the percentage of the total area (or number of cases) above, below, or between scores in an empirical distribution, the original scores must be expressed in units of the standard deviation or converted into Z scores.
s
XXZ i
![Page 6: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/6.jpg)
Computing Z-Scores – Fair Housing Survey 2000
What was the last grade you completed in school?
20.018.016.014.012.010.08.06.04.0
100
80
60
40
20
0
Std. Dev = 2.46
Mean = 12.9
N = 156.00
![Page 7: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/7.jpg)
Computing Z-Scores: Examples
What percentage of the cases have between six and the mean years of education?
From Appendix A, Table A: Z=-2.81 is 0.0026. From Appendix A, Table A: Z=0 is .5. P6-12.9 = .5-.0026 = .4974. 49.74% of the distribution lies between 6 and 12.9 years of
education
805.246.2
9.6
46.2
9.126
s
XXZ i 0
46.2
0
46.2
9.129.12
s
XXZ i
![Page 8: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/8.jpg)
Computing Z-Scores: Examples
What percentage of the cases are less than eight years of education?
What percentage have more than 13 years?
0233.4767.5.,992.146.2
9.4
46.2
9.128
p
s
XXZ i
.4840.5160.1;04.46.2
1.
46.2
9.1213
p
s
XXZ i
![Page 9: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/9.jpg)
Computing Z-Scores: Examples
What percentage of Birmingham residents have between 10 and 13 years of education?
.1190.,18.146.2
9.2
46.2
9.1210
p
s
XXZ i
3970.1190.5160.
.5160.;04.46.2
1.
46.2
9.1213
1310
p
ps
XXZ i
![Page 10: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/10.jpg)
Computing Z-scores: Rules
If you want the distance between a score and the mean, subtract the probability from .5 if the Z is negative. Subtract .5 from the probability if Z is positive.
If you want the distance beyond a score (less than a score lower than the mean), use the probability in Appendix A, Table A. If the distance is more than a score higher than the mean), subtract the probability in Appendix A, Table A from 1.
![Page 11: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/11.jpg)
Computing Z-scores: Rules
If you want the difference between two scores other than the mean: Calculate Z for each score, identify the
appropriate probability, and subtract the smaller probability from the larger.
![Page 12: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/12.jpg)
Probability
One interpretation of the area under the normal curve is as probabilities.
Probabilities are determined as the number of successful events divided by the total possible number of events.
The probability of selecting a king of hearts from a deck of cards is 1/52 or .0192 (1.92%).
![Page 13: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/13.jpg)
Probability
The proportions under the normal curve can be treated as probabilities that a randomly selected case will fall within the prescribed limits.
Thus, in the Birmingham fair housing survey, the probability of selecting a resident with between 10 and 13 years of education is 39.7%.
![Page 14: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/14.jpg)
Sampling
One of the goals of social science research is to test our theories and hypotheses using many different types of people drawn from a broad cross section of society.
However, the populations we are interested in are usually too large to test.
![Page 15: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/15.jpg)
Sampling
To deal with this problem, researchers select samples or subsets of the population.
The goal is to learn about the populations using the data from the samples.
![Page 16: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/16.jpg)
Sampling
Basic procedures for selecting probability samples, the only kind that allow generalization to the larger population.
Researcher do use nonprobability samples, but generalizing from them is nearly impossible.
The goal of sampling is to select cases in the final sample that are representative of the population from which they are drawn. A sample is representative if it reproduces the
important characteristics of the population.
![Page 17: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/17.jpg)
Sampling
The fundamental principle of probability sampling is that a sample is very likely to be representative if it is selected by the Equal Probability of Selection Method (EPSEM). Every case in the population must have an
equal chance of ending up in the sample.
![Page 18: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/18.jpg)
Sampling
EPSEM and representativeness are not the same thing. EPSEM samples can be unrepresentative, but
the probability of such an event can be calculated unlike nonprobability samples.
![Page 19: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/19.jpg)
EPSEM Sampling Techniques
Simple random sample – list of cases and a system for selection that ensures EPSEM.
Systematic sampling – only the first case is randomly sample, then a skip interval is used.
Stratified sample – random subsamples on the basis of some important characteristic.
Cluster sampling – used when no list exists. Clusters often based on geography.
![Page 20: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/20.jpg)
The Sampling Distribution
Once we have selected a probability sample according to some EPSEM procedure, what do we know? We know a great deal about the sample, but
nothing about the population. Somehow, we have to get from the sample to
the population. The instrument used is the sampling
distribution.
![Page 21: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/21.jpg)
The Sampling Distribution
The theoretical, probabilistic distribution of a descriptive statistic (such as the mean) for all possible samples of certain sample size (N).
Three distributions are involved in every application of inferential statistics. The sample distribution – empirical, shape, central
tendency and distribution. The population distribution – empirical, unknown. The sampling distribution – theoretical, shape, central
tendency, and dispersion can be deduced.
![Page 22: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/22.jpg)
The Sampling Distribution
The sampling distribution allows us to estimate the probability of any sample outcome.
Discuss the identification of a sampling distribution. Generally speaking, a sampling distribution will be symmetrical, approximately normal, and have the mean of the population.
![Page 23: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/23.jpg)
The Sampling Distribution
If repeated random samples of size N are drawn from a normal population with mean μ and standard deviation σ, then the sampling distribution of sample means will be normal with a mean μ and a standard deviation of σ/N (standard error of the mean).
![Page 24: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/24.jpg)
The Sampling Distribution
Central Limit Theorem. If repeated random samples of size N are drawn from
any population, with mean μ and standard deviation σ, then, as N becomes large, the sampling distribution of sample means will approach normality, with mean μ and standard deviation σ/N.
The theorem removes normality constraint in population.
Rule of thumb: N100.
![Page 25: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/25.jpg)
The Sampling Distribution
![Page 26: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/26.jpg)
Estimation Procedures
Bias – does the mean of the sampling distribution equal the mean of the population?
Efficiency – how closely around the mean does the sampling distribution cluster. You can improve efficiency by increasing sample size.
![Page 27: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/27.jpg)
Estimation Procedures
Point estimate – construct a sample, calculate a proportion or mean, and estimate the population will have the same value as the sample. Always some probability of error.
![Page 28: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/28.jpg)
Estimation Procedures
Confidence interval – range around the sample mean. First step: determine a confidence level: how much
error are you willing to tolerate. The common standard is 5% or .05. You are willing to be wrong 5% of the time in estimating populations. This figure is known as alpha or α. If an infinite number of confidence intervals are constructed, 95% will contain the population mean and 5% won’t.
![Page 29: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/29.jpg)
Estimation Procedures
We now work in reverse on the normal curve. Divide the probability of error between the upper
and lower tails of the curve (so that the 95% is in the middle), and estimate the Z-score that will contain 2.5% of the area under the curve on either end. That Z-score is ±1.96.
Similar Z-scores for 90% (alpha=.10), 99% (alpha=.01), and 99.9% (alpha=.001) are ±1.65, ±2.58, and ±3.29.
![Page 30: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/30.jpg)
Estimation Procedures
mean theoferror standard population the
level alpha by the determined as score ZtheZ
mean sample the
interval confidence..
where
..
N
X
ic
NZXic
![Page 31: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/31.jpg)
Estimation Procedures – Sample Mean
mean theoferror standard the1
level alpha by the determined as score ZtheZ
mean sample the
interval confidence..
where
1..
n
s
X
ic
n
sZXic
Only use if sample is 100 or greater
![Page 32: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/32.jpg)
Estimation – Proportions Large Sample
NZPic
N
PPZPic
s
uus
25...
estimates. veconservatimost for the .5P use
)1(..
Use only if sample size is greater than 100
![Page 33: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/33.jpg)
Estimation Procedures
You can control the width of the confidence intervals by adjusting the confidence level or alpha or by adjusting sample size.
![Page 34: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/34.jpg)
Confidence Interval Examples
55.1325.1265.9.12156
46.229.39.12
1..
41.1339.1251.9.12156
46.258.29.12
1..
29.1351.1239.9.12156
46.296.19.12
1..
n
sZXic
n
sZXic
n
sZXic
Birmingham Fair Housing Survey Education with 95%, 99%, and99.9% confidence intervals.
![Page 35: PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.](https://reader038.fdocuments.us/reader038/viewer/2022103022/56649d4e5503460f94a2e3a6/html5/thumbnails/35.jpg)
Confidence Interval Examples
Proportion of sample who believe that discrimination is a major problem in Birmingham.
333.095.119.214.)0361(.29.3214.0013.29.3214.192
25.29.3214.
25...
307.121.093.214.)0361(.58.2214.0013.58.2214.192
25.58.2214.
25...
285.143.071.214.)0361(.96.1214.0013.96.1214.192
25.96.1214.
25...
us
us
us
PN
ZPic
PN
ZPic
PN
ZPic