SAMPLING TECHNIQUES Mrs.S. Valarmathi. M.sc., Mphil., Research Officer, Department of Epidemiology...
-
Upload
kory-heath -
Category
Documents
-
view
218 -
download
5
Transcript of SAMPLING TECHNIQUES Mrs.S. Valarmathi. M.sc., Mphil., Research Officer, Department of Epidemiology...
SAMPLING TECHNIQUES
Mrs.S. Valarmathi. M.sc., Mphil.,Research Officer,Department of Epidemiology The Tamil Nadu Dr. MGR Medical University
Is it possible to taste the whole sambar and add salt
NO
Is it possible to work out what 50 million people think by asking only 1000?
YES
What exactly IS a Population ? The entire group under study as defined by research objectives. Sometimes called the “universe.”
The totality or aggregate of all individuals with the specified characteristics is a population
What exactly IS a “sampling”?
Selecting and studying a small number of subjects from a
specified population in order to draw inferences about the
whole population
Sampling TerminologyWho do you want
to generalize toWhat population can you get access to ?
THEORETICAL POPULATION STUDYPOPULATION
Sampling TerminologyWho do you want
to generalize toWhat population can you get access to ?
How Can you get access to them?
SAMPLING FRAMETHEORETICAL POPULATION STUDYPOPULATIONTHEORETICAL
POPULATION
Sampling TerminologyWho do you
want to generalize to
What population can you get access to ?
How Can you get access to them?
Who is in your study?
THE SAMPLETHEORETICAL POPULATION
STUDYPOPULATION
SAMPLING FRAME
Sampling and representative ness
Sample
Theoretical Population
StudyPopulation
Theoretical Population Study Population Sample
Sample
• Representativeness express the degree to which the sample data precisely characterize the population.
• Sample should reflect the study character of the population .
• Strength of statistical inference also depends on representativeness.
• Confidence level 95%, 99% for population
When you take a sample from a population, you only have a subset of the population - a piece of what you’re trying to understand.
S
S
S
P
Why Sampling Errors ?Sampling error can be reduced simply by increasing the sample size!
The sampling distribution
• The distribution of an infinite number of samples of the same size as the sample in your study is known as the sampling distribution.
Standard Error
• The standard deviation of the sampling distribution.
• It tells us something about how different samples would be distributed
• A measure of samplingvariability
Systematic / Non-sampling Errors
• Occurs whether a census or a sample is being used.
• Results solely from the manner in which the observations are made.
• Cannot be measured.
Types of Non-sampling Errors
• Coverage error
• Non response error
• Measurement error
Excluded from frame.
Follow up on non responses.
Bad Question!
TYPES OF SAMPLING
Cluster
Sampling
Non-Probability Sampling
Convenience
Probability Sampling
Simple Random
Systematic
Stratified
Quota
Judgement Snowball
Convenience Sampling
The sample is identified primarily by convenience.Examples:
“Man on the street”Medical student in the libraryVolunteer samplesPatient coming to OP
Problem : No evidence for representativeness.
HAPZHARD SAMPLE
Judgment Sampling
The sampling procedure in which an experienced research selects the sample
based on some appropriate characteristic of sample members… to serve a purpose (Purposive sampling, Deliberate sampling)
Quota Sampling
Attempt to be representative by selecting sample elements in proportion to their
known incidence in the population
Snowball samplingTypically used in qualitative researchWhen members of a population are difficult to locate, hidden activity groups, non-cooperative groupsRecruit one respondent, who identifies others, who identify others,….Primarily used for exploratory purposes
Respondent Driven Sampling
• Applicable for Hidden, Hard to reach populations – MSM, IDU
• A systematic form of snowball sampling with unique identification procedure.
• Depends social network of target population• Under certain assumptions may be treated as
a Random sample
Steps involved in RDS
• Begin with a small set of identified seeds.• Seeds recruit peers, who recruit their peers,
etc., continued till required sample size is achieved.
• Recruits are linked by coupons with unique identifying numbers.
• Incentives provided for participation and each successful recruit.
RDS: Advantages
No need of sampling frame / mapping
Ease of field operations - Target members recruit samples for you.
Reach less visible segment of population
Non Probability Sampling Methods
Convenience sampling relies upon convenience and access
Judgment sampling relies upon belief that participants fit characteristics
Quota sampling emphasizes representation of specific characteristics
Snowball sampling relies upon respondent referrals of others with like characteristics
Probability samples A sampling that selects subjects with a known, non zero, probability.
Removes possibility of bias in selection of subjects.
Allows application of statistical theory to results.
Important when one wishes to generalize the findings of the sample to the larger population from which samples are selected.
Simple random sampling
Applicable when population is small, homogeneous & readily available
Required number of units are selected randomly.
Each unit of the frame has an equal non zero probability of selection.
Simple random samplingMerits
• Easy to implement if list frame available or small population
• Approximately satisfies the sampling model on which conventional statistics is based, so we can carry out complex analyses
Demerits•Need complete list of units
•Units may be scattered
Simple random sampling
Example: evaluate the prevalence of hypertension among the 1200 children attending schoolin the age group 14 to 17 years.
List of children attending the schoolChildren numerated from 1 to 1200Sample size = 100 childrenRandom sampling of 100 numbers between 1 and 1200
57172 42088 70098 11333 26902 29959 43909 4960733883 87680 28923 15659 09839 45817 89405 7074377950 67344 10609 87119 15859 74577 42791 7588911607 11596 01796 24498 17009 67119 00614 4952956149 55678 38169 47228 49931 94303 67448 3128680719 65101 77729 83949 83358 75230 56624 2754993809 19505 82000 79068 45552 86776 48980 5668440950 86216 48161 17646 24164 35513 94057 5183412182 59744 65695 83710 41125 14291 74773 6639113382 48076 73151 48724 35670 38453 63154 5811638629 94576 48859 75654 17152 66516 78796 7309960728 32063 12431 23898 23683 10853 04038 7524601881 99056 46747 08846 01331 88163 74462 1455123094 29831 95387 23917 07421 97869 88092 7220115243 21100 48125 05243 16181 39641 36970 9952253501 58431 68149 25405 23463 49168 02048 3152207698 24181 01161 01527 17046 31460 91507 1605022921 25930 79579 43488 13211 71120 91715 4988168127 00501 37484 99278 28751 80855 02035 1091055309 10713 36439 65660 72554 77021 46279 2270592034 90892 69853 06175 61221 76825 18239 4768750612 84077 41387 54107 09190 74305 68196 7563481415 98504 32168 17822 49946 37545 47201 8522438461 44528 30953 08633 08049 68698 08759 4561107556 24587 88753 71626 64864 54986 38964 8353460557 50031 75829 05622 30237 77795 41870 26300
Table of random numbers
Generating Random Numbers
•This is a better and perhaps more efficient for selecting a simple random sample. •Computers and even your calculators can be used to generate random digits. The randomly produced digits can be used to pick your samples. •However, a complete listing of the members of the population is needed in this type of random selection.
Systematic random sampling
The defined target population is ordered and the sample is selected according to position using a skip interval
Systematic random sampling
Systematically spreads sample through a list of population membersIn nearly all practical examples, the procedure results in a sample equivalent to SRSINTERVAL SAMPLING
Systematic random samplingN = 1200, and n = 60
sampling interval = 1200/60 = 20
List persons from 1 to 1200Randomly select a number between 1 and 20 (ex : 8)
1st person selected = the 8th on the list 2nd person = 8 + 20 = the 28th
etc .....
1. Careful that there is no systematic rhythm to the flow or list of people.
2. If every Kth person on the list is, say, “rich” or “senior” or some other consistent pattern, avoid this method
Stratified Random Sampling
A method of probability sampling in which the population is divided
into different subgroups and samplesare selected from each
Proportional Allocation Method
Epidemiological profile of tuberculosis under 12 years of age. Sample size is 120 centre 1 - 56% -67centre 2– 24% - 29Centre 3 – 20% - 24
Equal Allocation Method
Epidemiological profile of tuberculosis under 12 years of age. Sample size is 120 centre 1 - 40centre 2– 40Centre 3 – 40
CHENNAI
NORTH CENTRAL EAST
GOVT.
5 SCHOOLS
SOUTH
PRIVATE GOVT. PRIVATE GOVT. PRIVATE GOVT. PRIVATE
5 SCHOOLS
EACH10
STUDENTS
5 GIRLS
5 BOYS
EACH10
STUDENTS
5 GIRLS
5 BOYS
Cluster Sampling
• Population by it self is divided into number of natural groups known as clusters (geographic or organizational) .
• The units are heterogeneous within cluster but homogeneous between clusters.
• Cluster sample is obtained by selecting the clusters by simple random sampling and all the units in the sampled clusters are included in the sample
Cluster Sampling
• Advantages– Sampling frame is not required– Simple and Easy– Less resources required
• Disadvantages– Imprecise if units within clusters are
homogeneous
Cluster Sampling
Randomly select Clusters and select all subjects
Randomly select Clusters and select subjects randomly
Cluster Sampling
Especially useful for door-to-door personal surveys (significantly reduces costs)
However, clustering increases sampling errors (people who live close together tend to be more similar)
Drawing the clusters
You need :Map of the regionDistribution of population (by Taluks or area)Age distribution (population 5-12:3%)
1600220
3200400800200
1200200
1600
530007300
1060001300026500
660040000
660053000
MetturSankariSalem
EdapadiOmalurYercaud
VazhappadiAttur
Gangavalli
5-12Pop.Taluks
1600220
3200400800200
1200200
1600
MetturSankariSalem
EdapadiOmalurYercaud
VazhappadiAttur
Gangavalli
5-12Taluks
Total = 9420
1600220
3200400800200
1200200
1600
MetturSankariSalem
EdapadiOmalurYercaud
VazhappadiAttur
Gangavalli
5-12Taluks
Total = 9420
Then compute sampling fraction :K = 9420/30 = 314
51
10121415
Drawing households and childrenOn the spot
Go to the center of the Taluk , choose direction (random)
Number the houses in this direction Ex: 21
Draw random number (between 1 and 21) to identify the first house to visit
From this house progress until finding the 7 children ( itinerary rules fixed beforehand)
Multistage Sampling
• Selection of subjects is done in stage by stage
• Any one of the sampling schemes can be applied during each stage
• Multistage sampling generally ends with unequal probability to sampling unit
• Analysis procedure becomes more complex.
Multistage Sampling
• Advantages
– No sampling frame of population required
– Most feasible approach for large populations
• Disadvantages
– Several sampling lists
– Needs more man power
Multi Stage Sampling
• District Level Household & Facility Survey– Stage – 1 Selection of District– Stage – 2 Selection of Villages– Stage – 3 Selection of Households
• Immunization coverage in a state– Stage – 1 Selection of District– Stage – 2 Selection of PHCs– Stage – 3 Selection of Subcenters
Probability Sampling Methods
Simple random sampling relies upon simple randomization
Systematic sampling relies upon on the sampling interval
Stratified sampling emphasizes dividing into groups and subgroups
Cluster sampling relies upon geographical or organizational groupsCluster sampling relies upon geographical or organizational groupsCluster sampling relies upon geographical or organizational groups
Factors Affecting Choice of Sampling Design
Sampling Frame: Existence and Size
Costs
Precision Desired
Sub-Population Comparisons
TO SUMMARIZE
•Population•Sample•Standard Error•Types Of Sampling•Non Probability sampling•Probability Sampling
Social actors are not predictable like objects.
Randomized events are irrelevant to social life.
Probability sampling is expensive and inefficient.
Therefore…
Non-probability sampling is the best approach.
We want to generalize to the population.
Random events are predictable.
Therefore…We can compare random events to our results.
Probability sampling is the best approach.
Conclusions
Probability samples are the best Beware of …
•refusals•absentees•“do not know”
Ensure •Validity•Precision …..within available constraints
Professor: Hope u understand the sampling techniquesStudent: It is impossible to draw conclusion for the whole population by drawing samplesProfessor explained the whole thing againStudent: I am not convinced
Professor: Well, so next time when you go for a blood test ask them to extract all the blood from your body