Random Sampling Methods (College Board AP® Statistics)
Study Guide
Systematic sampling
What is systematic sampling?
A systematic sample is one in which you jump periodically (systematically) in regular intervals to select elements from the population to be in your sample
A systematic sample of size from a population of size uses intervals of size
If this is a decimal, round to the nearest whole number
Randomly select a starting element in the first interval
Then jump by the interval amount to get the other elements
If necessary, wrap back around the list again to keep going
Is a systematic sample a type of simple random sample (SRS)?
No, a systematic sample is not a type of simple random sample (SRS)
In an SRS, every possible sample of size is equally likely
But in a systematic sample, two elements next to each other would never be selected
However, a systematic sample is still a type of 'random sample', as selecting the first element is done randomly
Worked Example
A database orders 200 users alphabetically by surname.
(a) Describe how to take a systematic sample of 40 users from the database.
Answer:
The interval size is 5 as 200 divided by 40 is 5
Label the first 5 users from the database with the numbers from 1 to 5
Then use a random number generator to generate a random number between 1 and 5 (inclusive)
Select the user that corresponds to the random number to be your starting point
Then select every 5th user on the list after this starting point until 40 users in total have been selected
This creates a systematic sample of size 40
(b) Would a simple random sample of 40 users be more fair than the systematic sample in part (a)? Explain your answer.
Answer:
Yes, a simple random sample of 40 users would be more fair than a systematic sample of 40 users
In a systematic sample, users with surnames that are next to each other alphabetically cannot be in the same sample
But in a simple random sample, every sample of size 40 is equally likely so those same users could appear in the same sample together
Stratified sampling
What is a stratified sampling?
In a stratified sample, the population is first divided into distinct groups called strata (singular: stratum)
The strata cover the whole population but are non-overlapping
The same element cannot appear in multiple strata
Elements are chosen for the sample using simple random sampling from each strata in the population
This can either be done proportionately:
where the number of elements from each stratum in the sample correspond to the proportion of the population in that stratum
A possible formula is shown below
Or it can be done disproportionately:
where the number of elements from each stratum does not need to correspond to the proportions in the population
e.g. randomly selecting 20 elements from each strata
Should I use a stratified sample or a simple random sample?
If the population can be split into obvious non-overlapping groups then a stratified sample will always be more representative of the population structure than a simple random sample
Elements from every group are guaranteed to be included in the sample
This means a sample cannot all come from, say, the group with the lowest values
A stratified sample is more balanced across the groups
This makes any estimates of population parameters less extreme (more precise)
So there is less variability in estimates of population parameters
However, if the sample size is very small, it may not be worth the time to split into groups
A simple random sample would be quicker
Examiner Tips and Tricks
After calculating the numbers to be chosen from each stratum, add them back up again to check they equal to the total sample size!
Worked Example
In David's school there are 636 students, 36 teachers, and 48 non-teaching staff. David wishes to choose a stratified sample of 60 people from the school.
Calculate the numbers of students, teachers and non-teaching staff that David should include in his sample.
Answer:
First find the total number of people in the school
To find the number to sample from each stratum use
Check to make sure these numbers add up to 60
53 + 3 + 4 = 60
53 students, 3 teachers and 4 non-teaching staff
Cluster sampling
What is cluster sampling?
In a cluster sample, the population is first divided into groups called clusters
Within each cluster there should be a good spread of different elements
Different clusters should have similar overall compositions
Then a simple random sample is used to randomly select one or more clusters out of the possible clusters
All elements in the selected clusters are then used in the sample
What is the difference between cluster sampling and stratified sampling?
Cluster sampling means using all elements from some randomly selected groups in the population
Whereas stratified sampling means using some elements from all groups in the population
Stratified samples are more representative of the population than cluster samples
But cluster samples can be faster and cheaper than stratified samples
The groupings used for cluster samples should have a good mix of different elements (heterogeneity)
whereas the groupings used for stratified samples should contain similar type of elements (homogeneity)
Worked Example
Employees at a postal company are to be interviewed. The employees work in one of ten different post offices owned by the company.
Four of the post offices are randomly selected and the employees from these four post offices are interviewed.
Which sampling method is being used?
(A) Stratified sampling
(B) Systematic sampling
(C) Cluster sampling
(D) Simple random sampling
Answer:
The population of employees is split across ten different post offices (groups) so could be stratified or cluster sampling
Four groups are randomly selected and all employees from these groups are interviewed (all elements of some groups are used, so it is cluster sampling)
The correct answer is C
Sign up now. It’s free!
Did this page help you?