Random Sampling Methods (College Board AP® Statistics)

Study Guide

Mark Curtis

Written by: Mark Curtis

Reviewed by: Dan Finlay

Systematic sampling

What is systematic sampling?

  • A systematic sample is one in which you jump periodically (systematically) in regular intervals to select elements from the population to be in your sample

  • A systematic sample of size nfrom a population of size N uses intervals of size N over n

    • If this is a decimal, round to the nearest whole number

  • Randomly select a starting element in the first interval

    • Then jump by the interval amount to get the other elements

      • If necessary, wrap back around the list again to keep going

Is a systematic sample a type of simple random sample (SRS)?

  • No, a systematic sample is not a type of simple random sample (SRS)

    • In an SRS, every possible sample of size n is equally likely

    • But in a systematic sample, two elements next to each other would never be selected

  • However, a systematic sample is still a type of 'random sample', as selecting the first element is done randomly

Worked Example

A database orders 200 users alphabetically by surname.

(a) Describe how to take a systematic sample of 40 users from the database.

Answer:

The interval size is 5 as 200 divided by 40 is 5

Label the first 5 users from the database with the numbers from 1 to 5

Then use a random number generator to generate a random number between 1 and 5 (inclusive)

Select the user that corresponds to the random number to be your starting point

Then select every 5th user on the list after this starting point until 40 users in total have been selected

This creates a systematic sample of size 40

(b) Would a simple random sample of 40 users be more fair than the systematic sample in part (a)? Explain your answer.

Answer:

Yes, a simple random sample of 40 users would be more fair than a systematic sample of 40 users

In a systematic sample, users with surnames that are next to each other alphabetically cannot be in the same sample

But in a simple random sample, every sample of size 40 is equally likely so those same users could appear in the same sample together

Stratified sampling

What is a stratified sampling?

  • In a stratified sample, the population is first divided into distinct groups called strata (singular: stratum)

    • The strata cover the whole population but are non-overlapping

      • The same element cannot appear in multiple strata

  • Elements are chosen for the sample using simple random sampling from each strata in the population

    • This can either be done proportionately:

      • where the number of elements from each stratum in the sample correspond to the proportion of the population in that stratum

      • A possible formula is shown below

      number space from space stratum equals fraction numerator size space of space stratum over denominator size space of space population end fraction cross times size space of space sample

    • Or it can be done disproportionately:

      • where the number of elements from each stratum does not need to correspond to the proportions in the population

      • e.g. randomly selecting 20 elements from each strata

Should I use a stratified sample or a simple random sample?

  • If the population can be split into obvious non-overlapping groups then a stratified sample will always be more representative of the population structure than a simple random sample

    • Elements from every group are guaranteed to be included in the sample

    • This means a sample cannot all come from, say, the group with the lowest values

      • A stratified sample is more balanced across the groups

      • This makes any estimates of population parameters less extreme (more precise)

      • So there is less variability in estimates of population parameters

  • However, if the sample size is very small, it may not be worth the time to split into groups

    • A simple random sample would be quicker

Examiner Tips and Tricks

After calculating the numbers to be chosen from each stratum, add them back up again to check they equal to the total sample size!

Worked Example

In David's school there are 636 students, 36 teachers, and 48 non-teaching staff. David wishes to choose a stratified sample of 60 people from the school.

Calculate the numbers of students, teachers and non-teaching staff that David should include in his sample.

Answer:

First find the total number of people in the school

636 plus 36 plus 48 equals 720

To find the number to sample from each stratum use number space from space stratum equals fraction numerator size space of space stratum over denominator size space of space population end fraction cross times size space of space sample

students colon space space 636 over 720 cross times 60 equals 53

teachers colon space space 36 over 720 cross times 60 equals 3

non minus teaching space staff colon space space 48 over 720 cross times 60 equals 4

Check to make sure these numbers add up to 60

53 + 3 + 4 = 60

53 students, 3 teachers and 4 non-teaching staff

Cluster sampling

What is cluster sampling?

  • In a cluster sample, the population is first divided into groups called clusters

    • Within each cluster there should be a good spread of different elements

      • Different clusters should have similar overall compositions

  • Then a simple random sample is used to randomly select one or more clusters out of the possible clusters

    • All elements in the selected clusters are then used in the sample

What is the difference between cluster sampling and stratified sampling?

  • Cluster sampling means using all elements from some randomly selected groups in the population

  • Whereas stratified sampling means using some elements from all groups in the population

  • Stratified samples are more representative of the population than cluster samples

    • But cluster samples can be faster and cheaper than stratified samples

  • The groupings used for cluster samples should have a good mix of different elements (heterogeneity)

    • whereas the groupings used for stratified samples should contain similar type of elements (homogeneity)

Worked Example

Employees at a postal company are to be interviewed. The employees work in one of ten different post offices owned by the company.

Four of the post offices are randomly selected and the employees from these four post offices are interviewed.

Which sampling method is being used?

(A) Stratified sampling

(B) Systematic sampling

(C) Cluster sampling

(D) Simple random sampling

Answer:

The population of employees is split across ten different post offices (groups) so could be stratified or cluster sampling

Four groups are randomly selected and all employees from these groups are interviewed (all elements of some groups are used, so it is cluster sampling)

The correct answer is C

Last updated:

You've read 0 of your 5 free study guides this week

Sign up now. It’s free!

Join the 100,000+ Students that ❤️ Save My Exams

the (exam) results speak for themselves:

Did this page help you?

Mark Curtis

Author: Mark Curtis

Expertise: Maths

Mark graduated twice from the University of Oxford: once in 2009 with a First in Mathematics, then again in 2013 with a PhD (DPhil) in Mathematics. He has had nine successful years as a secondary school teacher, specialising in A-Level Further Maths and running extension classes for Oxbridge Maths applicants. Alongside his teaching, he has written five internal textbooks, introduced new spiralling school curriculums and trained other Maths teachers through outreach programmes.

Dan Finlay

Author: Dan Finlay

Expertise: Maths Lead

Dan graduated from the University of Oxford with a First class degree in mathematics. As well as teaching maths for over 8 years, Dan has marked a range of exams for Edexcel, tutored students and taught A Level Accounting. Dan has a keen interest in statistics and probability and their real-life applications.