Random Sampling Methods (College Board AP® Statistics)

Revision Note

Mark Curtis

Expertise

Maths

Systematic sampling

What is systematic sampling?

  • A systematic sample is one in which you jump periodically (systematically) in regular intervals to select elements from the population to be in your sample

  • A systematic sample of size nfrom a population of size N uses intervals of size N over n

    • If this is a decimal, round to the nearest whole number

  • Randomly select a starting element in the first interval

    • Then jump by the interval amount to get the other elements

      • If necessary, wrap back around the list again to keep going

Is a systematic sample a type of simple random sample (SRS)?

  • No, a systematic sample is not a type of simple random sample (SRS)

    • In an SRS, every possible sample of size n is equally likely

    • But in a systematic sample, two elements next to each other would never be selected

  • However, a systematic sample is still a type of 'random sample', as selecting the first element is done randomly

Worked Example

A database orders 200 users alphabetically by surname.

(a) Describe how to take a systematic sample of 40 users from the database.

Answer:

The interval size is 5 as 200 divided by 40 is 5

Label the first 5 users from the database with the numbers from 1 to 5

Then use a random number generator to generate a random number between 1 and 5 (inclusive)

Select the user that corresponds to the random number to be your starting point

Then select every 5th user on the list after this starting point until 40 users in total have been selected

This creates a systematic sample of size 40

(b) Would a simple random sample of 40 users be more fair than the systematic sample in part (a)? Explain your answer.

Answer:

Yes, a simple random sample of 40 users would be more fair than a systematic sample of 40 users

In a systematic sample, users with surnames that are next to each other alphabetically cannot be in the same sample

But in a simple random sample, every sample of size 40 is equally likely so those same users could appear in the same sample together

Stratified sampling

What is a stratified sampling?

  • In a stratified sample, the population is first divided into distinct groups called strata (singular: stratum)

    • The strata cover the whole population but are non-overlapping

      • The same element cannot appear in multiple strata

  • Elements are chosen for the sample using simple random sampling from each strata in the population

    • The number of elements from each stratum in the sample correspond to the proportion of the population in that stratum

      • A possible formula is shown below

number space from space stratum equals fraction numerator size space of space stratum over denominator size space of space population end fraction cross times size space of space sample

Should I use a stratified sample or a simple random sample?

  • If the population can be split into obvious non-overlapping groups then a stratified sample will always be more representative of the population structure than a simple random sample

    • Elements from every stratum are guaranteed to be included in the sample

  • However, if the sample size is very small, it may not be worth the time to split into groups

    • A simple random sample would be quicker

Exam Tip

After calculating the numbers to be chosen from each stratum, add them back up again to check they equal to the total sample size!

Worked Example

In David's school there are 636 students, 36 teachers, and 48 non-teaching staff. David wishes to choose a stratified sample of 60 people from the school.

Calculate the numbers of students, teachers and non-teaching staff that David should include in his sample.

Answer:

First find the total number of people in the school

636 plus 36 plus 48 equals 720

To find the number to sample from each stratum use number space from space stratum equals fraction numerator size space of space stratum over denominator size space of space population end fraction cross times size space of space sample

students colon space space 636 over 720 cross times 60 equals 53

teachers colon space space 36 over 720 cross times 60 equals 3

non minus teaching space staff colon space space 48 over 720 cross times 60 equals 4

Check to make sure these numbers add up to 60

53 + 3 + 4 = 60

53 students, 3 teachers and 4 non-teaching staff

Cluster sampling

What is cluster sampling?

  • In a cluster sample, the population is first divided into groups called clusters

    • Within each cluster there should be a good spread of different elements

      • Different clusters should have similar overall compositions

  • Then a simple random sample is used to randomly select one or more clusters out of the possible clusters

    • All elements in the selected clusters are then used in the sample

What is the difference between cluster sampling and stratified sampling?

  • Cluster sampling means using all elements from some randomly selected groups in the population

  • Whereas stratified sampling means using some elements from all groups in the population

  • Stratified samples are more representative of the population than cluster samples

    • But cluster samples can be faster and cheaper than stratified samples

  • The groupings used for cluster samples should have a good mix of different elements (heterogeneity)

    • whereas the groupings used for stratified samples should contain similar type of elements (homogeneity)

Worked Example

Employees at a postal company are to be interviewed. The employees work in one of ten different post offices owned by the company.

Four of the post offices are randomly selected and the employees from these four post offices are interviewed.

Which sampling method is being used?

(A) Stratified sampling

(B) Systematic sampling

(C) Cluster sampling

(D) Simple random sampling

Answer:

The population of employees is split across ten different post offices (groups) so could be stratified or cluster sampling

Four groups are randomly selected and all employees from these groups are interviewed (all elements of some groups are used, so it is cluster sampling)

The correct answer is C

You've read 0 of your 10 free revision notes

Unlock more, it's free!

Join the 100,000+ Students that ❤️ Save My Exams

the (exam) results speak for themselves:

Did this page help you?

Mark Curtis

Author: Mark Curtis

Mark graduated twice from the University of Oxford: once in 2009 with a First in Mathematics, then again in 2013 with a PhD (DPhil) in Mathematics. He has had nine successful years as a secondary school teacher, specialising in A-Level Further Maths and running extension classes for Oxbridge Maths applicants. Alongside his teaching, he has written five internal textbooks, introduced new spiralling school curriculums and trained other Maths teachers through outreach programmes.