Confidence Intervals for Differences in Population Proportions (College Board AP® Statistics)

Revision Note

Mark Curtis

Expertise

Maths

Two-sample z-interval for difference in population proportions

What is a confidence interval for the difference between two population proportions?

  • A confidence interval for the difference between two population proportions is

    • a symmetric range of values centered about the difference between two sample proportions

    • designed to capture the actual value of the difference between the two population proportions

  • Different samples generate different confidence intervals

    • e.g. a difference of sample proportions of 0.2 may have a confidence interval of (0.15, 0.25)

How do I calculate a confidence interval for the difference between two population proportions?

  • The confidence interval for the difference between two population proportions is given by

    • difference space between space sample space proportions plus-or-minus open parentheses critical space value close parentheses open parentheses standard space error space of space sample space proportions space of space two space populations close parentheses

  • Where:

    • The difference between the two sample proportions is calculated from the samples or is given to you

    • The critical value is the relevant z-value

      • The critical value depends on the confidence level C%

    • The standard error is an estimate of how different the difference between two population proportions is likely to be from the difference between the two sample proportions, square root of fraction numerator p with hat on top subscript 1 open parentheses 1 minus p with hat on top subscript 1 close parentheses over denominator n subscript 1 end fraction plus fraction numerator p with hat on top subscript 2 open parentheses 1 minus p with hat on top subscript 2 close parentheses over denominator n subscript 2 end fraction end root

Exam Tip

The general formula for confidence intervals (including a table of standard errors) is given in the exam: statistic plus-or-minus open parentheses critical space value close parentheses open parentheses standard space error space of space statistic close parentheses.

You will need to apply it appropriately using the difference between the sample proportions and the standard error of the difference of the sample proportions.

What are the conditions for a confidence interval for a difference in population proportions?

  • When calculating a two-sample z-interval for a difference in population proportions, you must show that it meets the following conditions:

    • Items in the two samples (or experiment) must satisfy the independence condition

      • by verifying that data is collected by random sampling

      • or random assignment (in an experiment)

      • and, if sampling without replacement, showing that both sample sizes are less than 10% of their population size

    • The sampling distribution of p with hat on top subscript 1 minus p with hat on top subscript 2 must be approximately normal, by verifying that

      • n subscript 1 p with hat on top subscript 1 greater or equal than 10

      • n subscript 1 open parentheses 1 minus p with hat on top subscript 1 close parentheses greater or equal than 10

      • n subscript 2 p with hat on top subscript 2 greater or equal than 10

      • n subscript 2 open parentheses 1 minus p with hat on top subscript 2 close parentheses greater or equal than 10

Exam Tip

Some exam questions may change the four greater or equal than 10 conditions into four greater or equal than 5 conditions (changing the 10 into a 5), though this will be made clear in the question.

What is the margin of error?

  • The margin of error is the half-width of the confidence interval

    • margin space of space error equals open parentheses critical space value close parentheses open parentheses standard space error space of space the space difference space of space the space sample space proportions close parentheses

  • The confidence interval is

    • difference space in space sample space proportions plus-or-minus margin space of space error

  • The total width of a confidence interval is 2 cross times margin space of space error

  • You may be given an interval and asked to calculate its margin of error

    • or another value, such as n

      • This involves forming and solving an equation

Exam Tip

You need to know that the width of a confidence interval increases as the confidence level increases, whereas it decreases as the sample sizes increase!

How do I interpret a confidence interval for a population mean?

  • You must conclude calculations of a confidence interval by referring to the context

    • Start by saying 'we can be C% confidence that the interval from [lower limit] to [upper limit]...'

      • using the limits from the confidence interval

    • then end with it capturing the difference between the population proportions in context

      • e.g. 'captures the actual difference between the proportion of left-handed students in School A and the proportion of left-handed students in School B'

  • Confidence intervals for differences may have negative limits

    • This means that the difference, p subscript 1 minus p subscript 2, is negative

      • so p subscript 1 less than p subscript 2

How do I use confidence intervals to justify a claim about a population proportions difference?

  • If the difference in population proportions is claimed to be a specific value

    • check if that value lies in your confidence interval

  • If it does, the sample data provides sufficient evidence that the difference in population proportions is that value

    • If it does not, the sample data does not provide sufficient evidence that the difference in population proportions is that value

  • Look out for confidence intervals for differences that contain zero

    • This means p subscript 1 minus p subscript 2 equals 0 so there is evidence to suggest p subscript 1 equals p subscript 2

Worked Example

Nova University and Terra University have over 10,000 students each. A random sample of 200 students at Nova University and a random sample of 150 students from Terra University were asked to complete a survey to measure their level of smartphone addiction. The results showed that 35% of the students sampled from Nova University were addicted to their smartphones, while 28% of the students sampled from Terra University were addicted to their smartphones.

Construct a 95% confidence interval for the difference in the proportion of students addicted to smartphones at Nova University and the proportion of students addicted to smartphones at Terra University.

Answer:

Define the population parameters, p subscript 1 and p subscript 2

Let p subscript 1 be the proportion of all students at Nova University who are addicted to their smartphones

Let p subscript 2 be the proportion of all students at Terra University who are addicted to their smartphones

State the type of interval being used and verify that the conditions for the interval are met

The correct inference procedure is a two-sample z-interval for the difference in population proportions at a 95% confidence level

  • The independence condition is satisfied, as

    • both samples were selected randomly

    • the sample size from Nova University, 200, is less than 10% of the total number of students at Nova University (10% of 'over 10,000' is 'over 1000')

    • the sample size from Terra University, 150, is less than 10% of the total number of students at Terra University (10% of 'over 10,000' is 'over 1000')

      • These conditions are required as sampling was conducted without replacement

  • The sample size is large enough for the sampling distribution of the difference in sample proportions to be approximately normally distributed, because the following conditions are satisfied

    • n subscript 1 p with hat on top subscript 1 equals 200 times 0.35 equals 70 greater or equal than 10

    • n subscript 1 open parentheses 1 minus p with hat on top subscript 1 close parentheses equals 200 times open parentheses 1 minus 0.35 close parentheses equals 130 greater or equal than 10

    • n subscript 2 p with hat on top subscript 2 equals 150 times 0.28 equals 42 greater or equal than 10

    • n subscript 2 open parentheses 1 minus p with hat on top subscript 2 close parentheses equals 150 times open parentheses 1 minus 0.28 close parentheses equals 108 greater or equal than 10

List the sample sizes, n subscript 1 and n subscript 2, the sample proportions, p with hat on top subscript 1 and p with hat on top subscript 2, and calculate the standard error of the difference in sample proportions, square root of fraction numerator p with hat on top subscript 1 open parentheses 1 minus p with hat on top subscript 1 close parentheses over denominator n subscript 1 end fraction plus fraction numerator p with hat on top subscript 2 open parentheses 1 minus p with hat on top subscript 2 close parentheses over denominator n subscript 2 end fraction end root

table row cell n subscript 1 end cell equals 200 row cell n subscript 2 end cell equals 150 row cell p with hat on top subscript 1 end cell equals cell 0.35 end cell row cell p with hat on top subscript 2 end cell equals cell 0.28 end cell row cell square root of fraction numerator p with hat on top subscript 1 open parentheses 1 minus p with hat on top subscript 1 close parentheses over denominator n subscript 1 end fraction plus fraction numerator p with hat on top subscript 2 open parentheses 1 minus p with hat on top subscript 2 close parentheses over denominator n subscript 2 end fraction end root end cell equals cell square root of fraction numerator 0.35 open parentheses 1 minus 0.35 close parentheses over denominator 200 end fraction plus fraction numerator 0.28 open parentheses 1 minus 0.28 close parentheses over denominator 150 end fraction end root equals 0.0498146... end cell end table

Find the z-score (critical value) for a confidence level of 95%, e.g. from the tables

Remember that a confidence level of 95% is 5% in both tails combined, so use 2.5% for a single tail in the table
(Alternatively, the row for t subscript infinity in the t-tables are z-scores, together with the corresponding 'Confidence level C' shown below)

z-score = 1.960

Calculate the confidence interval using the formula given to you in the exam, Confidence space interval equals statistic plus-or-minus open parentheses critical space value close parentheses space open parentheses standard space error space of space statistic close parentheses

table row CI equals cell open parentheses p with hat on top subscript 1 minus p with hat on top subscript 2 close parentheses plus-or-minus z times square root of fraction numerator p with hat on top subscript 1 open parentheses 1 minus p with hat on top subscript 1 close parentheses over denominator n subscript 1 end fraction plus fraction numerator p with hat on top subscript 2 open parentheses 1 minus p with hat on top subscript 2 close parentheses over denominator n subscript 2 end fraction end root end cell row blank equals cell open parentheses 0.35 minus 0.28 close parentheses plus-or-minus 1.960 times 0.0498146... end cell end table

State the confidence interval

open parentheses negative 0.0276 comma space 0.1676 close parentheses

Explain the confidence interval in the context of the question

We can be 95% confident that the interval from -0.0276 to 0.1676 captures the actual value of the difference in the proportion of students addicted to smartphones at Nova University and the proportion of students addicted to smartphones at Terra University

You've read 0 of your 10 free revision notes

Unlock more, it's free!

Join the 100,000+ Students that ❤️ Save My Exams

the (exam) results speak for themselves:

Did this page help you?

Mark Curtis

Author: Mark Curtis

Mark graduated twice from the University of Oxford: once in 2009 with a First in Mathematics, then again in 2013 with a PhD (DPhil) in Mathematics. He has had nine successful years as a secondary school teacher, specialising in A-Level Further Maths and running extension classes for Oxbridge Maths applicants. Alongside his teaching, he has written five internal textbooks, introduced new spiralling school curriculums and trained other Maths teachers through outreach programmes.