Hypothesis Tests for Differences in Population Proportions (College Board AP® Statistics)
Study Guide
Written by: Mark Curtis
Reviewed by: Dan Finlay
Two-sample z-test for difference in population proportions
What is a two-sample z-test for a difference in population proportions?
A two sample z-test is used to test whether or not the population proportions of two independent populations, and , are equal
One random sample of size is taken from the first population
A different random sample of size is taken the second population
The sample proportions are and
The difference in sample proportions is
What are the hypotheses?
The null hypothesis, , is the assumption that there is no difference between the population proportions
e.g. The proportion of left-handed students at both schools is equal,
It is assumed to be correct, unless evidence proves otherwise
It can also be written as
The alternative hypothesis, , is how you think the population proportions might be different to each other
e.g. The proportion of left-handed students in School A is greater than in School B,
Remember that a z-test could be one-tailed or two-tailed ()
Examiner Tips and Tricks
When writing out your hypotheses, always fully define the symbol used for the population parameters in context, e.g. '... where is the proportion of left-handed students in School A and is the proportion of left-handed students in School B'.
What are the conditions required?
When performing a two-sample z-test for a difference in population proportions, you must show that it meets the following conditions:
Items in the two samples (or experiment) must satisfy the independence condition
by verifying that data is collected by random sampling
or random assignment (in an experiment)
and, if sampling without replacement, showing that both sample sizes are less than 10% of their population size
The sampling distribution of must be approximately normal
by first calculating the combined proportion, , given by (which assumes the null hypothesis, ) where (the number of successes in the first sample) and (the number of successes in the second sample)
then using to verify that , , and
The combined proportion, , is also called the pooled proportion
It can only be used when is assumed (like under the null hypothesis)
Examiner Tips and Tricks
The formula for the combined proportion, , is given in the exam, but you need to learn that (the number of successes in the first sample) and (the number of successes in the second sample).
Examiner Tips and Tricks
Some exam questions may change the four conditions into four conditions (changing the 10 into a 5), though this will be made clear in the question.
How do I calculate the standardized test statistic?
The standardized test statistic for a difference in sample proportions is a z-score given by:
where and are the sample proportions
and are the sample sizes
is the combined proportion given by where and
and the zero, 0, highlights that the difference in population proportions is zero under the null hypothesis,
Examiner Tips and Tricks
The formula for the standardized test statistic is given in the exam, , along with tables of parameters and standard errors.
There are two different standard errors for population proportion given in the exam. For hypothesis testing, you need the second one where it says ' is assumed'!
How do I calculate the p-value?
The p-value is the probability of obtaining a test statistic as extreme, or more extreme, than the one observed in the difference of the two samples, assuming the null hypothesis is true
Use the standard normal distribution, , to calculate the probability of being in the extreme region (tail) that extends from the z-score given by the formula above
You can use either the z-tables or a calculator to find this probability
For a two-tail test, remember to work out the total probability across both tails
You can double the p-value from a one-tail test
How do I conclude a hypothesis test?
Conclusions to a hypothesis test need to show two things:
a decision about the null hypothesis
an interpretation of this decision in the context of the question
To make the decision, compare the p-value to the significance level,
If then the null hypothesis should be rejected
If then the null hypothesis should not be rejected
In a two-tailed test, double the p-value and compare this to
Examiner Tips and Tricks
Remember that the test should be interpreted within the context of the question.
Use the same language in your conclusion that is used in the problem, e.g. 'The data provides sufficient evidence that the proportion of left-handed students in School A is greater than the proportion of left-handed students in School B'.
What are the steps on a calculator?
When using a calculator to conduct a z-test for a difference in population proportions, you must still write down all steps of the hypothesis testing process:
State the null and alternative hypotheses and clearly define your parameter
Describe the test being used and show that the situation meets the conditions required
Calculate the standardized test statistic (z-score)
Calculate the p-value using your calculator
Compare the p-value to the significance level
Write down the conclusion to the test and interpret it in the context of the problem
Examiner Tips and Tricks
Even if you perform a z-test for a difference in population proportions on your calculator, it is still important to show all of your working to demonstrate full understanding, including calculating the z-score.
Worked Example
Nova University and Terra University have over 10,000 students each. A random sample of 200 students at Nova University and a random sample of 150 students from Terra University were asked to complete a survey to measure their level of smartphone addiction. The results showed that 35% of the students sampled from Nova University were addicted to their smartphones, while 28% of the students sampled from Terra University were addicted to their smartphones.
Is there sufficient evidence, at a 0.05 level of significance, to conclude that there is a difference in the proportion of students addicted to smartphones at Nova University and Terra University?
Answer:
State the type of test being used and verify the conditions for the test
The correct inference procedure is a two-sample z-test for the difference in population proportions with
The independence condition is satisfied, as
both samples were selected randomly
the sample size from Nova University, 200, is less than 10% of the total number of students at Nova University (10% of 'over 10,000' is 'over 1000')
the sample size from Terra University, 150, is less than 10% of the total number of students at Terra University (10% of 'over 10,000' is 'over 1000')
These conditions are required as sampling was conducted without replacement
The sample size is large enough for the sampling distribution of the difference in sample proportions to be approximately normally distributed, because
the combined proportion is where and
giving
and the following conditions are satisfied
Define the population parameters, and
Let be the proportion of all students at Nova University who are addicted to their smartphones
Let be the proportion of all students at Terra University who are addicted to their smartphones
Write the null and alternative hypotheses
This will be a two-tailed test as a difference is assumed, but no direction is specified
Calculate the standardized test statistic
Find the p-value for one of the tails, , e.g. from the z-tables
Double this probability to find the p-value for both tails
Compare this probability to the significance level and state the conclusion of the test
is not rejected
Interpret this result in the context of the question
There is not sufficient evidence to conclude that there is a difference in the proportion of students addicted to smartphones at Nova University and Terra University
Last updated:
You've read 0 of your 5 free study guides this week
Sign up now. It’s free!
Did this page help you?