Introduction to Hypothesis Testing (College Board AP® Statistics)

Study Guide

Mark Curtis

Written by: Mark Curtis

Reviewed by: Dan Finlay

Introduction to hypothesis testing

What is a hypothesis test?

  • A hypothesis test is a procedure for determining whether or not a population parameter has changed significantly from its previous (or previously assumed or accepted) value

    • e.g. you suspect the mean height of all the trees on an island has increased from its original value of 5.2 meters

What is the null hypothesis?

  • A null hypothesis, straight H subscript 0, is the assumption that the population parameter has not changed

    • e.g. straight H subscript 0: The mean height of all trees on an island is still 5.2 meters (mu equals 5.2)

      • It is assumed to be correct, unless evidence proves otherwise

  • Sometimes a null hypothesis can include an inequality with an equals

    • straight H subscript 0 colon space mu less or equal than...

      • but these are still tested at the boundary, mu equals...

What is the alternative hypothesis?

  • An alternative hypothesis, straight H subscript a, is how you think the population parameter has changed

    • e.g. straight H subscript a colon The mean height of all trees on an island has increased from 5.2 meters (mu greater than 5.2)

      • It is the situation for which evidence is being collected

Examiner Tips and Tricks

After writing out your hypotheses, always fully define the symbol used for the population parameter in context, e.g. '... where mu is the mean weight of all trees on the island'.

What is a one-tailed test?

  • A one-tailed test is when you suspect the population parameter has either increased or decreased

    • i.e. has changed in a particular direction

  • The alternative hypothesis is a strict one-sided inequality

    • straight H subscript a colon space space mu greater than...

    • straight H subscript a colon space space mu less than...

Examiner Tips and Tricks

You will know from the wording of the question if it is a one-tailed test. For example:

  • The manager worries that the mean salary has gone up

  • The biologist suspects a reduction in mean weight

What is a two-tailed test?

  • A two-tailed test is when you suspect the population parameter has changed

    • but you do not know if it has increased or decreased

  • The alternative hypothesis is a not-equal-to statement

    • straight H subscript a colon space space mu not equal to...

Examiner Tips and Tricks

Common wording used for two-tailed tests include:

  • The mathematician feels that the mean scores are different to last year

  • The scientist suspects a change in mean temperature

  • The journalist does not know if it is rising or falling

What is a test statistic?

  • To try to prove your case, you take a recent sample from the population and calculate an unbiased estimate of the population parameter from that sample

    • This estimate is called a test statistic

    • e.g. you randomly sample 30 trees and calculate their mean height of 6.5 m

      • The test statistic is 6.5 m

  • The type of test statistic will depend on what you are trying to prove

    • Use a sample mean to prove a change in population mean

    • Use a sample proportion to prove a change in population proportion

Examiner Tips and Tricks

Do not confuse the test statistic (e.g. the sample mean) with the population parameter (e.g. the mean of the whole population)!

What is the standardized test statistic?

  • It is not enough to just take a sample, work out a test statistic from it, then use that value to say the population parameter has changed

    • A test statistic will generally not be exactly equal to the population parameter it is estimating

    • You need to know whether the test statistic is extreme or not, compared to the population parameter assumed by straight H subscript 0

  • Instead, you need a measure of how far the test statistic is from the population parameter

    • This is called the standardized test statistic, given by:

      • fraction numerator statistic minus parameter over denominator standard space error space of space the space statistic end fraction

    • It shows how many standard errors you are from the population parameter

      • The standard error is an estimate of the population standard deviation from the data

Examiner Tips and Tricks

The formula for the standardized test statistic is given in the exam, along with tables of parameters and standard errors.

What are conditions for a hypothesis test?

  • Each type of hypothesis test has its own set of conditions that must be met

  • A common condition is the independence condition which states that items in the sample (or experiment) must be independent

    • This is checked in two stages, first:

      • verify that data is collected by random sampling

      • or random assignment (in an experiment)

    • Second, if sampling without replacement:

      • verify that the sample size is less than or equal to 10% of the population size

      • Sometimes written n less or equal than 0.1 N

  • Another common condition is normality of the population or sampling distribution

    • The distribution needs to be approximately symmetric

    • There should be no outliers

Examiner Tips and Tricks

You must learn the conditions relevant to each type of hypothesis test.

What is the p-value?

  • To prove a test statistic is extreme when compared to the old population parameter

    • you need a probability that shows it is unlikely to have happened under the old population parameter

      • i.e. unlikely under the null hypothesis

  • The p-value is the probability of obtaining a test statistic as extreme, or more extreme, than the one observed in the sample, assuming the null hypothesis is true

    • Note that this is a probability of an extreme region

      • not just of one extreme value

  • The more extreme the test statistic is compared to the sample, the smaller the p-value

    • and the more evidence there is to suggest that the null hypothesis can be rejected

How do I calculate the p-value?

  • The p-value is calculated by finding probabilities from the sampling distribution for that test statistic

    • Use the population parameter from the null hypothesis

      • as p-values are calculated assuming the null hypothesis is true

    • Work out the standardized test statistic

      • This allows you to use the appropriate probability tables

What is the significance level?

  • The significance level, alpha, is a probability boundary (threshold) at which you reject the null hypothesis

    • If the p-value is less than the significance level, you reject the null hypothesis

  • It must be set before a hypothesis test is carried out

    • It will usually be 1%, 5% or 10%

      • written as alpha equals 0.01, alpha equals 0.05 or alpha equals 0.1

      • if it is not given in the question, use 5%

How do I conclude a hypothesis test?

  • Conclusions to a hypothesis test need to contain two things:

    • a decision about the null hypothesis

    • an interpretation of this decision in the context of the question

  • To make the decision, compare the p-value to the significance level

    • If p less than alpha then the null hypothesis should be rejected

    • If p greater than alpha then the null hypothesis should not be rejected

  • In a two-tailed test, double the p-value and compare this to alpha

    • This is because there are two regions that are as extreme, or more extreme, than the one observed in the sample

Examiner Tips and Tricks

In the exam, your decision should say whether the null hypothesis should be rejected or not (avoid using 'accepted' or talking about the 'alternative hypothesis' at this point).

How do I interpret a conclusion in context?

  • One or two sentences are needed at the end that interpret the conclusion in context

  • Start by saying whether the data provides sufficient evidence

    • or does not provide sufficient evidence

      • It must mention sufficient

      • It is not correct to say 'no / zero evidence' or 'this proves' or 'we accept that...'

  • Then continue by writing out the relevant hypothesis in context

    • Copy out exact phrases from the question

      • Make sure the population parameter is mentioned in context

  • For example, a good interpretation is:

    • 'The data provides sufficient evidence that the mean height of all trees on the island has increased'

    • but not:

      • 'this proves the heights of all trees have increased'

How do I lay out a hypothesis test in the exam?

  • STEP 1

    Hypotheses:

    • straight H subscript 0 colon space space mu equals...

    • straight H subscript a colon space space mu greater than... or straight H subscript a colon space space mu less than... or straight H subscript a colon space space mu not equal to...

    • Define the population parameter in context

      • '...where mu is the ...'

    • State the significance level, alpha

      • e.g. alpha equals 0.05

  • STEP 2
    Identify the appropriate inference procedure:

    • This is the type of test you want to perform

      • e.g. 'The appropriate inference procedure is a one-sided t-test for a population mean'

  • STEP 3
    Verify any conditions relevant to that test

  • STEP 4
    Find the p-value:

    • Calculate the standardized test statistic

      • fraction numerator statistic minus parameter over denominator standard space error space of space the space statistic end fraction

    • Calculate the p-value

      • Mention any degrees of freedom where relevant

  • STEP 5
    Conclusion:

    • Compare the p-value to alpha

    • Provide a correct decision about straight H subscript 0

      • e.g. 'straight H subscript 0 should be rejected'

    • Interpret the conclusion in context

      • e.g. the data provides sufficient evidence that the mean of ... etc

Last updated:

You've read 0 of your 5 free study guides this week

Sign up now. It’s free!

Join the 100,000+ Students that ❤️ Save My Exams

the (exam) results speak for themselves:

Did this page help you?

Mark Curtis

Author: Mark Curtis

Expertise: Maths

Mark graduated twice from the University of Oxford: once in 2009 with a First in Mathematics, then again in 2013 with a PhD (DPhil) in Mathematics. He has had nine successful years as a secondary school teacher, specialising in A-Level Further Maths and running extension classes for Oxbridge Maths applicants. Alongside his teaching, he has written five internal textbooks, introduced new spiralling school curriculums and trained other Maths teachers through outreach programmes.

Dan Finlay

Author: Dan Finlay

Expertise: Maths Lead

Dan graduated from the University of Oxford with a First class degree in mathematics. As well as teaching maths for over 8 years, Dan has marked a range of exams for Edexcel, tutored students and taught A Level Accounting. Dan has a keen interest in statistics and probability and their real-life applications.