Residuals (College Board AP® Statistics)

Revision Note

Mark Curtis

Expertise

Maths

Residuals

What are residuals?

  • A residual of a data point on a scatterplot is its vertical distance above the regression line

    • A positive residual means the point lies above the regression line

    • A negative residual means the point lies below the regression line

  • When a residual is positive, the regression line underestimates the y-value of that data point

    • whereas when a residual is negative, the regression line overestimates it

What is the formula for calculating a residual?

  • The formula for calculating a residual is

    • residual = y value of the data point - y value of regression line

      • i.e. residual = actual y-value - predicted y-value

Worked Example

A scatterplot and regression line are shown below. Calculate the residual for each of the five data points.

A scatterplot with points shown and a dashed regression line on a grid.

Answer:

The residuals are the numbers shown in brackets on the diagram below

The residuals +2, -3, 0, +3, -2 shown between data points on a scatterplot and the regression line.

You've read 0 of your 10 free revision notes

Unlock more, it's free!

Join the 100,000+ Students that ❤️ Save My Exams

the (exam) results speak for themselves:

Did this page help you?

Mark Curtis

Author: Mark Curtis

Mark graduated twice from the University of Oxford: once in 2009 with a First in Mathematics, then again in 2013 with a PhD (DPhil) in Mathematics. He has had nine successful years as a secondary school teacher, specialising in A-Level Further Maths and running extension classes for Oxbridge Maths applicants. Alongside his teaching, he has written five internal textbooks, introduced new spiralling school curriculums and trained other Maths teachers through outreach programmes.