Correlation & Regression (DP IB Maths: AA HL)

Exam Questions

3 hours23 questions
1a
Sme Calculator
2 marks

A teacher collected the maths and physics test scores of a number of students and drew a scatter diagram to represent this data.

Thrh_pj__q1a-4-2-correlation-regression-medium-ib-ai-sl-maths-screenshot

Describe the correlation shown by the scatter diagram, and interpret the correlation in context.

1b
Sme Calculator
2 marks

An alternative therapist collected data on his clients’ reported levels of anxiety as well as the number of trees they had hugged in the course of therapy. He drew a scatter diagram to represent this data.

dKESSwEn_q1a-2-4-2-correlation-regression-medium-ib-ai-sl-maths-screenshot

Describe the correlation shown by the scatter diagram, and interpret the correlation in context.

Did this page help you?

2a
Sme Calculator
4 marks

Jennifer sells cups of tea at her shop and has noticed that she sells more tea on cooler days.  On five different days, she records the maximum daily temperature, T, measured in degrees Celsius, and the number of cups of teas sold, C. The results are shown in the following table.

Maximum Daily Temperature, T.

3

5

8

9

12

Cups of tea sold, C

37

34

33

26

21


The relationship between straight T
and straight C can be modelled by the regression line of straight C on straight T with equation C equals a T plus b.

(i)
Find the value of a and the value of b.

(ii)
Write down the value of the Pearson’s product-moment correlation coefficient, r.
2b
Sme Calculator
2 marks

Use your regression equation from part (a)(i) to estimate the number of teas that Jennifer will sell on a day when the maximum temperature is 11°C.

2c
Sme Calculator
2 marks

Being sure to consider the result from part (a)(ii) in your answer, state how confident you would be in your estimate from part (b).

Did this page help you?

3a
Sme Calculator
4 marks

The following table shows the mean height, y cm, of primary school children who are age x years old.

Age, x years

6.25

7.35

8.5

9.25

10.75

Mean Height, y cm

115

121

129

136

140

 

The relationship between x and y can be modelled by the regression line of y on x with equation y equals a x plus b.

(i)

Find the value of a and the value of b.


(ii)
Write down the value of the Pearson’s product-moment correlation coefficient, r .
3b
Sme Calculator
2 marks

Using the regression equation with your values of  a and b from part (a)(i), estimate the height of a child aged 9 years old.

3c
Sme Calculator
1 mark

Explain why it is not appropriate to use the regression equation to estimate the age of a child who is 133 cm tall.

Did this page help you?

4a
Sme Calculator
4 marks

Rebecca, a regular jogger, ran the “Thao Dien Loop” on 7 consecutive days. The following table shows the distance, x km, that she ran and the corresponding number of calories, y, that she was able to burn during the run. 

Distance (x)

2

5

6

7

10

12

14

Calories (y)

180

315

365

435

619

830

871

 

The number of calories burnt during a run is dependent upon on the length of the run.  The relationship between x and y can be modelled by the regression line of y on x  with equation y equals a x plus b.  

(i)

Find the value of a and the value of b.


(ii)
Write down the value of the Pearson’s product-moment correlation coefficient, r.
4b
Sme Calculator
1 mark

Interpret, in the context of the question, the value of a found in part (a)(i).

4c
Sme Calculator
2 marks

On the 8th day, Rebecca is only able to run for 8 kilometres.

Use the result from part (a)(i) to estimate the number of calories Rebecca will lose.

4d
Sme Calculator
1 mark

Comment on the validity of using the result from part (a)(i) to answer part (c).

Did this page help you?

5a
Sme Calculator
4 marks

The percentage of people who are willing to get a particular vaccine is dependent on their age. The following table shows the age, A years old, and the corresponding percentage of people, V. that are willing to receive a vaccine for 6 different ages. 

Age, (A)

25

30

35

40

45

50

Percentage of willing people, (V)

57

59

61

62

68

75

 
The relationship between A and V  can be modelled by the regression line of Von A with equation V equals a A plus b.

(i)

Find the value of a and the value of b.

(ii)
Write down the value of the Pearson’s product-moment correlation coefficient, r.
5b
Sme Calculator
1 mark

Interpret, in the context of the question, the value of a found in part (a)(i).

5c
Sme Calculator
2 marks

Use the result from part (a)(i) to estimate the percentage of people aged 95 years old who are in willing to receive a vaccine.

5d
Sme Calculator
1 mark

Comment on the validity of using the result from part (a)(i) to answer part (c).

Did this page help you?

6a
Sme Calculator
3 marks

The price, $P , of an airline ticket is dependent on the distance,D km, between two cities.  The table below shows the airfares in US dollars from Prague in the Czech Republic, to eight different destinations in Europe.

Distance begin bold style stretchy left parenthesis D stretchy right parenthesis end style  885 340 835 330 1270 295 650 1930
Price begin bold style stretchy left parenthesis P stretchy right parenthesis end style  99 50 90 45 119 42.5 59 139

Let L subscript 1be the regression line of P spaceon D.  The equation of the line L subscript 1 can be written in the form  P equals a D plus b.

Let L subscript 2 space end subscriptbe the regression line of D spaceon P.  The equation of the line L subscript 2 can be written in the form D equals c P plus d.

(i)
Find the value of a and the value of b

 

(ii)
Find the value of c spaceand the value of d
6b
Sme Calculator
2 marks

Write down the value of Pearson’s product-moment correlation coefficient,r.

6c
Sme Calculator
2 marks

Use the result from part (a)(i) to estimate the price of an airline ticket for a flight from Prague to a destination that is 2635 km away.

6d
Sme Calculator
2 marks

The lines L subscript 1 space end subscriptand L subscript 2 space end subscriptboth pass through the same point with coordinates left parenthesis p comma q right parenthesis.

Find the value of p and the value of q

Did this page help you?

1a
Sme Calculator
2 marks

10 students are asked to give a score from 0 to 10 on how much they enjoy watching football and how much they enjoy watching cricket. The scores are shown in the scatter plot below.

ib5a-ai-sl-4-2-ib-maths-hard

Use the scatter diagram to complete the missing cells in the table below.

Football score, x 1 3 4 4 5 5 7 8 10 10
Cricket score, y     4     10       2
1b
Sme Calculator
2 marks

Another student only gave a cricket score of 6 and no football score.

Estimate the football score for the student who has a cricket score of 6.

1c
Sme Calculator
1 mark

Comment on the reliability of your estimate found in part (b).

Did this page help you?

2a
Sme Calculator
4 marks

A study is conducted on 6 participants (labelled A to F) measuring their average number of hours of sleep per night and their score, from 0 to 100, in a short-term memory test. The results of the study are shown in the table below.

  Participant A B C D E F
 Average number of hours of sleep open parentheses x close parentheses 6.8 7.2 8.1 9.4 5.9 7.5
  Short term memory test score open parentheses y close parentheses 72 70 82 79 62 80

Draw a scatter diagram for the above data on the axes below.

ib6a-ai-sl-4-2-ib-maths-hard

2b
Sme Calculator
3 marks
(i)
Calculate the Pearson’s product-moment correlation coefficient,r .

(ii)
Write down the equation of the regression line y on x.

(iii)
Draw the regression line on your scatter diagram.
2c
Sme Calculator
2 marks

Use the regression line from part (b) to estimate the average number of hours of sleep a participant gets per night when their score in the memory test is 67. Give your answer to the nearest integer.

2d
Sme Calculator
1 mark

Comment on the reliability of your estimate found in part (c).

Did this page help you?

3a
Sme Calculator
3 marks

The following table shows the total CO subscript 2 emissions, T tonnes, from 5 different countries (labelled straight A to straight E) and their average annual household income, S USD.

  Country A B C D E
  CO subscript 2emissions, T tonnes 10   500   000 5 space 500 space 000 2 space 500 space 000 1 space 600 space 000 1 space 200 space 000
 Average annual household income,S USD 55 space 000 105 space 000 15 space 000 55 space 000 40 space 000

(i)
Calculate the Pearson’s product-moment correlation coefficient, r.

(ii)
Hence comment on the result.
3b
Sme Calculator
2 marks

Write down the equation of the regression line S on T.

3c
Sme Calculator
2 marks

State two reasons why it would be inappropriate to use the regression line from part (b) to estimate the percentage of total global emissions from a country where the average household annual income is 75 000 USD.

Did this page help you?

4a
Sme Calculator
4 marks

Easy Breezes is a company based in the snowy mountains of Greenland that makes heat pumps. Easy Breezes wants to see if the average weekly temperature, in degree C, is correlated with the average weekly energy consumption from their air conditioning units, in kilowatt hours (kWH). Easy Breezes records the following data.

 Average weekly temperature,in degree C left parenthesis x right parenthesis  -8.2 -4.3 -1.7 0.8 2.0 4.4 8.5
Average weekly energy consumption, in kWH left parenthesis y right parenthesis 365.2 316.4 292.7 249.1 187.2 142.8 131.2

(i)
Calculate the Pearson’s product-moment correlation coefficient,r .

(ii)
Write down the equation of the regression line y on x.
4b
Sme Calculator
4 marks
(i)
Use your regression line from part (a) (ii) to estimate the number of kWH one of  Easy Breezes air conditioning units would  use in a week when the weekly average temperature is 11 degree C.

(ii)
Comment on the reliability of your estimate.

Did this page help you?

5a
Sme Calculator
3 marks

A supermarket has a physical and online store. The following table shows the total daily revenue, y, in USD, along with the number of customers that they had come into their physical store during the day, over 8 separate days.

  Customers open parentheses x close parentheses 45 88 54 97 154 107 36 72
  Revenue, USD open parentheses y close parentheses 548.21 832.55 497.71 1021.97 1138.73 988.62 1026.21 754.38

(i)
Calculate the Pearson’s product-moment correlation coefficient,r .

(ii)
Hence comment on the result.
5b
Sme Calculator
3 marks

The regression line y on x can be written in the form y equals a plus b x.

Calculate the values of a spaceand b spaceand interpret their meanings.

5c
Sme Calculator
2 marks

The supermarket has daily fixed costs of 650 USD.

Find an expression for the daily profit of the supermarket when they have x spacecustomers on a particular day.

5d
Sme Calculator
2 marks

Estimate the least number of physical customers required in order to make a profit on any particular day.

Did this page help you?

6a
Sme Calculator
4 marks

10 rugby players (labelled A to J) are used to investigate the relationship between a player’s maximum sprint velocity, in ms to the power of negative 1 end exponent, and their weight, in kg.  The data is recorded in the table below.

  Player A B C D E F G H i J
 Weight, in kg open parentheses x close parentheses 96 99 88 95 112 98 85 108 82 109
Maximum sprint velocity, in ms to the power of negative 1 end exponent space open parentheses straight y close parentheses   7.5 6.9 10.1 8.8 6.1 6.9 5.8 10.7 11.9 6.5

Calculate the value of the Pearson’s product-moment correlation coefficient,r

(i)

with players G and H.

(ii)
without the players G and H.
6b
Sme Calculator
4 marks

Write down the equation of the regression line y spaceon x

(i)

with players G and H.

(ii)
without the players G and H.
6c
Sme Calculator
2 marks

Comment on the results found in part (a) and (b) and state whether you would use the regression line with players G and H or the regression line without players G and H when estimating a rugby player’s maximum sprint velocity, given their weight.

Did this page help you?

7a
Sme Calculator
1 mark

A study was conducted on 6 students measuring their arm length, x cm, and the maximum number of push ups they can do in a minute. The results of the study are shown in the table below.

  Arm length, x cm 72.2 69.2 75.6 78.1 78.5 74.5
  No. of push ups, y 34 42 24 30 38 31

State the range of the number of push ups.

7b
Sme Calculator
3 marks
(i)
Calculate the Pearson’s product-moment correlation coefficient,r .

(ii)
Comment on the correlation between the athlete’s arm length and the maximum  number of push ups they can do in a minute.
7c
Sme Calculator
2 marks

Write down the equation of the regression line y spaceon x.

7d
Sme Calculator
2 marks

Another student, Tom, is a sportsman and can do 62 push ups in one minute.

Use the regression line found in part (c) to estimate Tom’s arm length.

7e
Sme Calculator
2 marks

State whether your estimate is valid and justify your answer.

Did this page help you?

8a
Sme Calculator
3 marks

Body mass index (BMI) is used to measure whether someone is over or under weight, however BMI does not take someone’s body fat percentage into account. The following table shows the BMI and body fat percentage from 7 male participants.

  BMI, x 22.4 19.8 25.5 29.8 31.2 18.1 17.2
  Body fat %, y 22.1 20.1 24.2 31.1 16.2 15.1 8.6

(i)
Calculate the Pearson’s product-moment correlation coefficient, r.

(ii)
Comment on the result found for r.
8b
Sme Calculator
4 marks

The regression line y on x is in the form y equals m x plus c.

Calculate the values of m and c spaceand interpret their meanings. Explain whether they are appropriate within the context of the question.

8c
Sme Calculator
3 marks

The formula to calculate someone’s BMI is

BMI equals fraction numerator weight space in space kilograms over denominator open parentheses height space in space meters close parentheses squared end fraction

John weighs 95 kilograms and is 1.84 metres tall.

Estimate John’s body fat percentage and comment on the reliability of your estimate.

Did this page help you?

9a
Sme Calculator
4 marks

A movie cinema is considering significantly reducing the price of their popcorn as they believe their customers spend more on drinks when they buy popcorn. They recorded the following data of the daily revenue from popcorn, $x space, and the daily revenue from drinks, $ y over 8 randomly selected days.

Popcorn revenue, $ x 78.20 102.50 30.80 22.20 132.90 200.50 154.80 132.40
Drinks revenue, $ y 202.10 308.50 60.70 75.80 270.50 300.00 368.20 198.70

Calculate

(i)

x with bar on top, the mean daily revenue from popcorn.

(ii)

y with bar on top, the mean daily revenue from drinks.

(iii)
r, the Pearson’s product-moment correlation coefficient.
9b
Sme Calculator
4 marks

The equation of the regression line y on x is in the form y equals a plus b x.

Calculate the values of a and b and interpret their meanings and explain whether they are appropriate within the context of the question.

9c
Sme Calculator
2 marks

Show that the point straight M left parenthesis x with bar on top comma y ̅ right parenthesis lies on the regression line y on x .

Did this page help you?

10a
Sme Calculator
2 marks

James decides to measure the performance of his laptop relative to its internal temperature. To measure his laptop’s performance James will record how long it takes, in seconds, to load a Safari browser window. To alter the temperature of his laptop James will start up some of the RAM intensive apps he has, which he knows causes the computer to heat up.

The following table details some of his findings

  T space left parenthesis degree C right parenthesis 39.2 42.8 50.1 55.6 68.7
  t (seconds) 2.2 3.2 3.1 4.1 5.2

Write down the independent and dependent variable.

10b
Sme Calculator
2 marks

James believes that the relationship between T and t can be modelled by a linear regression equation.

James describes the strength of the correlation as moderate. Circle the value below which best represents the correlation coefficient.

0.976 0.020 -0.593 0.612 -0.312
10c
Sme Calculator
1 mark

Suggest one significant drawback to James’ methods of testing the performance of his laptop relative to its internal temperature.

Did this page help you?

1a
Sme Calculator
4 marks

What to Watch (WTW) and Bingeable are two organisations that review television series. Based on different sets of criteria, scores out of 5 are assigned to 6 recent television series (labelled A to F). The data is shown in the table below.

TV series A B C D E F
WTW’S score open parentheses x close parentheses 4.6 4.5 3.9 4.8 1.2 1.5
Bingeable’s score open parentheses y close parentheses 4.9 2.5 1.5 3.2 1.1 1.4

(i)
Find the Pearson’s product-moment correlation coefficient, r, for this data.

(ii)
Describe the correlation between the scoring made by the two different organisations.
1b
Sme Calculator
2 marks

Write down the equation of the regression line x on y.

1c
Sme Calculator
2 marks

WTW gives a new series G a score of 4.7. Use the regression line x on y spaceto predict the score that Bingeable awards the same series.

1d
Sme Calculator
2 marks

Comment on the reliability of your answer to part (c).

Did this page help you?

2a
Sme Calculator
4 marks

The table below shows the lengths, in km, of 5 taxi rides in Melbourne, Australia and the corresponding prices, in AUD.

Length, in km open parentheses x close parentheses 12.1 4.2 9.1 3.7 6.2
Price, in AUD y 26.75 5.75 8.50 5.50 6.95

Draw a scatter diagram for the above data on the axes below.

ib2a-ai-sl-4-2-ib-maths-veryhard

2b
Sme Calculator
3 marks

Calculate

(i)

x with bar on top, the mean taxi ride length

(ii)

y with bar on top, the mean price

(iii)
Plot the point M open parentheses x with bar on top comma space y with bar on top close parentheses on your scatter diagram.
2c
Sme Calculator
3 marks
(i)
Write down the equation of the regression line y on x.

(ii)
Draw the regression line y on x on your scatter diagram.
2d
Sme Calculator
1 mark

Show that the point straight M open parentheses x with bar on top comma y with bar on top close parentheses lies on the regression line y on x.

Did this page help you?

3a
Sme Calculator
4 marks

A health study was conducted on 5 male and 5 female participants, measuring their average daily caffeine intake, in milligrams (mg), and their resting heart rate, in beats per minute (BPM).

The following table shows the results of the study.

  Average daily caffeine intake, in mg – male left parenthesis x subscript m right parenthesis 222 312 211 190 120

  Resting heart rate, in BPM – male left parenthesis y subscript m right parenthesis

57 72 60 48 50
 Average daily caffeine intake, in mg – femaleleft parenthesis x subscript f right parenthesis 202 411 254 81 52

Resting heart rate, in BPM – female left parenthesis y subscript f right parenthesis

57 81 71 45 49

Calculate the Pearson’s product-moment correlation coefficient for,

(i)

the male participants,r subscript m ,

(ii)
the female participants,r subscript f .
3b
Sme Calculator
4 marks

Write down the equation of the regression line

(i)

y subscript m on x subscript m

(ii)
y subscript f on x subscript f.
3c
Sme Calculator
3 marks

Find the intersection of the two regression lines found in part (b) and interpret its meaning.

Did this page help you?

4a
Sme Calculator
3 marks

The following table shows the distance, in km, to 5 different ferry destinations from Rostock, Germany and the corresponding price of the cruise, in €.

Destination Copenhagen Oslo Stockholm Helsinki Riga
Distance, in km (D) 174 620 730 933 810
Price, in € (P) 30.50 65.00 45.75 85.50 125.00

The regression line P on D can be written in the form P equals a plus b D.

Calculate the values of a spaceand b and interpret their meanings

4b
Sme Calculator
2 marks

The distance to Aberdeen from Rostock is 1093 km.

Estimate the cost of the ferry to Aberdeen.

4c
Sme Calculator
1 mark

Comment on the reliability of your estimate found in part (b).

Did this page help you?

5a
Sme Calculator
2 marks

The following table shows the total revenue,R , in £, obtained weekly during the first 7 weeks of 2021 by Larry, an independent financial consultant, and the number of clients,x , served.

Week 1 2 3 4 5 6 7
Revenue, in £ (R) 2452 5751 6429 1203 4587 9786 6911
Clients, x 7 11 14 4 5 8 9

Write down the equation of the regression line straight R on x.

5b
Sme Calculator
3 marks

Larry’s weekly operating costs are £2250 and the cost of serving each client is £35.

 Find an expression for the profit Larry makes when serving x clients in a week.

5c
Sme Calculator
3 marks

Estimate the least number of clients required to generate a minimum of £1000 profit.

Did this page help you?

6a
Sme Calculator
2 marks

Sandy Café is located on a beach and is open all year. The manager wants to see whether the daily average temperature, in degree C, is correlated with the average tip they receive, as a percentage of the customer’s total bill. He records this data over 9 days and details it in the table below.

Daily average temperature, indegree C left parenthesis x right parenthesis 22.4 27.8 15.4 12.2 8.8 2.1 33.4 14.7 19.4
Average tip as a percentage of the total bill left parenthesis y right parenthesis 20.1 16.3 12.4 12.8 10.1 9.4 18.8 13.1 15.9

(i)
Find the Pearson’s product-moment correlation coefficient,r , for this data.

(ii)
Write down the equation of the regression line y on x.
6b
Sme Calculator
4 marks

On the 10th day, the average temperature is 25degree C  and a customer tips their waiter $20.

Use the regression line to estimate the customer’s total bill. Give your answer to 2 decimal places.

6c
Sme Calculator
2 marks

The customer’s total bill was $98.50.

Calculate the tip as a percentage of the actual total bill. Give your answer to the nearest integer.

Did this page help you?

7a
Sme Calculator
1 mark

The table below shows the petrol prices, in New Zealand dollars (NZD) per litre, for 6 different petrol stations (labelled A to F) along with their distance south of Auckland’s city centre.

Petrol station A B C D E F
Distance south of Auckland, in km left parenthesis x right parenthesis 122 314 456 231 178 392
Petrol price, in NZD per litre left parenthesis y right parenthesis 1.94 1.88 1.78 1.84 1.99 1.81

Calculate the mean petrol price,y with bar on top .

7b
Sme Calculator
3 marks

The equation of the regression line y on x can be written in the form y equals a plus b x.

(i)
Calculate the value of a

(ii)
Calculate the value of b, giving your answer in the form k cross times 10 to the power of n, where 1 less or equal than vertical line k vertical line less than 10 comma space space n element of Z.
7c
Sme Calculator
2 marks

The distance between Auckland’s city centre and a new petrol station, G, is 200 km and the bearing of G from Auckland’s city centre is 166 degree.

Estimate the petrol price at G.

Did this page help you?