Data Handling: Inferential Statistics (AQA A Level Psychology)

Exam Questions

1 hour17 questions
12 marks

A biological psychologist wanted to see if there was a positive correlation between heart rate (measured in beats per minute) and levels of a stress hormone (measured in micrograms) in people with obsessive-compulsive disorder (OCD). She wrote a directional hypothesis and collected data from 20 people with OCD.

The psychologist chose to use the Pearson’s r test to analyse the data because she was testing for a correlation.

Explain one other reason why the psychologist used a Pearson’s r test in this study.

Did this page help you?

22 marks

A psychologist decided to conduct an experiment to investigate the effect of watching horror films before going to bed.

A volunteer sample of 50 university students consented to take part in the experiment.

The 50 students were randomly split into two groups. Group 1 watched a horror film before going to bed each night for the first week then a romantic comedy before going to bed each night for the second week. Group 2 watched the romantic comedy in the first week and the horror film in the second week.

When the students woke up each morning, each student received a text message that asked if they had had a nightmare during the night. They could respond ‘yes’ or ‘no’.

The psychologist found that the difference in the number of nightmares reported in the two conditions was significant at p<0.05.

Explain what is meant by ‘significant at p<0.05’ in the context of this experiment.

Did this page help you?

3a2 marks

It was recently reported in a newspaper that time spent playing team sports increases happiness levels. A researcher was keen to find out whether this was due to participating in a team activity or due to participating in physical activity, as he could not find any published research on this.

The researcher used a matched-pairs design. He went into the student café and selected the first 20 students he met. Each student was assigned to one of two groups. Participants in Group A were requested to carry out 3 hours of team sports per week.

Participants in Group B were requested to carry out 3 hours of exercise independently in a gym each week. All participants were told not to take part in any other type of exercise for the 4-week duration of the study.

All participants completed a happiness questionnaire at the start and end of the study. The researcher then calculated the improvement in happiness score for each participant.

The results of the study are given in Table 1, below.

Table 1 Improvement in happiness scores

Team sports Group A

Independent sport Group B

Pair 1

0

2

Pair 2

14

8

Pair 3

4

4

Pair 4

-1

1

Pair 5

3

3

Pair 6

12

5

Pair 7

8

7

Pair 8

15

8

Pair 9

7

8

Pair 10

3

2

The researcher decided to use the Sign Test to see whether there was a significant difference in the improvement in the scores between the two groups at the 5% level of significance.

Calculate the value of S in this study. Show your workings.

3b2 marks

Table 1 Table of critical values of the sign test (S)

Level of significance for a two-tailed test

N

0.10

0.05

0.01

7

0

0

8

1

0

0

9

1

1

0

10

1

1

0

11

2

1

0

12

2

2

1

Significance is shown if the calculated value of S is equal to or less than the critical value

Explain whether or not there was a significant difference in the improvement in the scores between the two groups. Use your answer to Question 3a and Table 2.

Did this page help you?

43 marks

What is meant by a Type II error? Explain why psychologists normally use the 5% level of significance in their research.

Did this page help you?

52 marks

In a study of communication, researchers used a language rating scale to compare two groups of participants. Participants in Group A were a volunteer sample of people from a self-help group for people with schizophrenia. Participants in Group B, the control group, were a volunteer sample of people who attended a community centre. 1 2

The researchers used an inferential statistical test to see if there was a significant difference in language rating between the two groups.

Explain why a Mann-Whitney test is an appropriate choice of statistical test in this situation and a Spearman’s test is not.

Did this page help you?

6a2 marks

A psychologist wanted to investigate whether exercise would affect stress levels in 15-year-olds. Previous research into the effects of exercise on stress in teenagers had shown that exercise decreased stress levels.

The psychologist decided to use a repeated measures design to investigate the effects of exercise on stress levels in 20 15-year-old students. All the students were approaching their end-of-year exams.

For Condition A, students were required to complete a 2 km run during their morning breaktime each school day for one week.

In Condition B, students continued their normal activities in the playground during their morning breaktime each school day for one week.

At the end of each week of the investigation, for both Condition A and Condition B, each student was asked to rate their levels of stress on a rating scale of 1–10, where the higher the self-reported rating the greater the stress levels.

A Wilcoxon test was used to determine whether differences in stress ratings were significant. The psychologist used a 0.01 level of significance.

Explain what it would mean if the results in the above study were significant at the 0.01 level.

6b2 marks

The psychologist found that the difference was not significant at the 0.01 level.

The psychologist was concerned that there might have been a type II error.

Explain what is meant by a type II error in the context of this study.

6c1 mark

Suggest one way the psychologist could reduce the chance of a type II error.

Did this page help you?

14 marks

A psychology teacher read a researcher’s study on sport and happiness. She considered whether setting group tasks could improve her students’ level of happiness. She decided to conduct an independent groups experiment with 30 students taking A-level Psychology using the same happiness questionnaire.

Suggest an appropriate statistical test the psychology teacher could use to analyse the data. Justify your choice of test.

Did this page help you?

2a4 marks

A researcher investigated whether there was a correlation between the number of hours slept and how well rested the students felt.

The researcher randomly selected 18 participants from first-year students at the university.

On the day of the study, each student participant was asked, ‘How many hours did you sleep last night?’ They then had to rate on a scale of 1 to 5 how well rested they felt.

The researcher hypothesised that there would be a positive correlation between the two co-variables.

The researcher used Spearman’s rho statistical test to analyse the data from this study.

Explain why Spearman’s rho was a suitable test for this study. Refer to the description of the study in your answer.

2b4 marks

The researcher chose to use the 5% level of significance and the calculated correlation box coefficient for the Spearman’s rho test was 0.395

Table 1 Critical values of rho

Level of significance for a one-tailed test

0.05

0.025

Level of significance for a two-tailed test

0.10

0.05

N = 16

0.429

0.503

17

0.414

0.485

18

0.401

0.472

19

0.391

0.460

20

0.380

0.447

The calculated value of rho must be greater than or equal to the critical value to be significant.

Identify the appropriate critical value from Table 1. Explain your choice.

2c2 marks

Explain whether the researcher’s hypothesis should be accepted. Refer to the critical value identified in Question 2b in your answer.

2d2 marks

Explain why the researcher decided to use the 5% level of significance rather than the 1% level in this study.

2e2 marks

When the researcher compared the calculated and critical values of rho, he began to wonder if he might have made a Type II error.

Explain what is meant by a Type II error in the context of this study.

Did this page help you?

32 marks

A researcher studying depression wanted to see whether or not there was a relationship between level of self-esteem and negative schema score. She constructed two questionnaires and asked ten people who had been diagnosed with depression to complete them.

One questionnaire measured the participant’s level of self-esteem. A low score (out of 50) indicated low self-esteem.

The other questionnaire measured whether the participant showed evidence of negative schema. A low score (out of 50) indicated evidence of negative schema. The two sets of results for each participant are shown in the table below.

The researcher analysed the data in Table 1 using a Spearman’s rho statistical test.

Table 1 Self-esteem score and negative schema score for each patient

Table showing participant numbers 1 to 10, their self-esteem scores (ranging from 8 to 22), and negative schema scores (ranging from 11 to 20).

With reference to level of measurement, explain why Spearman’s rho is an appropriate statistical test for this data.

Did this page help you?

44 marks

A biological psychologist wanted to see if there was a positive correlation between heart rate (measured in beats per minute) and levels of a stress hormone (measured in micrograms) in people with obsessive-compulsive disorder (OCD). She wrote a directional hypothesis and collected data from 20 people with OCD.

The psychologist chose to use the Pearson’s r test to analyse the data because she was testing for a correlation. The calculated value of r for the Pearson’s test was +0.42.

The researcher used the following table of critical values.

Table 1: Critical values of r for Pearson's test at various significance levels for one-tailed and two-tailed tests with degrees of freedom ranging from 17 to 20.

Calculated value of r must be equal to or more than the critical value for significance to be shown Degrees of freedom (df) = N – 2

Is the psychologist’s calculated value significant? With reference to Table 1, explain your answer.

Did this page help you?

5a2 marks

In an experiment, researchers arranged for participants to complete a very personal and embarrassing questionnaire in a room with other people. Each participant was tested individually. The other people were confederates of the experimenter.

In condition 1: the confederates completed the questionnaire.

In condition 2: the confederates refused to complete the questionnaire and asked to leave the experiment.

The researchers tested 15 participants in condition 1, and 15 different participants in condition 2.

The researchers recorded the number of participants who completed the questionnaire in each condition.

In order to analyse the difference in the number of participants who completed the questionnaire in each condition, the researchers used a chi-squared test.

Apart from reference to the level of measurement, give two reasons why the researchers used the chi-squared test.

5b2 marks

The calculated value of chi-squared in the experiment described as 3.97

Table 1: Critical values for the chi-squared test

Level of significance

df

0.1

0.05

0.02

0.01

1

2.71

3.84

5.41

6.64

The calculated value of chi-squared should be equal to or greater than the critical value to be statistically significant.

With reference to the critical values in Table 1, explain whether or not the calculated value of chi-squared is significant at the 5% level.

Did this page help you?

64 marks

In 1987, a survey of 1000 young people found that 540 said they smoked cigarettes, whilst 460 said they did not. In 2017, a similar survey of another 1000 young people found that 125 said they smoked cigarettes, whilst 875 said they did not.

Which statistical test should be used to calculate whether there is a significant difference in reported smoking behaviour between the two surveys? Give three reasons for your answer.

Did this page help you?

74 marks

In an experiment into the effects of cognitive priming, 20 participants played an unfamiliar computer game. Half of the participants watched a violent film before playing the computer game.

The other half watched a neutral film before playing the computer game. After playing the computer game, each participant was classified as showing High Aggression or Low Aggression.

The researcher decided to use a Chi-Squared test to analyse the data from this study. One reason for choosing this test was that each participant only took part in one condition of the experiment.

Referring to information about the study, explain two other reasons why it was appropriate to use a Chi-Squared test in this case.

Did this page help you?

15 marks

A psychologist wanted to test whether listening to music improves running performance.

The psychologist conducted a study using 10 volunteers from a local gym. The psychologist used a repeated measures design. Half of the participants were assigned to condition A (without music) and half to condition B (with music).

All participants were asked to run 400 metres as fast as they could on a treadmill in the psychology department. All participants were given standardised instructions. All participants wore headphones in both conditions. The psychologist recorded their running times in seconds. The participants returned to the psychology department the following week and repeated the test in the other condition.

The researcher used a directional hypothesis and analysed the data using a related t-test. The calculated value of t where degrees of freedom (df) = 9 was 1.4377. He decided to use the 5% level of significance.

Table 1 Table of critical values of t

Table showing critical t-values for one-tailed tests at 0.05, 0.025 significance levels and two-tailed tests at 0.10, 0.05 significance levels for 1 to 10 degrees of freedom.

Calculated value of t must be equal to or greater than the critical value in this table for significance to be shown.

Give three reasons why the researcher used a related t-test in this study and, using Table 1, explain whether or not the results are significant.

Did this page help you?

27 marks

A psychologist decided to design an experiment to test the effects of recreational screen time on children’s academic performance.

The psychologist randomly selected four schools from all the primary schools in her county to take part in the experiment involving Year 5 pupils. Three of the four schools agreed to take part. In total, there were 58 pupils whose parents consented for them to participate. The 58 pupils were then randomly allocated to Group A or Group B.

For the two-week period of the experiment, pupils in Group A had no recreational screen time. Pupils in Group B were allowed unrestricted recreational screen time. At the end of the experiment all pupils completed a 45-minute class test, to achieve a test score.

The psychologist wanted to test the statistical significance of the data.

Identify the most appropriate choice of statistical test for analysing the data collected and explain three reasons for your choice in the context of this study.

Did this page help you?

37 marks

Researchers investigated whether the experience of bullying is influenced by attachment type. They interviewed teenagers about their early attachment experiences. Following the interviews, the teenagers were categorised into two groups based on their attachment type:

  • Group 1 – secure attachment in childhood

  • Group 2 – insecure attachment (insecure-avoidant or insecure-resistant) in childhood.

During the interview, the teenagers were also asked about their experience of bullying. They were categorised as either:

  • having had experience of bullying, or

  • having had no experience of bullying.

Which statistical test would be most suitable to analyse the data in this investigation? With reference to this investigation, explain three reasons for your choice of test.

Did this page help you?

47 marks

A controlled observation was designed to compare the social behaviours of pre-school children of working parents and pre-school children of stay-at-home parents. The sample consisted of 100 children aged three, who were observed separately. Half of the children had working parents and the other half had stay-at-home parents.

The observation took place in a room which looked like a nursery, with a variety of toys available. In the room, there were four children and one supervising adult. Their behaviour was not recorded.

Each child participant was brought into the room and settled by their parent. The parent then left to sit outside. Each child participant’s behaviour was observed covertly for five minutes while they played in the room.

The observation was conducted in a controlled environment and a standardised script was used when the children and their parents arrived

The data from the observation was summarised by converting the number of agreed observations into a total social behaviour score for each child.

The researcher then conducted a statistical test to identify whether there was a significant difference between the social behaviour scores for the children of stay-at-home parents and those of working parents.

Identify an appropriate statistical test that the researcher could use to analyse the social behaviour scores in this study. Explain three reasons for your choice in the context of this study.

Did this page help you?