Properties of Normal Distributions (College Board AP® Statistics)

Revision Note

Naomi C

Author

Naomi C

Expertise

Maths

Continuous random variables

What is a continuous random variable?

  • A continuous random variable (CRV) is a random variable whose value lies within one or more intervals on a number line

    • The number of values that a CRV could be, within a given interval, are infinite

      • e.g. the weight of a squirrel is a continuous random variable, a squirrel selected at random could take on any value within the given interval

  • In some cases it can be easy to confuse continuous and discrete variables

    • e.g. foot length is a continuous variable, as it can be measured to any specified degree of accuracy

    • but shoe size is a discrete variable as it can be counted (4, 5, 6, etc.)

What is a continuous probability distribution?

  • A continuous probability distribution describes the probabilities of the occurrences of each possible outcome associated with a continuous random variable

  • As there are an infinite number of possible values for a continuous random variable within a given interval

    • the probability that it assumes one specific value is 0

      • i.e. for a value x equals k in the given interval, P open parentheses X equals k close parentheses equals 0

    • Instead we talk about the probability of X being within a certain range

  • A continuous probability distribution can be represented by a continuous graph, f open parentheses x close parentheses, called a probability density function f open parentheses x close parentheses

    • the values for X lie along the horizontal axis

    • and the probability density is given on the vertical axis

  • The area under the graph between the points x equals a and x equals b is equal to P open parentheses a less or equal than X less or equal than b close parentheses

    • The total area under the graph equals 1

  • As P open parentheses X equals k close parentheses equals 0 for any value k, it does not matter if we use strict or weak inequalities

    • P open parentheses X less or equal than k close parentheses equals P open parentheses X less than k close parentheses for any value of k

Properties of normal distributions

What is a normal distribution?

  • A normal distribution is a continuous probability distribution

  • If a continuous random variable follows a normal distribution, then its shape will be:

    • symmetrical

    • mound-shaped (bell-shaped)

  • If X follows a normal distribution then

    • mu is the mean

    • sigma is the standard deviation

What are the important properties of a normal distribution?

  • The normal distribution is symmetrical about x equals mu

    • Mean = Median = Mode = mu

  • The empirical rules state that the data in a normal distribution is distributed such that (approximately):

    • 68% of the data lies within one standard deviation of the mean open parentheses mu plus-or-minus sigma close parentheses

    • 95% of the data lies within two standard deviations of the mean open parentheses mu plus-or-minus 2 sigma close parentheses

    • 99.7% lies within three standard deviations of the mean open parentheses mu plus-or-minus 3 sigma close parentheses

Bell curve showing a normal distribution with shaded areas representing 68% (𝞵 ± 𝞼), 95% (𝞵 ± 2𝞼), and 99.7% (𝞵 ± 3𝞼) of the data.

Worked Example

The distribution of the travel range of adult female grizzly bears follows approximately a normal distribution. Based on a very large sample, it was found that 16 percent of the bears had a travel range of less than 6.9 miles and 84 percent of the bears had a travel range of less than 8.1 miles. What are the mean and standard deviation of the distribution of the travel ranges of the adult female grizzly bears?

Answer:

Subtract the percentage of bears that have a travel range of less than 8.1 miles from 100% to find the percentage of bears that have a travel range of greater than 8.1 miles

100% - 84% = 16%

It can be useful to sketch a diagram of the distribution

Bell curve depicting a normal distribution with mean (μ) at the center. Areas under the curve representing 16% are highlighted on both sides.

Because the distribution is symmetrical, the mean of the distribution will be halfway between the two values

fraction numerator 6.9 plus 8.1 over denominator 2 end fraction equals 7.5

Find the percentage of bears that have a travel range of between 6.9 miles and 84 miles

84% - 16% = 68%

Approximately 68% of the data of a normal distribution lies within one standard deviation of the mean

Each of the values 6.9 and 8.1 are one standard deviation either side of the mean, you can therefore find the value of the standard deviation by dividing the difference between the two values by two

fraction numerator 8.1 minus 6.9 over denominator 2 end fraction equals 0.6

The distribution of the travel ranges of the adult female grizzly bears has a mean of 7.5 miles with standard deviation 0.6 miles

What can be modeled using a normal distribution?

  • A lot of real-life continuous variables can be modeled by a normal distribution if:

    • the population is large enough

    • and the variable is roughly symmetrical with one mode

  • For a normal distribution, the continuous random variable X can take any real value

    • However, values far from the mean (more than 4 standard deviations away) have a probability density of practically zero

    • This allows us to model variables that are not defined for all real values, e.g. height and weight

What cannot be modeled using a normal distribution?

  • Variables which have more than one mode or no mode

    • e.g. the number given by a random number generator

  • Variables which are not symmetrical

    • e.g. how long a human lives for

  • Variables in which acceptable ranges of values (within 3 standard deviations of the mean) become negative when they are meant to be positive

    • e.g. length modeled by a normal distribution with mean 10 cm and standard deviation 8 cm

    • 3 standard deviations below the mean would be 10 minus 3 cross times 8 equals negative 14 and a length cannot be negative

Worked Example

The random variable S represents the speeds (mph) of a certain species of cheetahs when they run. The variable is modeled using a mean of 40 mph and a standard deviation of 10 mph.

State the two assumptions that have been made about the shape of the distribution of cheetah speeds in order to use this.

Answer:

We assume that the distribution of cheetah speeds is symmetrical and mound-shaped

You've read 0 of your 10 free revision notes

Unlock more, it's free!

Join the 100,000+ Students that ❤️ Save My Exams

the (exam) results speak for themselves:

Did this page help you?

Naomi C

Author: Naomi C

Naomi graduated from Durham University in 2007 with a Masters degree in Civil Engineering. She has taught Mathematics in the UK, Malaysia and Switzerland covering GCSE, IGCSE, A-Level and IB. She particularly enjoys applying Mathematics to real life and endeavours to bring creativity to the content she creates.