The Geometric Distribution (Edexcel A Level Further Maths: Further Statistics 1)

Revision Note

Roger

Author

Roger

Last updated

Conditions for Geometric Models

What is the geometric distribution?

  • The geometric distribution models the number of trials needed to reach the first success
    • For example, how many times will you have to roll a dice until it lands on a '6' for the first time
  • The notation for the geometric distribution is Geo open parentheses p close parentheses 
    • For a random variable X that has the geometric distribution you can write X tilde Geo open parentheses p close parentheses space
    • X is the number of trials it takes to reach the first success
      • For example, X equals 8 means the first success occurred on the 8th trial
    • p is the fixed probability of success in any one trial

What are the conditions for using a geometric model?

  • A geometric model can be used for an experiment that satisfies the following conditions:
    • The experiment consists of an indefinite number of successive trials
    • The outcome of each trial is independent of the outcomes of all other trials
    • There are exactly two possible outcomes for each trial (success and failure)
    • The probability of success in any one trial (p) is constant 
  •  Note that these conditions are very similar to the conditions for the binomial distribution
    • But for a binomial distribution the number of trials (bold italic n) is fixed
      • And you count the number of successes
    • While for a negative binomial distribution the experiment continues until the first success is achieved
      • And you count the number of trials it takes to reach that first success

When might the conditions not be satisfied?

  • If asked to criticise a geometric model, you may be able to question whether the trials are really independent
    • For example, someone may be repeating an activity until they achieve a success
      • The trials may not be independent because the person gets better from practising the activity
      • This also means the probability of success, p, is not constant
    • In order to proceed using the model, you would have to assume that the trials are independent

Examiner Tip

  • Replace the word "trials" with the context (e.g. "flips of a coin") when commenting on conditions and assumptions

Geometric Probabilities

What are the probabilities for the geometric distribution?

  • If X space tilde space Geo open parentheses p close parentheses then X has the probability function:
    • the random variable X is the number of trials needed to get the first success
    • p is the constant probability of success in one trial
    • straight P open parentheses X equals x close parentheses is the probability that the first success will occur on the x th trial
  • Note that that is the product of
    • the probability of first getting bold italic x bold minus bold 1 failures,  ,
    • and the probability of getting a success in the bold italic x bold th trialp
  • Also note that there is no greatest possible x
    • It could require any number of trials to reach the first success
    • However straight P open parentheses X equals x close parentheses gets closer and closer to zero as x gets larger
  • Your calculator may allow you to calculate Geometric probabilities directly
    • i.e., without having to use the above formula

What are the properties of the geometric distribution?

  • Note that straight P open parentheses X equals 1 close parentheses comma space straight P open parentheses X equals 2 close parentheses comma space straight P open parentheses X equals 3 close parentheses comma space straight P open parentheses X equals 4 close parentheses comma space... equals p comma space p open parentheses 1 minus p close parentheses comma space p open parentheses 1 minus p close parentheses squared comma space p open parentheses 1 minus p close parentheses cubed comma space... space
    • This means that the geometric probabilities form a geometric sequence
      • The first term is p
      • The common ratio is open parentheses 1 minus p close parentheses
    • This is where the geometric distribution gets its name!
  • Assuming that 0 less than p less than 1, then it is also true that 0 less than open parentheses p minus 1 close parentheses less than 1
    • This means that  straight P open parentheses X equals 1 close parentheses greater than straight P open parentheses X equals 2 close parentheses greater than straight P open parentheses X equals 3 close parentheses greater than straight P open parentheses X equals 4 close parentheses greater than...
      • i.e., the probabilities form a decreasing sequence
      • and straight P open parentheses X equals 1 close parentheses is the largest probability in the sequence
      • Therefore X equals 1 is the mode of the distribution

3-2-3-cie-fig0-geo-dist-graphs

  • The geometric distribution has no 'memory'
    • It doesn't matter what has happened previously, or how many 'failures' in a row there have been
    • The probability of getting a 'success' in any trial is always p
    • This means that the number of additional trials needed for the first success is not dependent on the number of trials that have already occurred
      • e.g. if 5 (failed) trials have already occurred, the probability of the first success happening after 7 trials is simply the probability of success happening after 2 trials in the first place, i.e. p open parentheses 1 minus p close parentheses

What are the cumulative probabilities for the geometric distribution?

  • If X space tilde space Geo open parentheses p close parentheses then X has the cumulative geometric distribution:
    • the random variable X is the number of trials needed to get the first success
    • p is the constant probability of success in one trial
    • straight P open parentheses X less or equal than x close parentheses is the probability that the first success will occur on or before the x th trial
  • Your calculator may allow you to calculate Geometric probabilities directly
    • i.e., without having to use the above formula
  • The formula can be proved as follows
    • If the first success occurs on or before the x th trial, that means that the first x trials have not all been failures
      • The probability of getting x failures in a row is Error converting from MathML to accessible text.
      • So the probability of that not happening is 1 minus open parentheses 1 minus p close parentheses to the power of x
  • Alternatively, it can be proved algebraically
    • The geometric probabilities form a geometric sequence with first term p and common ratio 1 minus p
    • Putting that into the geometric series formula S subscript n equals fraction numerator a open parentheses 1 minus r to the power of n close parentheses over denominator 1 minus r end fraction gives

  • Because limit as x rightwards arrow infinity of open parentheses 1 minus p close parentheses to the power of x equals 0 (assuming 0 less than p less than 1), it follows that

sum from r equals 1 to infinity of straight P open parentheses X equals r close parentheses equals 1 minus 0 equals 1

    • So the sum of all probabilities is equal to 1
      • This is a requirement of any probability distribution

Examiner Tip

  • If you forget the formulae in the exam, you can often still do questions using basic probability concepts and geometric series

Worked example

Joshua is an inspector in a factory.  His job is to randomly sample widgets produced by a particular machine, until he finds a widget that has a defect.  If he finds a widget with a defect, then the machine must be stopped until a repair procedure has been completed.  Given that the probability of a widget being defective is 0.002, find the probability that:

a)
the 10th widget that Joshua inspects is the first one that is defective

geo-probs-we-a

b)
the 250th widget that Joshua inspects is the first one that is defective

geo-probs-we-b

c)
the 250th widget that Joshua inspects is the first one that is defective, given that the first 240 were not defective

geo-probs-we-c

d)
Joshua will inspect 250 or fewer widgets before finding the first one that is defective

geo-probs-we-d

e)
Joshua will need to inspect more than 250 widgets before finding the first one that is defective.

geo-probs-we-e

f)
State an assumption you have used in calculating the above probabilities.

geo-probs-we-f

Geometric Mean & Variance

What are the mean and variance of the geometric distribution?

  • If X space tilde space Geo open parentheses p close parentheses, then
    • The mean of X is  straight E open parentheses X close parentheses equals mu italic equals 1 over p
    • The variance of X is  Var open parentheses X close parentheses equals sigma to the power of italic 2 italic equals fraction numerator italic 1 italic minus p over denominator p to the power of italic 2 end fraction
  • You need to be able to use these formulae to answer questions about the geometric distribution

Examiner Tip

  • If a question gives you the value of the mean or variance, form an equation in p and solve it

Worked example

Palamedes is rolling a biased dice for which the probability of the dice landing on a '6' is p.  The random variable X represents the number of times he needs to roll the dice until a '6' appears for the first time.  Given that the standard deviation of X is 2 square root of 3, find:

a)
the value of p.

geo-mean-var-we-a

b)
the mean of X

geo-mean-var-we-b

c)
straight P open parentheses X less or equal than 3 close parentheses.

geo-mean-var-we-c

You've read 0 of your 5 free revision notes this week

Sign up now. It’s free!

Join the 100,000+ Students that ❤️ Save My Exams

the (exam) results speak for themselves:

Did this page help you?

Roger

Author: Roger

Expertise: Maths

Roger's teaching experience stretches all the way back to 1992, and in that time he has taught students at all levels between Year 7 and university undergraduate. Having conducted and published postgraduate research into the mathematical theory behind quantum computing, he is more than confident in dealing with mathematics at any level the exam boards might throw at you.