0Still learning
Know0
What is a hypothesis test?
Enjoying Flashcards?
Tell us what you think
What is a hypothesis test?
A hypothesis test uses a sample of data in an experiment to test a statement made about the population.
The statement is either about a population parameter or the distribution of the population.
Which hypothesis do you assume to be true when performing a hypothesis test: the null hypothesis or the alternative hypothesis?
When performing a hypothesis test, you assume the null hypothesis to be true.
What is denoted by the notation ?
The null hypothesis is denoted by .
What is the notation for the alternative hypothesis?
The notation for the alternative hypothesis is .
What is meant by the significance level of a hypothesis test?
The significance level is the probability that a hypothesis test rejects the null hypothesis when it is true.
The significance level sets the smallest probability that an event could have occurred if the null hypothesis were true. Any probability smaller than the significance level would suggest that the event is unlikely to have happened by chance.
An observation of the test statistic is taken for a hypothesis test.
What is meant by the p-value of this observation?
An observation of the test statistic is taken for a hypothesis test.
The p-value is the probability of obtaining a value at least as extreme as the observation if the null hypothesis were true.
True or False?
If the p-value is less than the significance level then there is evidence to reject the null hypothesis.
True.
If the p-value is less than the significance level then there is evidence to reject the null hypothesis.
What is meant by the critical region for a test statistic?
The critical region is the set of values for a test statistic that would lead to the rejection of the null hypothesis.
These are the values that are unlikely to be obtained if the null hypothesis were true.
What is meant by the critical value(s) for a test statistic?
The critical value(s) is the boundary of the critical region.
It is the least extreme value that would lead to the rejection of the null hypothesis.
What is a test for independence used for?
A test for independence is used to test whether two variables are statistically independent of each other.
What are contingency tables in a test for independence?
A contingency table is a two-way table that shows the observed frequencies for each combination of the two variables.
For example:
Eye colour | ||||
Blue | Brown | Green | ||
Hair colour | Black | 17 | 12 | 29 |
Blonde | 31 | 25 | 21 |
True or False?
If a contingency table has m rows and n columns then the number of degrees of freedom is equal to m×n.
False.
If a contingency table has m rows and n columns then the number of degrees of freedom is equal to (m-1)×(n-1).
True or False?
For a test for independence, you reject the null hypothesis if the test statistic is greater than the critical value.
True.
For a test for independence, you reject the null hypothesis if the test statistic is greater than the critical value.
What is meant by the expected frequencies for a test for independence?
The expected frequencies for a test for independence are the frequencies for each possible combination of outcomes of the two variables if they were independent.
How should you write the null hypothesis of a test for independence?
For example, suppose you are testing whether hair colour and eye colour are independent.
The null hypothesis of a test for independence should be of the form:
: variable X is independent of variable Y.
For example, : hair colour is independent of eye colour.
In an exam, how do you find the statistic for a test for independence?
In an exam, to find the statistic for a test for independence you:
use the two-way test option on your GDC,
input the observed frequencies as a matrix,
run the test.
Your GDC will give you the value of the statistic as well as the p-value and the expected frequencies.
If the null hypothesis is rejected for a test for independence, then what does this suggest about the two variables?
If the null hypothesis is rejected for a test for independence, then this suggests that the two variables are not independent.
However, this conclusion is not definitive as there will still be a small chance that they are independent.
If the null hypothesis is not rejected for a test for independence, then what does this suggest about the two variables?
If the null hypothesis is not rejected for a test for independence, then there is insufficient evidence to suggest that the variables are not independent.
Therefore, this suggests that the two variables could be independent.
However, this conclusion is not definitive.
What is a goodness of fit test used for?
A goodness of fit test is used to test whether data can be modelled by a specified distribution.
True or False?
For a goodness of fit test, you reject the null hypothesis if the test statistic is less than the critical value.
False.
For a goodness of fit test, you reject the null hypothesis if the test statistic is greater than the critical value.
What is meant by the expected frequencies for a goodness of fit test?
The expected frequencies for a goodness of fit test are the frequencies for each outcome if the data follows the specified distribution.
How do you find the expected frequencies for a goodness of fit test?
To find the expected frequencies for a goodness of fit test, you:
find the probability of each outcome assuming the data follows the specified distribution,
multiply the probabilities by the total frequency.
If there are k different outcomes (or groups of outcomes) for a goodness of fit test, then what is the number of degrees of freedom of the test?
If there are k different outcomes (or groups of outcomes) for a goodness of fit test, then there are k-1 degrees of freedom.
In an exam, how do you find the statistic for a goodness of fit test?
In an exam, to find the statistic for a goodness of fit test you:
use the goodness of fit option on your GDC,
input the observed frequencies as a list,
input the expected frequencies as a separate list,
enter the number of degrees of freedom,
run the test.
Your GDC will give you the value of the statistic as well as the p-value.
How should you write the null hypothesis of a goodness of fit test?
For example, suppose you are testing whether the number of eggs in a nest can be modelled by B(3, 0.1).
The null hypothesis of a goodness of fit test should be of the form:
: variable X follows the distribution...(state the distribution)
For example, : the number of eggs in a nest follows the binomial distribution B(3, 0.1).
Suppose you are performing a goodness of fit test to test whether the following data can be modelled by .
What three probabilities would you need to calculate?
Height | Frequency |
35 | |
45 | |
20 |
Suppose you are performing a goodness of fit test to test whether the following data can be modelled by .
You would need to calculate the following three probabilities.
Height | Probability |
What is the conclusion be if the null hypothesis is rejected for a goodness of fit test?
If the null hypothesis is rejected for a goodness of fit test then there is sufficient evidence to suggest that the data does not follow the specified distribution.
What is the conclusion if the null hypothesis is not rejected for a goodness of fit test?
If the null hypothesis is not rejected for a goodness of fit test then there is insufficient evidence to suggest that the data does not follow the specified distribution.
Therefore, this suggests that the data does follow the specified distribution.
What is a t-test used for?
A t-test is used to compare the means of two normally distributed populations.
In the exam, the population variance will always be unknown.
What two assumptions are made when conducting a (pooled two-sample) t-test?
When conducting a (pooled two-sample) t-test you need to assume that
the underlying distribution for each variable must be normal,
the variances for the two groups are equal.
When would you use a one-tailed t-test?
You would use a one-tailed t-test when you want to test one of two following hypotheses:
The population mean of one normal distribution is greater than the population mean of another normal distribution.
The population mean of one normal distribution is smaller than the population mean of another normal distribution.
When would you use a two-tailed t-test?
You would use a two-tailed t-test when you want to test whether the population means of two normal distributions are not equal.
In an exam, how do you calculate the p-value for a t-test?
In an exam, to calculate the p-value for a t-test you:
input the data from the sample of the first population in one list on your GDC,
input the data from the sample of the second population in another list on your GDC,
select the pooled two-sample t-test option on your GDC,
choose the form of the alternative hypothesis,
run the test.
Using symbols, what does the alternative hypothesis look like for a two-tailed t-test?
The alternative hypothesis for a two-tailed t-test looks like .
What would the conclusion be if the null hypothesis is not rejected?
If the null hypothesis is not rejected then there is insufficient evidence to suggest that the mean of population Y is greater than the mean of population X.
What would the conclusion be if the null hypothesis is rejected?
If the null hypothesis is rejected then there is sufficient evidence to suggest that the mean of population Y is greater than the mean of population X.
True or False?
If the p-value of a two-tailed t-test is greater than the significance level then the means of two normal populations are equal.
False.
If the p-value of a two-tailed t-test is greater than the significance level then the test concludes that there is insufficient evidence to say that the means are different.
This does not guarantee that the means are equal.