The general concept of conducting an hypothesis test is to
Calculate a test statistic;
Compare the test statistic to a sampling distribution, either using a critical value or a -value.
This is a valid method for testing all sorts of parameters from different statistical models. The test statistic and sampling distribution change according to the parameter being tested. You will see further examples throughout the course.
We give a justification for the use of both the statistic and Students- distribution. First recall the null hypothesis,
Since an unbiased estimator of is the sample mean , it seems sensible to compare the observed sample mean to the hypothesised population mean . The question is then:
When , how far from can we reasonably expect to be?
To answer this we need to look at the sampling distribution of the estimator . This will show us, for a given sample size , how much sampling variability we can expect in the sample mean. We start with the case where the population variance is known, since in this case we can use the Central Limit Theorem to help prove the sampling distribution of the test statistic.
If are IID random variables, with mean and variance then as ,
This is an exact result if are Normal random variables, and so holds for all . Otherwise the limiting result follows by application of the Central Limit Theorem.
If is true, we can adjust this to,
By the usual standardisation technique,
This then gives us both a test statistic and its sampling distribution but only if we know the population variance .
When the population variance is unknown, it is replaced by the sample variance , but this is itself a realisation of a random variable (different samples of size will each have a different sample variance). Deriving the sampling distribution of
is therefore considerably harder, as it is the ratio of two related random variables and . In fact, the -distribution was first tabulated as the sampling distribution for this test statistic, by William S. Gossett (work published in 1908) as part of his work in quality control for the Guinness brewery in Dublin. Gossett found that the Normal distribution was no longer an appropriate sampling distribution for , due to the extra uncertainty introduced when estimating (especially for small samples). His derivation of the -distribution had three steps:
Showing that the sampling distribution of is ;
Showing that is independent of ;
Showing that the ratio of a Normal random variable to the square root of a random variable follows a -distribution.
The entire proof takes up a page of A4 paper. To cover this thoroughly is beyond the scope of this course.