We now justify the confidence interval approach in the case of the mean of a Normal distribution.
Suppose that we have observed realisations of IID random variables which have a distribution. To test, at the level,
vs.
we calculate the appropriate test statistic
Then is rejected if or .
By the symmetry of the -distribution, our rejection criterion could be written instead as,
as shown in Figure 4.1.
The probability that we accept , given that is true, is
But this is exactly the definition of the % confidence interval given in equation (4.1).
To test at an level
For a two-tailed alternative hypothesis, we reject if the hypothesised value lies outside the confidence interval.
For a one-tailed alternative hypothesis , we reject if and only if the hypothesised value lies above the confidence interval.
For a one-tailed alternative hypothesis , we reject if and only if the hypothesised value lies below the confidence interval.
Some things to note:
The size of a confidence interval depends on both the sample size and .
If all else is equal, what should happen to the confidence interval as the sample size increases?
A larger sample size will lead to narrower confidence intervals as there is more information and the uncertainty in the point estimate is less.
If all else is equal, what should happen to the confidence interval if is decreased?
A smaller value of will lead to wider confidence intervals.
Since is the probability of rejecting the null hypothesis, given that it is in fact true i.e. the probability of a Type I error, the smaller this probability, the fewer times we should incorrectly reject and therefore the larger the confidence interval needs to be.
In the case of the population mean, the size of the confidence interval also depends on the size of the population variance. All else being equal, a larger variance will lead to a wider confidence interval. The intuition for this is that, for a larger variance, there is less chance that a sample of size will capture the full population behaviour.