Home page for accesible maths 4.2 Parametric confidence intervals

Style control - access keys in brackets

Font (2 3) - + Letter spacing (4 5) - + Word spacing (6 7) - + Line spacing (8 9) - +

4.2.1 One-sample mean

We construct confidence intervals for the population mean μ of a Normal distribution. Again we assume that our data x1,,xn are realisations of a sequence of IID random variables X1,,Xn with Normal(μ,σ2) distribution. As before, if the sample size is large enough the Normal assumption can be relaxed using the Central Limit Theorem. We obtain the 95% confidence interval first, before giving the formula more generally.

As we saw earlier,

X¯-μS/ntn-1.

We can therefore say that

Pr(tn-1(0.025)X¯-μS/ntn-1(0.975))=0.95

where

  • tn-1(0.025) is the 2.5% quantile of the tn-1-distribution, and

  • tn-1(0.975) is the 97.5% quantile of the tn-1-distribution.

By the symmetry of the t-distribution, tn-1(0.025)=-tn-1(0.975), so that we can write

Pr(-tn-1(0.975)X¯-μS/ntn-1(0.975))=0.95

Rearranging,

0.95 =Pr(-tn-1(0.975)S/nX¯-μtn-1(0.975)S/n)
=Pr(-X¯-tn-1(0.975)S/n-μ-X¯+tn-1(0.975)S/n)
=Pr(X¯-tn-1(0.975)S/nμX¯+tn-1(0.975)S/n).

And the 95% confidence interval for μ is given by {mdframed}

(X¯-tn-1(0.975)S/n,X¯+tn-1(0.975)S/n).
Remark.

The end points of the confidence interval are random variables, since they are functions of the estimators X¯ and S. Different samples of the same size therefore produce different confidence intervals. In practice, X¯ is replaced by x¯ and S2 is replaced by s2.

To extend this to obtain the more general 100(1-α)% confidence interval, simply replace the 97.5% quantiles of the t-distribution with the 100(1-α/2)% quantiles, {mdframed}

(X¯-tn-1(1-α/2)S/n,X¯+tn-1(1-α/2)S/n). (4.1)
TheoremExample 4.2.1 Arctic sea ice

Use the sample of Arctic sea ice data from Example 4.1.1 to create a 95% confidence interval for the population mean μ of the minimum sea ice extent.

We use the formula given in equation (4.1). For a 95% confidence interval, α=0.05, and so we need to find the t9(0.975) quantile. Either from tables or using R,

> qt(0.975,9)
[1] 2.262157

so t9(0.975)=2.26. From previous examples, x¯=6.06, s=0.911 and n=10. So using equation (4.1), the 95% confidence interval for μ is

(6.06-2.26×0.911/10,6.06+2.26×0.911/10)=(5.41,6.71)×106km2.
Remark.

Can you create a 90% confidence interval for the Arctic sea ice data? How does it compare to the 95% confidence interval?