Home page for accesible maths 3.2 One sample tests 3.2 One sample tests 3.3 Two sample tests

Style control - access keys in brackets

Font (2 3) - + Letter spacing (4 5) - + Word spacing (6 7) - + Line spacing (8 9) - +

3.2.1 Justification of the one-sample $t$ -test

The general concept of conducting an hypothesis test is to

1.

Calculate a test statistic;
2.

Compare the test statistic to a sampling distribution, either using a critical value or a $p$ -value.

This is a valid method for testing all sorts of parameters from different statistical models. The test statistic and sampling distribution change according to the parameter being tested. You will see further examples throughout the course.

We give a justification for the use of both the $t$ statistic and Students- $t$ distribution. First recall the null hypothesis,

\displaystyle H_{0}:\mu=\mu_{0}.

Since an unbiased estimator of $\mu$ is the sample mean $\bar{X}$ , it seems sensible to compare the observed sample mean $\bar{x}$ to the hypothesised population mean $\mu_{0}$ . The question is then:

When $\mu=\mu_{0}$ , how far from $\mu_{0}$ can we reasonably expect $\bar{x}$ to be?

To answer this we need to look at the sampling distribution of the estimator $\bar{X}$ . This will show us, for a given sample size $n$ , how much sampling variability we can expect in the sample mean. We start with the case where the population variance $\sigma^{2}$ is known, since in this case we can use the Central Limit Theorem to help prove the sampling distribution of the test statistic.

Theorem 1.

If $X_{1},\ldots,X_{n}$ are IID random variables, with mean $\mu$ and variance $\sigma^{2}<\infty$ then as $n\rightarrow\infty$ ,

\displaystyle\bar{X}\sim\operatorname{Normal}\left(\mu,\frac{\sigma^{2}}{n}% \right).

This is an exact result if $X_{1},\ldots,X_{n}$ are Normal random variables, and so holds for all $n$ . Otherwise the limiting result follows by application of the Central Limit Theorem.

If $H_{0}$ is true, we can adjust this to,

\displaystyle\bar{X}\sim\operatorname{Normal}\left(\mu_{0},\frac{\sigma^{2}}{n% }\right).

By the usual standardisation technique,

\displaystyle Z=\frac{\bar{X}-\mu_{0}}{\sigma/\sqrt{n}}\sim\operatorname{% Normal}\left(0,1\right).

This then gives us both a test statistic $Z$ and its sampling distribution but only if we know the population variance $\sigma^{2}$ .

When the population variance is unknown, it is replaced by the sample variance $s^{2}$ , but this is itself a realisation of a random variable $S^{2}$ (different samples of size $n$ will each have a different sample variance). Deriving the sampling distribution of

\displaystyle T=\frac{\bar{X}-\mu}{S/\sqrt{n}}

is therefore considerably harder, as it is the ratio of two related random variables $\bar{X}$ and $S^{2}$ . In fact, the $t$ -distribution was first tabulated as the sampling distribution for this test statistic, by William S. Gossett (work published in 1908) as part of his work in quality control for the Guinness brewery in Dublin. Gossett found that the Normal distribution was no longer an appropriate sampling distribution for $T$ , due to the extra uncertainty introduced when estimating $S$ (especially for small samples). His derivation of the $t$ -distribution had three steps:

1.

Showing that the sampling distribution of $S$ is $\chi^{2}_{n}$ ;
2.

Showing that $S$ is independent of $\bar{X}$ ;
3.

Showing that the ratio of a Normal random variable to the square root of a $\chi^{2}_{n}$ random variable follows a $t_{n-1}$ -distribution.

The entire proof takes up a page of A4 paper. To cover this thoroughly is beyond the scope of this course.

Style control - access keys in brackets

3.2.1 Justification of the one-sample t-test

Theorem 1.

3.2.1 Justification of the one-sample $t$ -test