Home page for accesible maths 3.4 The t distribution for the difference of two means

Style control - access keys in brackets

Font (2 3) - + Letter spacing (4 5) - + Word spacing (6 7) - + Line spacing (8 9) - +

3.4.1 Sampling distributions for the difference in two means

In the example of two exam versions, the teacher would like to evaluate whether there is convincing evidence that the difference in average scores between the two exams is not due to chance.

It will be useful to extend the t distribution method from Section 3.3 to apply to a difference of means:

x¯1-x¯2  as a point estimate for  μ1-μ2

Our procedure for checking conditions mirrors what we did for large samples in Section 3.2. First, we verify the small sample conditions (independence and nearly normal data) for each sample separately, then we verify that the samples are also independent. For instance, if the teacher believes students in her class are independent, the exam scores are nearly normal, and the students taking each version of the exam were independent, then we can use the t distribution for inference on the point estimate x¯1-x¯2.

The formula for the standard error of x¯1-x¯2, introduced in Section 3.2, also applies to small samples:

SEx¯1-x¯2=SEx¯12+SEx¯22=s12n1+s22n2 (3.3)

Because we will use the t distribution, we will need to identify the appropriate degrees of freedom. This can be done using computer software. An alternative technique is to use the smaller of n1-1 and n2-1, which is the method we will apply.3535This technique for degrees of freedom is conservative with respect to a Type 1 Error; it is more difficult to reject the null hypothesis using this df method.



Using the t distribution for a difference in means The t distribution can be used for inference when working with the standardized difference of two means if (1) each sample meets the conditions for using the t distribution and (2) the samples are independent. We estimate the standard error of the difference of two means using Equation (3.3).