Home page for accesible maths 3.3 One-sample means with the

t

t

confidence intervals 3.4 The

t

distribution for the difference of two means

Style control - access keys in brackets

Font (2 3) - + Letter spacing (4 5) - + Word spacing (6 7) - + Line spacing (8 9) - +

3.3.5 One sample $t$ tests

An SAT preparation company claims that its students’ scores improve by over 100 points on average after their course. A consumer group would like to evaluate this claim, and they collect data on a random sample of 30 students who took the class. Each of these students took the SAT before and after taking the company’s course, and so we have a difference in scores for each student. We will examine these differences $x_{1}=57$ , $x_{2}=133$ , …, $x_{30}=140$ as a sample to evaluate the company’s claim. (This is paired data, so we analyse the score differences; for a review of the ideas of paired data, see Section 3.1.) The distribution of the differences, shown in Figure LABEL:satImprovementHTDataHistogram, has mean 135.9 and standard deviation 82.2. Do these data provide convincing evidence to back up the company’s claim?

Example 3.3.9

Set up hypotheses to evaluate the company’s claim. Use $\mu_{{}_{diff}}$ to represent the true average difference in student scores.

Answer. This is a one-sided test. $H_{0}$ : student scores do not improve by more than 100 after taking the company’s course. $\mu_{{}_{diff}}=100$ (we always write the null hypothesis with an equality). $H_{A}$ : students scores improve by more than 100 points on average after taking the company’s course. $\mu_{{}_{diff}}>100$ .

Example 3.3.10

Are the conditions to use the $t$ distribution method satisfied?

Answer. This is a random sample from less than 10% of the company’s students (assuming they have more than 300 former students), so the independence condition is reasonable. The normality condition also seems reasonable based on Figure LABEL:satImprovementHTDataHistogram. We can use the $t$ distribution method. Note that we could use the normal distribution. However, since the sample size ( $n=30$ ) just meets the threshold for reasonably estimating the standard error, it is advisable to use the $t$ distribution. Just as we did for the normal case, we standardize the sample mean using the Z score to identify the test statistic. However, we will write $T$ instead of $Z$ , because we have a small sample and are basing our inference on the $t$ distribution:

\displaystyle T=\frac{\bar{x}-\text{null value}}{SE}=\frac{135.9-100}{82.2/% \sqrt{30}}=2.39

If the null hypothesis was true, the test statistic $T$ would follow a $t$ distribution with $df=n-1=29$ degrees of freedom. We can draw a picture of this distribution and mark the observed $T$ , as in Figure LABEL:pValueShownForSATHTOfOver100PtGain. The shaded right tail represents the p-value: the probability of observing such strong evidence in favour of the SAT company’s claim, if the average student improvement is really only 100.

Example 3.3.11

Use R to identify the p-value. What do you conclude?

Answer. We use 29 degrees of freedom. The value $t=2.39$ thus pt(2.39,df=29) $=\mathbb{P}(T_{29}<2.39)=0.9882165$ . Therefore the p-value is 1-0.9882165= 0.0117835 and is less than 0.05 (the default significance level), so we reject the null hypothesis. The data provide convincing evidence to support the company’s claim that student scores improve by more than 100 points following the class.

Example 3.3.12

Because we rejected the null hypothesis, does this mean that taking the company’s class improves student scores by more than 100 points on average?

Answer. This is an observational study, so we cannot make this causal conclusion. For instance, maybe SAT test takers tend to improve their score over time even if they don’t take a special SAT class, or perhaps only the most motivated students take such SAT courses.