When estimating the mean and standard error from a small sample, the distribution is a more accurate tool than the normal model. This is true for both small and large samples.
TIP: When to use the distribution
Use the distribution for inference of the sample mean when observations are independent and
nearly normal. You may relax the nearly normal condition as the sample size increases. For example,
the data distribution may be moderately skewed when the sample size is at least 30.
To proceed with the distribution for inference about a single mean, we must check two conditions.
We verify this condition just as we did before. We collect a simple random sample from less than 10% of the population, or if it was an experiment or random process, we carefully check to the best of our abilities that the observations were independent.
This second condition is difficult to verify with small data sets. We often (i) take a look at a plot of the data for obvious departures from the normal model, and (ii) consider whether any previous experiences alert us that the data may not be nearly normal.
When examining a sample mean and estimated standard error from a sample of independent and nearly normal observations, we use a distribution with degrees of freedom (). For example, if the sample size was 19, then we would use the distribution with degrees of freedom and proceed exactly as we did in Chapter 2.6, except that now we use the distribution, i.e. pt or qt in R.