Home page for accesible maths 9.1 Preview 9.1 Preview Illustration for other distributions through simulation

Style control - access keys in brackets

Font (2 3) - + Letter spacing (4 5) - + Word spacing (6 7) - + Line spacing (8 9) - +

Sequences of Normal random variables

Suppose $X_{1},X_{2},\ldots$ are independent and identically distributed Normal random variables, $X_{i}\sim N(\mu,\sigma^{2})$ . What can we say about $\overline{X}_{n}$ ?

Let us first recall the important ‘convolution’ property of the normal distribution, Theorem 6.4.2 (see also Chapter 8, Example 8.2.1). If $X_{i}$ $(i=1,\dots,n)$ are independent and each has a Normal distribution then so does $X_{1}+X_{2}$ .

By induction, therefore, $S_{n}=\sum_{i=1}^{n}X_{i}$ also has a Normal distribution.

Thus $\overline{X}_{n}=\frac{1}{n}\sum_{i=1}^{n}X_{i}$ has a Normal distribution too.

This is the reason why the normal distribution plays such a central role in probability theory. For instance it is because of this property that it is the normal distribution that appears in the Central Limit Theorem (see later) and not anything else.

From the previous subsection we know the expectation and variance of $\overline{X}_{n}$ , and hence:

\displaystyle\overline{X}_{n}\sim N(\mu,\sigma^{2}/n).

(9.1)

Example 9.1.1.

Find, in terms of the CDF of the standard Normal, $\Phi$ ,

\displaystyle\operatorname{\mathsf{P}}\left({|\overline{X}_{n}-\mu|>0.01}% \right),

and hence find $\lim_{n\to\infty}\operatorname{\mathsf{P}}\left({|\overline{X}_{n}-\mu|>0.01}\right)$ .

Solution.

	$\displaystyle\operatorname{\mathsf{P}}\left({\|\overline{X}_{n}-\mu\|>0.01}\right)$	$\displaystyle={\color[rgb]{0.76,0.01,0}\operatorname{\mathsf{P}}\left({% \overline{X}_{n}-\mu>0.01}\right)+\operatorname{\mathsf{P}}\left({\overline{X}% _{n}-\mu<-0.01}\right)}$
		$\displaystyle={\color[rgb]{0.76,0.01,0}2\operatorname{\mathsf{P}}\left({% \overline{X}_{n}-\mu<-0.01}\right)}$

by symmetry – draw it. So

	$\displaystyle\operatorname{\mathsf{P}}\left({\|\overline{X}_{n}-\mu\|>0.01}\right)$	$\displaystyle={\color[rgb]{0.76,0.01,0}2\Pr\left(\frac{\overline{X}_{n}-\mu}{% \sigma/\sqrt{n}}<-0.01\frac{\sqrt{n}}{\sigma}\right)}$
		$\displaystyle={\color[rgb]{0.76,0.01,0}2\Phi\left(-0.01\frac{\sqrt{n}}{\sigma}% \right).}$

Hence

\displaystyle\lim_{n\to\infty}\operatorname{\mathsf{P}}\left({|\overline{X}_{n% }-\mu|>0.01}\right)=2\lim_{n\to\infty}{\color[rgb]{0.76,0.01,0}\Phi\left(-0.01% \frac{\sqrt{n}}{\sigma}\right)=0.}

Clearly the same argument holds for any $\epsilon>0$ , so for the mean of $n$ IID normal random variables

\displaystyle\operatorname{\mathsf{P}}\left({|\overline{X}_{n}-\mu|>\epsilon}% \right)\to 0.

Crudely put, for large $n$ , $\overline{X}_{n}\approx\mu$ (probably). This type of convergence is called ‘convergence in probability’; i.e. $\overline{X}_{n}\to\mu$ in probability. You will cover this type of convergence (and other types) in the third year module on probability.

The Weak Law of Large Numbers (see next section) will show that this property often holds for averages of random variables even when they do not have a Normal distribution.

What about deviations from $\mu$ ? From (9.1),

\displaystyle\overline{X}_{n}-\mu\sim N(0,\sigma^{2}/n),

and as $n\to\infty$ , the distribution of $\overline{X}_{n}-\mu$ tends to $N(0,0)$ , a point mass at $0$ , which we knew anyway. To obtain a useful distribution we must rescale, dividing $\overline{X}_{n}-\mu$ by its SD:

\displaystyle\frac{\overline{X}_{n}-\mu}{\sigma/\sqrt{n}}\sim N(0,1),

or equivalently

\displaystyle\frac{\sqrt{n}\left(\overline{X}_{n}-\mu\right)}{\sigma}\sim N(0,% 1),

and hence

\displaystyle\operatorname{\mathsf{P}}\left({\frac{\sqrt{n}\left(\overline{X}_% {n}-\mu\right)}{\sigma}\leq z}\right)=\Phi(z).

If the discrepancy from $\mu$ is scaled by the standard deviation of $X_{i}$ and stretched by $\sqrt{n}$ then it has a standard Normal distribution.

If the $X_{i}$ have a distribution other than the Normal then the average of any $n$ of them is, in general, not Normal, however the Central Limit Theorem will show that, if the same rescaling is applied then for large enough $n$ the distribution (and hence the cdf) of ${\sqrt{n}\left(\overline{X}_{n}-\mu\right)}/{\sigma}$ can be made as close as we like to a standard Normal: for any $z$ ,

\displaystyle\operatorname{\mathsf{P}}\left({\frac{\sqrt{n}\left(\overline{X}_% {n}-\mu\right)}{\sigma}\leq z}\right)\to\Phi(z).

This type of convergence is called ‘convergence in distribution’; i.e. for any $Z\sim\operatorname{\mathsf{N}}(0,1)$ , $\sqrt{n}(\overline{X}_{n}-\mu)/\sigma\to Z$ in distribution. You will cover this type of convergence (and other types) in the third year module on probability.