Home page for accesible maths 3 Probability Models

Style control - access keys in brackets

Font (2 3) - + Letter spacing (4 5) - + Word spacing (6 7) - + Line spacing (8 9) - +

3.7 Normal Distribution: 𝖭(μ,σ2)

A random variable X has a normal distribution if its pdf is given by

fX(x;𝜽)=12πσ2exp{-(x-μ)22σ2}

for -<x<, with parameters 𝜽=(μ,σ2). We write XN(μ,σ2). This pdf is illustrated on Figure 3.5 (First Link, Second Link). Notice that all the curves are symmetric around μ with a characteristic bell-shape, the width of which is controlled by σ2.

Figure 3.5: First Link, Second Link, Caption: The pdf for the normal distribution with two different sets of parameter values.

The normal distribution has played a central role in the history of probability and statistics. It was introduced by the French mathematician Abraham de Moivre in 1733, who used it to approximate probabilities of winning in various game of chance involving coin tossing. It was later used by the German mathematician Carl Friedrich Gauss to predict the location of astronomical bodies and became known as the Gaussian distribution.

In statistics it is by far the most important distribution. Traditionally, it has been viewed as the natural distribution of measurement errors, yields from field experiments etc. The theoretical justification for this is the central limit theorem (CLT, see Chapter 9), which says that the sum of a large number of independent random variables each of which is small compared to the sum will be approximately normally distributed. The central limit theorem is also the reason why the normal distribution often occurs as the approximate distribution of estimators in statistics.

A random variable Z, which has a Normal distribution with μ=0 and σ=1 is said to have the Standard Normal distribution, and, by definition, has a density of

ϕ(z)=12πexp(-z2/2)

for -<z<. This pdf is shown in the top left panel of Figure 3.5 (First Link, Second Link). The corresponding cdf is

Φ(z)=𝖯(Zz)=-zϕ(t)dt=-z12πexp(-t2/2)dt.

Values of Φ(z) can be obtained from a table of standard normal probabilities, although we will use the R function pnorm().

The following will be proved in Chapter 4, but will be useful right now in simplifying a number of our calculations.

Theorem 3.7.1.

If X𝖭(μ,σ2), then the random variable

Z=X-μσ𝖭(0,1)

and conversely, if Z𝖭(0,1), then the random variable

X=μ+σZ𝖭(μ,σ2).

We will now show that the normal density is indeed a density. We then find the moments of the standard normal distribution and use Theorem 3.7.1 to obtain the corresponding moments of the general normal distribution.

The density formula clearly cannot be negative. Firstly we substitute s=(v-μ)/σ so ds=dv/σ to obtain:

-fX(v)dv =-12πσ2exp{-(v-μ)22σ2}dv
=12π-exp(-s2/2)ds.

So it is sufficient to show that ϕ(s) integrates to 1. We do this and find the moments of the standard normal distribution at the same time. For r0, let

J(r)=12π-srexp(-s2/2)ds.

For odd r, the integrand is an odd function of s and so J(r)=0; i.e. 𝖤[Zr]=0.

For even r, the integrand is an even function of s and so

J(r)=22π0srexp(-s2/2)ds=2π0sr-1exp(-s2/2)sds.

Substituting t=s2/2, so dt=sds and s=(2t)1/2, gives

J(r) =2π0(2t)(r-1)/2exp(-t)dt
=2r/2π0t(r+1)/2-1exp(-t)dt
=2r/2πΓ(r+12).

With r=0 this is 1πΓ(1/2)=1, so the density does integrate to 1.

With r=2 it is

2πΓ(3/2)=2ϕ12Γ(1/2)=1,

so 𝖤[Z2]=1, and since 𝖤[Z]=0, we have 𝖵𝖺𝗋[Z]=1.

With r=4 it is

22πΓ(5/2)=22π3212Γ(1/2)=3,

so 𝖤[Z4]=3 as mentioned in Chapter 2.

Since X=μ+σZ,

  1. 𝖤[X]=𝖤[μ+σZ]=μ+σ𝖤[Z]=μ,

  2. 𝖵𝖺𝗋[X]=σ2𝖵𝖺𝗋[Z]=σ2,

as we would expect.

Mathematically, probability statements for a general normal random variable can always be simplified to probability statements for a standard normal random variable, and can often then be written in terms of Φ. Thus, our R examples will always use the standard normal; if you use the general normal in R, note that the functions use the standard deviation σ as an argument, not the variance σ2.

Example 3.7.1.

A normal model is proposed to model the variation in height H of women with parameters μ=170 and σ2=36 measured in cm. Write in terms of Φ, and evaluate numerically, the probability a randomly selected woman is over 180 cm tall.

Solution.  H𝖭(170,36), so

𝖯(H>180) =𝖯((H-170)/6>(180-170)/6)
=𝖯(Z>10/6)=1-𝖯(Z<5/3)
=1-Φ(5/3)
=0.0478,

since 1-pnorm(5/3) gives 0.04779035.

Example 3.7.2.

Find the lower quartile of XN(1,9).

Solution. 

0.25 =𝖯(Xx0.25)
=𝖯(X-13x0.25-13)
=𝖯(Zx0.25-13)=Φ(x0.25-13).

So x0.25=1+3Φ-1(0.25)=-1.023 (evaluated as 1+3*qnorm(0.25)).

Example 3.7.3.

XN(5,4), find 𝖯(X2<9) in terms of Φ, the cdf of the standard normal distribution.

Solution. 

𝖯(X29) =𝖯(-3X3)
=𝖯(-3-52Z3-52)
=Φ(-1)-Φ(-4).