Home page for accesible maths 2 Events 2.2.2 The birthday problem 3 The axiomatic approach

Style control - access keys in brackets

Font (2 3) - + Letter spacing (4 5) - + Word spacing (6 7) - + Line spacing (8 9) - +

2.3 Empirical probability

A serious limitation of classical probability is that it only applies in situations where the sample space is finite and all outcomes are equiprobable. While this might be useful for drawing cards, rolling dice, or pulling balls from urns, it offers no method for dealing with outcomes with unequal probabilities, or where the sample space may be infinite.

The frequency or empirical approach to probability is based on the idea that the underlying probability of an event can be measured by repeated trials. Supposing that ${A}$ is an event for some experiment, then if you repeat the experiment a number of times, ${n}$ , we might hope that the proportion of trials in which ${A}$ occurs tends to stabilise as ${n{\rightarrow}\infty}$ . We would like to call this Prob( ${A}$ ). More precisely

\displaystyle{\rm Prob}(A)=\lim_{n{\rightarrow}\infty}\frac{n_{A}}{n},

where ${n_{A}}$ is the number of times event ${A}$ occurs after ${n}$ trials.

Example 2.21.

•

If you toss a coin 1000 times and get heads 200 times that suggests the coin is biased and ${\rm Prob}(\{H\})\approx$ 1/5.
•

Suppose a survey asks 500 people how they will vote in the next election and 150 say they support Labour. If ${A}$ is the event that a given person supports Labour, then ${\rm Prob}(A)\approx$ 3/10.

In both cases, increasing the number of trials will improve the approximation.

What can we say about:

•

Prob( ${A}$ ) in general?
${n_{A}\geq 0}$ , so ${{\rm Prob}(A)\geq 0.}$
•

Prob( ${\Omega}$ )?
${n_{\Omega}=n}$ , so Prob ${(\Omega)=1}$ .
•

Prob( ${\emptyset}$ )?
${n_{\emptyset}=0}$ , so Prob ${(\emptyset)=0}$ .

Furthermore, if ${A}$ and ${B}$ are exclusive events, and ${C=A\cup B}$ , set

•

${n_{A}}$ to be the number of times ${A}$ occurs in the first ${n}$ trials
•

${n_{B}}$ to be the number of times ${B}$ occurs in the first ${n}$ trials
•

${n_{C}}$ to be the number of times ${C}$ occurs in the first ${n}$ trials

Then ${n_{C}=n_{A}+n_{B}}$ since ${A}$ and ${B}$ are exclusive. Therefore

{\frac{c_{n}}{n}=\frac{a_{n}}{n}+\frac{b_{n}}{n}.}

Taking the limit as ${n{\rightarrow}\infty}$ we see that

{\mbox{Prob}(C)=\mbox{Prob}(A)+\mbox{Prob}(B).}

But since ${C=A\cup B}$ , we have

{\mbox{Prob}(A\cup B)=\mbox{Prob}(A)+\mbox{Prob}(B)}

for the exclusive events ${A}$ and ${B}$ .

However, how can we know if this thinking is valid? It seems intuitively reasonable, but we can’t be sure. In particular, it is impossible to conduct an infinite number of trials, and it is unclear how large ${n}$ must be to give a good approximation. More seriously, ${\frac{n_{A}}{n}}$ may not converge at all, or even if it does, if we repeat the experiment again, we may not necessarily obtain the same limit.

The modern theory of probability works the other way round: we assume that for each event ${A}$ there exists a number ${{\rm P}(A)}$ called the probability of ${A}$ , and place axioms on the function ${{\rm P}}$ . We will see that these axioms imply the convergence we hope to see.