Home page for accesible maths 2 Events

Style control - access keys in brackets

Font (2 3) - + Letter spacing (4 5) - + Word spacing (6 7) - + Line spacing (8 9) - +

2.3 Empirical probability

A serious limitation of classical probability is that it only applies in situations where the sample space is finite and all outcomes are equiprobable. While this might be useful for drawing cards, rolling dice, or pulling balls from urns, it offers no method for dealing with outcomes with unequal probabilities, or where the sample space may be infinite.

The frequency or empirical approach to probability is based on the idea that the underlying probability of an event can be measured by repeated trials. Supposing that A is an event for some experiment, then if you repeat the experiment a number of times, n, we might hope that the proportion of trials in which A occurs tends to stabilise as n. We would like to call this Prob(A). More precisely

Prob(A)=limnnAn,

where nA is the number of times event A occurs after n trials.

Example 2.21.
  • If you toss a coin 1000 times and get heads 200 times that suggests the coin is biased and Prob({H}) 1/5.

  • Suppose a survey asks 500 people how they will vote in the next election and 150 say they support Labour. If A is the event that a given person supports Labour, then Prob(A)3/10.

In both cases, increasing the number of trials will improve the approximation.


What can we say about:

  • Prob(A) in general?
    nA0, so Prob(A)0.

  • Prob(Ω)?
    nΩ=n, so Prob(Ω)=1.

  • Prob()?
    n=0, so Prob()=0.

Furthermore, if A and B are exclusive events, and C=AB, set

  • nA to be the number of times A occurs in the first n trials

  • nB to be the number of times B occurs in the first n trials

  • nC to be the number of times C occurs in the first n trials

Then nC=nA+nB since A and B are exclusive. Therefore

cnn=ann+bnn.

Taking the limit as n we see that

Prob(C)=Prob(A)+Prob(B).

But since C=AB, we have

Prob(AB)=Prob(A)+Prob(B)

for the exclusive events A and B.

However, how can we know if this thinking is valid? It seems intuitively reasonable, but we can’t be sure. In particular, it is impossible to conduct an infinite number of trials, and it is unclear how large n must be to give a good approximation. More seriously, nAn may not converge at all, or even if it does, if we repeat the experiment again, we may not necessarily obtain the same limit.

The modern theory of probability works the other way round: we assume that for each event A there exists a number P(A) called the probability of A, and place axioms on the function P. We will see that these axioms imply the convergence we hope to see.