Consider an experiment in which independent Bernoulli trials are carried out, each with probability of success being . Let be the random variable reporting the number of successes in these trials. The induced sample space is . The random variable is termed a Binomial random variable with parameters and . We say .
Examples include:
the number of heads in tosses of a of biased coin,
the number of patients with cancer in the next examined,
the number of 6ft tall smokers in a tutorial of size .
The derivation is a little more complex here so first consider the case with and denoting success and failure respectively. The sample space for the experiment is
The random variable of interest, , is the number of successes.
Find for .
Previously, when , we used equi-probable outcomes to derive the pmf. This is not possible with an arbitrary . Instead we need to use independence to calculate the probabilities of the sample points. This results in the following calculations:
with for other values of .
A general formula which summarises these results is
for .
Show that
The binomial theorem states that
Putting and gives the result.
The more general form for the pmf is as follows:
The pmf of a Binomial random variable is
for , with otherwise, where .
For any sample point with ’s and ’s, the probability of the event consisting solely of that sample point is by independence.
There are sample points with successes and failures (choose of the trials to be , with the others ).
Hence .
∎
Show that
See Example 5.5
The software package R can evaluate pmfs from standard probability models, including the Binomial:
The rv . Use R to evaluate and plot the pmf of . Repeat with .
dbinom(0:3,size=3,prob=0.5) dbinom(0:3,size=3,prob=0.4) # Note how the probabilities change. p = dbinom(0:3,size=3,prob=0.5) barplot(p, names.arg=c(0:3))
Find the probability of rolling a fair die and finding
sixes in rolls,
sixes in rolls,
at least sixes in rolls.
sixes in rolls: model as
sixes in rolls: model as
at least 2 sixes in rolls: model as
We could calculate these In R using the following commands:
dbinom(2,size=4,prob=1/6) dbinom(2,size=5,prob=1/6) 1-dbinom(0,size=4,prob=1/6)-dbinom(1,size=4,prob=1/6)
There are two families each with three children. If each gender has the same probability and the genders of the children are independent then find the probability that the families have the same number of girls.
R hint: sum( dbinom(0:3, size=3, prob=1/2)^2 )
Let be the number of girls in a family. Because of independence between children and constant probability for each child model For the two families and .
Now