1 Introduction to stochastic processes

1.3 Conditional expectation

Whilst not specific to stochastic processes, the following theorem will be central to how analyse a number of stochastic processes during the course.

Theorem 1.3.1 (Tower property).

For random variables U and V,

E(E(U|V))=E(U).

Illustration. Suppose U is income and V is age for a certain population. The idea of average (or expected) income is straight forward. So is the idea of the expected income for a given age - a conditional expectation, formed from the conditional distribution of income given age. If this conditional expectation of income is then averaged over age, we will just get the expected income in the whole population. This may seem indirect, but conditional distributions are useful for characterising populations. We shall find this simple result very useful.

Assume we have 5 people, two in their twenties who earn £100 and £150 per week; and 3 in their thirties who earn £150, £180 and £220 per week.
Let U denote income in pounds per week, and V denote age (V=0 denotes in twenties; V=1 denotes in thirties) of a randomly chosen person.

E(U|V=0)=100+1502=125P(V=0)=25
E(U|V=1)=150+180+2203=5503

So

E{E(U|V)} = (100+1502)25+(150+180+2203)35
= 100+150+150+180+2005=160

Check this is E(U):

E(U)=100+150+150+180+2205=160.
Proof.

For discrete variables, assuming that U and V have a joint distribution:

P(U=i,V=j)=pij

so that the marginal distribution of V is

P(V=j)=ipij=p.j

and the conditional distribution of U given V is

P(U=i|V=j)=pijp.j.

Then the conditional expectation of U given V is formed from this as

E(U|V=j)={iipijp.j},

the value depending on the value j of V. Now form the expectation of this quantity with respect to the distribution of V, to get:

E(E(U|V))=j{iipijp.j}p.j=jiipij=iijpij=iipi.=E(U).

Exercise 1.3.2.

Calculate the expectation of the number of times you need to toss a fair coin until you first throw a head. Remember to condition on the outcome of the first toss.

Let T be the time until the first head and V be the outcome of the first throw.

Figure 1.2: Link, Caption: none provided

So, by Theorem 1.3.1,

E(T)=E(E(T|V))=(12)1+(12)(1+E(T))=1+12E(T).

Solving this linear equation in E(T) gives E(T)=2.

Corollary 1.3.3.

If we consider an event A and random variable V

P(A)=E(P(A|V)).
Remark.

We interpret the right-hand side as

E(P(A|V))=jP(A|V=j)P(V=j).
Proof.

Use Theorem 1.3.1 with

U={1if A,0otherwise,

and note that E(U)=P(A), and E(U|V=j)=P(A|V=j). ∎