Académique Documents
Professionnel Documents
Culture Documents
Fatous Lemma
Theorem 21.1 Consider a probability space (, F, P). Let X : R R be a continuous random variable.
Let g be a measurable function which is either non-negative or satisfies |g|dPX < .Then,
Z
E[g(X)] = gfX d.
K
P
Proof: Let us first consider the case of g being a simple function i.e. g = ai IAi for some measurable
i=1
disjoint subsets Ai over the real line. We then have
Z
E[g(X)] = g dPX
K
X
= ai PX (Ai ) [g is a simple function]
i=1
K
X Z
= ai fX d [From Radon-Nikodym Theorem]
i=1 Ai
K Z
X
= ai fX d [ai is a constant]
i=1 A
i
K Z
X
= (ai IAi fX )d [IAi is the indicator random variable of event Ai ]
i=1
21-1
21-2 Lecture 21: Expectation of CRVs, Fatous Lemma and DCT
K
Z X
= (ai IAi fX )d [Interchanging finite summation and integral]
i=1
Z K
!
X
= (ai IAi ) fX d
i=1
Z
= (gfX )d.
Thus we have proved the above theorem for simple functions. We now assume g to be a non-negative mea-
surable function which may not necessarily be simple.
Let gn be an increasing sequence of non-negative simple functions that converge to g point wise. One way
of coming up with such a sequence was discussed in the previous lecture. We then have,
Z
E[g(X)] = lim gn dPX [From MCT]
n
Z
= lim gn fX d [From result for simple functions]
n
Z
= gfX d. [From MCT, since gn fX gfX ]
For arbitrary g which are absolutely integrable, a similar proof can be worked out by writing g = g+ g
and proceeding.
Example 1: Let X be an exponential random variable with parameter . Find E[X] and E[X 2 ].
Solution: Recall that for an exponential random variable with parameter , fX (x) = ex . Thus, we have
Z Z
1
E[X] = xfX d = xex dx = .
0
Z Z
2
2
E[X ] = 2
x fX d = x2 ex dx = .
2
0
Z Z
1 (x)2
E[X] = xfX d = x e 22 dx = .
2
Z Z
1 (x)2
2
E[X ] = 2
x fX d = x2 e 22 dx = 2 + 2 .
2
Lecture 21: Expectation of CRVs, Fatous Lemma and DCT 21-3
2 1
Example 3: Let X be a one-sided Cauchy random variable i.e. fX (x) = 1+x2 for x 0. Find E[X].
Solution: We have
Z Z
2 1
E[X] = xfX d = x dx = .
1 + x2
0
Solution: In this case the random variable X takes both positive and negative values. Hence, we need to
find E[X+ ] and E[X ] seperately and then evaluate E[X] = E[X+ ] E[X ]. Recall that X+ = max(X, 0)
and X = min(X, 0). Thus,
X+ () = 0 for A = { |X() < 0},
X+ () = X() for Ac .
Similarly,
X () = 0 for B = { |X() > 0},
X () = X() for B c .
Similarly,
Z
E[X ] = xdPX
Z0
1 1
E[X ] = 0 P(B) + x dx = .
1 + x2
The above limit does not exist and hence the integral is not defined.
min(X, Y ) X.
min(X, Y ) Y.
E [min(X, Y )] E[X].
E [min(X, Y )] E[Y ].
The other statment of the lemma involving maximum of X and Y can be proved in a similar way and is left
to the reader as an exercise.
The above lemma can be generalized to any finite collection of random variables and a similar proof can be
worked out. Fatous Lemma generalizes this idea for a sequence of random variables.
Lemma 21.3 Fatous Lemma: Let Y be a random variable that satisfies E[|Y |] < . Then the following
holds,
h i
If Y Xn , for all n, then E lim inf Xn lim inf E[Xn ].
n n
If Y Xn , for all n, then E lim sup Xn lim sup E[Xn ].
n n
Proof: Let us start by proving the first statement. For some n we have
inf Xk Y Xm Y, m n.
kn
Taking expectations,
E inf Xk Y E[Xm Y ], m n.
kn
h i
E lim inf Xn Y lim inf E[Xn Y ].
n n
As E[|Y |] < , we can invoke linearity of expectation to get the first result of Fatous lemma.
The second statement can be proved similarly and is left to the reader as an exercise.
The DCT is an important result which asserts a sufficient condition under which we can interchange a limit
and integral.
Theorem 21.4 Consider a sequence of random variables Xn that converges almost surely to X. Suppose
there exists a random variable Y such that |Xn | Y almost surely for all n and E[Y ] < . Then, we have
Proof: We have |Xn | Y which implies Y Xn Y . We can now apply Fatous lemma to obtain
h i
E[X] = E lim inf Xn lim inf E[Xn ] lim sup E[Xn ] E lim sup Xn = E[X].
n n n n
Thus, all the inequalities in the above equation must be met with equalities and we have
which proves that the limit, lim E[Xn ] exists and is given by
n
Thus we see that Dominated Convergence theorem (DCT) is a direct consequence of Fatous Lemma. The
name dominated is intuitive because we need |Xn | to be bounded by some random variable Y almost
surely for every n. However, we do not require Xn s to be monotonically increasing as in the case of MCT.
Corollary 21.5 A special case of DCT is known as Bounded Convergence theorem (BCT). Here, the random
variable Y is taken to be a constant random variable. BCT states that if there exists a constant c R such
that |Xn | c almost surely for all n, then lim E[Xn ] = E[X].
n
21-6 Lecture 21: Expectation of CRVs, Fatous Lemma and DCT
21.4 Exercise
1. [MIT OCW problem set] A workstation consists of three machines, M1 , M2 and M3 , each of which
will fail after an amount of time Ti which is an independent exponentially distributed random variable,
with parameter 1. Assume that the times to failure of the different machines are independent. The
workstation fails as soon as both of the following have happened:
(a) Machine M1 has failed.
(b) Atleast one of the machines M2 or M3 has failed.