Convergence
Definition 1.
A sequence of random variables converges almost surely to a random variable if
Denoted by .
Definition 2.
A sequence of random variables converges in probability to a random variable if for all ,
Denoted by .
Definition 3.
A sequence of random variables with distribution functions converges in distribution to a random variable with distribution function if for all at which is continuous,
Denoted by .
The limit of a sequence of random variables is almost surely (that is, bar a set of measure zero) unique for almost sure convergence and for convergence in probability. In other words, we have or , then is uniquely determined for all but a set of measure zero. This is not the case for convergence in distribution; take an iid sequence for example:
Theorem 4.
Almost sure convergence implies convergence in probability.
Proof.
If converges to almost surely, it means that the event has probability . Fix . Consider the sequence of sets
This sequence of sets is decreasing () towards the set
Thus, we have
We shall now show that is . For any point outside of , we have , which implies that for all for some . In particular, for such , the point will not lie in , and hence won’t lie in . Therefore, , and so .
Finally,
which by definition means converges in probability to .□
Theorem 5.
Convergence in probability implies convergence in distribution.
Proof.
We want to show that
Let .
Combining the two inequalities, we get
Take limsup:
Take liminf:
Thus,
If is continuous at , . Then, it follows that
□
Note that , where and are distributions, does not imply . The converse is true, however.
Theorem 6.
Let be distribution functions with densities and . If pointwise, then pointwise.
Proof.
We want to prove that
Now,
so it suffices to prove that the integral on the right converges to as . Since
we have
So,
and
Now, note that , and is integrable. Thus, using DCT, we have
□
Theorem 7(Lemma).
If , , and
then .
Proof.
We want to show that
As before, we can bound:
Take liminf on the left, limsup on the right as :
Take limit as .
Using the hypothesis,
It follows that at all continuous points of , .□
Theorem 8.
Let be a sequence of random variables with characteristic functions . Then iff pointwise.
Proof.
Let be a random variable independent from and . Let . Then, . Define
Let be the density of , and be the density of . Then, we have
If these densities are not continuous, we can arrive at these expressions using the convolution product:
From the dominated convergence theorem, we have pointwise for each (the dominating function being ). We have shown that this implies pointwise for each , which by definition means . Now,
Thus, as . This implies .
Here, . Thus,
The lemma is applicable, and we have .□