Distributions of sums and quotients
Distribution of X+Y
Let be random variables (not necessarily independent) with joint distribution . We want to find .
Thus,
Additionally, if and are independent, we have
The above expression is analogous to the convolution product defined for densities of discrete random variables.
Example 1.
Let .
Tools
The dominated convergence theorem
We will use the DCT frequently in the coming proofs.
Theorem 2(DCT for sequences of sequences).
Let be a sequence for . Assume a summable positive sequence exists such that for all and , that is, for all . Let the sequence of sequences converge to a sequence pointwise, that is for all . Then, each is summable, is summable, and
To put it simply, if a sequence of sequences is bounded by a summable sequence and converges pointwise to a sequence, then the limit of its sum is the sum of its limit. Here, summable means absolutely convergent. Note that the conclusion that each is summable follows from the hypothesis that it is bounded by a summable sequence.
Theorem 3(DCT for sequences of functions).
Let be a measurable function for . Assume an integrable positive function exists such that for all . Let the sequence of functions converge to a function pointwise. Then, is integrable, is integrable, and
Here, integrable means Lebesgue integrable. Any measurable function that is absolutely dominated by an integrable function is integrable1 (thus, the conclusion that each is integrable follows from the hypothesis that it is measurable and bounded by an integrable function).
Theorem 4(DCT for sequences of random variables).
Let be a sequence of random variables. Let be a random variable such that for every , we have , that is, converges to pointwise. Assume there is an integrable random variable such that . Then,
Proof.
Treat each, and as measurable functions (they are measurable by the definition of a random variable)
where is the Borel -algebra on . The probability measure on plays the role of the Lebesgue measure. We are given that , and that
So, we have all the hypotheses of the DCT, which enables us to write
□
A random variable is said to be integrable if it has finite expectation.
Fubini’s Theorem
Used to justify swapping integrals.
Theorem 5(Fubini's Theorem).
For a function defined on , if
then the double integral equals the iterated integrals in either order.
Characteristic functions
Definition 6.
is a complex random variable if and are both real random variables.
Definition 7.
Let be a complex random variable. has finite expectation if and have finite expectation, in which case we define
Note the following facts for real random variables:
- .
- If , then .
It is easy to verify that for complex random variables and .
Theorem 8.
Let be a complex random variable. Then, .
Proof.
Since for some , we have
Now, . Thus, we have
□
Note the following facts for all , which are also easy to verify (just use the Taylor expansion for ):
- ,
- .
Definition 9.
Let be a random variable. Define the characteristic function of by
If is continuous, we have
It is clear that .
Characteristic functions of common distributions
Let .
Let .
Let .
From the dominated convergence theorem (one can take the dominating function to be ), we have
This yields a simple differential equation.
tells us that . Thus,
If , then .
Characteristic function of sum of independent random variables
Theorem 10.
Properties of characteristic functions
Property 0: , .
Theorem 11(Property 1).
A characteristic function is uniformly continuous.
Proof.
Let be the characteristic function of .
From the dominated convergence theorem,
Since as , given , we can choose such that if .□
Definition 12.
A function is called positive definite if for all and , ,
Theorem 13(Property 2).
A characteristic function is positive definite.
Proof.
□
Bochner’s Theorem
Note that for any distribution function , there exists a random variable with distribution .
Bochner’s Theorem claims that the properties listed in the previous section completely characterize characteristic functions.
Theorem 14(Bochner's Theorem).
If satisfies
- , ;
- is continuous;
- is positive definite;
Then there exists a distribution function such that if is a random variable with distribution , .
(continuity and positive definiteness together apparently imply uniform continuity.)
In other words, there exists a surjective map from the space of all distribution functions to the space of all functions satisfying the three listed properties (called characteristic functions from now on).
We will now prove that this map is injective.
Inverse theorem
Inverse theorem for integer valued random variables
Theorem 15.
Let be an integer valued random variable. Let be the mass function of , and let be the characteristic function of . Then,
Proof.
Compute:
□Justifying swapping the sum and integral
Now, we need to justify swapping the sum and the integral (see here for more). From Tonelli’s theorem, we have if for all , without any further conditions needed. Then Fubini’s theorem says that for general , if or (by Tonelli the two are equivalent), then . This can also be proven using DCT: Consider the functions
Clearly, pointwise. Now,
The constant function is integrable on the bounded interval . Thus,
Inverse theorem for discrete random variables
Theorem 16.
Let be a discrete random variable with density and characteristic function . Then,
Proof.
Note that the support of is countable.
□
Inverse theorem for continuous random variables
Theorem 17.
Let be a continuous random variable with continuous density and integrable characteristic function (). Then,
Proof.
Notice that
Let’s compute the limit on the left.
To use Fubini’s Theorem to swap the integrals, we must show that the integrand is absolutely integrable.
Now, we can swap those pesky integrals:
Substitute .
If we dress the expression nicely, we’ll see that the inner integral is the characteristic function of the normal distribution evaluated at , which we have already computed:
Substitute .
Passing the limit inside is tricky; since may not be bounded and does not vanish on taking absolute value, using the DCT directly is difficult. Instead, we use the DCT on an compact interval (where is bounded) to show that in , we can take the limit inside. We then show that as we increase , the integral on vanishes (requires justification I don’t have time for now).
Thus, the expression becomes
□
Footnotes
-
Note that there does not exist a similar property for Riemann integrals, that is, being absolutely dominated by a Riemann integrable function does not imply Riemann integrability. Even if we assume Riemann integrability in the hypothesis, we cannot conclude that the limit is Riemann integrable (here’s an example). The closest analogue of the DCT in Riemann land does away with the dominating function and requires the sequence of functions to converge uniformly instead. ↩