Distributions of sums and quotients

Distribution of X+Y

Let $X, Y$ be random variables (not necessarily independent) with joint distribution $f_{X, Y}$ . We want to find $F_{X + Y} (z)$ .

F_{X + Y} (z) = P (X + Y \leq z) = \iint_{{x + y \leq z}} f (x, y) d x d y = \int_{- \infty}^{\infty} \int_{- \infty}^{z - x} f (x, y) d y d x . = \int_{- \infty}^{\infty} \int_{- \infty}^{z} f (x, u - x) d u d x = \int_{- \infty}^{z} (\int_{- \infty}^{\infty} f (x, u - x) d x) d u .

Thus,

f_{X + Y} (u) = \int_{- \infty}^{\infty} f (t, u - t) d t .

Additionally, if $X$ and $Y$ are independent, we have

f_{X + Y} (u) = \int_{- \infty}^{\infty} f_{X} (t) f_{Y} (u - t) d t = \int_{- \infty}^{\infty} f_{Y} (t) f_{X} (u - t) d t .

The above expression is analogous to the convolution product defined for densities of discrete random variables.

Example 1(Example).

Let $X, Y \sim Exp (λ)$ .
$f_{X + Y} (u) = \int_{0}^{u} λ e^{- λ t} λ e^{- λ (u - t)} d t = λ^{2} \int_{0}^{u} e^{- λ u} d t = λ^{2} u e^{- λ u} .$

Tools

The dominated convergence theorem

We will use the DCT frequently in the coming proofs.

Theorem 2(DCT for sequences of sequences).

Let $f_{n} : N \to R$ be a sequence for $n \in N$ . Assume a summable positive sequence $r : N \to R_{\geq 0}$ exists such that $∣ f_{n} (i) ∣ \leq r (i)$ for all $n$ and $i$ , that is, $∣ f_{n} ∣ \leq r$ for all $n$ . Let the sequence of sequences $(f_{n})$ converge to a sequence $f$ pointwise, that is $f_{n} (i) \to f (i)$ for all $i$ . Then, each $f_{n}$ is summable, $f$ is summable, and
$n \to \infty lim i = 1 \sum \infty f_{n} (i) = i = 1 \sum \infty n \to \infty lim f_{n} (i) = i = 1 \sum \infty f (i) .$

To put it simply, if a sequence of sequences is bounded by a summable sequence and converges pointwise to a sequence, then the limit of its sum is the sum of its limit. Here, summable means absolutely convergent. Note that the conclusion that each $f_{n}$ is summable follows from the hypothesis that it is bounded by a summable sequence.

Theorem 3(DCT for sequences of functions).

Let $f_{n} : R \to R$ be a measurable function for $n \in N$ . Assume an integrable positive function $r : R \to [0, \infty)$ exists such that $∣ f_{n} ∣ \leq r$ for all $n$ . Let the sequence of functions $(f_{n})$ converge to a function $f$ pointwise. Then, $f_{n}$ is integrable, $f$ is integrable, and
$n \to \infty lim \int_{- \infty}^{\infty} f_{n} (μ) d μ = \int_{- \infty}^{\infty} n \to \infty lim f_{n} (μ) d μ = \int_{- \infty}^{\infty} f (μ) d μ .$

Here, integrable means Lebesgue integrable. Any measurable function that is absolutely dominated by an integrable function is integrable¹ (thus, the conclusion that each $f_{n}$ is integrable follows from the hypothesis that it is measurable and bounded by an integrable function).

Theorem 4(DCT for sequences of random variables).

Let $(X_{n})$ be a sequence of random variables. Let $X$ be a random variable such that for every $ω \in Ω$ , we have $X_{n} (ω) \to X (ω)$ , that is, $(X_{n})$ converges to $X$ pointwise. Assume there is an integrable random variable $Y$ such that $∣ X_{n} ∣ \leq Y$ . Then,
$E X_{n} \to EX .$

Proof.

Treat each $X_{n}$ , $X$ and $Y$ as measurable functions (they are measurable by the definition of a random variable)
$X_{n}, X, Y : (Ω, F) \to (R, B),$
where $B$ is the Borel $σ$ -algebra on $R$ . The probability measure $P$ on $Ω$ plays the role of the Lebesgue measure. We are given that $∣ X_{n} ∣ \leq Y$ , and that
$E Y = \int_{Ω} Y (ω) d P (ω) < \infty.$
So, we have all the hypotheses of the DCT, which enables us to write
$n \to \infty lim E X_{n} = n \to \infty lim \int_{Ω} X_{n} (ω) d P (ω) = \int_{Ω} X (ω) d P (ω) = EX .$ □

A random variable is said to be integrable if it has finite expectation.

Fubini’s Theorem

Used to justify swapping integrals.

Theorem 5(Fubini's Theorem).

For a function $g (t, y)$ defined on $R \times R$ , if
$\int_{- \infty}^{\infty} \int_{- \infty}^{\infty} ∣ g (t, y) ∣ d y d t < \infty$
then the double integral equals the iterated integrals in either order.

Characteristic functions

Definition 6(Definition).

$X : Ω \to C$ is a complex random variable if $Re X$ and $Im X$ are both real random variables.

Definition 7(Definition).

Let $X$ be a complex random variable. $X$ has finite expectation if $Re X$ and $Im X$ have finite expectation, in which case we define
$E (X) = E (Re X) + i E (Im X) .$

Note the following facts for real random variables:

$∣ E (X) ∣ \leq E ∣ X ∣$ .
If $X \leq Y$ , then $E (X) \leq E (Y)$ .

It is easy to verify that $E (α X + Y) = α E (X) + E (Y)$ for complex random variables $X, Y$ and $α \in C$ .

Theorem 8(Theorem).

Let $X$ be a complex random variable. Then, $∣ E (X) ∣ \leq E (∣ X ∣)$ .

Proof.

Since $E (X) = e^{i θ} ∣ E (X) ∣$ for some $θ$ , we have
$∣ E (X) ∣ = e^{- i θ} E (X) = Re (e^{- i θ} E (X)) = Re (E (e^{- i θ} X)) = E (Re (e^{- i θ} X))$
Now, $Re (e^{- i θ} X) \leq ∣ e^{- i θ} X ∣ = ∣ X ∣$ . Thus, we have
$E (Re (e^{- i θ} X)) \leq E (∣ X ∣) .$ □

Note the following facts for all $z \in C$ , which are also easy to verify (just use the Taylor expansion for $e^{z}$ ):

$\frac{d}{d t} e^{z t} = z e^{z t}$ ,
$\int e^{z t} d t = e^{z t} / z$ .

Definition 9(Definition).

Let $X : Ω \to R$ be a random variable. Define the characteristic function of $X$ by
$φ_{X} (t) \equiv E e^{i tX}, t \in R .$

If $X$ is continuous, we have

φ_{X} (t) = \int_{- \infty}^{\infty} e^{i t x} f_{X} (x) d x .

It is clear that $∣ φ_{X} ∣ \leq 1$ .

Characteristic functions of common distributions

Let $X \sim Unif (a, b)$ .

φ_{X} (t) = \int_{a}^{b} e^{i t x} f_{X} (x) d x = \frac{e ^{i t b} - e ^{i t z}}{( b - a ) i t} .

Let $X \sim Exp (λ)$ .

φ_{X} (t) = λ \int_{0}^{\infty} e^{i t x} e^{- λ x} d x = \frac{λ}{λ - i t} .

Let $X \sim n (0, 1)$ .

φ_{X} (t) = \frac{1}{2 π} \int_{- \infty}^{\infty} e^{i t x} e^{- x^{2} /2} d x = \frac{1}{2 π} \int_{- \infty}^{\infty} cos (t x) e^{- x^{2} /2} d x + i = 0 \int_{- \infty}^{\infty} sin (t x) e^{- x^{2} /2} d x = \frac{1}{2 π} \int_{- \infty}^{\infty} cos (t x) e^{- x^{2} /2} d x

From the dominated convergence theorem (one can take the dominating function $g$ to be $∣ x ∣ e^{- x^{2} /2}$ ), we have

\frac{d}{d t} φ_{X} (t) = \frac{- 1}{2 π} \int_{- \infty}^{\infty} sin (t x) x e^{- x^{2} /2} d x . = \frac{- 1}{2 π} ([- sin (t x) e^{- x^{2} /2}]_{- \infty}^{\infty} + t \int_{- \infty}^{\infty} cos (t x) e^{- x^{2} /2} d x) = \frac{- t}{2 π} \int_{- \infty}^{\infty} cos (t x) e^{- x^{2} /2} d x .

This yields a simple differential equation.

\frac{d}{d t} φ_{X} (t) \int \frac{1}{φ _{X} ( t )} d φ_{X} (t) φ_{X} (t) = - t φ_{X} (t) = - \int t d t = c e^{- t^{2} /2}

$φ_{X} (0) = 1$ tells us that $c = 1$ . Thus,

φ_{X} (t) = e^{- t^{2} /2} .

If $Y = μ + σ X$ , then $Y \sim n (μ, σ^{2})$ .

φ_{Y} (t) = E e^{i t (μ + σ X)} = e^{μ i t} E e^{i t σ X} = e^{μ i t} φ_{X} (σ t) = e^{μ i t} e^{- σ^{2} t^{2} /2}

Characteristic function of sum of independent random variables

Theorem 10(Theorem).

If $X$ and $Y$ are independent random variables, then
$φ_{X + Y} (t) = φ_{X} (t) φ_{Y} (t)$

Proof.

$φ_{X + Y} (t) = E e^{i t (X + Y)} = E e^{i tX} e^{i t Y} = E e^{i tX} E e^{i t Y} = φ_{X} (t) φ_{Y} (t) .$ □

Properties of characteristic functions

Property 0: $φ_{X} (0) = 1$ , $∣ φ_{X} (t) ∣ \leq 1$ .

Theorem 11(Property 1).

A characteristic function is uniformly continuous.

Proof.

Let $φ$ be the characteristic function of $X$ .
$∣ φ (t + h) - φ (t) ∣ = ∣ E e^{i tX} (e^{ih X} - 1) ∣ \leq E ∣ e^{i tX} (e^{ih X} - 1) ∣ = E (∣ e^{i tX} ∣∣ e^{ih X} - 1∣) = E ∣ e^{ih X} - 1∣$
From the dominated convergence theorem,
$h \to 0 lim E ∣ e^{ih X} - 1∣ = h \to 0 lim \int_{- \infty}^{\infty} ∣ e^{ih X} - 1∣ f_{X} (x) d x = \int_{- \infty}^{\infty} h \to 0 lim ∣ e^{ih X} - 1∣ f_{X} (x) d x = 0.$
Since $e^{ih X} \to 0$ as $h \to 0$ , given $ϵ$ , we can choose $δ$ such that $E ∣ e^{ih X} - 1∣ < ϵ$ if $∣ h ∣ < δ$ .□

Definition 12(Definition).

A function $φ : R \to C$ is called positive definite if for all $z_{1}, \dots, z_{k} \in C$ and $t_{1}, \dots, t_{k} \in R$ , $k \in N$ ,
$i, j = 1 \sum n z_{i} \overline{z_{j}} φ (t_{i} - t_{j}) \geq 0.$

Theorem 13(Property 2).

A characteristic function is positive definite.

Proof.

$i, j = 1 \sum n z_{i} \overline{z_{j}} φ (t_{i} - t_{j}) = i, j = 1 \sum n z_{i} \overline{z_{j}} E e^{i t_{i} X - i t_{j} X} = i, j = 1 \sum n z_{i} \overline{z_{j}} E e^{i t_{i} X} \overline{e^{i t_{j} X}} = E (i, j = 1 \sum n z_{i} \overline{z_{j}} e^{i t_{i} X} \overline{e^{i t_{j} X}}) = E i = 1 \sum n z_{i} e^{i t_{i} X}^{2} \geq 0.$ □

Bochner’s Theorem

Note that for any distribution function $F$ , there exists a random variable with distribution $F$ .

Bochner’s Theorem claims that the properties listed in the previous section completely characterize characteristic functions.

Theorem 14(Bochner's Theorem).

If $φ : R \to C$ satisfies

$φ (0) = 1$ , $∣ φ ∣ \leq 1$ ;

$φ$ is continuous;

$φ$ is positive definite;

Then there exists a distribution function $F$ such that if $X$ is a random variable with distribution $F$ , $φ_{X} = φ$ .

(continuity and positive definiteness together apparently imply uniform continuity.)

In other words, there exists a surjective map from the space of all distribution functions to the space of all functions satisfying the three listed properties (called characteristic functions from now on).

We will now prove that this map is injective.

Inverse theorem

Inverse theorem for integer valued random variables

Theorem 15(Theorem).

Let $X$ be an integer valued random variable. Let $f_{X}$ be the mass function of $X$ , and let $φ_{X}$ be the characteristic function of $X$ . Then,
$f_{X} (k) = \frac{1}{2 π} \int_{- π}^{π} e^{- i t k} φ_{X} (t) d t .$

Proof.

Compute:
$\frac{1}{2 π} \int_{- π}^{π} e^{- i t k} [j = - \infty \sum \infty e^{ij t} f_{X} (j)] d t$ $= \frac{1}{2 π} j = - \infty \sum \infty f_{X} (j) \int_{- π}^{π} e^{i t (j - k)} d t = f_{X} (k) + \frac{1}{2 π} j \neq = k \sum f_{X} (j) = 0 \int_{- π}^{π} e^{i t (j - k)} d t = f_{X} (k) .$ □

Justifying swapping the sum and integral

Now, we need to justify swapping the sum and the integral (see here for more). From Tonelli’s theorem, we have $\int \sum f_{n} = \sum \int f_{n}$ if $f_{n} \geq 0$ for all $n, x$ , without any further conditions needed. Then Fubini’s theorem says that for general $f_{n}$ , if $\sum \int ∣ f_{n} ∣ < \infty$ or $\int \sum ∣ f_{n} ∣ < \infty$ (by Tonelli the two are equivalent), then $\int \sum f_{n} = \sum \int f_{n}$ . This can also be proven using DCT: Consider the functions
$f_{n} (t) f (t) = j = - n \sum n e^{i t (j - k)} f_{X} (j) for n \in N, = j = - \infty \sum \infty e^{i t (j - k)} f_{X} (j) .$
Clearly, $f_{n} \to f$ pointwise. Now,
$∣ f_{n} ∣ \leq j = - n \sum n ∣ e^{- i t k} e^{ij t} f_{X} (j) ∣ = j = - n \sum n f_{X} (j) < 1.$
The constant function $1$ is integrable on the bounded interval $[- π, π]$ . Thus,
$\int_{- π}^{π} j = - \infty \sum \infty e^{i t (j - k)} f_{X} (j) d t = \int_{- π}^{π} n \to \infty lim j = - n \sum n e^{i t (j - k)} f_{X} (j) d t = n \to \infty lim \int_{- π}^{π} j = - n \sum n e^{i t (j - k)} f_{X} (j) d t = n \to \infty lim j = - n \sum n \int_{- π}^{π} e^{i t (j - k)} f_{X} (j) d t = j = - \infty \sum \infty \int_{- π}^{π} e^{i t (j - k)} f_{X} (j) d t .$

Inverse theorem for discrete random variables

Theorem 16(Theorem).

Let $X$ be a discrete random variable with density $f_{X}$ and characteristic function $φ_{X}$ . Then,
$f_{X} (x) = T \to \infty lim \frac{1}{2 T} \int_{- T}^{T} e^{- i t x} φ_{X} (t) d t .$

Proof.

Note that the support of $X$ is countable.
$T \to \infty lim \frac{1}{2 T} \int_{- T}^{T} e^{- i t x} φ_{X} (t) d t = T \to \infty lim \frac{1}{2 T} \int_{- T}^{T} e^{- i t x} y \in R \sum e^{i t y} f_{X} (y) d t = T \to \infty lim \frac{1}{2 T} y \in R \sum f_{X} (y) \int_{- T}^{T} e^{i t (y - x)} d t = f_{X} (x) + T \to \infty lim \frac{1}{2 T} y \neq = x \sum f_{X} (y) \int_{- T}^{T} e^{i t (y - x)} d t = f_{X} (x) + T \to \infty lim y \neq = x \sum ∣ \cdot ∣ \leq f_{X} f_{X} (y) \frac{sin ( T ( y - x ))}{T ( y - x )} = f_{X} (x) + y \neq = x \sum f_{X} (y) T \to \infty lim \frac{sin ( T ( y - x ))}{T ( y - x )} = f_{X} (x) .$ □

Inverse theorem for continuous random variables

Theorem 17(Theorem).

Let $X$ be a continuous random variable with continuous density $f$ and integrable characteristic function $φ$ ( $\int_{- \infty}^{\infty} ∣ φ (t) ∣ d t < \infty$ ). Then,
$f (x) = \frac{1}{2 π} \int_{- \infty}^{\infty} e^{- i t x} φ (t) d t .$

Proof.

Notice that
$ϵ \to 0 lim \int_{- \infty}^{\infty} ∣ \cdot ∣ \leq ∣ φ (t) ∣ e^{- ϵ^{2} t^{2} /2} e^{- i t x} φ (t) d t = \int_{- \infty}^{\infty} e^{- i t x} φ (t) d t .$
Let’s compute the limit on the left.
$ϵ \to 0 lim \int_{- \infty}^{\infty} e^{- ϵ^{2} t^{2} /2} e^{- i t x} (\int_{- \infty}^{\infty} e^{i y t} f (y) d y) d t = ϵ \to 0 lim \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} e^{- ϵ^{2} t^{2} /2} e^{- i t x} e^{i y t} f (y) d y d t$
To use Fubini’s Theorem to swap the integrals, we must show that the integrand is absolutely integrable.
$\iint_{R^{2}} ∣ e^{- ϵ^{2} t^{2} /2} e^{- i t x} e^{- i y t} f (y) ∣ d y d t = \iint_{R^{2}} e^{- ϵ^{2} t^{2} /2} f (y) d y d t = (\int_{- \infty}^{\infty} e^{- ϵ^{2} t^{2} /2} d t) (\int_{- \infty}^{\infty} f (y) d y) = \frac{2 π}{ϵ} .$
Now, we can swap those pesky integrals:
$ϵ \to 0 lim \int_{- \infty}^{\infty} f (y) \int_{- \infty}^{\infty} e^{- ϵ^{2} t^{2} /2} e^{i t (y - x)} d t d y .$
Substitute $u = ϵ t$ .
$ϵ \to 0 lim \frac{1}{ϵ} \int_{- \infty}^{\infty} f (y) \int_{- \infty}^{\infty} e^{- u^{2} /2} e^{i u (y - x) / ϵ} d u d y .$
If we dress the expression nicely, we’ll see that the inner integral is the characteristic function of the normal distribution evaluated at $(y - x) / ϵ$ , which we have already computed:
$ϵ \to 0 lim \frac{2 π}{ϵ} \int_{- \infty}^{\infty} f (y) \int_{- \infty}^{\infty} \frac{1}{2 π} e^{i (\frac{y - x}{ϵ}) u} e^{- u^{2} /2} d u d y = ϵ \to 0 lim \frac{2 π}{ϵ} \int_{- \infty}^{\infty} f (y) φ_{n (0, 1)} (\frac{y - x}{ϵ}) d y = ϵ \to 0 lim \frac{2 π}{ϵ} \int_{- \infty}^{\infty} f (y) e^{- (y - x)^{2} /2 ϵ^{2}} d y .$
Substitute $v = (y - x) / ϵ$ .
$ϵ \to 0 lim 2 π \int_{- \infty}^{\infty} f (ϵ v + x) e^{- v^{2} /2} d v .$
Passing the limit inside is tricky; since $f$ may not be bounded and $ϵ$ does not vanish on taking absolute value, using the DCT directly is difficult. Instead, we use the DCT on an compact interval $[- R, R]$ (where $f$ is bounded) to show that in $[- R, R]$ , we can take the limit inside. We then show that as we increase $R$ , the integral on $[- R, R]^{c}$ vanishes (requires justification I don’t have time for now).
$2 π f bounded; DCT applicable \int_{- R}^{R} ϵ \to 0 lim f (ϵ v + x) e^{- v^{2} /2} d v + \to 0 as R \to \infty ϵ \to 0 lim \int_{[- R, R]^{c}} f (ϵ v + x) e^{- v^{2} /2} d v .$
Thus, the expression becomes
$f (x) 2 π \int_{- \infty}^{\infty} e^{- v^{2} /2} d v = f (x) .$ □

Note that there does not exist a similar property for Riemann integrals, that is, being absolutely dominated by a Riemann integrable function does not imply Riemann integrability. Even if we assume Riemann integrability in the hypothesis, we cannot conclude that the limit is Riemann integrable (here’s an example). The closest analogue of the DCT in Riemann land does away with the dominating function and requires the sequence of functions to converge uniformly instead. ↩

NoNotes

Graph View

PROB_L10

Distributions of sums and quotients

Distribution of X+Y

Tools

The dominated convergence theorem

Fubini’s Theorem

Characteristic functions

Characteristic functions of common distributions

Characteristic function of sum of independent random variables

Properties of characteristic functions

Bochner’s Theorem

Inverse theorem

Inverse theorem for integer valued random variables

Inverse theorem for discrete random variables

Inverse theorem for continuous random variables

Table of Contents

Backlinks

NoNotes

Graph View

PROB_L10

Distributions of sums and quotients

Distribution of X+Y

Tools

The dominated convergence theorem

Fubini’s Theorem

Characteristic functions

Characteristic functions of common distributions

Characteristic function of sum of independent random variables

Properties of characteristic functions

Bochner’s Theorem

Inverse theorem

Inverse theorem for integer valued random variables

Inverse theorem for discrete random variables

Inverse theorem for continuous random variables

Footnotes

Table of Contents

Backlinks