PROB_L3

Independent events

We say two events are independent if $P (A \cap B) = P (A) P (B)$ .
Events $A_{1}, \dots, A_{n}$ are said to be independent if $P (A_{1} \cap A_{2} \cap \dots \cap A_{n}) = P (A_{1}) P (A_{2}) \dots P (A_{n})$ and any subcollection of ${A_{i}}$ containing at least two but fewer than $n$ events to be mutually independent.

Note that if $A_{1}, \dots, A_{n}$ are pairwise independent, they need not be independent as a collection. For example, let $Ω = {a, b, c, d}$ and let the probability function assign a probability of $\frac{1}{4}$ for each element in $Ω$ . Consider the events $A_{1} = {b, c}$ , $A_{2} = {b, d}$ , and $A_{3} = {c, d}$ . $P (A_{i}) = \frac{1}{2}$ , $P (A_{i} \cap A_{j}) = \frac{1}{4}$ . Thus, the $A_{i}$ s are pairwise independent. But, $P (A \cap B \cap C) = 0$ .

The notion of independence is frequently used to construct probability spaces corresponding to repetitions of the same experiment, where the outcome of each iteration of the experiment is not influenced by the results of the other iterations. Let $S_{i} = (Ω_{i}, P_{i})$ be a discrete or finite probability space modeling the $i$ th iteration of the experiment. The probability space for $n$ iterations of the experiment can be constructed like so: $S = (Ω_{1} \times Ω_{2} \times \dots \times Ω_{n}, P)$ , where $P$ is defined on the elementary events like so: $P ({(ω_{1}, ω_{2}, \dots, ω_{n})}) = \prod_{i = 1}^{n} P_{i} (ω_{i})$ (it is easy to verify that this definition satisfies the properties of a probability measure). Note that the event $C_{i} \subset Ω_{i}$ occurring in the $i$ th iteration would correspond to the event $Ω_{1} \times \dots \times C_{i} \times \dots \times Ω_{n}$ in the new probability space. As we would expect, this construction makes events that belong entirely to different iterations independent.

For example, let the experiment we wish to repeat be a coin toss. The probability space modeling a single coin toss is $({0, 1}, {({}, 0), ({0}, 1 - p), ({1}, p), ({0, 1}, 1)})$ . The probability space $S$ modeling $n$ iterations of the experiment would have a sample space of $n$ -tuples consisting of $0$ s and $1$ s. The probability of an elementary event would be computed like so:

P ((0, 1, 1, 0, 1)) = (1 - p) (p) (p) (1 - p) (p) .

Going a little further, it is evident that all elementary events which produce the same total number of $1$ s have the same probability. Let’s associate with every elementary event in $S$ a number $X$ which counts the number of $1$ s that appear in the event. $X$ is what is called a discrete random variable. One can say that the probability of getting $k$ $1$ s is

P (X = k) = (k n) p^{k} (1 - p)^{n - k} .

Discrete random variables

Definition 1(Definition).

A discrete real valued random variable $X$ on a probability space $(Ω, F, P)$ is a function $X : Ω \to S \subset R$ , $∣ S ∣ \leq ℵ_{0}$ such that ${ω ∣ X (ω) = x} \in F$ for all $x \in S$ .

Important

All random variables/vectors we will deal with before the midsem are going to be discrete ones.

${ω ∣ X (ω) = x}$ is usually shortened to $(X = x)$ . In the previous example, $(X = 3)$ would be the event corresponding to getting $3$ ones.

Note that if $X$ is a random variable on $(Ω, F, P)$ and $f : R \to R$ is any function, then $f \circ X$ is also a random variable, since $(f \circ X = c) = ⋃_{x \in f^{- 1} (c)} (X = x)$ .

Discrete density functions

Definition 2(Definition).

The real valued function $f : R \to [0, 1]$ defined by $f (x) = P (X = x)$ is called the discrete density function or discrete mass function of $X$ . A number $x$ is called a possible value of $X$ if $f (x) > 0$ .

Properties of the probability mass function that should be obvious:

$f (x) \geq 0$ for all $x \in R$ , and $f (x) > 0$ for at most countably many $x \in R$ .
$\sum_{x \in R} f (x) = 1$ .

Also, for any function $f : R \to [0, 1]$ satisfying the above properties, there exists a probability space and a random variable $X$ with mass function $f$ (take the trivial example to show its existence). This result assures us that statements like “Let $X$ be a random variable with discrete density $f$ ” always make sense, even if we do not specify directly a probability space upon which $X$ is defined.

Binomial distribution

Consider $n$ independent repetitions of a simple success-failure experiment, like the coin tossing one discussed above. Let $S_{n}$ denote the number of successes in $n$ trials. Then, $S_{n}$ is a random variable that can only assume the values $0, 1, \dots, n$ . The probability density for such an experiment is called the binomial density.

f (x) = ⎩ ⎨ ⎧ (x n) p^{x} (1 - p)^{n - x} 0 x \in {0, 1, \dots, n} otherwise.

The outcome of performing $n$ Bernoulli trials with fixed parameter $p$ can be given by the random vector $X = (X_{1}, X_{2}, \dots, X_{n})$ , with $X_{i} = 1$ and $X_{i} = 0$ signaling success and failure in the $i$ th trial respectively. We know that the random variable $S_{n} = X_{1} + X_{2} + \dots + X_{n}$ is binomially distributed with parameters $n$ and $p$ , as shown above. Turning this around, we can say that any random variable $Y$ that is binomially distributed with these same parameters can be thought of as the sum of $n$ independent Bernoulli random variables $X_{1}, \dots, X_{n}$ each having parameter $p$ .

The distribution function

What follows is valid for all probability spaces.

Definition 3(Definition).

If $X$ is a random variable on $Ω$ , define its distribution function by $F (x) = P (X \leq x) = P (X \in (- \infty, x])$ .

Theorem 4(Proposition).

$F$ is non decreasing.

$lim_{x \to \infty} F (x) = 1$

$lim_{x \to - \infty} F (x) = 0$

$lim_{x \to x_{0}^{+}} F (x) = F (x_{0})$ for all $x$ (right continuity).

The fourth point is shown by considering any monotone decreasing sequence $(ϵ_{n})$ converging to $0$ , and observing that

F (x_{0}) = P (X \leq x_{0}) = P ((- \infty, x_{0}]) = n \to \infty lim P ((- \infty, x_{0} + ϵ_{n}]) = n \to \infty lim F (x_{0} + ϵ_{n}),

where these properties have been used. A closely related result is $F (x_{0} -) = P (X < x_{0})$ :

P (X < x) = n \to \infty lim P (X \leq x - ϵ_{n}) = n \to \infty lim F (x_{0} - ϵ_{n}) .

It follows that $F (x +) - F (x -) = P (X = x)$ . This will be important when we discuss continuous random variables.

Also, any function satisfying the four properties in the proposition above is called a distribution function.

NoNotes

Graph View

PROB_L3

Independent events

Discrete random variables

Discrete density functions

Binomial distribution

The distribution function

Table of Contents

Backlinks