0% found this document useful (0 votes)

72 views62 pages

07 Covariance Answers Hidden Lecture

This document discusses the concepts of independence, covariance, and correlation in probability theory. It defines independence of random variables, provides the factorization formula, and introduces Buffon's Needle Problem as a practical example. Additionally, it explains covariance, its interpretation, and alternative formulas for calculating it.

Uploaded by

Rasha Elsayed Sakr

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

72 views62 pages

07 Covariance Answers Hidden Lecture

Uploaded by

Rasha Elsayed Sakr

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 62

Introduction to Probability

Lecture 7: Independence, Covariance and Correlation

Mateja Jamnik, Thomas Sauerwald

University of Cambridge, Department of Computer Science and Technology

email: {mateja.jamnik,thomas.sauerwald}@cl.cam.ac.uk
Independence of Random Variables

Definition of Independence
Two random variables X and Y are independent if for all values a, b:

P [ X ≤ a, Y ≤ b ] = P [ X ≤ a ] · P [ Y ≤ b ] .

Intro to Probability 2
Independence of Random Variables

Definition of Independence
Two random variables X and Y are independent if for all values a, b:

P [ X ≤ a, Y ≤ b ] = P [ X ≤ a ] · P [ Y ≤ b ] .

For two discrete random variables, an equivalent definition is:

P [ X = a, Y = b ] = P [ X = a ] · P [ Y = b ] .

Intro to Probability 2
Independence of Random Variables

Definition of Independence
Two random variables X and Y are independent if for all values a, b:

P [ X ≤ a, Y ≤ b ] = P [ X ≤ a ] · P [ Y ≤ b ] .

For two discrete random variables, an equivalent definition is:

P [ X = a, Y = b ] = P [ X = a ] · P [ Y = b ] .

This is useless for continuous random variables.

Intro to Probability 2
Independence of Random Variables
This definition covers the discrete and continuous case!
Definition of Independence
Two random variables X and Y are independent if for all values a, b:

P [ X ≤ a, Y ≤ b ] = P [ X ≤ a ] · P [ Y ≤ b ] .

For two discrete random variables, an equivalent definition is:

P [ X = a, Y = b ] = P [ X = a ] · P [ Y = b ] .

This is useless for continuous random variables.

P [ X ≤ a, Y ≤ b ] = P [ X ≤ a ] · P [ Y ≤ b ] .

For two discrete random variables, an equivalent definition is:

P [ X = a, Y = b ] = P [ X = a ] · P [ Y = b ] .

This is useless for continuous random variables.

Remark
Using the joint probability distribution, the above is equivalent to for all a, b,

F (a, b) = FX (a) · FY (b).

P [ X ≤ a, Y ≤ b ] = P [ X ≤ a ] · P [ Y ≤ b ] .

For two discrete random variables, an equivalent definition is:

P [ X = a, Y = b ] = P [ X = a ] · P [ Y = b ] .

This is useless for continuous random variables.

Remark
Using the joint probability distribution, the above is equivalent to for all a, b,

F (a, b) = FX (a) · FY (b).

All these definitions extend in the natural way to more than two variables!

Intro to Probability 2
Factorisation
Factorisation
The definition of independence of X and Y implies the following factor-
isation formula: for any “suitable” sets A and B,

P [ X ∈ A, Y ∈ B ] = P [ X ∈ A ] · P [ Y ∈ B ]

Intro to Probability 3
Factorisation
Factorisation
The definition of independence of X and Y implies the following factor-
isation formula: for any “suitable” sets A and B,

P [ X ∈ A, Y ∈ B ] = P [ X ∈ A ] · P [ Y ∈ B ]

For continuous distributions one obtains by differentiating both sides in

the formula for the joint distribution:

fX ,Y (x, y ) = fX (x) · fY (y )

Intro to Probability 3
Factorisation
Factorisation
The definition of independence of X and Y implies the following factor-
isation formula: for any “suitable” sets A and B,

P [ X ∈ A, Y ∈ B ] = P [ X ∈ A ] · P [ Y ∈ B ]

For continuous distributions one obtains by differentiating both sides in

the formula for the joint distribution:

fX ,Y (x, y ) = fX (x) · fY (y )

Example
Let X and Y be two independent variables. Let I = (a, b] be any interval
and define U := 1X ∈I and V := 1Y ∈I . Prove U and V are independent.
Answer

Intro to Probability 3
Buffon’s Needle Problem (1/2)

Georges-Louis Leclerc de Buffon 1707–1788 (Source Wikipedia)

A table is ruled with equidistant, parallel lines a distance D apart.

Intro to Probability 4
Buffon’s Needle Problem (1/2)

Georges-Louis Leclerc de Buffon 1707–1788 (Source Wikipedia)

A table is ruled with equidistant, parallel lines a distance D apart.

A needle of length L is thrown randomly on the table.

Intro to Probability 4
Buffon’s Needle Problem (1/2)

Georges-Louis Leclerc de Buffon 1707–1788 (Source Wikipedia)

A table is ruled with equidistant, parallel lines a distance D apart.

A needle of length L is thrown randomly on the table.
What is the probability that the needle will intersect one of the two lines?

Intro to Probability 4
Buffon’s Needle Problem (1/2)

Source: Ross, Probability 8th ed.

Georges-Louis Leclerc de Buffon 1707–1788 (Source Wikipedia)

A table is ruled with equidistant, parallel lines a distance D apart.

A needle of length L is thrown randomly on the table.
What is the probability that the needle will intersect one of the two lines?

Intro to Probability 4
Buffon’s Needle Problem (1/2)

Source: Ross, Probability 8th ed.

Georges-Louis Leclerc de Buffon 1707–1788 (Source Wikipedia)

A table is ruled with equidistant, parallel lines a distance D apart.

A needle of length L is thrown randomly on the table.
What is the probability that the needle will intersect one of the two lines?
Let X be the distance of the middle point of the needle to the closest parallel
line. Needle intersects a line if hypotenuse of the triangle is less than L/2, i.e.,
X L L
< ⇔ X < cos(θ).
cos(θ) 2 2

Intro to Probability 4
Buffon’s Needle Problem (1/2)

Source: Ross, Probability 8th ed.

Georges-Louis Leclerc de Buffon 1707–1788 (Source Wikipedia)

A table is ruled with equidistant, parallel lines a distance D apart.

We assume that X ∈ [0, D/2] and θ ∈ [0, π/2] are independent and uniform.

Intro to Probability 4
Buffon’s Needle Problem (1/2)

Source: Ross, Probability 8th ed.

Georges-Louis Leclerc de Buffon 1707–1788 (Source Wikipedia)

A table is ruled with equidistant, parallel lines a distance D apart.

We assume that X ∈ [0, D/2] and θ ∈ [0, π/2] are independent and uniform.

Can be thought of as: 1. Sample the middle point of needle, 2. Sample the angle.

Intro to Probability 4
Buffon’s Needle Problem (2/2)

Let us compute the probability that the line intersects:

L
P X < · cos(θ)
2

Intro to Probability 5
Buffon’s Needle Problem (2/2)

Let us compute the probability that the line intersects:

ZZ
L
P X < · cos(θ) = fX ,θ (x, y ) dx dy
2
x<(L/2) cos y

Intro to Probability 5
Buffon’s Needle Problem (2/2)

Let us compute the probability that the line intersects:

ZZ
L
P X < · cos(θ) = fX ,θ (x, y ) dx dy
2
x<(L/2) cos y
ZZ
= fX (x)fθ (y ) dx dy
x<(L/2) cos y

Intro to Probability 5
Buffon’s Needle Problem (2/2)

Let us compute the probability that the line intersects:

ZZ
L
P X < · cos(θ) = fX ,θ (x, y ) dx dy
2
x<(L/2) cos y
ZZ
= fX (x)fθ (y ) dx dy
x<(L/2) cos y
Z π/2 Z L/2 cos(y )
4
= dxdy
πD 0 0

Intro to Probability 5
Buffon’s Needle Problem (2/2)

Let us compute the probability that the line intersects:

ZZ
L
P X < · cos(θ) = fX ,θ (x, y ) dx dy
2
x<(L/2) cos y
ZZ
= fX (x)fθ (y ) dx dy
x<(L/2) cos y
Z π/2 Z L/2 cos(y )
4
= dxdy
πD 0 0
Z π/2
4 L
= cos(y )dy
πD 0 2

Intro to Probability 5
Buffon’s Needle Problem (2/2)

Let us compute the probability that the line intersects:

ZZ
L
P X < · cos(θ) = fX ,θ (x, y ) dx dy
2
x<(L/2) cos y
ZZ
= fX (x)fθ (y ) dx dy
x<(L/2) cos y
Z π/2 Z L/2 cos(y )
4
= dxdy
πD 0 0
Z π/2
4 L
= cos(y )dy
πD 0 2
2L
= .
πD

Intro to Probability 5
Buffon’s Needle Problem (2/2)

Let us compute the probability that the line intersects:

ZZ
L
P X < · cos(θ) = fX ,θ (x, y ) dx dy
2
x<(L/2) cos y
ZZ
= fX (x)fθ (y ) dx dy
x<(L/2) cos y
Z π/2 Z L/2 cos(y )
4
= dxdy
πD 0 0
Z π/2
4 L
= cos(y )dy
πD 0 2
2L
= .
πD

This gives us a method to estimate π!

Intro to Probability 5
Covariance
Definition of Covariance
Let X and Y be two random variables. The covariance is defined as: