Chapter 06
Chapter 06
Unit Six
Probability Distributions
seid.belay@aastu.edu.et 19-May-25
2 Introduction
Probability distributions are fundamental concepts in statistics. They are
used both on a theoretical level and a practical level.
Used for
Confidence interval computations
Hypothesis testing
Modeling of data
Simulations …
seid.belay@aastu.edu.et 19-May-25
3 Random Variable (RV)
Given random experiment with sample space S. Then a random variable is
a real valued function that maps elements of S to real number.
X
S ՜ 𝑅 X is Called RV, X S = R, R is called the range of X
seid.belay@aastu.edu.et 19-May-25
6 Probability Density Function(pdf)
𝐛
Let X be continuous RV and 𝑓 be a function such that 𝐏 𝐚 < 𝐗 < 𝐛 = 𝐱𝐝 𝐱 𝐟 𝐚
𝑓𝑜𝑟 𝑎𝑙𝑙 𝑎 < 𝑏. The function f is called probability density function of the random
variable X IF IT SATISFIES .
i. 𝑓 𝑥 ≥ 0; 𝑎𝑛𝑑
∞
ii. −∞ 𝑓 𝑥 𝑑𝑥 = 1
2
2. Let X be a continuous random variable with pdf 𝑓 𝑥 = ቊ3𝑥 𝑖𝑓 0 < 𝑥 < 1
0 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒
2 1
Find 𝑃(𝑋 < |𝑋 > )
3 3
seid.belay@aastu.edu.et 19-May-25
8 Distribution Function
Let X be a RV with pdf/or pmf/ f, then the function
𝐹(𝑥) = 𝑃(𝑋 ≤ 𝑥) is called Distribution Function/ or cumulative probability distribution function.
𝑓(𝑥𝑖) 𝑓𝑜𝑟 𝑝𝑚𝑓 𝑓
𝑥𝑖 ∈𝑋,𝑥𝑖 ≤𝑥
𝐹 𝑥 = 𝑃 𝑋 ≤𝑥 = 𝑥
න 𝑓 𝑥 𝑑𝑥 𝑓𝑜𝑟 𝑝𝑑𝑓 𝑓
−∞
P (a ≤ x ≤ b) = F(b) - F(a) for any real two constants a and b with a ≤ b, and
𝑑𝐹(𝑥)
𝑓 𝑥 = where the derivative exist
𝑑𝑥
seid.belay@aastu.edu.et 19-May-25
10 Introduction to Expectation: Mean, Variance
The expectation of a random variable X is often called mean and denoted by
E(X) which gives the long run average value of X.
seid.belay@aastu.edu.et 19-May-25
11 Examples
A game involves rolling a Korean die (4 faces). If a one, two, or three shows,
the player receives the face value of the die in dollars, but if a four shows,
the player is obligated to pay $4. What is the expected value of the game?
During her four years at college, Niki received A's in 30% of her courses, B's
in 60% of her courses, and C's in the remaining 10%. If A = 4, B = 3, and C =
2, find her grade point average.
seid.belay@aastu.edu.et 19-May-25
12 Example
The distribution of the amount of gravel (in tons) sold by a
particular construction supply company in a given week is a
3
1 − 𝑥 2 𝑓𝑜𝑟 0 ≤ 𝑥 ≤ 1
continuous RV X with pdf 𝑓 𝑥 = ൝2
0 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒
𝑓(𝑥)
Find the expected amount of gravel sold.
1.5
1 3 3 1
Sol. 𝐄 𝐱 = 0 𝑥 × 1− 𝑥2 𝑑𝑥 = 𝑥 − 𝑥 3 𝑑𝑥
2 2 0
3 𝑥2
1 3 1 1𝑥4
= × = × − = 3/8
−
2 0 2
2 2 44
0 1 x
The expected amount of gravel sold in a given week is 3/8 tons.
seid.belay@aastu.edu.et 19-May-25
13 Example
Example: The time it takes to repair a personal computer is a random variable
whose density, in hours, is given by
1
𝑓 𝑥 = ቐ2 𝑖𝑓 0 < 𝑥 < 2
0 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒
The cost of the repair depends on the time it takes and is equal to 40 + 30√𝑥
when the time is x. Compute the expected cost to repair a personal
computer.
seid.belay@aastu.edu.et 19-May-25
14 Properties of expectation
For random variables X, and Y, and constants k, a, and b
If X = K, E(X) = K
E(aX + b) = a E(X) + b
E(X ± Y) = E(X) ± E(Y).
𝐄(𝐗𝐘) = 𝐄(𝐗)𝐄(𝐘) if X and Y are independent
seid.belay@aastu.edu.et 19-May-25
Variance
15 Let X is a random variable. The variance of X, denoted by 𝑉(𝑋) 𝑜𝑟 𝑉𝑎𝑟(𝑋) or 𝛿𝑥2 , defined
2.
as: 𝑉 𝑋 = 𝐸 𝑋 – 𝐸 𝑋 The positive square root of 𝑉(𝑋) is called the
𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛 𝑜𝑓 𝑋 and denoted by 𝜎𝑥 .
3 𝑥 3 5
𝑥 1 3 1 1
P .3 .6 .1 1 = 2× − = × − =1/5
3 5 0 2 3 5
X*p .9 2.4 .5 3.8 1
Therefore, Var X = 𝐸 𝑥2 − 𝐸 𝑥 2 = −
𝐱 𝟐 ×P 2.7 9.6 2.5 14.6 5
3 2 19
E(X) =3.8 E(𝐱 𝟐 ) =14.6 Var(x) =0.16 = 320 ≈ 0.06
seid.belay@aastu.edu.et 8
Standard deviation = 𝑉𝑎𝑟(𝑋) = 0.06 ≈ 0.243
19-May-25
16 Properties of variance
For a constant k, random variables X and Y with finite variance,
Var(X + K) = Var(X)
Var(KX) = K2 Var(X)
𝑽𝒂𝒓 𝑿 ± 𝒀 = 𝑽𝒂𝒓 𝑿 + 𝑽𝒂𝒓(𝒀) When X and Y are independent
For any real a , Var(X) = E[(X – a)2] – [E(X) – a]2.
Example:
1. If X is RV Satisfying E[(X – 1)2] = 10, E[(X – 2)2] = 6.
Find (a) E(X) (b) Var(X) (c) Standard deviation of X?
3𝑥 2 𝑖𝑓 0 < 𝑥 < 1
2. Let X be a continuous random variable with pdf 𝑓 𝑥 = ቊ
0 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒
2
If 𝑌 = 𝑋 + 3 find 𝑉𝑎𝑟 (𝑌)
seid.belay@aastu.edu.et 19-May-25
17 Moments and Moment Generating Functions(MGF)
Let X be a random variable with pmf/ or pdf f: 𝒙𝒏 𝒇 𝒙 𝒊 𝒇𝒐𝒓 𝒑𝒎𝒇 𝒇
The n-th moment about origin is X is defined as: 𝝁′ 𝒏 = 𝑬 𝑿𝒏 = 𝒊=𝟏
∞
න 𝒙𝒏 𝒇 𝒙 𝒅𝒙 𝒇𝒐𝒓 𝒑𝒅𝒇 𝒇
−∞
Moment Generating Function of X is Denoted by 𝑴𝑿 (𝒕). Defined as: 𝑴𝑿 𝒕 = 𝑬 𝒆𝒕𝒙
𝒕𝟐 𝒙𝟐𝒊 𝒕𝒏 𝒙𝒏
σ𝒊=𝟏 𝒆𝒕𝒙𝒊 𝒇 𝒙𝒊 = σ𝒊=𝟏 𝟏 + 𝒕𝒙𝒊 + + ⋯+ 𝒊
+ ⋯ 𝒇 𝒙𝒊 , 𝒇 𝒊𝒔 𝒑𝒎𝒇
𝟐! 𝒏!
𝑬 𝒆𝒕𝒙 =
∞ ∞ 𝒕𝟐 𝒙𝟐 𝒕𝒏 𝒙𝒏
−∞ 𝒆𝒕𝒙 𝒇 𝒙 𝒅𝒙 = −∞ 𝟏 + 𝒕𝒙 +
𝟐!
+ ⋯+
𝒏!
+ ⋯ 𝑓 𝑥 𝑑𝑥 , 𝒇 𝒊𝒔 𝒑𝒅𝒇
𝑡 2 𝝁′ 2 𝑡 𝑛 𝝁′ 𝑛
=1+ 𝑡𝝁′ 1 + 2! + ⋯ + 𝑛! + ⋯
𝒅𝒏 𝑴𝑿
✓ | = 𝝁′ 𝑛 = 𝑬 𝑿𝒏
𝒅𝒕𝒏 𝒕=𝟎
seid.belay@aastu.edu.et 19-May-25
19 Bernoulli & Binomial Distributions
Bernoulli experiment:
Consists of a random experiment with two possible outcomes; (a success or a failure)
The probability of a success, denoted by p, probability of failure=1-p .
X=The number of successes
o 𝑷 𝑿 = 𝒙 = 𝒑𝒙 𝟏 − 𝒑 𝟏−𝒙 ; 𝑥 = 0, 1
X 1 0 o E(x) = P, Var(x) = p(1-p)
P(X=x) p 1-p o 𝑴𝑿 𝒕 = 𝟏 − 𝒑 + 𝒑𝒆𝒕
Called Bernoulli Distribution Mean=p, Variance= p(1-p)
Binomial Distribution
Random experiment with two outcomes - a success with probability p, or a failure with 1-p prob.
Consists of n repeated independent trials; p remains constant from trial to trial.
X=The number of successes in n trials, called binomial variable & take values 0,1, …, n
𝒏 𝒙 𝒏−𝒙
Binomial Distribution o 𝑷 𝑿=𝒙 =
𝒙
𝒑 𝟏−𝒑 ; 𝑥 = 0, 1, … , 𝑛
❖ Example 2: Nine percent of men and 0.25% of women cannot distinguish between the
colors red and green. This is the type of color blindness that causes problems with traffic
signals. If six men are randomly selected for a study of traffic signal perceptions, find the
probability that exactly two of them cannot distinguish between red and green.
seid.belay@aastu.edu.et 19-May-25
21 Binomial …
❖ Example 3: The probability that a student passes statistics exam is 0.8. If there are
50 students in a class, what is the probability that
seid.belay@aastu.edu.et 19-May-25
22 Poisson Distribution
Developed by the French mathematician Simeon Denis Poisson in 1837.
A Poisson variable Counts number of events occurring in time interval/space and satisfies:
The number of successes in two disjoint time intervals is independent
The probability of a success during a small time interval is proportional to the entire length of
the time interval
- number of typing errors on a page
Applications:
- birth defects and genetic mutations
- number of car accidents …
Probability Distribution of Poison RV:
𝒆−𝝀 𝝀𝒊
𝑷 𝑿=𝒊 = ; 𝒊 = 𝟎, 𝟏, 𝟐, 𝟑, …
𝒊!
𝒕
Mean = E(x)= 𝝀 Variance =𝑬 𝒙𝟐 − 𝑬 𝒙 𝟐 =𝝀 𝑴𝑿 𝒕 = 𝒆𝝀(𝒆 −𝟏)
Example: A life insurance salesman sells on the average 3 life insurance policies per week. Use
Poisson's law to calculate the probability that in a given week he will sell
a) Some policies b) at least 2 at most 5 policies c) Assuming that there are 5 working days per
week,seid.belay@aastu.edu.et
what is the probability that in a given day he will sell one policy? 19-May-25
23 Poisson …
For a Poisson Variable X:
Your friend sends you text messages at a daily Poisson rate of 𝝀𝟏 =3 and your dad
independently sends you text massages at a daily rate of 𝝀𝟐 =1. suppose that you
don’t receive messages from other source. What is the probability that you will
receive exactly 5 messages tomorrow?
seid.belay@aastu.edu.et 19-May-25
24 Exercise
Let’s say we have two independent random Poisson variables for requests received at a
web server in a minute: X = # requests from humans/minute, X ∼ Poi(5) and Y = # requests
from bots/minute, Y ∼ Poi(3).
b) Given that we have 10 requests in a given minute, what is the probability that 7 of the
requests are human requests?
seid.belay@aastu.edu.et 19-May-25
25 Poisson Approximation To Binomial
For large n, calculating probabilities for binomial distribution is Nasty.
The Good Thing:
In a binomial distribution if 𝑛 ՜ ∞, 𝑎𝑛𝑑 𝑝 ՜ 0, 𝑛𝑝 ՜ 𝑎 𝑐𝑜𝑛𝑠𝑡𝑎𝑛𝑡 𝜆
𝑥 𝑛 −𝑥
𝑛 𝑥 𝑛−𝑥
𝑛 × 𝑛 − 1 … (𝑛 − 𝑥 + 1) 𝜆 𝜆 𝜆
lim 𝑝 1−𝑝 = lim 1− 1−
𝑛՜∞ 𝑥 𝑛՜∞ 𝑥! 𝑛 𝑛 𝑛
x=terms
𝐵𝑖𝑛𝑜. 𝐷𝑖𝑠𝑡.
𝜆𝑥 𝑛× 𝑛−1 …(𝑛−𝑥+1) 𝜆 𝑛 𝜆 −𝑥
= × lim × lim 1− × lim 1 − =
𝑥! 𝑛՜∞ 𝑛𝑥 𝑛՜∞ 𝑛 𝑛՜∞ 𝑛
𝜆𝑥 𝑒 −𝜆
≅1 ≅ 𝑒 −𝜆 ≅1
≅
𝑥!
x=terms
𝑃𝑜𝑖𝑠. 𝐷𝑖𝑠𝑡.
Example2: A 747 plane has 360 seats. An airline has found that 1% of people with
tickets do not turn up. Find the probability that more than 4 people do not turn up for a
flight.
seid.belay@aastu.edu.et 19-May-25
27 Gaussian Distribution / Normal Distribution
The most important probability distribution in statistics – It fits many phenomenon
Important in hypothesis testing, regression, inference
It is symmetric, bell shaped, asymptotic to x-axis.
Mean = Mode = Median
For normal random variable X with mean 𝜇 and variance 𝜎 2
𝒙−𝝁 𝟐 𝟏
𝟏 − 𝝁𝒕+𝟐𝒕𝟐 𝝈𝟐
PDF: 𝒇 𝒙 = 𝒆 𝟐𝝈𝟐 , 𝑥 ∈ (−∞, ∞) With MGF 𝑴𝑿 𝒕 = 𝒆
𝝈 𝟐𝝅
seid.belay@aastu.edu.et 19-May-25
28 Standard Normal Distribution
A special normal distribution where the mean is 0 and the
standard deviation is 1.
𝑧 2
1 −2
Pdf: 𝑓 𝑧 = 𝑒 , 𝑧 ∈ (−∞, ∞)
2𝜋
𝑋− 𝜇
Every Normal distribution can be transformed to it with 𝑍 = 𝜎
seid.belay@aastu.edu.et 19-May-25
30 Example (Cont’d)
Find the probability that diameter (d)
i. Exceeds 17.5cm
ii. Falls between 11cmand 14cm Z
iii. Falls between 12cm and 18cm
iv. The third quartile value q
Sol.
17.5−15
o 𝑃 𝑑 > 17.5 = 𝑃 𝑧 >
3
= 𝑃(𝑍 > 0.83) Z
11−15 14−15
o 𝑃 11 < 𝑑 < 14 = 𝑃 <𝑧<
3 3
seid.belay@aastu.edu.et 19-May-25 d
31 Example
The score of students in an exam is normally distributed with mean 75 and standard deviation of
10. If the professor decided to assign grade A for the top 10% scorers, what is the minimum score to
get Grade A?
The probability that a river flow exceeds 2,000 cubic meters per second is 12.3%. The coefficient of
variation of these flows is 20%. Assuming a normal distribution, calculate
a) the mean of the flow.
b) the standard deviation of the flow.
c) the probability that the flow will be between 1300 and 1900 m3/s
The Scholastic Aptitude Test mathematics test scores across the population of high school seniors
follow a normal distribution with mean 500 and standard deviation 100. If five seniors are randomly
chosen, find the probability that
a) all scored below 600 and
b) exactly three of them scored above 640
seid.belay@aastu.edu.et 19-May-25
32 Normal Approximation to Binomial
Consider Tossing of a coin balanced 20 times
and consider the number of heads observed(x)
Considering the binomial,
20
P(X=10) = 0.510 1 − 0.5 20−10 = 0.176197
10
The shape of the distribution very close to normal
shape
b) Find the probability that next year’s precipitation will exceed that of the following
year by more than 3 inches. Assume that the precipitation totals for the next 2 years
are independent.
seid.belay@aastu.edu.et 19-May-25
35 Example
History suggests that scores on the Math portion of the Standard Achievement Test
(SAT) are normally distributed with a mean of 529 and a variance of 5732. History
also suggests that scores on the Verbal portion of the SAT are normally distributed
with a mean of 474 and a variance of 6368. Select two students at random.
Let X denote the first student's Math score, and let Y denote the second student's
Verbal score. What is P(X>Y)?
0.6915
seid.belay@aastu.edu.et 19-May-25
36 Exponential Distribution
A random variable X is said to be
exponentially distributed with rate 𝜆 if its pdf is
−𝝀𝒙
𝒇 𝒙; 𝝀 = ቊ 𝝀𝒆 𝒙≥𝟎
𝟎 𝒐𝒕𝒉𝒆𝒓𝒘𝒊𝒔𝒆
Its Mean and Variance respectively:
1 1
𝑎𝑛𝑑
𝜆 𝜆2
Example: The time (in hours) required to repair a machine is an exponentially distributed
random variable with parameter λ = 1.
(a) What is the probability that a repair time exceeds 2 hours?
(b) What is the conditional probability that a repair takes at least 3 hours, given that its
duration exceeds 2 hours?
seid.belay@aastu.edu.et 19-May-25
37 T-Distribution
It’s a distribution very similar to standard
normal distribution
− 𝝊+𝟏
𝒙𝟐 𝟐
𝟏+
Its pdf:
𝝊
𝑓 𝑥 = ; 𝑥 𝑖𝑠 𝑟𝑒𝑎𝑙
𝐵 0.5,0.5𝜐 𝝊
𝟏
Where 𝑩 𝜶, 𝜷 = 𝜶𝒕 𝟎−𝟏 𝟏 − 𝒕 𝜷−𝟏 𝒅𝒕 is beta
function
Mean= Mode =Median = 0
𝜈
Variance = ;𝜈 >2
𝜈−2
Example: If T has a t-distribution with 8 degrees
of freedom, find
(a) P{T ≥ 1.108}
(b) P{T ≤ 1.860}, and
(c) P{-1.108 < T < 1.108}
seid.belay@aastu.edu.et 19-May-25
= 2
22 Γ
2
2
2
∞
Where Γ 𝑎 = 0 𝑡 𝑎−1 𝑒 −𝑡 𝑑𝑡 𝑖𝑠 𝑔𝑎𝑚𝑚𝑎 𝑓𝑢𝑛𝑐𝑡𝑖𝑜𝑛
Mean = 𝜈 = Degree of Freedom = n-1
Variance = 2 𝜈
𝑃 𝝌𝟐 > 𝝌𝟐𝟎.𝟏,𝟗 = 𝟎. 𝟏, 𝒘𝒉𝒆𝒓𝒆 𝝌𝟐𝟎.𝟏,𝟗 = 𝟏𝟒. 𝟔𝟖𝟒
𝑃 𝝌𝟐 > 𝝌𝟐𝟎.𝟗𝟕𝟓,𝟔 = 𝟎. 𝟗𝟕𝟓 where 𝝌𝟐𝟎.𝟗𝟕𝟓,𝟔 = 𝟏. 𝟐𝟑𝟕
Use Case: seid.belay@aastu.edu.et 19-May-25
seid.belay@aastu.edu.et 19-May-25
49
seid.belay@aastu.edu.et 19-May-25