Chapter 1
Conditional probability
1.1
Total Probability theorem
P(B) = P (A1 )P (B|A1 ) + ... + P (An )P (B|An )
1.2
Bayes Rule
A Prior probability is Before we have any evidence. A Posterier probability is
after the evidence.P(B|A) is the probability of B after event A has happened.
This doesnt mean that P(A)=1. Also, this doesnt change the prior probability.
Bayes Rule in the discrete case
(A)
P (A|B) = P (B|A)P
P (B)
Bayes Rule in the continuos case
f
(y|x)f (x)
fX|Y (x|y) = Y |X fY (y) X
Bayes rule also works for a mix of Discrete and Continuous random variables
1.3
Joint and Marginal PDF
The marginal PDFs of X and Y can be calculated from the joint PDF, using
the formulas
R1
fX (x) = 1 fX,Y (x, y)
CHAPTER 1. CONDITIONAL PROBABILITY
Chapter 2
Continuous Random
Variables
2.1
Expected Value
Expected value of an event is = Probability of that event * Value of the outcome. If Probability of winning a lottery is .01 and the
R 1amount is Dollar 100.
Then the Expected winnings are 100*.01 = 1 E[X] = 1 xf (x)dx
R1
R0
E[X] = 0 P (X > y)dy
P (X < y)dy
1
Expected value
R 1 of a function of a Random Variable
E[g(X)] = 1 g(x)f (x)dx
Linearity of Expectations
E[aX + b] = aE[X] + b
2.2
Variance
Variance is the expected value of the function g(X) = (X
Var(X) = E[(X E[X])2 ]
Var(X) = E[X 2 ] E[X]2
Var(aX + b) = a2 V ar(X)
E[X])2 .
CHAPTER 2. CONTINUOUS RANDOM VARIABLES
Chapter 3
Further topics on RVs
3.1
Derived Distributions
We know X, and we want Y = g(X). A general method of solving this type of
problem is
1.Find the CDF of Y : Fy = P (Y y)
2.Substitute Y with X using the g(.)
3.Rewrite Fy asFx
4.Dierentiate
3.2
Convolution : Sum of two Random variables
X+Y=Z
When X and Y are independent E[Z] = E[X] + E[Y]
R1
fZ (z) = 1 fX (x)fY (z x)
i.e. Take all the X,Y pairs whose sum is Z. This formula is valid for independent RVs
Sum of two normal distributions
A = X + Y, where X and Y are independent normal. Then A is also normal with
E[A] = E[X] + E[Y] and var(A) = var(X) + var(Y). Do not use this formula
when X and Y are not independent.
3.3
Conditional expectations
E[X|Y=y] is the expected value of X, when Y = y. Due to the condition, E[X|Y]
is a function of Y.
Law of iterated Expectations: E[E[X|Y]] = E[X]
5
3.4
CHAPTER 3. FURTHER TOPICS ON RVS
Covariance and corelation
Covariance
Cov(X,Y) = E[(X-E[X])(Y-E[Y])]
Cov(X,Y) = E[XY] - E[X]E[Y]
Properties of Covariance
If X and Y are independent then Cov(X,Y)=0. The converse is not always true.
Cov(X,X) = Var(X)
Cov(aX + b , Y) = aCov(X,Y)
Cov(X , Y + Z) = Cov(X,Y) + Cov(X,Z)
Variance of a sum of RVs
V ar(
Pn
i=1
Xi ) =
Pn
j=1
V ar(Xj ) +
(i,j),i6=j
Cov(Xi , Xj )
Variance of sums = Sum of Variances + Sum of all the combinations of the
covariances. If they are independent, then the covariance = 0 and the variances of the sum = sum of the variances.
)
Co-relation co-efficient, (X, Y ) = Cov(X,Y
x y
3.5
The Conditional Variance
Law of total variance
var(X) = E[var(X|Y)] + var(E[X|Y])
The bold stu are a random variable of Y. So, they have an expected value
as well as a variance
How are these useful? I know how to deal with the sum of Random Variables.
But what about the sum of random number of random variables. i.e. that the
number of random variables is also a random variable.
Example. I go shopping.
N: Number of shops visited
Xi : Money spent in each shop
Y = X1 + ...XN Note that N is itself a RV
Find the E[Y]. i.e. The expected value of the total money spent I know how
to deal with this if I know the number of shops that were visited. So, I will
condition on that.
E[Y |N = n] = E[X1 + ... + Xn |N = n]
= E[X1 + ... + Xn ]
= nE[X]
3.5. THE CONDITIONAL VARIANCE
By the Total Expectation Theorem
E[Y ] =
X
n
pn E[Y |N = n]
pn nE[X]
= E[X]
pn n
= E[X]E[N ]
By the Law of Iterated Expectations
E[Y ] = E[E[Y |N ]]
E[Y ] = E[N E[X]]
= E[X]E[N ]
This makes intuitive sense. The amount of money spent is equal to the
amount spent in each store multiplied by the number of stores.
Now I find the Variance of a random number of random variables
By the Law of Total Variance
var(Y) = E[var(Y|N)] + var(E[Y|N])
Now, I find the two terms.
E[var[Y |N ]] = E[N var(X)]
= var(X)E[N ]
var(E[Y |N ]) = var(N E[X])
= (E[X])2 var(N )
var(Y ) = var(X)E[N ] + (E[X])2 var(N )
CHAPTER 3. FURTHER TOPICS ON RVS
Chapter 4
Bayesian Statistical
Inference
in BI, to find an unknown quantity/model we assume it to be a random variable
. This random variable has a prior probability distribution p (). We observe
some data x, and use Bayes rule to derive a posterior probability distribution
p|X (|x). This captures all the information that x can give about .
Summary of Bayesian inference
1. We start with a prior distribution p or f for the unknown RV.
2. We have a model pX| or fX| . i.e. How is X like, when is true. X is the
thing that we can observe.
3. For the posterior distribution of use Bayes rule.
10
CHAPTER 4. BAYESIAN STATISTICAL INFERENCE
Chapter 5
Problem solving tips
1. Deductibles : Suppose my deductible is $100. Then if the total bill is less
than 100, the insurance company will pay nothing. If the bill is$200, the
the company will pay $200 - $100 = $100.
2. E[X 2 ] can be easily found using
= E[X 2 ]
R
P the V ar(X)
Otherwise, use the formula x x.p(x) or x.f (x)
E[X]2 formula.
3. Many complex problems comes down to Binomial in the end. Be careful
and do not miss the nk term.
4. While changing the variables in an integration. I often mess up and forget
about changing the limits. Do, not be lazy and change the limits when
you change the variable.
5. Only an IID distribution can be modelled as Binomial. i.e. All the trials
should be independent. In some problems, like the unknown bias of a
coin. The trials are conditionally independent. Using this condition, the
problem can be solved using the law of iterated expectation.
6. Wishful thinking. If knowing a random variable helps me solve the
problem. Then try to condition on that random variable and then think
about the problem. Example, if I need to find the probability of P (X < Y )
where X and Y are independent Random Variables. Then if I can find this
P (X < y|Y = y). Then, P (X < Y ) = P (X < y|Y = y)fY (y). Add this
for Rall values of Y. Or integrate in case of continuous RV. P (X < Y ) =
i.e. Y P (X < y|Y = y)fY (y).
7. Maximum values If Y = max(X1 , X2 , X3 ). What is the P (Y > y). If at
least one of the Xi > y. Then Y will also be greater than y. So, this can
be modelled as P (X1 [ X2 [ X3 ). Look at the next point to find P(Yy).
8. Another similar problem is in page 239. I need to find out the Expected
value of the largest claim made. The PDF of the claim fX is given.
11
12
CHAPTER 5. PROBLEM SOLVING TIPS
Find the CDF FX . Now, FY (y) = P (Y < y) = P (max(X1 , X2 , X3 ) < y).
Suppose, the maximum value of Y is 10, then any of the Xi can not be
greater than 10. So, this means P (X1 < y) \ P (X2 < y) \ P (X3 < y).
This gives me the CDF of largest claim made. Which is give me the PDF
of the largest claim made and then I can find the expected value. Actually,
the same method can be used to solve the previous problem.
9. For problems involving deductibles create a new
( RV for the actual pay1
2 f or x > 1
ment. Example, if the PDF for loss is f(x) = x
And the
0, otherwise
deductible is 2. Then define Y, where Y is the retained losses of the policyholder. If the actual loss is between 1 and 2, then the policyholder
has to pay for everything. and if the actual loss is greater than 2, then
the
( insurance company will pay the actuals minus the deductible Y =
X f or 1 < X < 2
X 2 f or X > 2
For the insurance company, Define another random variable Z, representing what the company pays.
( The company pays nothing if the X is less
0 f or1 < X < 2
than the deductible. Z =
X 2 f orX > 2.
Wrong method E[loss] - deductible doesnt give the E[payment]. It
doesnt account for the case when the loss is less than the deductible and
the payment (by the insurance company) is zero.
10. PDFs with the term |x| can often be solved faster by using symmetry. If
the function is symmetric around a point, then that point is the expected
value. If a part of the function is symmetric around zero. Then that part
can be ignored. Convince yourself of symmetry by drawing the function.
11. Percentile Given PDF, what is the 95th percentile? It is asking for the
value of X = x for which F(x) = .95 or P(X x)=.95.
12. How to find the median? Find the CDF, F(X = x). F(x) = .5
13. Survival function is the complement of the CDF. i.e. P (X < x)
14. The concept of derived distributions is sometimes useful in calculating
payments. When the relationship between the RV of payment and RV of
loss is given. For example.
The time,
( T, that a manufacturing system is out of operation has CDF.
1 ( 2t )2 , t > 2
The resulting cost to the company is Y = T 2 .
F(t) =
0, otherwise.
Determine the density function of Y, for y > 4.
(a) Write FY (y) =P(Y y)
13
(b) Replace Y with T using g(.).
p
P (T 2 y) = P (T y) = 1
(c) f (y) = F 0 (y) =
( p2y ) = 1
42
y
4
y2
15. It is always useful to write the payment random variable before doing any
calculations. For example f (y) = 2y 3 for y > 1. Where y is the loss
value. And it is given 8
that the payment is capped at 10. So, The RV for
>0, Y 1
<
payment will be, Y= Y, f or1 < Y < 10 . Now that Ive divided this
>
:
10, f orY > 10
up into cases, I can easily calculate the Expected Value.
14
CHAPTER 5. PROBLEM SOLVING TIPS
Chapter 6
Integration
1.
R1
0
xt e tdt = t!
15