BIRLA INSTITUTE OF TECHNOLOGY & SCIENCE, PILANI
Work Integrated Learning Programmes Division
First Semester 2022-2023
Comprehensive exam
(EC-3 Regular)
Course No. : AAOC ZC111
Course Title : Probability & Statistics
Nature of Exam : Open Book No. of Pages =4
Weightage : 50% No. of Questions =6
Duration : 2.5 Hours
Date of Exam : 21/05/2023 (AN)
Note to Students:
1. Please follow all the Instructions to Candidates given on the cover page of the answer book.
2. All parts of a question should be answered consecutively. Each answer should start from a
fresh page.
3. Assumptions made if any, should be stated clearly at the beginning of your answer.
4. Use of a scientific calculator is allowed.
Q.1(a) Let the joint pdf of X and Y be
(
kx2 y, if x2 ≤ y ≤ 1
f (x, y) =
0 otherwise.
(a) Find the value of k.
(b) Hence, compute P (X > Y ).
(c) Calculate the marginal distributions for X and Y , and check if X and Y are inde-
pendent. [2+3+5]
Q.1(b) Let the joint pdf of X and Y be
(
kx2 y, if 0 < x < y < 1
f (x, y) =
0 otherwise.
(a) Find the value of k.
(b) Calculate the marginal distributions for X and Y .
(c) Find the covariance between 2X + 3 and 3Y + 5. [2+3+5]
Q.1(c) Let the joint pdf of X and Y be
(
kxe−x(y+1) , if x > 0, y > 0
f (x, y) =
0 otherwise.
(a) Find the value of k.
(b) Calculate the marginal distribution for Y .
(c) Find E[X|Y = y]. [4+3+3]
Q.2(a) The world famous cell phone company NOKIA requires to set a minimum working
hours guarantee on its new high quality model NOKIA-8110-4G. If the cell phone
wears out within the guarantee hours, the company will replace it once for free. Tests
reveal that the mean working hours are 67,900 with a standard deviation of 2,500 hours
and that the distribution of working hours closely follows the normal distribution. With
this information, compute the minimum guaranteed working hours for NOKIA-8110-
4G so that no more than 4% of the cell phones will have to be replaced by the company.
It is given that Φ (−1.75) = 0.04. [5]
Q.2(b) Older people often have a hard time finding work. According to the results of a recent
survey, 46 out of 200 workers aged 55 plus could not find their desired job.
(a) What is the point estimate of the proportion of older people who could not find
their suitable job?
(b) Develop a 95% confidence interval for the population proportion. [2+3]
Q.2(c) Older people often have a hard time finding work. According to the results of a recent
survey, 46 out of 200 workers aged 55 plus could not find their desired job.
(a) What is the point estimate of the proportion of older people who could not find
their suitable job?
(b) Develop a 99% confidence interval for the population proportion. [2+3]
Q.3(a) A research team from Coal India Limited (CIL) is studying the amount of sulfur con-
tent of the two major coal seams in a geographical region of Jharkhand. The sample
data collected through a completely randomized design are provided below:
Seam 1 Seam 2
1.51 1.69
1.92 0.90
1.08 1.41
2.04 1.01
2.14 1.41
At α = 0.05 is there any significant difference in the mean sulfur content of the coal
seams? Explain in detail by conducting appropriate hypothesis test. [10]
Q.3(b) A recent study claims that using music in a class enhances the concentration and con-
sequently helps students absorb more information. To figure this out, a school decided
to implement it on a smaller group of randomly selected students from two different
classes. Two different groups of five randomly selected students (all of the same age)
from both classrooms were selected. Each classroom was provided with a different
environment for students to study. Classroom A had constant music being played in
the background and classroom B was a regular class with no music playing. After
one month, the school conducted a test for all the three groups and collected their test
scores (out of 10). The obtained test scores were as follows:
Class A Class B
7 4
9 5
5 6
8 2
7 7
At α = 0.05 is there any difference in the mean number of scores obtained from Class
A, B? Explain in detail by conducting appropriate hypothesis test. [10]
Q.3(c) A recent study claims that using music in a class enhances the concentration and con-
sequently helps students absorb more information. To figure this out, a school decided
to implement it on a smaller group of randomly selected students from two different
classes. Two different groups of five randomly selected students (all of the same age)
from both classrooms were selected. Each classroom was provided with a different
environment for students to study. Classroom A had constant music being played in
the background and classroom B was a regular class with no music playing. After
one month, the school conducted a test for all the three groups and collected their test
scores (out of 10). The obtained test scores were as follows:
Class A Class B
7 6
9 2
5 3
8 4
7 5
At α = 0.01 is there any difference in the mean number of scores obtained from Class
A, B? Explain in detail by conducting appropriate hypothesis test. [10]
Q4(a) A tube light producer company claims that 80% of the tube lights manufactured by it
have life time more than 2 years. The consumer forum thinks the percentage is less.
(a) Identify the population parameter being tested, the null and alternative hypotheses.
(b) The survey conducted by the consumer forum to challenge the company’s claim
indicates that out of 150 randomly selected tube lights, 110 had life time more than
2 years. Does this evidence lead to the rejection of the null hypothesis at 5% level of
significance?
(c) Determine the probability of Type II error at 5% level of significance if the con-
sumer forum thinks that 70% tube lights have life time more than 2 years. [2+4+4]
Q4(b) A tube light producer company claims that 80% of the tube lights manufactured by it
have life time more than 2 years. The consumer forum thinks the percentage is less.
(a) Identify the population parameter being tested, the null and alternative hypotheses.
(b) The survey conducted by the consumer forum to challenge the company’s claim
indicates that out of 150 randomly selected tube lights, 110 had life time more than
2 years. Does this evidence lead to the rejection of the null hypothesis at 5% level of
significance?
(c) Determine the probability of Type II error at 5% level of significance if the con-
sumer forum thinks that 70% tube lights have life time more than 2 years. [2+4+4]
Q4(c) A random sample of 100 recorded deaths in India during a one year period showed
an average life span of 71.6 years. Assuming a population standard deviation of 8.9
years, does this seem to indicate that the mean life span during that year was greater
than 70 years? Use a 0.05 level of significance. [10]
Q5(a) Find the maximum likelihood estimate of the parameter α of a population having prob-
ability density function
(
2
2 (α − x) , if 0 ≤ x ≤ α
f (x) = α
0 otherwise.
for a sample of size 1, x being the sample value. Is it an unbiased estimate? [10]
Q5(b) Let X be a discrete random variable with the following pdf
X −2 −1 0 1 2
2(1+θ)
f (x) 3θ 1−5θ
3
θ
3 3
θ
3
where 0 < θ < 0.2 is a parameter. Based on the 8 independent observations {−2, 0,
1, 2, 1, 1, 0, 2}, find the maximum likelihood estimate of θ. [10]
Q5(c) Let X be a discrete random variable with the following pdf
X −2 −1 0 1 2
2(1+θ)
f (x) 3θ 1−5θ
3
θ
3 3
θ
3
where 0 < θ < 0.2 is a parameter. Based on the 8 independent observations {−2, 0,
1, 2, −1, 1, 0, 2}, find the maximum likelihood estimate of θ. [10]
Q6(a) Write the estimated linear regression equation for the following data for x (independent
variable) and y (dependent variable). [5]
x 2 4 5 6 7 8
y 14 33 40 60 75 85
Q6(b) One of the most challenging problems confronting the river pollution control field
is presented by the tanning industry. Tannery wastes are chemically complex, often
characterized by high values of chemical oxygen demand, volatile solids and other
pollution measures. Consider the following experimental data which were obtained
from 6 samples of chemically treated waste in a study conducted near Kanpur.
x 3 7 11 15 18 27
y 5 11 21 16 16 28
Find the equation of the estimated linear regression line. [5]
Q6(c) One of the most challenging problems confronting the river pollution control field
is presented by the tanning industry. Tannery wastes are chemically complex, often
characterized by high values of chemical oxygen demand, volatile solids and other
pollution measures. Consider the following experimental data which were obtained
from 6 samples of chemically treated waste in a study conducted near Kanpur.
x 3 7 11 15 18 27
y 5 11 28 19 13 28
Find the equation of the estimated linear regression line. [5]