Week 1: Assignment1: Assignment Submitted On 2022-01-28, 20:30 IST
Week 1: Assignment1: Assignment Submitted On 2022-01-28, 20:30 IST
(https://swayam.gov.in)
(https://swayam.gov.in/nc_details/NPTEL)
pranithavoggu@gmail.com
NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL)
»
Data Analytics with Python (course)
Course
outline Week 1 : Assignment1
The due date for submitting this assignment has passed.
How does an Due on 2022-02-09, 23:59 IST.
NPTEL online
course work?
() Assignment submitted on
2022-01-28, 20:30 IST
1) State True or false: 1 point
Week 0 () Statement: Data can be generated by machines but not by humans.
Week 1 ()
True
False
Introduction to
data analytics Yes, the answer is correct.
Score: 1
(unit?
unit=17&lesson=18) Accepted Answers:
False
Python
Fundamentals - 2) Which one of the following is not a classification of Data Analytics? 1 point
I (unit?
unit=17&lesson=19)
Diagnostic analytics
Deceptive analytics
Python
Fundamentals -
Predictive analytics
II (unit?
Prescriptive analytics
unit=17&lesson=20)
Yes, the answer is correct.
Central Score: 1
Week 3 ()
Accepted Answers:
A is correct B is false
Week 4 ()
5) For getting 4rd, 5th and 7th row of a datafile “df”in Python programming, we can write: 1 point
Week 5 ()
df.loc[[3,4,6]]
Week 6 ()
df.loc[[4,5,7]]
df.iloc[3,4,6]
Week 7 ()
None of these
Accepted Answers:
Week 9 ()
df.loc[[3,4,6]]
Skewness
Week 11 ()
Kurtosis
Week 12 ()
Range
percentile
Download
Yes, the answer is correct.
Videos ()
Score: 1
Accepted Answers:
Weekly percentile
Feedback ()
7) State the following true or false? Statement: Bimodal Data sets contains two modes. 1 point
Text
True
Transcripts ()
False
Y h i
Yes, the answer is correct.
Books () Score: 1
Accepted Answers:
Live sessions True
- Solve
8) Bar Charts are used for : 1 point
sample
problems
Continuous data
with us ()
Categorical data
Both of these
None of these
Accepted Answers:
Categorical data
Ordinal data
Interval data
Nominal data median applicable for ordinal data and interval but
mean not applicable for both ordiinal and nominal
None of these
Accepted Answers:
Nominal data
True
False
Accepted Answers:
True
Assignment 1 Solution
Q1 B
Data can be generated by machine, humans, or their interaction
Q2 B
Deceptive analytics is not a classification of data Analytics
Q3 A
Q4 C
We cannot pass negative values as an input with loc
Q5 A
The location index in python starts from zero not 1.
Q6 D
Percentile is not a measure of dispersion.
Q7 A
Q8 B
Q9 C
Q10 A
X
(https://swayam.gov.in)
(https://swayam.gov.in/nc_details/NPTEL)
pranithavoggu@gmail.com
NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL)
»
Data Analytics with Python (course)
Course
outline Week2 : Assignment 2
The due date for submitting this assignment has passed.
How does an Due on 2022-02-09, 23:59 IST.
NPTEL online
course work?
() Assignment submitted on
2022-01-28, 20:31 IST
1) A college plans to interview 8 students for possible offers of graduate assistant-ships. 1 point
Week 0 () The college has three assistant-ships available. How many groups of three can the college select?
Week 1 ()
126
56 8c3
Week 2 ()
136
Introduction to
130
Probability- I
Yes, the answer is correct.
(unit? Score: 1
Score: 1
Distributions - II
Accepted Answers:
(unit?
a sample point
unit=25&lesson=29)
Probability 3) Two events having nonzero probabilities 1 point
Distributions -
III (unit?
can be both mutually exclusive and independent
unit=25&lesson=30)
cannot be both mutually exclusive and independent
Quiz: Week2 :
are always mutually exclusive
Assignment 2
are always independent
(assessment?
Yes, the answer is correct.
name=125)
Score: 1
Assignment Accepted Answers:
solution week 2 cannot be both mutually exclusive and independent
(unit?
4) Ten individuals are candidates for positions of president, vice president of an 1 point
unit=25&lesson=135)
organization. How many possibilities of selections exist?
Week 3 ()
90
Week 4 ()
100
10c1*9c1
120
Week 5 ()
130
Accepted Answers:
Week 7 () 90
Mean 1 and standard distribution 0
Week 9 ()
Mean 0.5 and standard distribution 0.5
std nrml dist mean=0,std dist-1
Week 10 ()
Mean 0 and standard distribution 1
Mean 1 and standard distribution 1
Week 11 ()
Yes, the answer is correct.
Score: 1
Accepted Answers:
0.9772
True
False
Accepted Answers:
False
9) For a binomial experiment with p = 0.5 and a sample size of 100. The expected value 1 point
of this distribution is?
0.50
0.30
np
100
50
Accepted Answers:
50
True
False
Accepted Answers:
False
Assignment 2 Solution
Q1 B
P= 8C3 = 8x7x6! / (5!x 3!)= 56
Q2 B
Q3 B
Q4 A
P = 10C1 x 9C1 = 10x9 =90
Q5 C
Q6 B
z = x - mu / sigma = 241.25 - 200/ 25 = 1.65
from z table, the area between infinity and z score is 0.0495
Q7 B
z = x - mu / sigma = 250- 200/ 25 = 2
from z table, the area between - infinity and z score is 0.9772
Q8 B
Q9 D
Expected value = np = 0.5 x100 = 50
Q10 B
X
(https://swayam.gov.in)
(https://swayam.gov.in/nc_details/NPTEL)
pranithavoggu@gmail.com
NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL)
»
Data Analytics with Python (course)
Course
outline Week3 - Assignment 3
The due date for submitting this assignment has passed.
How does an Due on 2022-02-16, 23:59 IST.
NPTEL online As per our records you have not submitted this assignment.
course work?
() 1) Sate True or False: 1 point
Statement: The specific value of a random variable is called estimator
Week 0 ()
True estimate
Week 1 ()
False
Accepted Answers:
Week 3 () False
2) If the true proportion of customers who are below 20 years is P=0.35, what is the 1 point
Python Demo
probability that a sample size 100 yields a sample proportion between 0.3 to 0.4
for Distributions
(unit?
0.961
unit=32&lesson=33)
0.827
Sampling and
0.706
Sampling
Distribution
0.53
(unit?
No, the answer is incorrect.
unit=32&lesson=34) Score: 0
Distribution of
Accepted Answers:
Sample Means,
0.706
population, and
3) Stratified random sampling is a method of selecting a sample in which 1 point
variance (unit?
unit=32&lesson=35)
the sample is first divided into strata, and then random samples are taken from each stratum
Confidence
various strata are selected from the sample
interval
the population is first divided into strata, and then random samples are drawn from each
estimation:
stratum
Single
population - I
None of these alternatives is correct
(unit?
No, the answer is incorrect.
unit=32&lesson=36) Score: 0
Accepted Answers:
Confidence
the population is first divided into strata, and then random samples are drawn from each stratum
interval
estimation: 4) Sate True or False: 1 point
Single
Statement: A population is a set of all items or individual of interest
population - II
(unit?
True
unit=32&lesson=37)
False
Quiz: Week3 -
No, the answer is incorrect.
Assignment 3
Score: 0
(assessment?
Accepted Answers:
name=124)
True
Solution for
5) A question paper contains 90 multiple choice questions. There are 4 alternative 1 point
week 3 (unit?
unit=32&lesson=136)
answers (A, B, C or D) out of which only one is correct. Mr X answers these questions randomly (i.e.
without preparation). What is the probability that X gets a score of at least 10 marks?
Week 4 ()
0.9997
Week 5 ()
0.7894
0
Week 6 ()
0.001
Score: 0
Accepted Answers:
Week 8 () 0.9997
Week 10 ()
0.065
Week 11 ()
0.075
0.085
Week 12 ()
0.095
Text
0.0668
Transcripts ()
0.544
Books ()
0.082
Live sessions
0.205
- Solve No, the answer is incorrect.
sample Score: 0
8) A car distributor in city Y experiences on an average 2.5 car sales per day. Find the 1 point
probability that on a randomly selected day, they will sell no car:
0.0668
0.544
0.082
0.205
No, the answer is incorrect.
Score: 0
Accepted Answers:
0.082
9) A random sample of 100 people shows that 25 of them are females and rest are males. 1 point
Form a 95% confidence interval for the true proportion of females. The lower limit of this interval will
be:
0.150
0.145
0.165
0.175
Accepted Answers:
0.165
10) A random sample of 100 people shows that 25 of them are females and rest are males. 1 point
Form a 95% confidence interval for the true proportion of females. The upper limit of this interval will
be:
0.150
0.165
0.465
0.335
Accepted Answers:
0.335
Assignment 3 solution
A1 B
The specific value of a random variable is called estimate
A2 C
A3 C
A4 A
A5 A
A6 B
A7 A
A8 C
A9 C
A10 D
X
(https://swayam.gov.in)
(https://swayam.gov.in/nc_details/NPTEL)
pranithavoggu@gmail.com
NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL)
»
Data Analytics with Python (course)
Course
outline Week4 : Assignment 4
The due date for submitting this assignment has passed.
How does an Due on 2022-02-23, 23:59 IST.
NPTEL online
course work?
() Assignment submitted on
2022-02-23, 22:21 IST
1) If we have a sample size of 20 and population standard deviation is known, we will use: 1 point
Week 0 ()
t- test for hypothesis testing
Week 1 ()
z-test for hypothesis testing
both t and z test
Week 2 ()
F-test
Score: 1
Testing- III
Accepted Answers:
(unit?
Two tail test
unit=39&lesson=42)
Errors in 3) The quality-control manager at a Li-BATTERY factory needs to determine whether the 1 point
Hypothesis mean life of a large shipment of Li-Battery is equal to the specified value of 375 hours. The process
Testing (unit? standard deviation is known to be 100 hours. A random sample of 64 batteries indicates a sample
unit=39&lesson=43) mean life of 350 hours.
State the null and alternative hypotheses
Hypothesis
Testing: Two
Mu = 375
sample test- I
(unit?
Mu ≤ 375
unit=39&lesson=44)
Mu = 350
Important Data
Mu ≥ 350
Sets (unit?
Yes, the answer is correct.
unit=39&lesson=45) Score: 1
Week 8 ()
Accepted Answers:
Yes, there is
Week 9 ()
5) For one-tailed test, the test statistic z is determined to be zero. The p-value for this test 1 point
Week 10 () is:
Week 11 ()
Zero
-0.5
Week 12 ()
+0.5
1.0
Download
Videos () Yes, the answer is correct.
Score: 1
Accepted Answers:
Weekly
+0.5
Feedback ()
6) The error of rejecting a true null hypothesis is: 1 point
Text
Transcripts ()
a Type I error
a Type II error
Books ()
is the same as Beta
committed when not enough information is available
Y h i
Yes, the answer is correct.
Live sessions Score: 1
- Solve Accepted Answers:
sample a Type I error
problems
7) The mean cost of a hotel room in a city is said to be $168 per night. A random sample 1 point
with us ()
of 25 hotels resulted in X-bar = $172.50 and sample standard deviation s = 15.40. Calculate the t
statistic.
2
-2
1.46
-1.46
Accepted Answers:
1.46
no conclusions can be drawn from the test
the alternative hypothesis is true
the data must have been accumulated incorrectly
the sample size has been too small
Accepted Answers:
the alternative hypothesis is true
1 - the level of significance
the critical value
the confidence level
the level of significance
Yes, the answer is correct.
Score: 1
Accepted Answers:
the level of significance
will always be rejected at the 1% level
will always be accepted at the 1% level
will never be tested at the 1% level
May be rejected or not rejected at the 1% level
A dA
Accepted Answers:
May be rejected or not rejected at the 1% level
Assignment 4 Solution
A1 B
A2 C
A3 A
A4 A
A5 C
A6 A
A7 C
A8 B
A9 D
A10 D
X
(https://swayam.gov.in)
(https://swayam.gov.in/nc_details/NPTEL)
pranithavoggu@gmail.com
NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL)
»
Data Analytics with Python (course)
Course
outline Week5 : Asignment 5
The due date for submitting this assignment has passed.
How does an Due on 2022-03-02, 23:59 IST.
NPTEL online
course work?
() Assignment submitted on
2022-03-01, 20:06 IST
1) In the analysis of variance procedure (ANOVA) the term "factor" refers to: 1 point
Week 0 ()
the dependent variable
Week 1 ()
the independent variable
different levels of a treatment
Week 2 ()
the critical value of F
Week 5 () 2) In a problem of ANOVA, involving 3 treatments and 10 observations per treatment, 1 point
SSE = 500. The MSE for this situation is:
Hypothesis
Testing: Two
130.2
sample test- II
48.8
(unit?
18.52 MSE=SSE/DOF
unit=47&lesson=48)
30.0
Hypothesis
Testing: Two Yes, the answer is correct.
sample test- III Score: 1
Analysis(Tukey’s
test) (unit? Accepted Answers:
unit=47&lesson=52) MSTR/MSE
Important Data 4) An ANOVA procedure is applied to data obtained from 7 samples where each sample 1 point
files (unit? contains 10 observations. The degrees of freedom for the critical value of F are:
unit=47&lesson=53)
7 numerator and 20 denominator degrees of freedom
Quiz: Week5 : NU = 6-1
Asignment 5
5 numerator and 20 denominator degrees of freedom DU=70-7
(assessment?
6 numerator and 63 denominator degrees of freedom
name=121)
7 numerator and 63 denominator degrees of freedom
Solution For Yes, the answer is correct.
Week 5 (unit? Score: 1
Week 8 ()
120
SSE=SST-SSTR
80
Week 9 ()
220
Accepted Answers:
Week 11 () 120
Week 12 () 6) The critical F value with 8 numerator and 29 denominator degrees of freedom at alpha 1 point
= 0.01 is
Download
2.18
Videos ()
3.20
Weekly
3.53
Feedback ()
3.94
Accepted Answers:
t distribution with 48 degrees of freedom
True
False
Accepted Answers:
True
9) Mean marks obtained by male and female students of school ABCD in first unit test are 1 point
shown as below.
Male Female
Sample Size 64 36
Sample Mean Marks 44 41
Population Variance 128 72
The standard error for the difference between the two means is
4.0
7.46
4.24
2.0
Yes, the answer is correct.
Score: 1
Accepted Answers:
2.0
10) Mean marks obtained by male and female students of school ABCD in first unit test are 1 point
shown as below.
Male Female
Sample Size 64 36
Sample Mean Marks 44 41
Population Variance 128 72
If you are interested in testing whether or not the average marks of males is significantly greater
than that of females, the test statistic is:
2.0
1.5
1.96
1.645
Accepted Answers:
1.5
Assignment 5 Solution
ANSWER KEY
A1 B
A2 C
MSE = SSE/DOF =500/(30-3) = 18.52
A3 B
A4 C
NUMERATOR DOF = C-1 =6
DENOMINATOR DOF =N-C = 70 - 7 = 63
A5 B
SSE = SST-SSTR = 200 – 80 = 120
A6 B (USE F TABLE)
A7 D
DOF for two sample t test = n1+n2 -2 = 15 +35 -2 = 48
A8 A
Only z test is possible in case of two proportions.
A9 D
SE = sigma/√n = √((s12 / n1)+( s22 / n2)) = √(2+2) = 2
A10 B
t = (mean 1 – mean2)/ SE = 3/2 = 1.5
X
(https://swayam.gov.in)
(https://swayam.gov.in/nc_details/NPTEL)
pranithavoggu@gmail.com
NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL)
»
Data Analytics with Python (course)
Course
outline Week6 : Assignment 6
The due date for submitting this assignment has passed.
How does an Due on 2022-03-09, 23:59 IST.
NPTEL online
course work?
() Assignment submitted on
2022-03-08, 19:28 IST
1) Sate True or False: 1 point
Week 0 () Statement: In regression analysis the error term is normally distributed
Week 1 ()
True
False
Week 2 ()
Yes, the answer is correct.
Score: 1
unit=55&lesson=56)
Accepted Answers:
Two Way estimated regression equation
ANOVA (unit?
unit=55&lesson=57) 3) State True or False:
Statement: A completely randomized design (CRD) is useful when 1 point
the experimental units are heterogeneous
Linear
True
Regression - I
False
(unit?
unit=55&lesson=58) Yes, the answer is correct.
Score: 1
Linear
SSE = SST
Regression - III
SSE = 1
(unit?
unit=55&lesson=60)
SSR = SSE
SSR = SST
Important Data
files (unit? Yes, the answer is correct.
unit=55&lesson=61) Score: 1
Accepted Answers:
Quiz: Week6 : SSR = SST
Assignment 6
(assessment? 5) In a completely randomized design, a random sample of Salesmen would be 1 point
name=122) assigned to each shop alternatively. However, salesmen are believed to differ substantially in their
ability to handle number of customers. What is high surge of customers to one salesman might be
Solution for
week 6 (unit? only moderate or even low surge to another. A study measuring the efficiency of the salesmen
unit=55&lesson=150) resulted in proposals for modification and redesign of the salesmen’s work schedule. After
consideration of several schedules for the work, three specific alternatives are selected as having
Week 7 () the best potential for increasing the efficiency of the salesmen. Check to what extent does the three
alternatives differ in terms of their effect on the efficiency of the salesmen?
Week 9 () 2 74 74 74
3 70 71 75
Week 10 () 4 73 72 77
5 76 73 76
Week 11 () 6 73 73 73
Week 12 ()
Accept the null hypothesis
Download
Reject the null hypothesis
Videos ()
Can’t state any conclusion
None of these
Weekly
Feedback () Yes, the answer is correct.
Score: 1
Accepted Answers:
Text
Accept the null hypothesis
Transcripts ()
6) In a completely randomized design, a random sample of Salesmen would be 1 point
Books () assigned to each shop alternatively. However, salesmen are believed to differ substantially in their
ability to handle number of customers. What is high surge of customers to one salesman might be
only moderate or even low surge to another. A study measuring the efficiency of the salesmen
resulted in proposals for modification and redesign of the salesmen’s work schedule. After
Live sessions
consideration of several schedules for the work, three specific alternatives are selected as having
- Solve
the best potential for increasing the efficiency of the salesmen. Check to what extent does the three
sample
alternatives differ in terms of their effect on the efficiency of the salesmen?
problems
with us ()
2 74 74 74
3 70 71 75
4 73 72 77
5 76 73 76
6 73 73 73
Accept the null hypothesis
Reject the null hypothesis
Can’t state any conclusion
None of these
Accepted Answers:
Reject the null hypothesis
2 74 74 74
3 70 71 75
4 73 72 77
5 76 73 76
6 73 73 73
1.955
9.555
6.855
3.588
Accepted Answers:
3.588
1 75 76 78
2 74 74 74
3 70 71 75
4 73 72
77
5 76 73 76
6 73 73 73
1.955
9.555
6.855
3.588
Accepted Answers:
1.955
True
False
larger than SST
smaller than SST
equal to 1
equal to zero
Accepted Answers:
larger than SST
Assignment 6 Solution
A1 A
In regression analysis the error term is normally distributed
A2 C
Since the error term is not present it is an estimated regression equation
A3 B
A4 D
If r square is 1, it means there is no error term
A5 A
Solution of Problem 5 to 8
A6 B
A7 D
A8 A
A9 A
A10 A
X
(https://swayam.gov.in)
(https://swayam.gov.in/nc_details/NPTEL)
pranithavoggu@gmail.com
NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL)
»
Data Analytics with Python (course)
Course
outline Week7 : Assignment 7
The due date for submitting this assignment has passed.
How does an Due on 2022-03-16, 23:59 IST.
NPTEL online
course work?
() Assignment submitted on
2022-03-16, 19:09 IST
1) Which of the following
is not an assumption for simple linear regression? 1 point
Week 0 ()
Normally distributed variables
Week 1 ()
Multicollinearity
Linear relationship
Week 2 ()
Normally distributed residuals
Accepted Answers:
Week 4 ()
Multicollinearity
T-statistic squared
Week 6 ()
Square root of SSE
Week 7 ()
Square root of SST
Square root of MSE
Estimation,
Prediction of Yes, the answer is correct.
Score: 1
Regression
Model Residual Accepted Answers:
Analysis (unit? Square root of MSE
unit=63&lesson=64)
3) Which of the following
is true about multiple regression model? 1 point
Estimation,
It has only one independent variable
Prediction of
It has more than one dependent variable
Regression
Model Residual
It has more than one independent variable
Analysis - II
It has at least 2 dependent variables
(unit?
Yes, the answer is correct.
unit=63&lesson=65)
Score: 1
MULTIPLE Accepted Answers:
REGRESSION It has more than one independent variable
MODEL - I
4) In a multiple regression
model, the error term ɛ is assumed to 1 point
(unit?
unit=63&lesson=66)
Have a mean of 1
MULTIPLE
Have a variance of 0
REGRESSION
Have a standard deviation of 1
MODEL-II
(unit?
Be normally distributed
unit=63&lesson=67) Yes, the answer is correct.
Score: 1
Categorical
Accepted Answers:
variable
Be normally distributed
regression
(unit? 5) For a multiple
regression model with 2 independent variables, R.sq = 0.9041 point
unit=63&lesson=68) and adjusted R. sq
= 0.88, determine the number of observations (n)
Important data
6
Files (unit?
unit=63&lesson=69)
7
9
Quiz: Week7 :
Assignment 7
10
(assessment? No, the answer is incorrect.
name=133) Score: 0
Accepted Answers:
Assignment
10
solution - Week
7 (unit? 6) If the R.sq value is small
for a model with a large number of independent 1 point
unit=63&lesson=153) variables, the adjusted
coefficient of determination _______________
Week 8 ()
Can be positive
Can be negative
Week 9 ()
is 0
Week 10 ()
Can't say
Accepted Answers:
Week 12 () Can be negative
Accepted Answers:
Books () Mean of residuals is always 0
Accepted Answers:
By its slope
9) To check whether a
significant relationship exists between the dependent 1 point
and set of all
independent variables, _____ is used. It is the test for overall
significance.
F-test
R.sq test
a correlation test
t-test
Accepted Answers:
F-test
AUC-ROC
Accuracy
Logloss
Mean-Squared-Error
Accepted Answers:
Mean-Squared-Error
NPTEL – Data Analytics with Python
(https://swayam.gov.in)
(https://swayam.gov.in/nc_details/NPTEL)
pranithavoggu@gmail.com
NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL)
»
Data Analytics with Python (course)
Course
outline Week8 : Assignment 8
The due date for submitting this assignment has passed.
How does an Due on 2022-03-23, 23:59 IST.
NPTEL online
course work?
() Assignment submitted on
2022-03-23, 11:22 IST
1) For categorical data
with ‘n’ categories, the number of dummy variables 1 point
Week 0 () will be________
Week 1 ()
n
n - 1
Week 2 ()
n + 1
2n
Week 3 ()
Yes, the answer is correct.
Score: 1
Week 4 ()
Accepted Answers:
n-1
Week 5 ()
2) In the estimation of
regression parameters 1 point
Week 6 ()
The likelihood function
is a function of only 𝜎
Week 7 ()
The values of 𝛽0..𝛽n
and 𝜎 should be such that, they maximize the likelihood
function.
Week 8 ()
Both a. and b.
None of these
Maximum
Likelihood No, the answer is incorrect.
Score: 0
Estimation- I
(unit?
Accepted Answers:
unit=71&lesson=72)
Maximum
The values of 𝛽0..𝛽n
and 𝜎 should be such that, they maximize the likelihood
Likelihood
function.
Estimation-II 3) In logistic regression,
the null hypothesis tested is: 1 point
(unit?
unit=71&lesson=73)
H0: β = 0
LOGISTIC
H0: β ≠ 0
REGRESSION-
H0: μ = 0
I (unit?
H0: μ ≠ 0
unit=71&lesson=74)
Yes, the answer is correct.
LOGISTIC Score: 1
Week 10 ()
In logistic regression, the dependent variable must be continuous data
In logistic regression, the dependent variable must be categorical data
Week 11 ()
In logistic regression, both dependent and independent variables must be categorical data
Week 12 ()
None of these
True
Text
False
Transcripts () Yes, the answer is correct.
Score: 1
Odds will be 1
None of these
Accepted Answers:
Odds will be 1
That there are a greater number of explained vs. unexplained observations.
That the statistical model fits the data well.
That as the predictor variable increases, the likelihood of the outcome occurring decreases.
That the statistical model is a poor fit of the data
No, the answer is incorrect.
Score: 0
Accepted Answers:
That the statistical model is a poor fit of the data
(– ∞ , ∞)
(0,1)
(0 , ∞)
(- ∞, 0 )
Accepted Answers:
(– ∞ , ∞)
NPTEL: Data Analytics with Python
Week 8 – Assignment solutions
(https://swayam.gov.in)
(https://swayam.gov.in/nc_details/NPTEL)
pranithavoggu@gmail.com
NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL)
»
Data Analytics with Python (course)
Course
outline Week9 : Assignment 9
The due date for submitting this assignment has passed.
How does an Due on 2022-03-30, 23:59 IST.
NPTEL online
course work?
() Assignment submitted on
2022-03-30, 11:46 IST
1) State true or
false: Statement: there is no difference
between, y = β0 + 1 point
Week 0 () β1x + 𝜖
and E(y) = β0 + β1x , both are
regression equations
Week 1 ()
True
False
Week 2 () No, the answer is incorrect.
Score: 0
Week 5 ()
Sensitivity in ROC analysis is called True Positive Rate(tpr)
Specificity in ROC analysis is not called True Negative Rate (tnr)
Week 6 ()
Specificity in ROC analysis is called True Positive Rate(tpr)
Week 7 ()
Sensitivity in ROC analysis is called True Negative Rate (tnr)
Week 8 ()
Accepted Answers:
Week 9 () Sensitivity in ROC analysis is called True Positive Rate(tpr)
unit=79&lesson=81)
Accepted Answers:
Performance of Sensitivity decreases
Logistic Model-
III (unit? 4) Sensitivity in ROC
analysis is defined as: 1 point
unit=79&lesson=82)
FP / (FP+TN)
Regression
FN/(TP+FN)
Analysis Model
Building - I
TN / (TN+FP)
(unit?
TP / (TP+FN)
unit=79&lesson=83)
Yes, the answer is correct.
Score: 1
Regression
Analysis Model Accepted Answers:
Building TP / (TP+FN)
(Interaction)- II
5) In ROC analysis, a
classifier is called ‘good’ if it has ______ 1 point
(unit?
unit=79&lesson=84)
Low TPR and Low FPR
Important data
Low TPR and High FPR
files (unit?
High TPR and Low FPR
unit=79&lesson=85)
High TPR and High FPR
Quiz: Week9 :
Yes, the answer is correct.
Assignment 9
Score: 1
(assessment?
Accepted Answers:
name=142)
High TPR and Low FPR
Week 10 () 6) For the given confusion
matrix, compute the sensitivity 1 point
Accepted Answers:
Text
0.8
Transcripts ()
7) State true or False:
Precision is inversely proportional to sensitivity 1 point
Books ()
True
False
Y h i
Yes, the answer is correct.
Live sessions Score: 1
- Solve Accepted Answers:
sample False
problems
8) State True or False:
Standardization of features is not required before 1 point
with us ()
training a Logistic
regression model
True
False
Accepted Answers:
True
Linear Regression errors values have to be normally distributed but in the case of Logistic
Regression it is not the case
Logistic Regression errors values have to be normally distributed but in the case of Linear
Regression it is not the case
Both Linear Regression
and Logistic Regression error values have to be
normally distributed
Both Linear Regression
and Logistic Regression error values have not to be
normally distributed
Yes, the answer is correct.
Score: 1
Accepted Answers:
Linear Regression errors values have to be normally distributed but in the case of Logistic
Regression it is not the case
The dependent variable is continuous
The dependent variable is divided into two equal subcategories
The dependent variable consists of two categories
There is no dependent variable
(https://swayam.gov.in)
(https://swayam.gov.in/nc_details/NPTEL)
pranithavoggu@gmail.com
NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL)
»
Data Analytics with Python (course)
Course
outline Week10 : Assignment 10
The due date for submitting this assignment has passed.
How does an Due on 2022-04-06, 23:59 IST.
NPTEL online
course work?
() Assignment submitted on
2022-04-06, 11:51 IST
1) Sampling distribution
for the goodness of fit test is the 1 point
Week 0 ()
Poisson distribution
Week 1 ()
t distribution
normal distribution
Week 2 ()
chi-square distribution
Accepted Answers:
Week 4 ()
chi-square distribution
lower-tail test
Week 6 ()
upper-tail test
Week 7 ()
middle test
None of these
Week 8 ()
Yes, the answer is correct.
Score: 1
Test of
Independence - Accepted Answers:
II (unit? True
unit=87&lesson=89)
4) Statistical test
conducted to determine whether to reject or not reject a 1 point
Chi-Square hypothesized
probability distribution for a population is known as a
Goodness of Fit ________
Test (unit?
contingency test
unit=87&lesson=90)
probability test
Cluster
goodness of fit test
analysis:
Introduction- I
None of these
(unit?
No, the answer is incorrect.
unit=87&lesson=91) Score: 0
Download
True
Videos ()
False
Accepted Answers:
chi2_contingency
4.2 and 3
3.15 and 2
3.61 and 5
None of these
Accepted Answers:
3.61 and 5
True
False
Accepted Answers:
False
X
(https://swayam.gov.in)
(https://swayam.gov.in/nc_details/NPTEL)
pranithavoggu@gmail.com
NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL)
»
Data Analytics with Python (course)
Course
outline Week11 : Assignment 11
The due date for submitting this assignment has passed.
How does an Due on 2022-04-13, 23:59 IST.
NPTEL online
course work?
() Assignment submitted on
2022-04-13, 21:19 IST
1) ________ is used for
calculating distance measures in clustering using 1 point
Week 0 () python
Week 1 ()
distance_matrix
spatial_matrix
Week 2 ()
scipy_matrix
distance.matrix
Week 3 ()
Yes, the answer is correct.
Score: 1
Week 4 ()
Accepted Answers:
distance_matrix
Week 5 ()
2) The formula for dissimilarity computation between two objects for categorical variables 1 point
Week 6 () is –
A dA
Accepted Answers:
Week 11 () D(i,j) = p-m / p
Clustering 3) Select the correct option for a data set with 7 objects and an interval-scaled variable ‘f’ 1 point
analysis: Part we have the following measurements:
III (unit? f = (1, 2, 3, 4, 5, 8, 50) containing one outlying value.
unit=95&lesson=96)
Std deviation (std_f) and mean absolute deviation (s_f) are equally affected
Cluster
analysis: Part
Mean absolute deviation (s_f) is more affected by the outlier
IV (unit?
Std deviation (std_f) is more affected by the outlier
unit=95&lesson=97)
None of these
Cluster
Yes, the answer is correct.
analysis: Part V Score: 1
(unit?
Accepted Answers:
unit=95&lesson=98)
Std deviation (std_f) is more affected by the outlier
K- Means
4) Which of the following is true for K-means clustering? 1 point
Clustering
(unit?
It comes under the partitioning method
unit=95&lesson=99)
The number of clusters is predefined for this method
Hierarchical
Cluster similarity is measure in regard to the mean value of the objects in a cluster
method of
clustering -I
All of the above
(unit? Yes, the answer is correct.
unit=95&lesson=100) Score: 1
Accepted Answers:
Important data
All of the above
files (unit?
unit=95&lesson=101) 5) Which of the following can act as possible termination conditions in K-Means? 1 point
Quiz: Week11 :
1. For a fixed number of iterations.
Assignment 11 2. Assignment of observations to clusters does not change between iterations. Except for cases
(assessment? with a bad local minimum.
name=144) 3. Centroids do not change between successive iterations.
4. Terminate when Residual Sum of Squares (RSS) falls below a threshold.
Week 12 ()
1,3 and 4
Download
1,2,3 and 4
Videos ()
2 and 3
None of these
Weekly
Feedback () Yes, the answer is correct.
Score: 1
6) In the figure below, if you draw a horizontal line on y-axis for y=2. What will be the 1 point
Books () number of clusters formed?
Accepted Answers:
2
Partitional
Naive Bayes
Hierarchical
None of these
Accepted Answers:
Hierarchical
True
False
Yes, the answer is correct.
Score: 1
Accepted Answers:
True
True
False
Yes, the answer is correct.
Score: 1
Accepted Answers:
False
True
False
Yes, the answer is correct.
Score: 1
Accepted Answers:
True
X
(https://swayam.gov.in)
(https://swayam.gov.in/nc_details/NPTEL)
pranithavoggu@gmail.com
NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL)
»
Data Analytics with Python (course)
Course
outline Week12 : Assignment 12
The due date for submitting this assignment has passed.
How does an Due on 2022-04-20, 23:59 IST.
NPTEL online As per our records you have not submitted this assignment.
course work?
() 1) Which clustering
algorithm works well when the shape of the clusters is 1 point
hyper-spherical?
Week 0 ()
K means
Agglomerative Hierarchical Clustering
Week 1 ()
Divisive Hierarchical Clustering
Week 2 ()
All of these
Accepted Answers:
Week 4 () K means
2) In decision tree, an
internal node represents 1 point
Week 5 ()
A test on an attribute
Week 6 ()
An outcome of the test
Entire sample population
Week 7 ()
Holds a class label
Week 8 () No, the answer is incorrect.
Score: 0
Week 11 ()
CART is an unsupervised learning technique
CART is a supervised technique
Week 12 ()
CART adopts a greedy approach
Both b. and c.
Hierarchical
method of No, the answer is incorrect.
clustering- II Score: 0
unit=103&lesson=106)
Accepted Answers:
Attribute DecisionTreeClassifier
selection
Measures in 5) State True or False:
Gini Index enforces the resulting tree to have multiway 1 point
CART : II (unit? splits
unit=103&lesson=107)
True
Classification
False
and Regression
Trees (CART) - No, the answer is incorrect.
Score: 0
III (unit?
unit=103&lesson=108)
Accepted Answers:
False
Important data
files (unit? 6) In a decision tree,
______ node represents the entire population 1 point
unit=103&lesson=109)
Root
Python code
Internal
files (unit?
Child
unit=103&lesson=110)
None of the above
Quiz: Week12
: Assignment No, the answer is incorrect.
Score: 0
12
(assessment? Accepted Answers:
name=159) Root
Information gain
Weekly
Gini Index
Feedback ()
Entropy
None of the above
N h i i
No, the answer is incorrect.
Text Score: 0
Transcripts () Accepted Answers:
Entropy
Books ()
8) In a decision tree diagram, ________ node holds a class label 1 point
Live sessions
Root
- Solve
Internal
sample
problems
Child
with us ()
none of the above
Accepted Answers:
Child
True
False
Accepted Answers:
False
10) State True or False: LabelEncoder() is used to normalize and transform non-numerical 1 point
labels to numerical labels
True
False
Accepted Answers:
True