KEMBAR78
Week 1: Assignment1: Assignment Submitted On 2022-01-28, 20:30 IST | PDF | Type I And Type Ii Errors | Statistical Significance
0% found this document useful (0 votes)
2K views53 pages

Week 1: Assignment1: Assignment Submitted On 2022-01-28, 20:30 IST

The document describes an NPTEL online course on data analytics with Python. It provides the outline of the course which spans 12 weeks. It also includes the solutions to an assignment from Week 1 of the course. The assignment had 10 multiple choice questions testing concepts covered in Week 1 like data types, scales of measurement, NumPy indexing etc. All questions were answered correctly.

Uploaded by

Gan Esh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2K views53 pages

Week 1: Assignment1: Assignment Submitted On 2022-01-28, 20:30 IST

The document describes an NPTEL online course on data analytics with Python. It provides the outline of the course which spans 12 weeks. It also includes the solutions to an assignment from Week 1 of the course. The assignment had 10 multiple choice questions testing concepts covered in Week 1 like data types, scales of measurement, NumPy indexing etc. All questions were answered correctly.

Uploaded by

Gan Esh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 53

X

(https://swayam.gov.in)

(https://swayam.gov.in/nc_details/NPTEL)

pranithavoggu@gmail.com 

NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL)
»
Data Analytics with Python (course)

Course
outline Week 1 : Assignment1
The due date for submitting this assignment has passed.
How does an Due on 2022-02-09, 23:59 IST.
NPTEL online
course work?
() Assignment submitted on
2022-01-28, 20:30 IST
1) State True or false: 1 point
Week 0 () Statement: Data can be generated by machines but not by humans.

Week 1 ()
True

False
Introduction to
data analytics Yes, the answer is correct.
Score: 1

(unit?
unit=17&lesson=18) Accepted Answers:
False
Python
Fundamentals - 2) Which one of the following is not a classification of Data Analytics? 1 point
I (unit?
unit=17&lesson=19)
Diagnostic analytics

Deceptive analytics
Python
Fundamentals -
Predictive analytics
II (unit?
Prescriptive analytics
unit=17&lesson=20)
Yes, the answer is correct.
Central Score: 1

Tendency and Accepted Answers:


Dispersion - I Deceptive analytics
(unit?
unit=17&lesson=21) 3) State True or false: 1 point
Central Statement: Nominal scale is the lowest level of measurement and ratio scale is the highest level of
Tendency and measurement.
Dispersion - II
(unit?
True
unit=17&lesson=22)
False
Important Data Yes, the answer is correct.
Files (unit? Score: 1

unit=17&lesson=23) Accepted Answers:


True
Quiz: Week 1 :
Assignment1 4) Consider the following statements-  Statement A : With iloc, we can pass in the negative 1 point
(assessment? value.  Statement B : With loc, we can pass in the negative value. 
name=119)

A and B are correct 
Solution for
week 1 (unit?
Both are false 
unit=17&lesson=134)
A is correct B is false

B is correct A is false 
Week 2 ()
Yes, the answer is correct.
Score: 1

Week 3 ()
Accepted Answers:
A is correct B is false
Week 4 ()
5) For getting 4rd, 5th and 7th row of a datafile “df”in Python programming, we can write: 1 point
Week 5 ()

df.loc[[3,4,6]]  
Week 6 ()
df.loc[[4,5,7]]

df.iloc[3,4,6]
Week 7 ()

None of these

Week 8 () No, the answer is incorrect.


Score: 0

Accepted Answers:
Week 9 ()
df.loc[[3,4,6]]  

Week 10 () 6) Which of the following is not a measure of dispersion? 1 point


Skewness 
Week 11 ()

Kurtosis
Week 12 ()
Range

percentile
Download
Yes, the answer is correct.
Videos ()

Score: 1
Accepted Answers:
Weekly percentile
Feedback ()
7) State the following true or false? Statement: Bimodal Data sets contains two modes. 1 point
Text

True 
Transcripts ()

False

Y h i
Yes, the answer is correct.
Books () Score: 1
Accepted Answers:
Live sessions True 
- Solve
8) Bar Charts are used for : 1 point
sample
problems
Continuous data
with us ()

Categorical data 
 
Both of these

None of these

Yes, the answer is correct.


Score: 1

Accepted Answers:
Categorical data 

9) Median is not applicable to: 1 point


Ordinal data

Interval data 

Nominal data median applicable for ordinal data and interval but
mean not applicable for both ordiinal and nominal

None of these

Yes, the answer is correct.


Score: 1

Accepted Answers:
Nominal data

10) State true or false:


Statement: Arithmetic mean is not applicable for ordinal or nominal 1 point
data


True

False

Yes, the answer is correct.


Score: 1

Accepted Answers:
True
Assignment 1 Solution

Q1 B
Data can be generated by machine, humans, or their interaction
Q2 B
Deceptive analytics is not a classification of data Analytics
Q3 A
Q4 C
We cannot pass negative values as an input with loc
Q5 A
The location index in python starts from zero not 1.
Q6 D
Percentile is not a measure of dispersion.
Q7 A
Q8 B
Q9 C
Q10 A
X


(https://swayam.gov.in)

(https://swayam.gov.in/nc_details/NPTEL)

pranithavoggu@gmail.com 

NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL)
»
Data Analytics with Python (course)

Course
outline Week2 : Assignment 2
The due date for submitting this assignment has passed.
How does an Due on 2022-02-09, 23:59 IST.
NPTEL online
course work?
() Assignment submitted on
2022-01-28, 20:31 IST
1) A college plans to interview 8 students for possible offers of graduate assistant-ships. 1 point
Week 0 () The college has three assistant-ships available. How many groups of three can the college select?

Week 1 ()
126

56 8c3
Week 2 ()

136

Introduction to
130
Probability- I
Yes, the answer is correct.
(unit? Score: 1

unit=25&lesson=26) Accepted Answers:


Introduction to
56
Probability- II
2) Which of the following is each individual outcome of an experiment called? 1 point
(unit?
unit=25&lesson=27)
the sample space
Probability
a sample point
Distributions - I
an experiment
(unit?
unit=25&lesson=28)

an individual

Yes, the answer is correct.


Probability

Score: 1
Distributions - II
Accepted Answers:
(unit?
a sample point
unit=25&lesson=29)
Probability 3) Two events having nonzero probabilities 1 point
Distributions -
III (unit?
can be both mutually exclusive and independent
unit=25&lesson=30)
cannot be both mutually exclusive and independent

Quiz: Week2 :
are always mutually exclusive
Assignment 2
are always independent
(assessment?
Yes, the answer is correct.
name=125)

Score: 1
Assignment Accepted Answers:
solution week 2 cannot be both mutually exclusive and independent
(unit?
4) Ten individuals are candidates for positions of president, vice president of an 1 point
unit=25&lesson=135)
organization. How many possibilities of selections exist?
Week 3 ()

90

Week 4 ()
100
10c1*9c1

120
Week 5 ()
130

Yes, the answer is correct.


Week 6 () Score: 1

Accepted Answers:
Week 7 () 90

Week 8 () 5) A standard normal distribution has: 1 point


Mean 1 and standard distribution 0
Week 9 ()

Mean 0.5 and standard distribution 0.5
std nrml dist mean=0,std dist-1
Week 10 ()
Mean 0 and standard distribution 1

Mean 1 and standard distribution 1
Week 11 ()
Yes, the answer is correct.
Score: 1

Week 12 () Accepted Answers:


Mean 0 and standard distribution 1
Download
6) The weight of cows in a farm is normally distributed with a mean of 200 pounds and a 1 point
Videos ()
standard deviation of 25 pounds.
The probability of a cow weighing more than 241.25 pounds is:
Weekly
Feedback ()

0.4505

0.0495
Text z=x-mu/sigma
Transcripts ()
0.9505

0.9010
Books ()
No, the answer is incorrect.
Score: 0

Live sessions Accepted Answers:


- Solve 0.0495
sample
problems 7) The weight of cows in a farm is normally distributed with a mean of 200 pounds and a 1 point
with us () standard deviation of 25 pounds.
what is the probability of a cow weighing less than 250 pounds?
 

0.4772

0.9772

0.0528

0.5000

Yes, the answer is correct.


Score: 1

Accepted Answers:
0.9772

8) State True or False: 1 point


Statement: Binomial and normal Distributions are discrete probability distributions


True

False

No, the answer is incorrect.


Score: 0

Accepted Answers:
False

9) For a binomial experiment with p = 0.5 and a sample size of 100. The expected value 1 point
of this distribution is? 


0.50

0.30
np

100

50

Yes, the answer is correct.


Score: 1

Accepted Answers:
50

10) State true or false 1 point


Statement: All mutually exclusive events are independent events


True

False

Yes, the answer is correct.


Score: 1

Accepted Answers:
False
Assignment 2 Solution

Q1 B
P= 8C3 = 8x7x6! / (5!x 3!)= 56
Q2 B
Q3 B
Q4 A
P = 10C1 x 9C1 = 10x9 =90
Q5 C
Q6 B
z = x - mu / sigma = 241.25 - 200/ 25 = 1.65
from z table, the area between infinity and z score is 0.0495
Q7 B
z = x - mu / sigma = 250- 200/ 25 = 2
from z table, the area between - infinity and z score is 0.9772
Q8 B
Q9 D
Expected value = np = 0.5 x100 = 50
Q10 B
X


(https://swayam.gov.in)

(https://swayam.gov.in/nc_details/NPTEL)

pranithavoggu@gmail.com 

NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL)
»
Data Analytics with Python (course)

Course
outline Week3 - Assignment 3
The due date for submitting this assignment has passed.
How does an Due on 2022-02-16, 23:59 IST.
NPTEL online As per our records you have not submitted this assignment.
course work?
() 1) Sate True or False: 1 point
Statement: The specific value of a random variable is called estimator
Week 0 ()

True estimate
Week 1 ()
False

No, the answer is incorrect.


Week 2 () Score: 0

Accepted Answers:
Week 3 () False

2) If the true proportion of customers who are below 20 years is P=0.35, what is the 1 point
Python Demo
probability that a sample size 100 yields a sample proportion between 0.3 to 0.4
for Distributions

(unit?

0.961
unit=32&lesson=33)

0.827
Sampling and

0.706
Sampling
Distribution
0.53
(unit?
No, the answer is incorrect.
unit=32&lesson=34) Score: 0

Distribution of
Accepted Answers:
Sample Means,
0.706
population, and
3) Stratified random sampling is a method of selecting a sample in which 1 point
variance (unit?

unit=32&lesson=35)
the sample is first divided into strata, and then random samples are taken from each stratum
Confidence
various strata are selected from the sample
interval

the population is first divided into strata, and then random samples are drawn from each
estimation:
stratum
Single
population - I
None of these alternatives is correct
(unit?
No, the answer is incorrect.
unit=32&lesson=36) Score: 0

Accepted Answers:
Confidence
the population is first divided into strata, and then random samples are drawn from each stratum
interval
estimation: 4) Sate True or False: 1 point
Single
Statement: A population is a set of all items or individual of interest
population - II
(unit?

True
unit=32&lesson=37)

False
Quiz: Week3 -
No, the answer is incorrect.
Assignment 3

Score: 0
(assessment?
Accepted Answers:
name=124)
True
Solution for
5) A question paper contains 90 multiple choice questions. There are 4 alternative 1 point
week 3 (unit?
unit=32&lesson=136)
answers (A, B, C or D) out of which only one is correct. Mr X answers these questions randomly (i.e.
without preparation). What is the probability that X gets a score of at least 10 marks?

Week 4 ()

0.9997
Week 5 ()
0.7894

0
Week 6 ()
0.001

No, the answer is incorrect.


Week 7 ()

Score: 0
Accepted Answers:
Week 8 () 0.9997

Week 9 () 6) On an average 5 % items supplied by manufacturer X are defectives. If a batch of 10 1 point


items is inspected: what is the probability that 2 items are defective

Week 10 ()

0.065

Week 11 ()
0.075

0.085
Week 12 ()
0.095

No, the answer is incorrect.


Download Score: 0

Videos () Accepted Answers:


0.075
Weekly
Feedback () 7) A car distributor in city Y experiences on an average 2.5 car sales per day. Find the 1 point
probability that on a randomly selected day, they will sell  5 car:

Text

0.0668
Transcripts ()

0.544
Books ()

0.082
Live sessions
0.205
- Solve No, the answer is incorrect.
sample Score: 0

problems Accepted Answers:


with us () 0.0668

  8) A car distributor in city Y experiences on an average 2.5 car sales per day. Find the 1 point
probability that on a randomly selected day, they will sell  no car:


0.0668

0.544

0.082

0.205
No, the answer is incorrect.
Score: 0

Accepted Answers:
0.082

9) A random sample of 100 people shows that 25 of them are females and rest are males. 1 point
Form a 95% confidence interval for the true proportion of females. The lower limit of this interval will
be:


0.150

0.145

0.165

0.175

No, the answer is incorrect.


Score: 0

Accepted Answers:
0.165

10) A random sample of 100 people shows that 25 of them are females and rest are males. 1 point
Form a 95% confidence interval for the true proportion of females. The upper limit of this interval will
be:


0.150

0.165

0.465

0.335

No, the answer is incorrect.


Score: 0

Accepted Answers:
0.335
Assignment 3 solution

A1 B
The specific value of a random variable is called estimate
A2 C

A3 C
A4 A

A5 A

A6 B

A7 A
A8 C
A9 C

A10 D
X


(https://swayam.gov.in)

(https://swayam.gov.in/nc_details/NPTEL)

pranithavoggu@gmail.com 

NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL)
»
Data Analytics with Python (course)

Course
outline Week4 : Assignment 4
The due date for submitting this assignment has passed.
How does an Due on 2022-02-23, 23:59 IST.
NPTEL online
course work?
() Assignment submitted on
2022-02-23, 22:21 IST
1) If we have a sample size of 20 and population standard deviation is known, we will use: 1 point
Week 0 ()

t- test for hypothesis testing
Week 1 ()
z-test for hypothesis testing

both t and z test
Week 2 ()

F-test

Week 3 () Yes, the answer is correct.


Score: 1

Week 4 () Accepted Answers:


z-test for hypothesis testing
Hypothesis
2) Null hypothesis, Ho: mu1- mu2 = 0 is a: 1 point
Testing- I (unit?
unit=39&lesson=40)

Upper tail test
Hypothesis
Lower tail test
Testing- II

Two tail test
(unit?
unit=39&lesson=41)
F-test

Yes, the answer is correct.


Hypothesis

Score: 1
Testing- III
Accepted Answers:
(unit?
Two tail test
unit=39&lesson=42)
Errors in 3) The quality-control manager at a Li-BATTERY factory needs to determine whether the 1 point
Hypothesis mean life of a large shipment of Li-Battery is equal to the specified value of 375 hours. The process
Testing (unit? standard deviation is known to be 100 hours. A random sample of 64 batteries indicates a sample
unit=39&lesson=43) mean life of 350 hours. 
State the null and alternative hypotheses
Hypothesis
Testing: Two

Mu = 375
sample test- I
(unit?
Mu ≤ 375
unit=39&lesson=44)
Mu = 350
Important Data
Mu ≥ 350
Sets (unit?
Yes, the answer is correct.
unit=39&lesson=45) Score: 1

Quiz: Week4 : Accepted Answers:


Assignment 4 Mu = 375
(assessment?
4) The quality-control manager at a Li-BATTERY factory needs to determine whether the 1 point
name=123)
mean life of a large shipment of Li-Battery is equal to the specified value of 375 hours. The process
Solution For standard deviation is known to be 100 hours. A random sample of 64 batteries indicates a sample
Week 4 (unit? mean life of 350 hours. 
unit=39&lesson=146) At the alpha = 0.05 level of significance is there any evidence that the mean life is different from 375
hours?
Week 5 ()

Yes, there is
Week 6 ()

No, there is not

None of these
Week 7 ()
No, the answer is incorrect.
Score: 0

Week 8 ()
Accepted Answers:
Yes, there is
Week 9 ()
5) For one-tailed test, the test statistic z is determined to be zero. The p-value for this test 1 point
Week 10 () is:

Week 11 ()
Zero

-0.5
Week 12 ()
+0.5

1.0
Download
Videos () Yes, the answer is correct.
Score: 1

Accepted Answers:
Weekly
+0.5
Feedback ()
6) The error of rejecting a true null hypothesis is: 1 point
Text
Transcripts ()
a Type I error

a Type II error
Books ()
is the same as Beta

committed when not enough information is available
Y h i
Yes, the answer is correct.
Live sessions Score: 1
- Solve Accepted Answers:
sample a Type I error
problems
7) The mean cost of a hotel room in a city is said to be $168 per night. A random sample 1 point
with us ()
of 25 hotels resulted in X-bar = $172.50 and sample standard deviation s = 15.40. Calculate the t
  statistic.


2

-2

1.46

-1.46

Yes, the answer is correct.


Score: 1

Accepted Answers:
1.46

8) In hypothesis testing if the null hypothesis is rejected, 1 point


no conclusions can be drawn from the test

the alternative hypothesis is true

the data must have been accumulated incorrectly

the sample size has been too small

Yes, the answer is correct.


Score: 1

Accepted Answers:
the alternative hypothesis is true

9) In the hypothesis testing procedure, alpha is 1 point


1 - the level of significance

the critical value

the confidence level

the level of significance
Yes, the answer is correct.
Score: 1

Accepted Answers:
the level of significance

10) If a hypothesis is rejected at the 5% level of significance, it 1 point


will always be rejected at the 1% level

will always be accepted at the 1% level

will never be tested at the 1% level

May be rejected or not rejected at the 1% level

No, the answer is incorrect.


Score: 0

A dA
Accepted Answers:
May be rejected or not rejected at the 1% level
Assignment 4 Solution

A1 B
A2 C
A3 A

A4 A
A5 C
A6 A
A7 C

A8 B
A9 D
A10 D
X


(https://swayam.gov.in)

(https://swayam.gov.in/nc_details/NPTEL)

pranithavoggu@gmail.com 

NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL)
»
Data Analytics with Python (course)

Course
outline Week5 : Asignment 5
The due date for submitting this assignment has passed.
How does an Due on 2022-03-02, 23:59 IST.
NPTEL online
course work?
() Assignment submitted on
2022-03-01, 20:06 IST
1) In the analysis of variance procedure (ANOVA) the term "factor" refers to: 1 point
Week 0 ()

the dependent variable
Week 1 ()
the independent variable

different levels of a treatment
Week 2 ()

the critical value of F

Week 3 () Yes, the answer is correct.


Score: 1

Week 4 () Accepted Answers:


the independent variable

Week 5 () 2) In a problem of ANOVA, involving 3 treatments and 10 observations per treatment, 1 point
SSE = 500. The MSE for this situation is:
Hypothesis
Testing: Two
130.2
sample test- II

48.8
(unit?

18.52 MSE=SSE/DOF
unit=47&lesson=48)

30.0
Hypothesis
Testing: Two Yes, the answer is correct.
sample test- III Score: 1

(unit? Accepted Answers:


unit=47&lesson=49) 18.52
ANOVA - I 3) The ‘F’ ratio in a completely randomized ANOVA is the ratio of 1 point
(unit?
unit=47&lesson=50)
MST/MSE

MSTR/MSE
ANOVA - II
(unit?
MSE/MSTR
unit=47&lesson=51)
MSE/MST

Post Hoc Yes, the answer is correct.


Score: 1

Analysis(Tukey’s
test) (unit? Accepted Answers:
unit=47&lesson=52) MSTR/MSE

Important Data 4) An ANOVA procedure is applied to data obtained from 7 samples where each sample 1 point
files (unit? contains 10 observations. The degrees of freedom for the critical value of F are:
unit=47&lesson=53)

7 numerator and 20 denominator degrees of freedom
Quiz: Week5 : NU = 6-1
Asignment 5
5 numerator and 20 denominator degrees of freedom DU=70-7
(assessment?
6 numerator and 63 denominator degrees of freedom
name=121)

7 numerator and 63 denominator degrees of freedom
Solution For Yes, the answer is correct.
Week 5 (unit? Score: 1

unit=47&lesson=147) Accepted Answers:


6 numerator and 63 denominator degrees of freedom
Week 6 ()
5) In an ANOVA problem if SST = 200 and SSTR = 80, then SSE is 1 point
Week 7 ()

280

Week 8 ()
120
SSE=SST-SSTR

80
Week 9 ()
220

Yes, the answer is correct.


Week 10 () Score: 1

Accepted Answers:
Week 11 () 120

Week 12 () 6) The critical F value with 8 numerator and 29 denominator degrees of freedom at alpha 1 point
= 0.01 is

Download

2.18
Videos ()

3.20

Weekly
3.53
Feedback ()
3.94

Yes, the answer is correct.


Text Score: 1

Transcripts () Accepted Answers:


3.20
Books ()
7) Two Independent simple random samples are taken to test the difference between the 1 point
means of two populations. The standard deviations are not known, but are assumed to be equal.
The sample sizes are n1 = 15 and n2 = 35. The correct distribution to use is the:
Live sessions
- Solve
t distribution with 51 degrees of freedom
sample

z distribution with 50 degrees of freedom
problems
with us ()
z distribution with 49 degrees of freedom N1+N2-2

t distribution with 48 degrees of freedom
 
Yes, the answer is correct.
Score: 1

Accepted Answers:
t distribution with 48 degrees of freedom

8) Stare true or false: 1 point


Statement: The sampling distribution of two populations P bar1 -P bar2   is approximated by a
normal distribution


True

False

Yes, the answer is correct.


Score: 1

Accepted Answers:
True

9) Mean marks obtained by male and female students of school ABCD in first unit test are 1 point
shown as below.
Male Female
Sample Size 64 36
Sample Mean Marks 44 41
Population Variance 128 72

The standard error for the difference between the two means is


4.0

7.46

4.24

2.0
Yes, the answer is correct.
Score: 1

Accepted Answers:
2.0

10) Mean marks obtained by male and female students of school ABCD in first unit test are 1 point
shown as below.
Male Female
Sample Size 64 36
Sample Mean Marks 44 41
Population Variance 128 72

If you are interested in testing whether or not the average marks of males is significantly greater
than that of females, the test statistic is:


2.0

1.5

1.96

1.645

Yes, the answer is correct.


Score: 1

Accepted Answers:
1.5
Assignment 5 Solution

ANSWER KEY
A1 B
A2 C
MSE = SSE/DOF =500/(30-3) = 18.52
A3 B
A4 C
NUMERATOR DOF = C-1 =6
DENOMINATOR DOF =N-C = 70 - 7 = 63
A5 B
SSE = SST-SSTR = 200 – 80 = 120
A6 B (USE F TABLE)
A7 D
DOF for two sample t test = n1+n2 -2 = 15 +35 -2 = 48
A8 A
Only z test is possible in case of two proportions.
A9 D
SE = sigma/√n = √((s12 / n1)+( s22 / n2)) = √(2+2) = 2

A10 B
t = (mean 1 – mean2)/ SE = 3/2 = 1.5
X


(https://swayam.gov.in)

(https://swayam.gov.in/nc_details/NPTEL)

pranithavoggu@gmail.com 

NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL)
»
Data Analytics with Python (course)

Course
outline Week6 : Assignment 6
The due date for submitting this assignment has passed.
How does an Due on 2022-03-09, 23:59 IST.
NPTEL online
course work?
() Assignment submitted on
2022-03-08, 19:28 IST
1) Sate True or False: 1 point
Week 0 () Statement: In regression analysis the error term is normally distributed

Week 1 ()
True

False
Week 2 ()
Yes, the answer is correct.
Score: 1

Week 3 () Accepted Answers:


True
Week 4 ()
2) In model developed from sample data having the form of "yhat = b0 +b1 x" is known as 1 point
Week 5 ()

regression equation

correlation equation
Week 6 () NO ERROR TERM

estimated regression equation
Randomize
regression model
block design
(RBD) (unit? Yes, the answer is correct.
Score: 1

unit=55&lesson=56)
Accepted Answers:
Two Way estimated regression equation
ANOVA (unit?
unit=55&lesson=57) 3) State True or False:
Statement: A completely randomized design (CRD) is useful when 1 point
the experimental units are heterogeneous
Linear
True
Regression - I

False
(unit?
unit=55&lesson=58) Yes, the answer is correct.
Score: 1

Linear Accepted Answers:


Regression - II False
(unit?
unit=55&lesson=59) 4) In a regression and correlation analysis if r2 = 1, then 1 point

Linear
SSE = SST
Regression - III

SSE = 1
(unit?
unit=55&lesson=60)
SSR = SSE

SSR = SST
Important Data
files (unit? Yes, the answer is correct.
unit=55&lesson=61) Score: 1

Accepted Answers:
Quiz: Week6 : SSR = SST
Assignment 6
(assessment? 5) In a completely randomized design, a random sample of Salesmen would   be 1 point
name=122) assigned to each shop alternatively. However, salesmen are believed to differ substantially in their
ability to handle number of customers. What is high surge of customers to one salesman might be
Solution for
week 6 (unit? only moderate or even low surge to another. A study measuring the efficiency of the salesmen
unit=55&lesson=150) resulted in proposals for modification and redesign of the salesmen’s work schedule. After
consideration of several schedules for the work, three specific alternatives are selected as having
Week 7 () the best potential for increasing the efficiency of the salesmen. Check to what extent does the three
alternatives differ in terms of their effect on the efficiency of the salesmen?

Week 8 () Salesman Schedule1 Schedule2 Schedule3


1 75 76  78

Week 9 () 2 74 74 74

3 70 71 75

Week 10 () 4 73 72 77

5 76 73 76

Week 11 () 6 73 73 73

After performing one way ANOVA on the above problem we will:


Week 12 ()

Accept the null hypothesis
Download
Reject the null hypothesis
Videos ()
Can’t state any conclusion

None of these
Weekly
Feedback () Yes, the answer is correct.
Score: 1

Accepted Answers:
Text
Accept the null hypothesis
Transcripts ()
6) In a completely randomized design, a random sample of Salesmen would   be 1 point
Books () assigned to each shop alternatively. However, salesmen are believed to differ substantially in their
ability to handle number of customers. What is high surge of customers to one salesman might be
only moderate or even low surge to another. A study measuring the efficiency of the salesmen
resulted in proposals for modification and redesign of the salesmen’s work schedule. After
Live sessions
consideration of several schedules for the work, three specific alternatives are selected as having
- Solve
the best potential for increasing the efficiency of the salesmen. Check to what extent does the three
sample
alternatives differ in terms of their effect on the efficiency of the salesmen?
problems

with us ()

Salesman Schedule1  Schedule2  Schedule3


  1 75 76  78

2 74 74 74

3 70 71 75

4 73 72 77

5 76 73 76

6 73 73 73

After performing one way RBD on this problem we will



Accept the null hypothesis

Reject the null hypothesis

Can’t state any conclusion

None of these

Yes, the answer is correct.


Score: 1

Accepted Answers:
Reject the null hypothesis

7) In a completely randomized design, a random sample of Salesmen would   be 1 point


assigned to each shop alternatively. However, salesmen are believed to differ substantially in their
ability to handle number of customers. What is high surge of customers to one salesman might be
only moderate or even low surge to another. A study measuring the efficiency of the salesmen
resulted in proposals for modification and redesign of the salesmen’s work schedule. After
consideration of several schedules for the work, three specific alternatives are selected as having
the best potential for increasing the efficiency of the salesmen. Check to what extent does the three
alternatives differ in terms of their effect on the efficiency of the salesmen?

Salesman Schedule1  Schedule2  Schedule3


1 75 76  78

2 74 74 74

3 70 71 75

4 73 72 77

5 76 73 76

6 73 73 73

The value of MSE when this problem is solved by ANOVA is:



1.955

9.555

6.855

3.588

Yes, the answer is correct.


Score: 1

Accepted Answers:
3.588

8) In a completely randomized design, a random sample of Salesmen would   be 1 point


assigned to each shop alternatively. However, salesmen are believed to differ substantially in their
ability to handle number of customers. What is high surge of customers to one salesman might be
only moderate or even low surge to another. A study measuring the efficiency of the salesmen
resulted in proposals for modification and redesign of the salesmen’s work schedule. After
consideration of several schedules for the work, three specific alternatives are selected as having
the best potential for increasing the efficiency of the salesmen. Check to what extent does the three
alternatives differ in terms of their effect on the efficiency of the salesmen?

Salesman Schedule1 Schedule2 Schedule3


1 75 76 78

2 74 74 74

3 70 71 75
4 73 72
77

5 76 73 76

6 73 73 73

The value of MSE when this problem is solved by RBD is:



1.955

9.555

6.855

3.588

Yes, the answer is correct.


Score: 1

Accepted Answers:
1.955

9) State True of false 1 point


Statement: The variance of error, is same for all values of the independent variable 


True

False

Yes, the answer is correct.


Score: 1
Accepted Answers:
True

10) SSE can never be 1 point


larger than SST

smaller than SST

equal to 1

equal to zero

Yes, the answer is correct.


Score: 1

Accepted Answers:
larger than SST
Assignment 6 Solution

A1 A
In regression analysis the error term is normally distributed
A2 C
Since the error term is not present it is an estimated regression equation
A3 B
A4 D
If r square is 1, it means there is no error term
A5 A
Solution of Problem 5 to 8

A6 B
A7 D
A8 A
A9 A
A10 A
X


(https://swayam.gov.in)

(https://swayam.gov.in/nc_details/NPTEL)

pranithavoggu@gmail.com 

NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL)
»
Data Analytics with Python (course)

Course
outline Week7 : Assignment 7
The due date for submitting this assignment has passed.
How does an Due on 2022-03-16, 23:59 IST.
NPTEL online
course work?
() Assignment submitted on
2022-03-16, 19:09 IST
1) Which of the following
is not an assumption for simple linear regression? 1 point
Week 0 ()

Normally distributed variables
Week 1 ()
Multicollinearity

Linear relationship
Week 2 ()

Normally distributed residuals

Week 3 () No, the answer is incorrect.


Score: 0

Accepted Answers:
Week 4 ()
Multicollinearity

Week 5 () 2) Which of the following


is called Standard Error? 1 point


T-statistic squared
Week 6 ()

Square root of SSE
Week 7 ()
Square root of SST

Square root of MSE
Estimation,
Prediction of Yes, the answer is correct.
Score: 1

Regression
Model Residual Accepted Answers:
Analysis (unit? Square root of MSE
unit=63&lesson=64)
3) Which of the following
is true about multiple regression model? 1 point
Estimation,
It has only one independent variable
Prediction of

It has more than one dependent variable
Regression
Model Residual
It has more than one independent variable
Analysis - II
It has at least 2 dependent variables
(unit?
Yes, the answer is correct.
unit=63&lesson=65)

Score: 1
MULTIPLE Accepted Answers:
REGRESSION It has more than one independent variable
MODEL - I
4) In a multiple regression
model, the error term ɛ is assumed to 1 point
(unit?
unit=63&lesson=66)

Have a mean of 1
MULTIPLE
Have a variance of 0
REGRESSION

Have a standard deviation of 1
MODEL-II
(unit?
Be normally distributed
unit=63&lesson=67) Yes, the answer is correct.
Score: 1

Categorical
Accepted Answers:
variable
Be normally distributed
regression
(unit? 5) For a multiple
regression model with 2 independent variables, R.sq = 0.9041 point
unit=63&lesson=68) and adjusted R. sq
= 0.88, determine the number of observations (n)
Important data

6
Files (unit?
unit=63&lesson=69)
7

9
Quiz: Week7 :
Assignment 7
10
(assessment? No, the answer is incorrect.
name=133) Score: 0

Accepted Answers:
Assignment
10
solution - Week
7 (unit? 6) If the R.sq value is small
for a model with a large number of independent 1 point
unit=63&lesson=153) variables, the adjusted
coefficient of determination _______________
Week 8 ()
Can be positive

Can be negative
Week 9 ()

is 0

Week 10 ()
Can't say

Yes, the answer is correct.


Week 11 () Score: 1

Accepted Answers:
Week 12 () Can be negative

7) Which one of the


statements is true regarding residuals in regression 1 point
Download analysis?
Videos ()

Mean of residuals is always 0
Weekly
Mean of residuals is always < 0
Feedback ()
Mean of residuals is always > 0

There is no such rule for residuals
Text
Yes, the answer is correct.
Transcripts () Score: 1

Accepted Answers:
Books () Mean of residuals is always 0

Live sessions 8) In a simple linear


regression model (one independent variable), if we 1 point
- Solve
change the input variable by
1 unit, how much will the output variable
change?
sample
problems
By 1
with us ()

No change
 
By its slope

None of these

Yes, the answer is correct.


Score: 1

Accepted Answers:
By its slope

9) To check whether a
significant relationship exists between the dependent 1 point
and set of all
independent variables, _____ is used. It is the test for overall
significance.


F-test

R.sq test

a correlation test

t-test

Yes, the answer is correct.


Score: 1

Accepted Answers:
F-test

10) Which of the following


evaluation metrics is used to evaluate a regression 1 point
model?


AUC-ROC

Accuracy

Logloss

Mean-Squared-Error

Yes, the answer is correct.


Score: 1

Accepted Answers:
Mean-Squared-Error
NPTEL – Data Analytics with Python

Week 7 – Assignment Solution

1. Multicollinearity is not an assumption for simple linear regression


2. Sq. root of MSE is called Standard Error
3. A multiple regression model has more than one independent variable
4. The error term is assumed to be normally distributed in multiple regression model
5. The formula for adj. R.sq is as follows:

6. The adjusted Coefficient of Determination can be negative


7. The mean of residuals in regression analysis is always = 0
8. Y = mx + b. Unit variation in x will cause y to change by its slope (dy/dx = m)
9. F-test is used to check significant relationship between dependent and independent
variables
10. Mean Squared Error is used as evaluation metric for regression model
X


(https://swayam.gov.in)

(https://swayam.gov.in/nc_details/NPTEL)

pranithavoggu@gmail.com 

NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL)
»
Data Analytics with Python (course)

Course
outline Week8 : Assignment 8
The due date for submitting this assignment has passed.
How does an Due on 2022-03-23, 23:59 IST.
NPTEL online
course work?
() Assignment submitted on
2022-03-23, 11:22 IST
1) For categorical data
with ‘n’ categories, the number of dummy variables 1 point
Week 0 () will be________

Week 1 ()
n

n - 1
Week 2 ()
n + 1

2n
Week 3 ()
Yes, the answer is correct.
Score: 1

Week 4 ()
Accepted Answers:
n-1
Week 5 ()
2) In the estimation of
regression parameters  1 point
Week 6 ()

The likelihood function
is a function of only 𝜎

Week 7 ()
The values of 𝛽0..𝛽n
and 𝜎 should be such that, they maximize the likelihood
function.
Week 8 ()
Both a. and b.

None of these
Maximum
Likelihood No, the answer is incorrect.
Score: 0

Estimation- I
(unit?
Accepted Answers:
unit=71&lesson=72)
Maximum
The values of 𝛽0..𝛽n
and 𝜎 should be such that, they maximize the likelihood
Likelihood
function.
Estimation-II 3) In logistic regression,
the null hypothesis tested is: 1 point
(unit?
unit=71&lesson=73)
H0: β = 0

LOGISTIC

H0: β ≠ 0

REGRESSION-
H0: μ = 0
I (unit?

H0: μ ≠ 0
unit=71&lesson=74)
Yes, the answer is correct.
LOGISTIC Score: 1

REGRESSION- Accepted Answers:


II (unit? H0: β = 0
unit=71&lesson=75)
4) In logistic regression, 1 point
Linear
Regression
The graph doesn’t follow S shape curve
Model Vs

The dependent variable is categorical
Logistic
Regression
The estimated value of the dependent variable is not probability
Model (unit?
None of these
unit=71&lesson=76)
Yes, the answer is correct.
Important data Score: 1

files (unit? Accepted Answers:


unit=71&lesson=77) The dependent variable is categorical

Quiz: Week8 : 5) State true or false: G


statistic is used to check the individual significance of 1 point
Assignment 8 the independent
variables
(assessment?
name=141)
True

False
Assignment
solution - Week Yes, the answer is correct.
8 (unit? Score: 1

unit=71&lesson=154) Accepted Answers:


False
Week 9 ()
6) Choose the correct
statement 1 point

Week 10 ()
In logistic regression, the dependent variable must be continuous data

In logistic regression, the dependent variable must be categorical data
Week 11 ()

In logistic regression, both dependent and independent variables must be categorical data

Week 12 ()
None of these

No, the answer is incorrect.


Download Score: 0

Videos () Accepted Answers:


In logistic regression, the dependent variable must be categorical data
Weekly
7) State True or False: The
Method of Least Squares can be applied to 1 point
Feedback () models with any probability
distribution.


True
Text
False
Transcripts () Yes, the answer is correct.
Score: 1

Books () Accepted Answers:


False
Live sessions 8) Suppose you have been
given a fair coin and you want to find out the odds 1 point
- Solve of getting heads. Which of
the following option is true for such a case?
sample
problems
Odds will be 0
with us ()
Odds will be 0.5

 
Odds will be 1

None of these

Yes, the answer is correct.


Score: 1

Accepted Answers:
Odds will be 1

9) Large values of the


log-likelihood statistic indicate: 1 point


That there are a greater number of explained vs. unexplained observations.

That the statistical model fits the data well.

That as the predictor variable increases, the likelihood of the outcome occurring decreases.

That the statistical model is a poor fit of the data
No, the answer is incorrect.
Score: 0

Accepted Answers:
That the statistical model is a poor fit of the data

10) The logit function(given


as l(x)) is the log of odds function. What could be 1 point
the range of logit function
in the domain x=[0,1]?


(– ∞ , ∞)

(0,1)

(0 , ∞)

(- ∞, 0 )

Yes, the answer is correct.


Score: 1

Accepted Answers:
(– ∞ , ∞)
NPTEL: Data Analytics with Python
Week 8 – Assignment solutions

1. The number of dummy variables will be n -1


2. We wish to maximize the likelihood function when estimating regression parameters
3. The null hypothesis is H0: β = 0
4. The dependent variable is categorical
5. G-statistic is not used to check individual significance of independent variables
6. In logistic regression, the dependent variable must be categorical
7. The method of least squares cannot be used for models with any probability
distribution
8. Odds are defined as the ratio of the probability of success and the probability of
failure. So in case of fair coin probability of success is 1/2 and the probability of
failure is 1/2 so odd would be 1
9. Large values of log-likelihood indicate poor fit of the data
10. The range of logit function for given domain is (– ∞ , ∞)
X


(https://swayam.gov.in)

(https://swayam.gov.in/nc_details/NPTEL)

pranithavoggu@gmail.com 

NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL)
»
Data Analytics with Python (course)

Course
outline Week9 : Assignment 9
The due date for submitting this assignment has passed.
How does an Due on 2022-03-30, 23:59 IST.
NPTEL online
course work?
() Assignment submitted on
2022-03-30, 11:46 IST
1) State true or
false:  Statement: there is no difference
between,  y = β0 + 1 point
Week 0 () β1x + 𝜖
and  E(y) = β0 + β1x , both are
regression equations

Week 1 ()
True

False
Week 2 () No, the answer is incorrect.
Score: 0

Week 3 () Accepted Answers:


False
Week 4 ()
2) Which of the following
statements is correct 1 point

Week 5 ()
Sensitivity in ROC analysis is called True Positive Rate(tpr)

Specificity in ROC analysis is not called True Negative Rate (tnr)
Week 6 ()

Specificity in ROC analysis is called True Positive Rate(tpr) 

Week 7 ()
Sensitivity in ROC analysis is called True Negative Rate (tnr) 

Yes, the answer is correct.


Score: 1

Week 8 ()
Accepted Answers:
Week 9 () Sensitivity in ROC analysis is called True Positive Rate(tpr)

3) In ROC analysis when the


Threshold value is Higher: 1 point
Confusion
matrix and
Specificity decreases
ROC- I (unit?
Sensitivity decreases
unit=79&lesson=80)

Both a. and b.
Confusion

None of these
Matrix and
ROC-II (unit? Yes, the answer is correct.
Score: 1

unit=79&lesson=81)
Accepted Answers:
Performance of Sensitivity decreases
Logistic Model-
III (unit? 4) Sensitivity in ROC
analysis is defined as: 1 point
unit=79&lesson=82)

FP / (FP+TN)
Regression

FN/(TP+FN)
Analysis Model
Building - I
TN / (TN+FP) 
(unit?
TP / (TP+FN) 
unit=79&lesson=83)
Yes, the answer is correct.
Score: 1

Regression
Analysis Model Accepted Answers:
Building TP / (TP+FN) 
(Interaction)- II
5) In ROC analysis, a
classifier is called ‘good’ if it has ______ 1 point
(unit?
unit=79&lesson=84)

Low TPR and Low FPR
Important data
Low TPR and High FPR
files (unit?

High TPR and Low FPR
unit=79&lesson=85)

High TPR and High FPR
Quiz: Week9 :
Yes, the answer is correct.
Assignment 9

Score: 1
(assessment?
Accepted Answers:
name=142)
High TPR and Low FPR
Week 10 () 6) For the given confusion
matrix, compute the sensitivity 1 point

Week 11 ()                             True Positive     True Negative


Predicted Positive                 8                      3
Predicted Negative         2                      7
Week 12 ()

0.73
Download
0.7
Videos ()
0.78

0.8
Weekly
Feedback () Yes, the answer is correct.
Score: 1

Accepted Answers:
Text
0.8
Transcripts ()
7) State true or False:
Precision is inversely proportional to sensitivity 1 point
Books ()

True

False

Y h i
Yes, the answer is correct.
Live sessions Score: 1
- Solve Accepted Answers:
sample False
problems
8) State True or False:
Standardization of features is not required before 1 point
with us ()
training a Logistic
regression model
 

True

False

Yes, the answer is correct.


Score: 1

Accepted Answers:
True

9) Which of the following


option is true? 1 point


Linear Regression errors values have to be normally distributed but in the case of Logistic
Regression it is not the case

Logistic Regression errors values have to be normally distributed but in the case of Linear
Regression it is not the case

Both Linear Regression
and Logistic Regression error values have to be
normally distributed

Both Linear Regression
and Logistic Regression error values have not to be
normally distributed
Yes, the answer is correct.
Score: 1

Accepted Answers:
Linear Regression errors values have to be normally distributed but in the case of Logistic
Regression it is not the case

10) In binary logistic


regression, 1 point


The dependent variable is continuous

The dependent variable is divided into two equal subcategories

The dependent variable consists of two categories

There is no dependent variable

Yes, the answer is correct.


Score: 1
Accepted Answers:
The dependent variable consists of two categories
X


(https://swayam.gov.in)

(https://swayam.gov.in/nc_details/NPTEL)

pranithavoggu@gmail.com 

NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL)
»
Data Analytics with Python (course)

Course
outline Week10 : Assignment 10
The due date for submitting this assignment has passed.
How does an Due on 2022-04-06, 23:59 IST.
NPTEL online
course work?
() Assignment submitted on
2022-04-06, 11:51 IST
1) Sampling distribution
for the goodness of fit test is the  1 point
Week 0 ()

Poisson distribution
Week 1 ()
t distribution 

normal distribution 
Week 2 ()

chi-square distribution

Week 3 () Yes, the answer is correct.


Score: 1

Accepted Answers:
Week 4 ()
chi-square distribution

Week 5 () 2) The goodness of fit test


is always conducted as a  1 point


lower-tail test 
Week 6 ()

upper-tail test
Week 7 ()
middle test 

None of these 
Week 8 ()
Yes, the answer is correct.
Score: 1

Week 9 () Accepted Answers:


upper-tail test
Week 10 ()
Chi - Square 3) State True or False:
Statement: Null hypothesis for the chi-square test of 1 point
Test of independence assumes that
all the proportions are equal. 
Independence -

True
I (unit?
unit=87&lesson=88)
False

Chi-Square Yes, the answer is correct.


Score: 1

Test of
Independence - Accepted Answers:
II (unit? True
unit=87&lesson=89)
4) Statistical test
conducted to determine whether to reject or not reject a 1 point
Chi-Square hypothesized
probability distribution for a population is known as a
Goodness of Fit ________
Test (unit?

contingency test 
unit=87&lesson=90)


probability test 
Cluster

goodness of fit test 
analysis:
Introduction- I
None of these 
(unit?
No, the answer is incorrect.
unit=87&lesson=91) Score: 0

Clustering Accepted Answers:


analysis: part II goodness of fit test 
(unit? 5) What is the minimum no.
of variables/ features required to perform 1 point
unit=87&lesson=92) clustering?
Important data

0
files (unit?
unit=87&lesson=93)
1

2
Quiz: Week10
: Assignment
3
10
Yes, the answer is correct.
(assessment? Score: 1

name=143) Accepted Answers:


1
Week 11 ()
6) State True or False: The chi-square test of independence is used to analyze the 1 point
Week 12 () frequencies of two variables with multiple categories and determine whether they are independent

Download
True
Videos ()
False

Yes, the answer is correct.


Weekly Score: 1

Feedback () Accepted Answers:


True
Text
7) State True or False:
Minkowski distance is a generalization of both 1 point
Transcripts ()
Euclidean and Manhattan metrics
Books ()
True

False
Y h i
Yes, the answer is correct.
Live sessions Score: 1
- Solve Accepted Answers:
sample True
problems
8) In order to perform the
chi-square test to check independence in python 1 point
with us ()
using scipy, we import
_________ 
 

chi2_test

chi_square_independence

chi2_contingency

None of these

Yes, the answer is correct.


Score: 1

Accepted Answers:
chi2_contingency

9) Let x1 = (1,2) and x2 =


(3,5) be the co-ordinates for two objects. The 1 point
Euclidean and Manhattan distance
between these two objects is
__________ respectively


4.2 and 3

3.15 and 2

3.61 and 5

None of these

Yes, the answer is correct.


Score: 1

Accepted Answers:
3.61 and 5

10) State true or false:


Discriminant Analysis does not require the grouping 1 point
variable to be known at the
beginning


True

False

Yes, the answer is correct.


Score: 1

Accepted Answers:
False
X


(https://swayam.gov.in)

(https://swayam.gov.in/nc_details/NPTEL)

pranithavoggu@gmail.com 

NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL)
»
Data Analytics with Python (course)

Course
outline Week11 : Assignment 11
The due date for submitting this assignment has passed.
How does an Due on 2022-04-13, 23:59 IST.
NPTEL online
course work?
() Assignment submitted on
2022-04-13, 21:19 IST
1) ________ is used for
calculating distance measures in clustering using 1 point
Week 0 () python

Week 1 ()
distance_matrix

spatial_matrix
Week 2 ()
scipy_matrix

distance.matrix
Week 3 ()
Yes, the answer is correct.
Score: 1

Week 4 ()
Accepted Answers:
distance_matrix
Week 5 ()
2) The formula for dissimilarity computation between two objects for categorical variables 1 point
Week 6 () is – 

Here p is a categorical variable and m denotes the number of matches.


Week 7 ()

D(i,j) = p-m / p
Week 8 ()
D(i,j) = p-m / m

D(i,j) = m-p / p
Week 9 ()

D(i,j) = m-p / m

Week 10 () Yes, the answer is correct.


Score: 1

A dA
Accepted Answers:
Week 11 () D(i,j) = p-m / p

Clustering 3) Select the correct option for a data set with 7 objects and an interval-scaled variable ‘f’ 1 point
analysis: Part we have the following measurements: 
III (unit? f = (1, 2, 3, 4, 5, 8, 50) containing one outlying value.
unit=95&lesson=96)

Std deviation (std_f) and mean absolute deviation (s_f) are equally affected
Cluster
analysis: Part
Mean absolute deviation (s_f) is more affected by the outlier
IV (unit?
Std deviation (std_f) is more affected by the outlier
unit=95&lesson=97)

None of these
Cluster
Yes, the answer is correct.
analysis: Part V Score: 1

(unit?
Accepted Answers:
unit=95&lesson=98)
Std deviation (std_f) is more affected by the outlier
K- Means
4) Which of the following is true for K-means clustering? 1 point
Clustering
(unit?
It comes under the partitioning method
unit=95&lesson=99)

The number of clusters is predefined for this method
Hierarchical

Cluster similarity is measure in regard to the mean value of the objects in a cluster
method of
clustering -I
All of the above
(unit? Yes, the answer is correct.
unit=95&lesson=100) Score: 1

Accepted Answers:
Important data
All of the above
files (unit?
unit=95&lesson=101) 5) Which of the following can act as possible termination conditions in K-Means? 1 point
Quiz: Week11 :
1. For a fixed number of iterations.
Assignment 11 2. Assignment of observations to clusters does not change between iterations. Except for cases
(assessment? with a bad local minimum.
name=144) 3. Centroids do not change between successive iterations.
4. Terminate when Residual Sum of Squares (RSS) falls below a threshold.
Week 12 ()

1,3 and 4
Download
1,2,3 and 4
Videos ()

2 and 3

None of these
Weekly
Feedback () Yes, the answer is correct.
Score: 1

Text Accepted Answers:


Transcripts () 1,2,3 and 4

6) In the figure below, if you draw a horizontal line on y-axis for y=2. What will be the 1 point
Books () number of clusters formed?

Live sessions (Image link: https://drive.google.com/file/d/1uLZZgkWh7SwYlPg9S5PSwHc3kqGAlJbI/view)


- Solve
sample
1
problems
2
with us ()

3
 
4

Yes, the answer is correct.


Score: 1

Accepted Answers:
2

7) Which of the following clustering requires merging approach? 1 point


Partitional

Naive Bayes

Hierarchical

None of these

Yes, the answer is correct.


Score: 1

Accepted Answers:
Hierarchical

8) State True or False:


Hierarchical clustering should primarily be used for 1 point
exploration


True

False
Yes, the answer is correct.
Score: 1

Accepted Answers:
True

9) State True or False: For


finding dissimilarity between two clusters in 1 point
hierarchical clustering,
average-link is the only metric used


True

False
Yes, the answer is correct.
Score: 1

Accepted Answers:
False

10) Hierarchical clustering


can either be an agglomerative or divisive algorithm 1 point


True

False
Yes, the answer is correct.
Score: 1

Accepted Answers:
True
X


(https://swayam.gov.in)

(https://swayam.gov.in/nc_details/NPTEL)

pranithavoggu@gmail.com 

NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL)
»
Data Analytics with Python (course)

Course
outline Week12 : Assignment 12
The due date for submitting this assignment has passed.
How does an Due on 2022-04-20, 23:59 IST.
NPTEL online As per our records you have not submitted this assignment.
course work?
() 1) Which clustering
algorithm works well when the shape of the clusters is 1 point
hyper-spherical?
Week 0 ()
K means

Agglomerative Hierarchical Clustering
Week 1 ()

Divisive Hierarchical Clustering
Week 2 ()
All of these

No, the answer is incorrect.


Week 3 () Score: 0

Accepted Answers:
Week 4 () K means

2) In decision tree, an
internal node represents  1 point
Week 5 ()

A test on an attribute
Week 6 ()
An outcome of the test

Entire sample population
Week 7 ()

Holds a class label
Week 8 () No, the answer is incorrect.
Score: 0

Week 9 () Accepted Answers:


A test on an attribute

Week 10 () 3) Choose the correct


statement about the CART model  1 point

Week 11 ()
CART is an unsupervised learning technique

CART is a supervised technique
Week 12 ()
CART adopts a greedy approach

Both b. and c.
Hierarchical
method of No, the answer is incorrect.
clustering- II Score: 0

(unit? Accepted Answers:


unit=103&lesson=104) Both b. and c.

Classification 4) ______ is used to build


the decision tree model in python-   1 point
and Regression
Trees (CART :
Decision tree classifier
I) (unit?
DecisionTreeClassifier
unit=103&lesson=105)

Decision_tree_classifier
Measures of
Decision_tree_model
attribute
selection (unit? No, the answer is incorrect.
Score: 0

unit=103&lesson=106)
Accepted Answers:
Attribute DecisionTreeClassifier
selection
Measures in 5) State True or False:
Gini Index enforces the resulting tree to have multiway 1 point
CART : II (unit? splits
unit=103&lesson=107)

True
Classification
False
and Regression
Trees (CART) - No, the answer is incorrect.
Score: 0

III (unit?
unit=103&lesson=108)
Accepted Answers:
False
Important data
files (unit? 6) In a decision tree,
______ node represents the entire population 1 point
unit=103&lesson=109)

Root
Python code
Internal
files (unit?

Child
unit=103&lesson=110)

None of the above
Quiz: Week12
: Assignment No, the answer is incorrect.
Score: 0

12
(assessment? Accepted Answers:
name=159) Root

7) _______is the measure of uncertainty of a random variable, it characterizes the 1 point


Download
impurity of an arbitrary collection of examples.
Videos ()


Information gain
Weekly

Gini Index
Feedback ()

Entropy

None of the above
N h i i
No, the answer is incorrect.
Text Score: 0
Transcripts () Accepted Answers:
Entropy
Books ()
8) In a decision tree diagram, ________ node holds a class label 1 point

Live sessions
Root
- Solve

Internal
sample
problems
Child
with us ()
none of the above

  No, the answer is incorrect.


Score: 0

Accepted Answers:
Child

9) State True or False: In


the pre-pruning approach, sub-trees are removed 1 point
from a fully-grown decision
tree


True

False

No, the answer is incorrect.


Score: 0

Accepted Answers:
False

10) State True or False: LabelEncoder() is used to normalize and transform non-numerical 1 point
labels to numerical labels


True

False

No, the answer is incorrect.


Score: 0

Accepted Answers:
True

You might also like