Total No. of Questions : 10] SEAT No.
8
23
P3975 [5561]-679
[Total No. of Pages : 2
ic-
B.E.(Computer Engineering)
tat
3s
DATA ANALYTICS
6:0
(2015 Pattern) (Semester - I) (410243)
01 91
3:4
Time : 2½ Hours] [Max. Marks : 70
0
91
1/0 13
Instructions to the candidates:
1) Answer Q.1 or Q.2, Q.3 or Q.4, Q.5 or Q.6, Q.7 or Q.8, Q.9 or Q.10.
0
5/2
2) Neat diagrams must be drawn wherever necessary.
.23 GP
3) Figures to the right side indicate full marks.
4) Assume suitable data if necessary.
E
81
8
C
23
ic-
Q1) a) What is big data? Explain 3V’s of Big Data. [5]
16
tat
b) Draw Data Analytics Lifecycle & give brief description about all phases.
8.2
3s
[5]
.24
6:0
91
OR
49
3:4
30
91
Q2) a) Write a case study on Global Innovation Network & Analysis (GINA).[5]
01
01
b) Explain Null Hypothesis & Alternative Hypothesis. [5]
5/2
GP
1/0
CE
Q3) a) How Wilcoxon Rank-Sum Test works? [5]
81
8
23
.23
b) Explain Type 1 and Type 2 errors. [5]
ic-
16
tat
OR
8.2
3s
Q4) a) Write an Apriori Algorithm. [5]
.24
6:0
91
49
b) Define following terms with example : Confidence and Lift. [5]
3:4
30
91
01
01
Q5) a) Explain following Decision Tree Algorithms : [9]
5/2
GP
i) ID3 Algorithm
1/0
CE
81
ii) C4.5
.23
iii) CART
16
b) How Naive Baye’s classification works? Give its applications. [8]
8.2
OR
.24
P.T.O.
49
Q6) a) Explain following terms : [9]
8
23
i) Bagging
ic-
ii) Boosting
tat
iii) Random forest
3s
6:0
b) What is data visualization? Describe any four data visualization techniques.
01 91
[8]
3:4
0
91
1/0 13
Q7) a) Why is it difficult to visualize Big Data? Also explain analytical techniques
0
5/2
used in Big Data Visualization. [9]
.23 GP
b) Explain various tools to visualize Big Data. (Any four) [8]
E
81
8
OR
C
23
ic-
Q8) a) What is Map-Reduce? Explain working of Map-Reduce with example.[9]
16
tat
b) Explain HDFS with respect to NameNode, DataNodes, Secondary
8.2
3s
NameNode with example. [8]
.24
6:0
91
49
3:4
Q9) a) Explain following terms : [8]
30
91
i) Smoothing
01
01
ii) Confusion matrix
5/2
GP
1/0
b) Explain Data Visualization Tool - Tableau. [8]
CE
81
OR
8
23
.23
Q10)a) Explain following terms : [8]
ic-
16
tat
i) Key-value store
8.2
3s
ii) Document store
.24
6:0
91
iii) Column family store
49
3:4
iv) Graph Databases
30
91
b) Why communication is important in data analytics lifecycle projects?[8]
01
01
5/2
GP
1/0
CE
81
.23
16
8.2
.24
[5561]-679 2
49