Printed Page: 1 of 2
Subject Code: KIT601
0Roll No: 0 0 0 0 0 0 0 0 0 0 0 0 0
BTECH
(SEM VI) THEORY EXAMINATION 2021-22
DATA ANALYTICS
Time: 3 Hours Total Marks: 100
Note: Attempt all Sections. If you require any missing data, then choose suitably.
SECTION A
1. Attempt all questions in brief. 2*10 = 20
Qno Questions CO
(a) Discuss the need of data analytics. 1
(b) Give the classification of data. 1
(c) Define neural network. 2
(d) What is multivariate analysis? 2
(e) Give the full form of RTAP and discuss its application. 3
(f) What is the role of sampling data in a stream? 3
(g) Discuss the use of limited pass algorithm. 4
(h) What is the principle behind hierarchical clustering technique? 4
(i) List five R functions used in descriptive statistics. 5
90
1
(j) List the names of any 2 visualization tools. 5
13
_2
2.
P1
SECTION B
24
2E
2. Attempt any three of the following: 10*3 = 30
5.
.5
P2
Qno Questions CO
17
(a) Explain the process model and computation model for Big data 1
Q
platform.
|1
(b) Explain the use and advantages of decision trees. 2
5
(c) Explain the architecture of data stream model. 3
3
(d) Illustrate the K-means algorithm in detail with its advantages. 4
8:
(e) Differentiate between NoSQL and RDBMS databases. 5
:2
13
SECTION C
2
02
3. Attempt any one part of the following: 10*1 = 10
-2
Qno Questions CO
06
(a) Explain the various phases of data analytics life cycle. 1
7-
(b) Explain modern data analytics tools in detail. 1
|1
4. Attempt any one part of the following: 10 *1 = 10
Qno Questions CO
(a) Compare various types of support vector and kernel methods of data 2
analysis.
(b) Given data= {2,3,4,5,6,7;1,5,3,6,7,8}. Compute the principal 2
component using PCA algorithm.
QP22EP1_290 | 17-06-2022 13:28:35 | 117.55.242.131
Printed Page: 2 of 2
Subject Code: KIT601
0Roll No: 0 0 0 0 0 0 0 0 0 0 0 0 0
BTECH
(SEM VI) THEORY EXAMINATION 2021-22
DATA ANALYTICS
5. Attempt any one part of the following: 10*1 = 10
Qno Questions CO
(a) Explain any one algorithm to count number of distinct elements in a 3
data stream.
(b) Discuss the case study of stock market predictions in detail. 3
6. Attempt any one part of the following: 10*1 = 10
Qno Questions CO
(a) Differentiate between CLIQUE and ProCLUS clustering. 4
(b) A database has 5 transactions. Let min_sup=60% and min_conf=80%. 4
TID Items_Bought
T100 {M, O, N, K, E, Y}
T200 {D, O, N, K, E, Y}
T300 {M, A, K, E}
T400 {M, U, C, K, Y}
90
1
T500 {C, O, O, K, I, E}
13
_2
2.
P1
24
2E
i) Find all frequent itemsets using Apriori algorithm.
5.
ii) List all the strong association rules (with support s and confidence
.5
P2
c).
17
Q
|1
7. Attempt any one part of the following: 10*1 = 10
5
Qno Questions CO
3
8:
(a) Explain the HIVE architecture with its features in detail. 5
:2
(b) Write R function to check whether the given number is prime or not. 5
13
2
02
-2
06
7-
|1
QP22EP1_290 | 17-06-2022 13:28:35 | 117.55.242.131