Sample Question DMW

Dmw questions

Uploaded by

Akshay Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

98 views4 pages

Sample Question DMW

Dmw questions

Uploaded by

Akshay Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Sample Paper

Data Mining and Warehousing (20CSF-334)

2 Marks Questions
1) Define Data Mining
2) List out KDD process steps
3) What are the types of data?
4) Compare descriptive and predictive data mining
5) What is classification
6) What is prediction
7) Why we need to Pre-process the data
8) List out Data Pre-processing steps
9) What is Data cleaning
10) what is Data integration
11) Illustrate Data transformation functions
12) List out the major issues in data mining
13) What is Data selection
14) Define Data warehouse
15) Define Outlier Analysis
16) Define Clustering analysis
17) Define evolution Analysis
18) What is data redundancy
19) Define Data discretization
20) What is categorical attribute
21) List the key words used in the definition of Data Warehouse.
22) Compare the size of Database in OLTP and OLAP
23) Define metadata
24) List out the OLAP Operations
25) What is meant by association rule?
26) What is meant by Market basket analysis?
27) state and explain Apriori property.
28) What is meant by Mining Multilevel Association Rules?
29) Define Uniform Minimum Support.
30) What is meant by Reduced Minimum Support?
31) What is meant by multidimensional association rules?
32) What is meant by intradimensional association rule?
33) What is meant by inter dimensional association rules?
34) What is meant by Quantitative association rules?
35) What is meant by Partition Algorithms?
36) state and explain FP_growth Algorithm.
37) What is meant by Frequent itemset.
38) What is meant by Maximal Frequent Item Set?
39) What is meant by Closed Frequent Item Set?
40) Expalin the join & prune step in apriori algorithm.
41) Draw and explain the conditional FP_Tree.
42) How will you measure support and confidence with an example?
43) How to improve the efficiency of apriori algorithm.
44) What is meant by conditional pattern base?
45) Where are decision trees mainly used?
46) What do you meant by concept hierarchies?
47) How will you solve a classification problem using decision trees?
48) Explain ID3.
49) What is a “decision tree”?
50) Define Data Classification.
51) Define Prediction.
52) What is the difference between “supervised” and unsupervised” learning
scheme.
53) What are the requirements of clustering?
54) State the categories of clustering methods?
55) What do you meant by Bayesian Classification.
56) State and explain Bayes Theorem..
57) Difference between K-Means and K-Medoids Algorithms.
58) What do you meant by Hierarchical Clustering
59) What do you meant by Agglomerative Clustering.
60) What do you meant by Outlier Detection.
61) What do you meant by divisive Clustering.
62) What is Bayesian Belief Networks.
63) Why is naıve Bayesian classification called “naıve”? Briefly outline the major
ideas of naıve Bayesian classification.

5 Marks Questions
1) Identify the need for Data Mining
2) Show with diagrammatic illustration of the steps involved in the process of the
Knowledge Discovery from Data
3) Classify the different types of data on which Mining can be performed
4) Illustrate the architecture of a typical Data mining system
5) Explain Various Data Mining Functionalities with an example
6) Illustrate with a diagram about Data Mining Task Primitives.
7) Discuss about the Major issues in Data Mining.
8) What is Data Cleaning? Describe various methods of Data Cleaning.
9) List the Issues to be considered during Data Integration
10) Explain about Various kinds of Association rule Mining.
11) Explain in detail about partitional algorithms with an example.
12) Explain the steps involved in Apriori Algorithm.
13) Explain in detail about Multidimensional association rule.
14) Explain the Naive Bayesian Classification algorithm.
15) Write short notes on Bayesian Belief Networks?
16) Discuss about k-nearest neighbor classification algorithm with an example
17) Explain in detail about Hierarchical Clustering.
18) Explain in detail about partitional Clustering method.
19) Discuss about Outlier Detection.
20) Explain in detail about Clustering methods with an example.
21) Given a decision tree, you have the option of (a) converting the decision tree to
rules and then pruning the resulting rules, or (b) pruning the decision tree and
then converting the pruned tree to rules? What advantage does (a) have over
(b)?
22) Why is tree pruning useful in decision tree induction? What is a drawback of
using a separate set of tuples to evaluate pruning?
23) Compare the advantages and disadvantages of eager classification (e.g., decision
tree, Bayesian, neural network) versus lazy classification (e.g., k-nearest
neighbor, case-based reasoning).
24) Briefly describe and give examples of each of the following approaches to
clustering: partitioning methods, hierarchical methods, density-based methods
and grid-based methods.
25) Present conditions under which density-based clustering is more suitable than
partitioning-based clustering and hierarchical clustering. Give application
examples to support your argument.

10 Marks Questions
1) Suppose that the data for analysis includes the attribute age. The age values for
the data tuples are (in increasing order) :
13, 15, 16, 16, 19, 20, 23, 29, 35, 41, 44, 53, 62, 69, 72
(i) Use min-max normalization to transform the value of 45 for age onto the
range [0,1]
(ii) Use Z-Score normalization to transform the value 45 for age where the
standard deviation of age is 20.64 years
2) Discuss about detecting data redundancy using correlation analysis
3) Explain about Data Transformation method with suitable example
4) Explain about the different Data Reduction techniques.
5) Discuss about FP-growth algorithm for the example{M,O,N,K,E,Y}
{D,O,N,K,E,Y} {M,A,K,E} {M,U,C,K,Y} {C,O,O,K,I,E}, Support= 60 %,
Confidence = 80 %.
6) State and explain Apriori Algorithm with an example Consider the following
data set to generate Association rules {D,O,N,K,E,Y} {M,A,K,E}
{M,U,C,K,Y} {C,O,O,K,I,E}, Support= 60 %, Confidence = 80 %.
7) Explain in detail about support and Confidence Measures with an example
8) Discuss about Quantitative association mining.
9) Discuss about Decision tree induction algorithm with an example.
10) Explain about Attribute Subset Selection Measures with an example.
11) Explain clustering in detail with types of clustering algorithms.
12) Use single and complete link agglomerative clustering to group the elements of
the following dataset: {8,11,21,29,40}
13) To make a drink, salt and sugar are mixed in a glass of water in some ratio. With
these two attributes drink is classified in two classes i.e. good and bad. The
dataset for this scenario is given below:

Drink Id Salt Sweet Result

1 7 7 Bad
2 7 4 Bad
3 3 4 Good
4 1 4 Good
By using KNN classifier find that if the ratio of salt and sweet is 3 and 7
respectively then in which class that drink will lie. Take the value of K=3.

14) Consider the following transactional dataset:

Find all frequent 2-itemsets with min support count=2. Generate candidate frequent 3-
itemsets using C3=F2 x F1 candidate generation method. Prune candidates which
cannot be frequent

15) Consider the following 9 two-dimensional data points:

x1(0,0), x2(1,0), x3(1,1), x4(2,2), x5(3,1), x6(3,0), x7(0,1), x8(3,2), x9(6,3)

Use the Euclidean Distance with Eps =1 and MinPts = 3. Find all core points,
border points and noise points, and show the final clusters using DBCSAN
algorithm. Let’s show the result step by step.

16) Consider the following points:

Apply K-means starting from the centroids: K1=P7 and K2 = P4

Data Mining & Warehouse Q&A
No ratings yet
Data Mining & Warehouse Q&A
4 pages
Question Bank Bca - Ids
No ratings yet
Question Bank Bca - Ids
3 pages
DM Unit Wise Important Questions
No ratings yet
DM Unit Wise Important Questions
6 pages
Data Warehousing & Mining Exam 2019
No ratings yet
Data Warehousing & Mining Exam 2019
4 pages
Seperated
No ratings yet
Seperated
11 pages
QB Data Mining
No ratings yet
QB Data Mining
5 pages
Data Mining Exam Prep Guide
No ratings yet
Data Mining Exam Prep Guide
4 pages
DWDM QB
No ratings yet
DWDM QB
6 pages
Gandhinagar Institute of Technology: Computer Engineer Ing Department Question Bank
No ratings yet
Gandhinagar Institute of Technology: Computer Engineer Ing Department Question Bank
3 pages
Data Mining Suggestions
No ratings yet
Data Mining Suggestions
5 pages
DMDW Lab Oral Question Bank
No ratings yet
DMDW Lab Oral Question Bank
4 pages
DM Question Bank
No ratings yet
DM Question Bank
5 pages
DMBI QB AssignmentQ
No ratings yet
DMBI QB AssignmentQ
8 pages
Data Warehousing and Data Mining Unit - I Data Warehousing, Business Analysis and On-Line Analytical Processing (Olap) PART A (2 Marks)
No ratings yet
Data Warehousing and Data Mining Unit - I Data Warehousing, Business Analysis and On-Line Analytical Processing (Olap) PART A (2 Marks)
5 pages
DMDW Question Bank
No ratings yet
DMDW Question Bank
17 pages
CEUC502 - DMBI - Question - Bank
No ratings yet
CEUC502 - DMBI - Question - Bank
12 pages
Data Mining
No ratings yet
Data Mining
32 pages
Data Warehousing & Mining Exam 2018
No ratings yet
Data Warehousing & Mining Exam 2018
17 pages
SemSuggestions DM
No ratings yet
SemSuggestions DM
6 pages
Data Mning
No ratings yet
Data Mning
40 pages
Ans DM
No ratings yet
Ans DM
16 pages
Iv Semester: Data Mining Question Bank: Unit 2 2 Mark Questions)
No ratings yet
Iv Semester: Data Mining Question Bank: Unit 2 2 Mark Questions)
5 pages
Data Mining Suggestions - Updated
No ratings yet
Data Mining Suggestions - Updated
2 pages
1569928600-7cs It3a dmwh-3555
No ratings yet
1569928600-7cs It3a dmwh-3555
2 pages
Cs1004: Data Warehousing and Mining Two Marks Questions and Answers Unit I
No ratings yet
Cs1004: Data Warehousing and Mining Two Marks Questions and Answers Unit I
31 pages
DMBI Questions
No ratings yet
DMBI Questions
8 pages
DWDM
No ratings yet
DWDM
18 pages
DM Vsaq
No ratings yet
DM Vsaq
8 pages
Aie - Concept of Data Mining
No ratings yet
Aie - Concept of Data Mining
5 pages
DMA QB Solved
No ratings yet
DMA QB Solved
42 pages
DMDW
No ratings yet
DMDW
4 pages
Data Ming
No ratings yet
Data Ming
28 pages
Data Mining (Gtu Sem-6) 002
No ratings yet
Data Mining (Gtu Sem-6) 002
5 pages
Data Mining
No ratings yet
Data Mining
15 pages
Data Mining-1,2,3,4, & 5-Units & Qps
No ratings yet
Data Mining-1,2,3,4, & 5-Units & Qps
160 pages
Data Mining Question Bank
No ratings yet
Data Mining Question Bank
4 pages
Data Mining Merged
No ratings yet
Data Mining Merged
10 pages
III Yr B.Tech. - Computer Science & Engineering/Information Technology Data Mining
No ratings yet
III Yr B.Tech. - Computer Science & Engineering/Information Technology Data Mining
2 pages
DWDM Unitwise Questions
No ratings yet
DWDM Unitwise Questions
3 pages
Data Mining Key Concepts
No ratings yet
Data Mining Key Concepts
3 pages
DWDM Unitwise Qns
No ratings yet
DWDM Unitwise Qns
3 pages
Data Mining Long Answers
No ratings yet
Data Mining Long Answers
4 pages
Model Question Paper 2
No ratings yet
Model Question Paper 2
7 pages
DWDM MID - 2 Question Paper and Online Bits
No ratings yet
DWDM MID - 2 Question Paper and Online Bits
3 pages
DM QB
No ratings yet
DM QB
5 pages
FDS - I Unit
No ratings yet
FDS - I Unit
9 pages
Data Mining
No ratings yet
Data Mining
3 pages
Data Mining Assignment
No ratings yet
Data Mining Assignment
2 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
13 pages
Data Mining Exam Review Guide
No ratings yet
Data Mining Exam Review Guide
6 pages
DMDW Assignment
No ratings yet
DMDW Assignment
20 pages
16CS531-Data Warehousing and Data Mining
No ratings yet
16CS531-Data Warehousing and Data Mining
6 pages
CS1004 DWM 2marks 2013
No ratings yet
CS1004 DWM 2marks 2013
22 pages
DM Question Bank
No ratings yet
DM Question Bank
2 pages
AbhishekYadav Assignment 02
No ratings yet
AbhishekYadav Assignment 02
24 pages
Literature Review On Flower Classification IJERTV4IS020561
100% (1)
Literature Review On Flower Classification IJERTV4IS020561
3 pages
Pattern Recognition Explained
No ratings yet
Pattern Recognition Explained
40 pages
DNA Design
No ratings yet
DNA Design
10 pages
A Music Emotion Recognition Algorithm With Hierarchical SVM Based Classifiers
No ratings yet
A Music Emotion Recognition Algorithm With Hierarchical SVM Based Classifiers
4 pages
SSRN Id3373116 PDF
No ratings yet
SSRN Id3373116 PDF
39 pages
A User-Centric Machine Learning
No ratings yet
A User-Centric Machine Learning
11 pages
Carlini and Wagner 2017 Towards - Evaluating - The - Robustness - of - Neural - Networks
No ratings yet
Carlini and Wagner 2017 Towards - Evaluating - The - Robustness - of - Neural - Networks
19 pages
17.feature-Based Distant Domain Transfer Learning
No ratings yet
17.feature-Based Distant Domain Transfer Learning
8 pages
Development of Autonomous Car-Part II: A Case Study On The Implementation of An Autonomous Driving System Based On Distributed Architecture
No ratings yet
Development of Autonomous Car-Part II: A Case Study On The Implementation of An Autonomous Driving System Based On Distributed Architecture
14 pages
CE5008 - Machine Intelligence
No ratings yet
CE5008 - Machine Intelligence
6 pages
11 Economics Notes Ch03 Organization of Data
100% (3)
11 Economics Notes Ch03 Organization of Data
3 pages
CART: Theory & Applications
No ratings yet
CART: Theory & Applications
40 pages
Nikitakis 2019
No ratings yet
Nikitakis 2019
12 pages
Credit Card Fraud Detection ML
No ratings yet
Credit Card Fraud Detection ML
34 pages
Machine Learning Engineer Nanodegree Supervised Learning Project: Finding Donors For CharityML
No ratings yet
Machine Learning Engineer Nanodegree Supervised Learning Project: Finding Donors For CharityML
16 pages
Assignment 5'
No ratings yet
Assignment 5'
4 pages
The Cartoon Guide To Statistics-3
100% (1)
The Cartoon Guide To Statistics-3
8 pages
Data Mining Using Conceptual Clustering
No ratings yet
Data Mining Using Conceptual Clustering
29 pages
AMLFv1 EN PDF M02 SG
No ratings yet
AMLFv1 EN PDF M02 SG
55 pages
1 s2.0 S1877050920307742 Main PDF
No ratings yet
1 s2.0 S1877050920307742 Main PDF
15 pages
Emotxt: A Toolkit For Emotion Recognition From Text: Fabio Calefato, Filippo Lanubile, Nicole Novielli
No ratings yet
Emotxt: A Toolkit For Emotion Recognition From Text: Fabio Calefato, Filippo Lanubile, Nicole Novielli
2 pages
Data Mining Enhances CRM in Iran
No ratings yet
Data Mining Enhances CRM in Iran
7 pages
AbuSaa2019 Article FactorsAffectingStudentsPerfor
No ratings yet
AbuSaa2019 Article FactorsAffectingStudentsPerfor
32 pages
Analysis of German Credit Data
100% (1)
Analysis of German Credit Data
24 pages
EC1 M2 Applied Syllabus
No ratings yet
EC1 M2 Applied Syllabus
102 pages
DBSCAN Clustering Algorithm Solved Example
No ratings yet
DBSCAN Clustering Algorithm Solved Example
6 pages
Weka Tutorial
No ratings yet
Weka Tutorial
53 pages
BCSE497J Project I Report
No ratings yet
BCSE497J Project I Report
51 pages
MLT Unit 2 Notes
No ratings yet
MLT Unit 2 Notes
58 pages

Sample Question DMW

Uploaded by

Sample Question DMW

Uploaded by

Sample Paper

Data Mining and Warehousing (20CSF-334)

Drink Id Salt Sweet Result

14) Consider the following transactional dataset:

15) Consider the following 9 two-dimensional data points:

x1(0,0), x2(1,0), x3(1,1), x4(2,2), x5(3,1), x6(3,0), x7(0,1), x8(3,2), x9(6,3)

16) Consider the following points:

Apply K-means starting from the centroids: K1=P7 and K2 = P4

You might also like