KEMBAR78
DataMining Course Handout PDF | PDF | Data Mining | Cluster Analysis
0% found this document useful (0 votes)
185 views5 pages

DataMining Course Handout PDF

Uploaded by

Raja Karthik
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
185 views5 pages

DataMining Course Handout PDF

Uploaded by

Raja Karthik
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 5

BIRLA INSTITUTE OF TECHNOLOGY & SCIENCE, PILANI

WORK INTEGRATED LEARNING PROGRAMMES


COURSE HANDOUT

Part A: Content Design

Course Title Data Mining


Course No(s)
Credit Units 3
Last Revised by Swarna Chaudhary
Version No 2.0
Date 02/04/2020

Course Description
Data Mining is automated extraction of patterns representing knowledge implicitly stored in
information repositories. The course covers how to prepare real-world data for data mining tasks and
perform data mining tasks such as finding association rules, classification, and clustering. Students gain
knowledge of the design and use of data mining algorithms. The course includes database, statistical,
algorithmic and application perspectives of data mining.

Course Objectives
CO1 Understand the importance of data mining and the knowledge discovery that can be made from
information repositories with the help of data mining

CO2 Understand techniques of preparing real-world data for performing data mining

CO3 Understand data mining techniques for discovering interesting patterns from data

CO4 Understand efficiency, effectiveness of applicable techniques for data mining.

Text Book(s)
T1 Tan P. N., Steinbach M & Kumar V. “Introduction to Data Mining” Pearson Education, 2006
T2 Data Mining: Concepts and Techniques, Third Edition by Jiawei Han and Micheline Kamber
Morgan Kaufmann Publishers, 2006

Reference Book(s) & other resources


R1 Predictive Analytics and Data Mining: Concepts and Practice with RapidMiner by Vijay Kotu
and Bala Deshpande Morgan Kaufmann Publishers © 2015
R2 Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications by
Gary Miner et al. Academic Press © 2012
R3 Recommender Systems for Learning by Nikos Manouselis, Hendrik Drachsler, Katrien
Verbert and Erik Duval Springer © 2013
Modular Content Structure

1. Introduction to Data Mining


1.1. Data Mining definitions
1.2. Data Mining activities
1.3. DM process
1.4. DM challenges
2. Data Preprocessing
2.1. Data Quality
2.2. Data preprocessing requirements
2.3. Data preprocessing techniques
3. Data Exploration
3.1. Statistical descriptions of data
3.2. Measuring data similarity & dissimilarity
4. Classification and Prediction
4.1. Concepts of classification and prediction
4.2. Decision trees for classification
4.3. Rule based classification,
4.4. Prediction Techniques
5. Association Analysis
5.1. Association analysis concepts
5.2. Apriori Algorithm for frequent itemsets
5.3. FP-Tree technique for frequent itemsets
5.4. Mining association rules
6. Clustering
6.1. Cluster analysis concepts.
6.2. Partitioning methods
6.3. Hierarchical methods for cluster analysis
6.4. Density based methods for cluster analysis
7. Anomaly Detection
7.1. Concepts of Outliers
7.2. Statistical approaches
7.3. Proximity and Density based outlier detection
8. Data mining on unstructured (Big) data
8.1. Graph Mining methods and applications
8.2. Multimedia Data Mining
8.3. Text Mining, Web and Social Media Mining
9. Data Mining Applications
9.1. Recommendation systems
9.2. Fraud Detection
9.3. Sentiment Analysis

Learning Outcomes:
No Learning Outcomes

LO1 Realize how data mining can enable knowledge discovery.

LO2 Knowledge of techniques of preparing real-world data for performing data mining.

LO3 Knowledge of data mining techniques for discovering interesting patterns from data.

LO4 Knowledge on efficiency, effectiveness of applicable techniques for data mining.


Part B: Contact Session Plan

Academic Term
Course Title Data Mining
Course No
Lead Instructor

Course Contents

Contact List of Topic Title Topic # Text/Ref


Hours(#) (from content structure in Part A) (from Book/external
content resource
structure in
Part A)

1  Introduction to Data Mining 1 T1: Ch-1


o Data Mining definitions
2 o Data Mining activities
o DM process
o DM challenges

3  Data Preprocessing 2 T1: 2.1, 2.2


o Data Quality T2- Ch-3
o Data preprocessing requirements
4 o Data preprocessing techniques

5  Data Exploration 3 T2: Ch-2


o Statistical descriptions of data
o Measuring data similarity & dissimilarity
6

7  Classification and Prediction 4 T2 – 8.1, 8.2, ,


o Concepts of classification and prediction 8.4, 8.5
8 o Decision trees for classification
o Rule based classification,
9 o Evaluation of classification techniques
o Prediction Techniques
10

11

12

13  Association Analysis 5 T2: Ch-6


o Association analysis concepts
14 o Apriori Algorithm for frequent itemsets
o FP-Tree technique for frequent itemsets
15 o Mining association rules

16
17  Clustering 6 T2: 10.1, 10.2,
o Cluster analysis concepts. 10.3, 10.4, 10.6
18 o Partitioning methods
o Hierarchical methods for cluster analysis
19 o Density based methods for cluster
analysis
20
o Evaluation of clustering algorithms
21

22

23  Anomaly Detection 7 T2:


o Concepts of Outliers 12.1,12.2,12.3,
o Statistical approaches 12.4.1,12.4.3
o Proximity and Density based outlier
24 detection

25  Data mining on unstructured (Big) data 8 T2 (Second


o Graph Mining methods and applications Edition) : 9, 10
26 o Multimedia Data Mining
o Text Mining, Web and
27 o Social Media Mining

28

29  Data Mining Applications 9 T2: 13.3


o Recommendation systems http://infolab.stanf
o Fraud Detection ord.edu/~ullman/
30 o Sentiment Analysis mmds/ch9.pdf
https://www.scien
cedirect.com/scien
ce/article/pii/S221
2567115014859

31  Review

32

# The above contact hours and topics can be adapted for non-specific and specific WILP programs
depending on the requirements and class interests.

Select Topics for experiential learning


Topic No. Select Topics in Syllabus for experiential learning

1 Data Preprocessing
2 Classification

3 Regression

4 Clustering

Evaluation Scheme
Legend: EC = Evaluation Component
No Name Type Duration Weight Day, Date, Session, Time
Assignment Implementation based 10% To be announced
EC-1 Quiz-I MCQs 1 hour 5% To be announced
Quiz-II MCQs 1 hour 5% To be announced
EC-2 Mid-Semester Test Closed Book 2 hours 30% To be announced
EC-3 Comprehensive Exam Open Book 3 hours 50% To be announced
Note - Evaluation components can be tailored depending on the proposed model.

Important Information
Syllabus for Mid-Semester Test (Closed Book): Topics in Weeks 1-8
Syllabus for Comprehensive Exam (Open Book): All topics given in plan of study

Evaluation Guidelines:
1. EC-1 consists of one Assignment and two Quizzes. Announcements regarding the same will be made
in a timely manner.
2. For Closed Book tests: No books or reference material of any kind will be permitted.
Laptops/Mobiles of any kind are not allowed. Exchange of any material is not allowed.
3. For Open Book exams: Use of prescribed and reference text books, in original (not photocopies) is
permitted. Class notes/slides as reference material in filed or bound form is permitted. However,
loose sheets of paper will not be allowed. Use of calculators is permitted in all exams.
Laptops/Mobiles of any kind are not allowed. Exchange of any material is not allowed.
4. If a student is unable to appear for the Regular Test/Exam due to genuine exigencies, the student
should follow the procedure to apply for the Make-Up Test/Exam. The genuineness of the reason for
absence in the Regular Exam shall be assessed prior to giving permission to appear for the Make-up
Exam. Make-Up Test/Exam will be conducted only at selected exam centres on the dates to be
announced later.
It shall be the responsibility of the individual student to be regular in maintaining the self-study schedule as
given in the course handout, attend the lectures, and take all the prescribed evaluation components such as
Assignment/Quiz, Mid-Semester Test and Comprehensive Exam according to the evaluation scheme
provided in the handout.

You might also like