KEMBAR78
Data Mining | PDF | Cluster Analysis | Statistical Classification
0% found this document useful (0 votes)
290 views2 pages

Data Mining

The document outlines a course on data mining for a 4th year B.Tech in computer science. The course objectives are to learn data mining concepts, algorithms for association rule mining, classification, and clustering. The course outcomes are the ability to perform data preprocessing and apply mining techniques, identify patterns in large datasets, solve real-world problems using data mining, and classify web pages. The course covers topics like data preprocessing, association rule mining, classification techniques like decision trees and naive Bayes, clustering algorithms like k-means, and web and text mining.

Uploaded by

vijay kumar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
290 views2 pages

Data Mining

The document outlines a course on data mining for a 4th year B.Tech in computer science. The course objectives are to learn data mining concepts, algorithms for association rule mining, classification, and clustering. The course outcomes are the ability to perform data preprocessing and apply mining techniques, identify patterns in large datasets, solve real-world problems using data mining, and classify web pages. The course covers topics like data preprocessing, association rule mining, classification techniques like decision trees and naive Bayes, clustering algorithms like k-means, and web and text mining.

Uploaded by

vijay kumar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

R16 B.TECH CSE.

DATA MINING

B.Tech. IV Year I Sem. L T P C


Course Code: CS701PC 4 0 0 4

Course Objectives:
 Learn data mining concepts understand association rules mining.
 Discuss classification algorithms learn how data is grouped using clustering
techniques.
 To develop the abilities of critical analysis to data mining systems and applications.
 To implement practical and theoretical understanding of the technologies for data
mining
 To understand the strengths and limitations of various data mining models;

Course Outcomes:
 Ability to perform the preprocessing of data and apply mining techniques on it.
 Ability to identify the association rules, classification and clusters in large data sets.
 Ability to solve real world problems in business and scientific information using data
mining
 Ability to classify web pages, extracting knowledge from the web

UNIT - I
Introduction to Data Mining: Introduction, What is Data Mining, Definition, KDD,
Challenges, Data Mining Tasks, Data Preprocessing, Data Cleaning, Missing data,
Dimensionality Reduction, Feature Subset Selection, Discretization and Binaryzation, Data
Transformation; Measures of Similarity and Dissimilarity- Basics.

UNIT - II
Association Rules: Problem Definition, Frequent Item Set Generation, The APRIORI
Principle, Support and Confidence Measures, Association Rule Generation; APRIOIRI
Algorithm, The Partition Algorithms, FP-Growth Algorithms, Compact Representation of
Frequent Item Set- Maximal Frequent Item Set, Closed Frequent Item Set.

UNIT - III
Classification: Problem Definition, General Approaches to solving a classification problem ,
Evaluation of Classifiers , Classification techniques, Decision Trees-Decision tree
Construction , Methods for Expressing attribute test conditions, Measures for Selecting the
Best Split, Algorithm for Decision tree Induction ; Naive-Bayes Classifier, Bayesian Belief
Networks; K- Nearest neighbor classification-Algorithm and Characteristics.

UNIT - IV
Clustering: Problem Definition, Clustering Overview, Evaluation of Clustering Algorithms,
Partitioning Clustering-K-Means Algorithm, K-Means Additional issues, PAM Algorithm;
R16 B.TECH CSE.

Hierarchical Clustering-Agglomerative Methods and divisive methods, Basic Agglomerative


Hierarchical Clustering Algorithm, Specific techniques, Key Issues in Hierarchical
Clustering, Strengths and Weakness; Outlier Detection.

UNIT - V
Web and Text Mining: Introduction, web mining, web content mining, web structure
mining, we usage mining, Text mining –unstructured text, episode rule discovery for texts,
hierarchy of categories, text clustering.

TEXT BOOKS:
1. Data Mining- Concepts and Techniques- Jiawei Han, Micheline Kamber, Morgan
Kaufmann Publishers, Elsevier, 2 Edition, 2006.
2. Introduction to Data Mining, Pang-Ning Tan, Vipin Kumar, Michael Steinbanch,
Pearson Education.
3. Data mining Techniques and Applications, Hongbo Du Cengage India Publishing

REFERENCE BOOKS:
1. Data Mining Techniques, Arun K Pujari, 3rd Edition, Universities Press.
2. Data Mining Principles & Applications – T.V Sveresh Kumar, B.Esware Reddy,
Jagadish S Kalimani, Elsevier.
3. Data Mining, Vikaram Pudi, P Radha Krishna, Oxford University Press

You might also like