KEMBAR78
DMW Syllabus | PDF
0% found this document useful (0 votes)
51 views1 page

DMW Syllabus

The document provides an overview of data mining and warehousing, covering key concepts such as data processing, data cleaning, and data reduction techniques. It discusses classification and prediction methods, including decision trees and neural networks, as well as various clustering methods. Additionally, it addresses mining association rules and statistical measures relevant to large databases.

Uploaded by

wohipa6172
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
51 views1 page

DMW Syllabus

The document provides an overview of data mining and warehousing, covering key concepts such as data processing, data cleaning, and data reduction techniques. It discusses classification and prediction methods, including decision trees and neural networks, as well as various clustering methods. Additionally, it addresses mining association rules and statistical measures relevant to large databases.

Uploaded by

wohipa6172
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 1

Data Mining and Warehousing

Unit I:

Overview, Motivation (for Data Mining), Data Mining-Definition & Functionalities, Data
Processing, Form of Data Pre-processing, Data Cleaning: Missing Values, Noisy Data, (Binning,
Clustering, Regression, Computer and Human inspection), Inconsistent Data, Data Integration
and Transformation. Data Reduction -Data Cube Aggregation, Dimensionality reduction, Data
Compression, Numerosity Reduction, Clustering, Discretization and Concept hierarchy
generation.

Unit II:

Concept Description: Definition, Data Generalization, Analytical Characterization, Analysis of


attribute relevance, Mining Class comparisons, Statistical measures in large Databases.
Measuring Central Tendency, Measuring Dispersion of Data, Graph Displays of Basic Statistical
class Description, Mining Association Rules in Large Databases, Association rule mining,
mining Single-Dimensional Boolean Association rules from Transactional Databases – Apriori
Algorithm, Mining Multilevel Association rules from Transaction Databases and Mining Multi-
Dimensional Association rules from Relational Databases.

Unit III:

What is Classification & Prediction, Issues regarding Classification and prediction, Decision tree,
Bayesian Classification, Classification by Back propagation, Multilayer feed-forward Neural
Network, Back propagation Algorithm, Classification methods K-nearest neighbour classifiers,
Genetic Algorithm. Cluster Analysis: Data types in cluster analysis, Categories of clustering
methods, Partitioning methods. Hierarchical Clustering- CURE and Chameleon. Density Based
Methods-DBSCAN, OPTICS. Grid Based Methods- STING, CLIQUE. Model Based Method –
Statistical Approach, Neural Network approach, Outlier Analysis

You might also like