Data Mining for Business
Course Type Course Code Name of Course L T P Credit
DC NMSC522 Data Mining for Business 3 1 0 4
Course Objective
This course will provide students the opportunity to learn the importance of data mining and various
techniques for handling different types of data and its importance in business. Also, they will learn how
and when to implement a specific data mining technique for solving a particular business problem.
They will be exposed to various classification, clustering, and outlier detection algorithms & will learn
their workability in detail. They will also be exposed to the Weka software
(https://sourceforge.net/projects/weka/) for practical learning of data mining concepts using the popular
iris dataset.
Learning Outcomes
Students will be able to comprehend various data mining techniques and it will improve their skills in
solving industrial/business-related problems. They can get useful insights and accordingly can enhance
their decision-making capability.
Unit Lecture
Topics to be Covered Learning Outcome
No. Hours
Data Mining Relevance, types of data, 5L + 2T Students will be exposed to a brief
types of patterns and technologies used overview of data mining & different types
1.
to handle different types of data & of data & patterns.
patterns, Major issues in data mining.
Getting to know your data: Data 5L + 2T Students will get to know more about data
Objects, & Attribute Types, Basic & attributes, and its type. They will be
2. Statistical Descriptions of Data, Data exposed to statistical tools and techniques
Visualization, Measuring Similarity and to extract useful information from the
Dissimilarity. complex data.
Data Pre-processing: An overview, data 5L + 2T Students will get to know about data pre-
3. cleaning, data integration, data processing relevance and various
reduction, & data transformation. techniques to get data in a desired format.
Overview of Business Analytics and, Students will get an overview of the
Brief Introduction to Data Warehouse Business Analytics domain with an
and OLAP Technology Concepts, emphasis on Industry practices and they
4. 5L + 1T
supervised, and unsupervised data. will learn concepts, principles, and skills to
practice and engage in scalable pattern
discovery methods on massive data.
Mining Frequent Patterns, 6L + 2T In this module students will be exposed to
Associations, Correlations: Basic mining patterns with a Market Basket
5. Concepts & Methods. Analysis case. They will learn to identify
patterns using frequent item sets and
association rules.
Classification: Basic Concepts, decision 10L + 2T In this module, students will learn
tree, support vector machine, regression classification models and evaluation
technique, logistic classifier, neural techniques to compare the algorithm's
6.
network, and clustering algorithms, performance & learn to enhance accuracy
model evaluation, and selection with examples.
techniques.
Handling Missing values and Outliers; In this section, students will get a hold on
Dimension Reduction: Curse of how to prepare the data for modeling to
Dimensionality; Dimension Reduction meet the desired objectives and this will
7. using Principle Component Analysis; 3L + 1T focus on the in-depth study regarding
Feature Engineering; Imbalanced data dimensionality reduction techniques &
handling techniques their relevance in a business domain.
Data mining trends and Research 2L + 1T Students will learn the trends and research
Frontiers: mining complex data types frontier in data mining. They will be
8. and data mining applications in various exposed to various applications in several
domains (through Harvard Business domains to learn the deployment of models
Cases) in the business environment.
Data Mining Concepts Deployment 1L + 1T Students will get hands-on experience in
9. using Weka Software applying data mining concepts using
software for iris datasets.
Evaluation Components 100 Marks
Mid-Term Exam 30 Marks
End-Term Exam 50 Marks
20 Marks
2 Quizzes (subjective/multiple-choice questions)
Text Books:
1. Data Mining: Concepts and Techniques by Jiawei Han, Micheline Kamber, and Jian Pei
(Morgan Kaufmann, Elsevier publisher)
2. Data Mining: Practical Machine Learning Tools and Techniques (Morgan Kaufmann
Series in Data Management Systems)
Reference Books:
1. Storytelling with Data: A Data Visualization Guide for Business Professionals
2. Exploratory Data Mining and Data Cleaning: 442 (Wiley Series in Probability and
Statistics)
3. Data Mining for Business Analytics: Concepts, Techniques, and Applications in Python,
by Galit Shmueli, Peter C Bruce, Peter Gedeck, Nitin R Patel, (2020), Wiley.