0% found this document useful (0 votes)

6 views7 pages

CSET228 Course Handout

The document outlines the course plan for 'Data Mining and Predictive Modelling' (CSET228) at Bennett University, detailing the faculty involved, course structure, and learning outcomes. It includes a comprehensive syllabus covering data mining techniques, predictive modeling, and practical lab work, along with evaluation policies and recommended resources. The course is designed for B.Tech students in their fourth semester, focusing on data analysis and predictive analytics skills.

Uploaded by

Chirag Sethi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views7 pages

CSET228 Course Handout

Uploaded by

Chirag Sethi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

COURSE PLAN

For
Data Mining and Predictive Modelling

(CSET228)

Faculty Name : Dr. Karnika Dwivedi, Dr. Madhuri Gupta, Dr. Anshika
Arora, Dr Dinesh Kumar, Dr Eht e Sham
Course Type : B.Tech Specialization Core-II (Data Science)

Semester and Year : IV Semester (II Year)

L-T-P : 3-0-2

Credits :4

School : SCSET

Course Level : UG

School of Computer Science

Engineering and Technology

Bennett University
Greater Noida, Uttar Pradesh

Page 1 of 7
COURSE CONTEXT
VERSION NO. OF
CURRICULUM/SYLLABUS
SCHOOL SCSET THAT THIS COURSE IS A V1
PART OF
DATE THIS COURSE
DEPARTMENT WILL BE EFFECTIVE Jan–Jun,2024
FROM
VERSION NUMBER OF
DEGREE B.Tech. THIS COURSE 2

COURSE BRIEF
Data Mining and
COURSE TITLE Predictive PRE-REQUISITES NA
Modelling
COURSE CODE CSET228 TOTAL CREDITS 4
COURSE TYPE Specialized Core – II L-T-P FORMAT 3-0-2

Page 2 of 7
LIST OF FACULTY MEMBERS TEACHING THE COURSE:

Name Designation Email Id

(Professor/Associate
Professor/ Assistant
Professor/PHD
Scholar/
Postdoc/....)
Dr Karnika Dwivedi Assistant Professor Karnika.dwivedi@bennett.edu.in
Dr Anshika Arora Assistant Professor
Dr Madhuri Gupta Assistant Professor
Dr Dinesh Kumar Associate Professor
Dr Eht e Sham Assistant Professor

FACULTY TIME TABLE:

Dr Karnika Dwivedi: Course Coordinator

Dr Anshika Arora

Dr Madhuri Gupta

Page 3 of 7
Dr Dinesh Kumar

Dr Eht e Sham

COURSE SUMMARY
This course exposes multiple techniques of understanding and analyzing the data from a
mathematical point of view. In addition, they will also use multiple predictive models to
analyse the future trend. This will be done statistically.

COURSE-SPECIFIC LEARNING OUTCOMES (CO)

By the end of this program, students should have the following knowledge, skills and values:

CO1: To articulate data preparation for data mining and analyzing based on pre-
processingtechniques.

CO2: To examine predictive analysis in various use cases.

Page 4 of 7
CO3: To make use of exploratory data analysis to gain insights and prepare data for
predictive modelling.

CO – PO /PSO Mapping
COs PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 PO12 PSO1 PSO POS
→ 2 3
POs
CO1 H H H M H M H M H M M
CO2 H M M H M H H M H M
CO3 H H M M H M H H H M
H: High / M: Medium /L: Low

SYLLABUS
Module 1 (11 hours)
Purpose of Data mining, Procedures of Data Mining, Functionality of Data Mining, Knowledge
data discovery process, Data, and attribute type, Properties of data, Discrete and continuous
attributes, Dataset types, Data quality measurement, Noise Analysis and its importance,
Techniques of Data pre-processing, Aggregation, Sampling, Curse of dimensionality,
Dimensionality reduction, Feature selection and generation, Discretization and vectorization,
Binarization, Attribute transformation correlation, Association rule mining, Apriori algorithm,
Rule generation, Pattern Mining in: Multilevel, Multidimensional Space Pattern Mining.

Module 2 (7 hours)
Rule-based reasoning, Memory-based reasoning, measuring data similarity, Similarity Metrics:
Distance-based measure, Information based measures, Set similarity measure, Jaccard Index,
Sorenson Dice Coefficient, Model Selection Problem, Error Analysis, Case study, Startups in
DataAnalysis.

Module 3 (10 hours)

Outlier analysis in classification and clustering, Probabilistic models for clustering, Clustering
high dimensional data: Subspace clustering, Projection Based clustering, Exploratory data
analysis, Data summarization and visualization, Dataset exploration, Data Exploration Tools,
Interactive Data Exploration, Predictive models, Design Principles, Parametric Models, Non-
Parametric Models, ANOVA, Regression Analysis, Frequent Pattern Mining, Mining Closed and
Max Patterns.

Module 4 (14 hours)

Linear discriminant analysis, Fisher discriminant analysis, Time series Model: ARMA, ARIMA,

Page 5 of 7
ARFIMA, Factor Analysis, Uncertainty quantification, Forward uncertainty propagation, Inverse
uncertainty quantification, Non-Negative Matrix Factorization, Sequential Matrix Factorization.
Exact Matrix Factorization, Expert Lecture from Industry, Recommendation System and
Collaborative Filtering, Multidimensional Scaling, Mining Textual Data, Temporal mining,
Spatial mining, Visual and audio data mining, Ubiquitous and invisible data mining- Privacy,
Security, Social Impacts of data mining.

STUDIO WORK / LABORATORY EXPERIMENTS:

Data pre-processing and vectorization. Quality analysis of data. Feature selection and Ranking.
Association rule mining and implementation of the Apriori algorithm. Data Similarity and set
similarity. Error analysis and model selection. Frequent pattern mining and regression.
Discriminant Analysis. Factor Analysis. Matrix Factorization. Recommendation System.

TEXTBOOKS/LEARNING RESOURCES:
1. Bruce Ratner, Statistical and Machine-Learning Data Mining:Techniques for
Better PredictiveModeling and Analysis (3rd ed.), Chapman and Hall/CRC, 2017.
ISBN 978-1498797603.
2. Dursun Delen, Predictive Analytics (1st ed.), Knime, 2020. ISBN 9780136738516.

REFERNCE BOOKS/LEARNING RESOURCES:

1. Mohammed J. Zaki and Wagner Meira, Jr, Data Minimg and MachineLearning (1st ed.),
Cambridge University Press, 2020. ISBN 9781108473989

TEACHING-LEARNING STRATEGIES
The course will be taught using a combination of the best practices of teaching-learning.
Multiple environments will be used to enhance the outcomes such as seminar, self-
learning, MOOCs,group discussions and ICT based tools for class participation along with
the classroom sessions. The teaching pedagogy being followed includes more exposure
to hands-on experiment andpractical implementations done in the lab sessions. To match
with the latest trend in academics, case study, advanced topics and research oriented topics
are covered to lay down the foundation and develop the interest in the students leading to
further exploration of the related topics. To make the students aware of the industry trends,
one session of expert lecture will be organized to provide a platform to the students for
understanding the relevant industry needs.

EVALUATION POLICY

Page 6 of 7
Components of Course Evaluation Percentage Distribution

Mid-Term 20

End-Term 40

Course Certification and Viva 10

Lab Continuous Evaluations 15

End-Term Lab Examination/ Hackathon 15

To be Filled each Semester

Probable Case Studies:
1) Advanced Research Topics: Time Series Analysis, Prediction
2) Startups to be discussed: Uber, Fractal Analytics
3) Assessment Components Details: As given in evaluation policy
4) Software required: Anaconda, google colab, pycharm, IDLE, VScode (anyone)
5) Hardware required: NA

Relevant MOOC Courses being Referred:

Specialization: https://www.coursera.org/specializations/data-mining
Note on specialization course requirements:

The specialization program offers a selection of 6 courses, each varying in duration:

• Some courses are 30 hours or more.

• Some courses are 15–16 hours.

To fulfil the specialization requirements:

1. If a student opts for a 30-hour or longer course, they are required to complete only one
certification for that course.
2. If a student opts for a course of 15–16 hours, they must complete two certifications, each of 15
or 16 hours, to meet the requirement.

Page 7 of 7

Da Handbook
No ratings yet
Da Handbook
18 pages
Dmpa Syllabus
No ratings yet
Dmpa Syllabus
2 pages
Data Analytics Course Handout
No ratings yet
Data Analytics Course Handout
7 pages
Data Warehousing & Mining Course
No ratings yet
Data Warehousing & Mining Course
45 pages
Sp24 DM Teaching Plan 02042024 114322am
No ratings yet
Sp24 DM Teaching Plan 02042024 114322am
7 pages
M S Ramaiah Institute of Technology Department of Information Science & Engg
No ratings yet
M S Ramaiah Institute of Technology Department of Information Science & Engg
11 pages
Bcse 0553
No ratings yet
Bcse 0553
1 page
Gujarat Technological University: Page 1 of 2
No ratings yet
Gujarat Technological University: Page 1 of 2
2 pages
Data Mining and Business Intelligence
No ratings yet
Data Mining and Business Intelligence
4 pages
Data Warehousing & Mining Course
No ratings yet
Data Warehousing & Mining Course
3 pages
Handout
No ratings yet
Handout
4 pages
DM Handbook
No ratings yet
DM Handbook
11 pages
Data Warehousing and Data Mining
No ratings yet
Data Warehousing and Data Mining
3 pages
B.Tech Jntuh DWDM Course Description
No ratings yet
B.Tech Jntuh DWDM Course Description
6 pages
Co-Requisite: Prerequisite: Data Book / Codes/Standards Course Category Course Designed by Approval
No ratings yet
Co-Requisite: Prerequisite: Data Book / Codes/Standards Course Category Course Designed by Approval
2 pages
DM Handbook
No ratings yet
DM Handbook
11 pages
Mod1 Datamining&Warehousing Last
No ratings yet
Mod1 Datamining&Warehousing Last
5 pages
DATA 240 - 23 - Lec1 - FA 2024 - Dist
No ratings yet
DATA 240 - 23 - Lec1 - FA 2024 - Dist
26 pages
Course Outline
No ratings yet
Course Outline
2 pages
Pa - PPT Unit 4
100% (1)
Pa - PPT Unit 4
96 pages
DMDW Lesson Plan
No ratings yet
DMDW Lesson Plan
8 pages
Ad8552 ML Unit Iv
No ratings yet
Ad8552 ML Unit Iv
86 pages
r21 III II Syllabus Hits-1
No ratings yet
r21 III II Syllabus Hits-1
26 pages
1676457507
No ratings yet
1676457507
113 pages
303 - Data Analysis Using Python
No ratings yet
303 - Data Analysis Using Python
6 pages
CourseOutline FDS
No ratings yet
CourseOutline FDS
2 pages
Business Data Mining - Syllabus7675535
No ratings yet
Business Data Mining - Syllabus7675535
1 page
Perform Association Mining and Analyze Clusters Using Different Methods
No ratings yet
Perform Association Mining and Analyze Clusters Using Different Methods
90 pages
Course Objectives DM
No ratings yet
Course Objectives DM
4 pages
Dmsyll
No ratings yet
Dmsyll
2 pages
Aula 1 - Programa Mestrado Data Mining I 201617 v2
No ratings yet
Aula 1 - Programa Mestrado Data Mining I 201617 v2
6 pages
Cse2021 - Data Mining CH
No ratings yet
Cse2021 - Data Mining CH
13 pages
Program Name BCA Title of The Course Data Mining Course Code CA-E1 Credits 03 Total No. of Teaching Hours 48
No ratings yet
Program Name BCA Title of The Course Data Mining Course Code CA-E1 Credits 03 Total No. of Teaching Hours 48
2 pages
Data Science and Machine Learning Syllabus V1.0
No ratings yet
Data Science and Machine Learning Syllabus V1.0
6 pages
Lesson Plan: Unit Topic Books For Reference No. of Hours Required Teaching Methodology
No ratings yet
Lesson Plan: Unit Topic Books For Reference No. of Hours Required Teaching Methodology
6 pages
Mcse615l - Data-Analytics - TH - 1.0 - 71 - Mcse615l - 67 Acp
No ratings yet
Mcse615l - Data-Analytics - TH - 1.0 - 71 - Mcse615l - 67 Acp
2 pages
PDF
No ratings yet
PDF
7 pages
DMB Syllabus
No ratings yet
DMB Syllabus
2 pages
Course 9 Applied Data Analytics Second Version
No ratings yet
Course 9 Applied Data Analytics Second Version
16 pages
DM Course Hand-Out
No ratings yet
DM Course Hand-Out
10 pages
AI & ML Syllabus
No ratings yet
AI & ML Syllabus
10 pages
ISOM3360 20L1L2 20syllabus - 2122
No ratings yet
ISOM3360 20L1L2 20syllabus - 2122
6 pages
Data Mining Course Outline
No ratings yet
Data Mining Course Outline
7 pages
Data Analytics & Big Data Course
No ratings yet
Data Analytics & Big Data Course
10 pages
DMPA
No ratings yet
DMPA
5 pages
IT-416 Data Mining
No ratings yet
IT-416 Data Mining
3 pages
DM-Course File
No ratings yet
DM-Course File
14 pages
MBBA327L - Business Analytics - 2023-24 Odd Semester
No ratings yet
MBBA327L - Business Analytics - 2023-24 Odd Semester
9 pages
Course Specification: (Main, Optional, Free Choice) : Main F, A, P, 1,2,3, M)
No ratings yet
Course Specification: (Main, Optional, Free Choice) : Main F, A, P, 1,2,3, M)
3 pages
Brochure Big Data
No ratings yet
Brochure Big Data
6 pages
CMP 632 Data Science and Analytics
No ratings yet
CMP 632 Data Science and Analytics
4 pages
Course Outline - ML IIFT Delhi MBA (BA) Sep-Dec 24
No ratings yet
Course Outline - ML IIFT Delhi MBA (BA) Sep-Dec 24
5 pages
MR20 Vi-I Syllabus
No ratings yet
MR20 Vi-I Syllabus
22 pages
Ps - ML Coursepack - 19th Feb 24
No ratings yet
Ps - ML Coursepack - 19th Feb 24
8 pages
4th Semester Data Science Syllabus
No ratings yet
4th Semester Data Science Syllabus
10 pages
DMW Ebook TechKnowledge
No ratings yet
DMW Ebook TechKnowledge
216 pages
BIT 454 - Data Warehousing and Data Mining
No ratings yet
BIT 454 - Data Warehousing and Data Mining
2 pages
It5003 - Data Warehousing and Data Mining-1
No ratings yet
It5003 - Data Warehousing and Data Mining-1
5 pages
F 2 PDF
No ratings yet
F 2 PDF
9 pages
Gwendolyn Brooks Study Guide
No ratings yet
Gwendolyn Brooks Study Guide
6 pages
Ls Dyna Ls Prepost Tutorial
0% (1)
Ls Dyna Ls Prepost Tutorial
33 pages
7 The Brain
100% (1)
7 The Brain
19 pages
DLP in Math Ttleg
No ratings yet
DLP in Math Ttleg
3 pages
4shapes in Tide Pools
No ratings yet
4shapes in Tide Pools
7 pages
WO Albeng Alprod Depo 30
No ratings yet
WO Albeng Alprod Depo 30
3 pages
Empowering Girls Through Selfies
No ratings yet
Empowering Girls Through Selfies
3 pages
Asm Brief - POM A2
No ratings yet
Asm Brief - POM A2
5 pages
Advanced Building System: Resistance Thermal Insulation Energy Saving Fast Installation
No ratings yet
Advanced Building System: Resistance Thermal Insulation Energy Saving Fast Installation
23 pages
Diversity Models and Dimensions Guide
No ratings yet
Diversity Models and Dimensions Guide
4 pages
Actividad 6 Reading Comprehension: Deisy Johanna Guayacán Vanegas
No ratings yet
Actividad 6 Reading Comprehension: Deisy Johanna Guayacán Vanegas
4 pages
Manual Allplan BCM Quantities
No ratings yet
Manual Allplan BCM Quantities
193 pages
Dre8 Progress Test 2 A
No ratings yet
Dre8 Progress Test 2 A
3 pages
Atomic Structure
No ratings yet
Atomic Structure
18 pages
Master Chinese Pinyin in 7 Days
No ratings yet
Master Chinese Pinyin in 7 Days
1 page
TSS HD Suspension
No ratings yet
TSS HD Suspension
2 pages
RX200A-3-25-1D-MRZ 200mm Pedestrian + Acoustic Device
No ratings yet
RX200A-3-25-1D-MRZ 200mm Pedestrian + Acoustic Device
4 pages
CEL 2106 - Material 3
No ratings yet
CEL 2106 - Material 3
12 pages
SAP S - 4 HANA in Project Management
No ratings yet
SAP S - 4 HANA in Project Management
3 pages
ServiceManuals LG Fridge GRL257NI GR-L257NI Service Manual
100% (1)
ServiceManuals LG Fridge GRL257NI GR-L257NI Service Manual
128 pages
Contemporary Professional Nursing Final
No ratings yet
Contemporary Professional Nursing Final
17 pages
All in One Science Class 10
No ratings yet
All in One Science Class 10
25 pages
Za HL 368 Big Book Original in This Together Ver 2
No ratings yet
Za HL 368 Big Book Original in This Together Ver 2
26 pages
Klüber Lubricants for Glass Industry
No ratings yet
Klüber Lubricants for Glass Industry
12 pages
Courseera's Foray Into Gen AI
No ratings yet
Courseera's Foray Into Gen AI
23 pages
BW PCA ConfigurationGuide
100% (1)
BW PCA ConfigurationGuide
29 pages
VLSI Design MCQs & Answers
0% (1)
VLSI Design MCQs & Answers
20 pages
Diagramas GDZ-50E
No ratings yet
Diagramas GDZ-50E
4 pages
Mind, Language and Society Philosophy in The Real World
No ratings yet
Mind, Language and Society Philosophy in The Real World
189 pages

CSET228 Course Handout

Uploaded by

CSET228 Course Handout

Uploaded by

COURSE PLAN

Semester and Year : IV Semester (II Year)

School of Computer Science

Name Designation Email Id

FACULTY TIME TABLE:

COURSE-SPECIFIC LEARNING OUTCOMES (CO)

CO2: To examine predictive analysis in various use cases.

Module 3 (10 hours)

Module 4 (14 hours)

STUDIO WORK / LABORATORY EXPERIMENTS:

REFERNCE BOOKS/LEARNING RESOURCES:

Course Certification and Viva 10

Lab Continuous Evaluations 15

End-Term Lab Examination/ Hackathon 15

To be Filled each Semester

Relevant MOOC Courses being Referred:

The specialization program offers a selection of 6 courses, each varying in duration:

• Some courses are 30 hours or more.

To fulfil the specialization requirements:

You might also like