0% found this document useful (0 votes)

28 views5 pages

Data Science

The document outlines the Data Science course offered by Madhav Institute of Technology & Science, Gwalior, detailing course objectives, units of study, recommended books, and course outcomes. It covers fundamental concepts in data science, Python programming, data analysis, machine learning, and model evaluation. Additionally, it includes a list of experiments and skill-based projects for practical application of the learned concepts.

Uploaded by

Kratika Jain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views5 pages

Data Science

Uploaded by

Kratika Jain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

MADHAV INSTITUTE OF TECHNOLOGY & SCIENCE, GWALIOR

(Deemed to be University)
(Declared Under Distinct Category by Ministry of Education, Government of India)
NAAC Accredited with A++ Grade

Department of Computer Science and Engineering

DATA SCIENCE
150511/290501
DC
COURSE OBJECTIVES
● To provide the fundamental knowledge of Data Sciences, along with essential Python
programming skills..
● Apply data manipulation, statistical analysis, and visualization techniques using Python libraries
like NumPy and pandas.
● Develop, implement, and evaluate machine learning models while using statistical methods to
derive insights and validate results.
—--------------------------------------------------------------------------------------------------------------------

Unit – I:
Introduction to Data Science: Introduction, Definition, applications of Data Science, Impact of
Data Science, Data Analytics Life Cycle, role of Data Scientist.
Basics of Python: Essential Python libraries, Python Introduction- Features, Identifiers, Reserved
words, Indentation, Comments, Built-in Data types and their Methods: Strings, List, Tuples,
Dictionary, Set, Type Conversion- Operators. Decision Making: Looping-Loop Control statement,
Math and Random number functions. User defined functions.
Vectorized Computation: The NumPy ndarray- Creating ndarrays- Data Types for ndarrays-
Arithmetic with NumPy Arrays- Basic Indexing and Slicing.

Unit-II
Data Analysis (with Pandas): Series, DataFrame, Essential Functionality: Dropping Entries,
Indexing, Selection, and Filtering- Function Application and Mapping- Sorting and Ranking.
Summarizing and Computing Descriptive Statistics – Mean, Standard Deviation, Skewness and
Kurtosis. Unique Values, Value Counts, and Membership. Reading and Writing Data in Text
Format.

Unit-III
Exploratory Data Analysis and Visualisation: Handling Missing Data, Data Transformation:
Removing Duplicates, Transforming Data Using a Function or Mapping, Replacing Values,
Detecting and Filtering Outliers, Functions in pandas. Plotting with pandas: Line Plots, Bar Plots,
Histograms and Density Plots, Scatter or Point Plots.

Unit-IV
Introduction to Machine Learning: Types of Learning, Linear Regression- Simple Linear
Regression, Implementation, plotting and fitting regression line, Logistic Regression, K-Nearest
Neighbors (KNN), K-Means Clustering.
MADHAV INSTITUTE OF TECHNOLOGY & SCIENCE, GWALIOR
(Deemed to be University)
(Declared Under Distinct Category by Ministry of Education, Government of India)
NAAC Accredited with A++ Grade

Unit-V
Model Evaluation Metrics: Accuracy, Precision, Recall, F1-Score
Hypothesis Testing: Mean and Variance Tests, p-value, Errors, Z-Test, t-Test, Paired t-Test, and
F-Test, Analysis of Variance (ANOVA) and Contingency Table Analysis

-------------------------------------------------------------------------------------------------------------------------------

RECOMMENDED BOOKS
1. Cathy O’Neil and Rachel Schutt , “Doing Data Science”, O'Reilly, 2015.
2. David Dietrich, Barry Heller, Beibei Yang, “Data Science and Big data Analytics”, EMC 2013
3. Artificial Intelligence: A Modern Approach by Stuart J. Russell and Peter Norvig, Prentice Hall.
4. Pattern Recognition and Machine Learning, Christopher M. Bishop
5. James, Gareth, et al. An introduction to statistical learning. Vol. 112. New York: springer, 2013.

COURSE OUTCOMES
After completion of this course, the students would be able to:
CO1: Analyze Data Science concepts and apply Python programming for data tasks, including
data manipulation with NumPy.
CO2: Analysis of the data for applying various statistical modeling approaches.
CO3: Develop expertise in managing missing data and assessing the impact of visualizations on
data insight communication.
CO4: Design and implement machine learning algorithms and assess model performance.
CO5: Develop statistical tests and evaluate machine learning models.

CO-PO Mapping (1 - Slightly; 2 - Moderately; 3 – Substantially)

PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO1 PO1 PO1 PSO PSO
0 1 2 1 2
CO1 3 2 1 1 3 2 - - 1 2 1 1 3 2

CO2 3 3 2 2 3 2 - 1 1 2 2 1 3 3

CO3 2 2 2 1 3 2 - 1 1 3 3 2 3 2

CO4 3 3 3 2 3 3 - 1 1 2 3 2 3 3

CO5 3 3 2 2 3 3 1 1 1 2 2 2 3 3
MADHAV INSTITUTE OF TECHNOLOGY & SCIENCE, GWALIOR
(Deemed to be University)
(Declared Under Distinct Category by Ministry of Education, Government of India)
NAAC Accredited with A++ Grade

DATA SCIENCE
150511/290501
(DC)

List of Experiments

1. Perform Creation, indexing, slicing, concatenation and repetition operations on Python
built-in data types: Strings, List, Tuples, Dictionary, Set
2. Solve problems using decision and looping statements.
3. Apply Python built-in data types: Strings, List, Tuples, Dictionary, Set and their methods to
solve any given problem
4. Handle numerical operations using math and random number functions.
5. Manipulation of NumPy arrays- Indexing, Slicing, Reshaping, Joining and Splitting.
6. Computation on NumPy arrays using Universal Functions and Mathematical methods.
7. Import a CSV file and perform various Statistical and Comparison operations on
rows/columns.
8. Create Pandas Series and DataFrame from various inputs.
9. Import any CSV file to Pandas DataFrame and perform the following:
1. Visualize the first and last 10 records
2. Get the shape, index and column details
3. Select/Delete the records(rows)/columns based on conditions.
4. Perform ranking and sorting operations.
5. Do required statistical operations on the given columns.
6. Find the count and uniqueness of the given categorical values.
7. Rename single/multiple columns.
10.Import any CSV file to Pandas DataFrame and perform the following:
1. Handle missing data by detecting and dropping/ filling missing values.
2. Transform data using different methods.
3. Detect and filter outliers.
4. Perform Vectorized String operations on Pandas Series.
5. Visualize data using Line Plots, Bar Plots, Histograms, Density Plots and Scatter Plots.
11.Use the scikit-learn package in python to implement the regression model and its related
methods.
MADHAV INSTITUTE OF TECHNOLOGY & SCIENCE, GWALIOR
(Deemed to be University)
(Declared Under Distinct Category by Ministry of Education, Government of India)
NAAC Accredited with A++ Grade

Course Outcomes (COs) for the Data Science lab:

CO1: Apply fundamental Python programming constructs such as data types, control structures,
and functions to design ethical and efficient solutions for real-life problems.

CO2: Analyze and process structured and unstructured data using Python libraries like NumPy and
Pandas to derive meaningful insights while considering societal relevance and responsible data
handling.

CO3: Develop real world data science applications using Python

CO-PO Mapping (1 - Slightly; 2 - Moderately; 3 – Substantially)

COs PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 PO12 PSO1 PSO2
CO1 3 3 2 2 2 2 2 3 2
CO2 3 3 2 2 2 2 2 2 3 3
CO3 3 2 3 2 2 2 3 3 2 2 3 3

MADHAV INSTITUTE OF TECHNOLOGY & SCIENCE, GWALIOR
(Deemed to be University)
(Declared Under Distinct Category by Ministry of Education, Government of India)
NAAC Accredited with A++ Grade

DATA SCIENCE
150511/290501
(DC)

list of skill-based project (Sample list)

● Exploratory Data Analysis (EDA): Perform an in-depth analysis of a dataset, including data
cleaning, visualization, and statistical analysis to gain insights and understand the
underlying patterns and relationships.
● Predictive Modeling: Build a machine learning model to predict a specific outcome or
target variable based on a given dataset. This could include classification, regression, or
time series forecasting tasks.
● Natural Language Processing (NLP): Develop a text classification or sentiment analysis
model using techniques such as tokenization, word embeddings, and recurrent neural
networks (RNNs) to analyze and understand text data.
● Image Recognition: Create an image recognition system using convolutional neural
networks (CNNs) to classify or identify objects, faces, or patterns in images.
● Recommendation System: Build a recommendation engine that suggests personalized
recommendations to users based on their preferences and behavior, using collaborative
filtering or content-based filtering techniques.
● Clustering Analysis: Implement clustering algorithms such as k-means, hierarchical
clustering, or DBSCAN to group similar data points together and discover hidden patterns
or segments within a dataset.
● Time Series Analysis: Analyze time-dependent data, such as stock prices or weather data,
using techniques like autoregressive integrated moving average (ARIMA), exponential
smoothing, or recurrent neural networks (RNNs).
● Anomaly Detection: Develop an anomaly detection system that can identify unusual or
suspicious patterns in data, which can be useful for fraud detection, network intrusion
detection, or outlier detection.
● Social Media Sentiment Analysis: Use data from social media platforms to analyze public
sentiment towards specific topics, brands, or events using natural language processing
techniques and sentiment analysis algorithms.
● Data Visualization Dashboard: Create an interactive dashboard using libraries like Plotly or
Dash to visualize and explore data, providing users with an intuitive interface to interact
with and gain insights from the data.

Please Note: Each project has to be submitted by a group of 1 or 2 students, and each group will
be assigned only one project.
***********

Skill Based Projects - Data - Science (See List On Last Page)
No ratings yet
Skill Based Projects - Data - Science (See List On Last Page)
4 pages
OCS353 Syllabus
No ratings yet
OCS353 Syllabus
5 pages
227C4A Data Science
No ratings yet
227C4A Data Science
2 pages
DSP U1
No ratings yet
DSP U1
89 pages
Minor Cse Dsv2
No ratings yet
Minor Cse Dsv2
7 pages
Syllabus AIML
No ratings yet
Syllabus AIML
14 pages
DSP U2
No ratings yet
DSP U2
172 pages
3rd Sem Syllabus
No ratings yet
3rd Sem Syllabus
5 pages
Python Data Science Certificate Course
No ratings yet
Python Data Science Certificate Course
5 pages
Data Science for Engineers Course
No ratings yet
Data Science for Engineers Course
8 pages
Gujarat Technological University: Overview of Python and Data Structures
No ratings yet
Gujarat Technological University: Overview of Python and Data Structures
4 pages
Cab112:Introduction To Data Science: Session 2024-25 Page:1/2
No ratings yet
Cab112:Introduction To Data Science: Session 2024-25 Page:1/2
2 pages
CU MSDS All Semesters Syllabus
No ratings yet
CU MSDS All Semesters Syllabus
10 pages
22am901 Data Science Using Python Unit 2
No ratings yet
22am901 Data Science Using Python Unit 2
116 pages
Ya5uE5 Syllabus Instructors
No ratings yet
Ya5uE5 Syllabus Instructors
2 pages
B.Tech - AIDS R 2021
No ratings yet
B.Tech - AIDS R 2021
31 pages
Ocs353 Data Science Fundamentals
No ratings yet
Ocs353 Data Science Fundamentals
2 pages
Machine Learning Lab Course Overview
No ratings yet
Machine Learning Lab Course Overview
49 pages
PDS Merged New
No ratings yet
PDS Merged New
19 pages
Syllabus OE AIDSML.
No ratings yet
Syllabus OE AIDSML.
7 pages
Edit Ds
No ratings yet
Edit Ds
37 pages
Data Science and Machine Learning Using Python
No ratings yet
Data Science and Machine Learning Using Python
4 pages
B.tech Minor Syllabus-CSE (Data Science) - Final
No ratings yet
B.tech Minor Syllabus-CSE (Data Science) - Final
17 pages
# Syllabus
No ratings yet
# Syllabus
2 pages
Data Science Diploma for Aspiring Pros
No ratings yet
Data Science Diploma for Aspiring Pros
43 pages
BOS CSE-Data Science (10!5!25)
No ratings yet
BOS CSE-Data Science (10!5!25)
39 pages
Minor Python Syllabus
No ratings yet
Minor Python Syllabus
41 pages
Master of Science (Data Science and Analytics)
No ratings yet
Master of Science (Data Science and Analytics)
10 pages
MSc AI & ML Program Structure 2024
No ratings yet
MSc AI & ML Program Structure 2024
9 pages
Data Science & Python Syllabus 2022-24
No ratings yet
Data Science & Python Syllabus 2022-24
9 pages
Cpget2023 M.SC Datascience Eligibility Criteria
No ratings yet
Cpget2023 M.SC Datascience Eligibility Criteria
1 page
Data Science Lab Guide
No ratings yet
Data Science Lab Guide
61 pages
Course Plan Fods
No ratings yet
Course Plan Fods
6 pages
Data Science & Big Data Lab Manual
No ratings yet
Data Science & Big Data Lab Manual
117 pages
Macse502 Programming-For-data-science Eth 1.0 83 Macse502
No ratings yet
Macse502 Programming-For-data-science Eth 1.0 83 Macse502
4 pages
Ocs353 DSF Syllabus
No ratings yet
Ocs353 DSF Syllabus
3 pages
Fundamentals of Machine Learning 4341603
No ratings yet
Fundamentals of Machine Learning 4341603
9 pages
Data Science Lab Guide
No ratings yet
Data Science Lab Guide
98 pages
DSBDAlab Manual
No ratings yet
DSBDAlab Manual
116 pages
Introduction of Machine Learning Course Code: 4350702
No ratings yet
Introduction of Machine Learning Course Code: 4350702
9 pages
Syllabus - PGD - DS - Batch-7 PDF
No ratings yet
Syllabus - PGD - DS - Batch-7 PDF
12 pages
Combined SoCIT OE 7th Sem
No ratings yet
Combined SoCIT OE 7th Sem
7 pages
Sem 6
No ratings yet
Sem 6
12 pages
Cds3005 Foundations-Of-data-science LP 1.0 18 Cds3005 Foundation-Of-data-science LP 1.0 1 Foundations of Data Science
No ratings yet
Cds3005 Foundations-Of-data-science LP 1.0 18 Cds3005 Foundation-Of-data-science LP 1.0 1 Foundations of Data Science
2 pages
Introduction To Data Science Course Outline
No ratings yet
Introduction To Data Science Course Outline
5 pages
DS+Roadmap Compressed
No ratings yet
DS+Roadmap Compressed
12 pages
CS 3352 Foundations of Data Science Syllabus
No ratings yet
CS 3352 Foundations of Data Science Syllabus
2 pages
Data Analysis
No ratings yet
Data Analysis
8 pages
PDF
No ratings yet
PDF
25 pages
20ad41e2 - Data Science
No ratings yet
20ad41e2 - Data Science
2 pages
Minor Data Science
No ratings yet
Minor Data Science
15 pages
BTCS9202 Data Sciences Lab Manual
No ratings yet
BTCS9202 Data Sciences Lab Manual
39 pages
Ocs353dsf Unit Wise Notes
100% (2)
Ocs353dsf Unit Wise Notes
121 pages
303 - Data Analysis Using Python
No ratings yet
303 - Data Analysis Using Python
6 pages
Data Science Syl Lab Us
No ratings yet
Data Science Syl Lab Us
4 pages
Data - Science - Manaul (Te)
No ratings yet
Data - Science - Manaul (Te)
78 pages
PDS Practical
No ratings yet
PDS Practical
94 pages
Ad3411-Dsa Lab Final Record
No ratings yet
Ad3411-Dsa Lab Final Record
33 pages
User Request For Attention
No ratings yet
User Request For Attention
12 pages
Khusboo Agarwal CSE MITS, Gwalior
No ratings yet
Khusboo Agarwal CSE MITS, Gwalior
14 pages
TCP Ip
No ratings yet
TCP Ip
3 pages
Class Less Addressing
No ratings yet
Class Less Addressing
33 pages
IP Address
No ratings yet
IP Address
27 pages
Khusboo Agarwal CSE MITS, Gwalior
No ratings yet
Khusboo Agarwal CSE MITS, Gwalior
15 pages
Questions CCV
No ratings yet
Questions CCV
5 pages
Classfull Address
No ratings yet
Classfull Address
21 pages
Delivery Forwarding
No ratings yet
Delivery Forwarding
17 pages
Disaster Management 13
No ratings yet
Disaster Management 13
26 pages
File Summary Review
No ratings yet
File Summary Review
12 pages
Disaster Management 8
No ratings yet
Disaster Management 8
18 pages
Disaster Management 4
No ratings yet
Disaster Management 4
12 pages
Assignment 2 Answer
No ratings yet
Assignment 2 Answer
17 pages
Assignment 2
No ratings yet
Assignment 2
1 page
Drawing LKG
No ratings yet
Drawing LKG
1 page
TCP Ip Unit 1,2
No ratings yet
TCP Ip Unit 1,2
9 pages
2 Merged
No ratings yet
2 Merged
29 pages
GFQR1027 L01
No ratings yet
GFQR1027 L01
5 pages
Sahu, Dwivedi - 2019 - User Profile As A Bridge in Cross-Domain Recommender Systems For Sparsity Reduction
No ratings yet
Sahu, Dwivedi - 2019 - User Profile As A Bridge in Cross-Domain Recommender Systems For Sparsity Reduction
21 pages
Build a Movie Recommender System
No ratings yet
Build a Movie Recommender System
19 pages
Computer Science Review: Saurabh Kulkarni, Sunil F. Rodd
No ratings yet
Computer Science Review: Saurabh Kulkarni, Sunil F. Rodd
33 pages
Use of Deep Learning in Modern Recommendation System: A Summary of Recent Works
No ratings yet
Use of Deep Learning in Modern Recommendation System: A Summary of Recent Works
6 pages
Recommender Systems Overview
No ratings yet
Recommender Systems Overview
6 pages
AI Foundations Student Guide
0% (1)
AI Foundations Student Guide
69 pages
SRD Document On Revolutionizing Fashion Through AI
No ratings yet
SRD Document On Revolutionizing Fashion Through AI
10 pages
Week3 Assignment
No ratings yet
Week3 Assignment
6 pages
IEEE Paper
No ratings yet
IEEE Paper
8 pages
Application of Dimensionality Reduction in Recommender System - A Case Study
No ratings yet
Application of Dimensionality Reduction in Recommender System - A Case Study
12 pages
A Smart Recommendation System For Carrier Shipper Matching Using Multilabel Classification - A Survey
No ratings yet
A Smart Recommendation System For Carrier Shipper Matching Using Multilabel Classification - A Survey
5 pages
AI's Impact on Higher Education
No ratings yet
AI's Impact on Higher Education
31 pages
Major Project Report NIT 2019002
No ratings yet
Major Project Report NIT 2019002
22 pages
Sodapdf
No ratings yet
Sodapdf
133 pages
Final PPT SIH2023 College
No ratings yet
Final PPT SIH2023 College
4 pages
Preprints202305 1649 v1
No ratings yet
Preprints202305 1649 v1
25 pages
DRUG Recommendation System Based On Sentiment Analysis of DRUG Reviews Using Machine Learning
No ratings yet
DRUG Recommendation System Based On Sentiment Analysis of DRUG Reviews Using Machine Learning
5 pages
Final Report Kmean 3
No ratings yet
Final Report Kmean 3
9 pages
Secure Persona Prediction and Data Leakage Prevention System Using Python
No ratings yet
Secure Persona Prediction and Data Leakage Prevention System Using Python
49 pages
A Personalized Food Recommendation Application Using A Hybrid Collaborative Filtering Approach
No ratings yet
A Personalized Food Recommendation Application Using A Hybrid Collaborative Filtering Approach
8 pages
Ml-Mod 1 Pyq and Imp QN
No ratings yet
Ml-Mod 1 Pyq and Imp QN
12 pages
Unit 4 - MLMM
No ratings yet
Unit 4 - MLMM
36 pages
GitHub - Peggy1502 - Amazing-Resources - List of References and Online Resources Related To Data Science, Machine Learning and Deep Learning
No ratings yet
GitHub - Peggy1502 - Amazing-Resources - List of References and Online Resources Related To Data Science, Machine Learning and Deep Learning
41 pages
Online Web Based Music Player
No ratings yet
Online Web Based Music Player
32 pages
Assignment 5 Ai
No ratings yet
Assignment 5 Ai
3 pages
Impact of Artificial Intelligence
No ratings yet
Impact of Artificial Intelligence
60 pages
Airbnb Data Insights for Stakeholders
No ratings yet
Airbnb Data Insights for Stakeholders
15 pages
AI Associate Dump
No ratings yet
AI Associate Dump
9 pages

Data Science

Uploaded by

Data Science

Uploaded by

MADHAV INSTITUTE OF TECHNOLOGY & SCIENCE, GWALIOR

Department of Computer Science and Engineering

CO-PO Mapping (1 - Slightly; 2 - Moderately; 3 – Substantially)

Course Outcomes (COs) for the Data Science lab:

CO-PO Mapping (1 - Slightly; 2 - Moderately; 3 – Substantially)

list of skill-based project (Sample list)

You might also like