KEMBAR78
Data Science | PDF | Data Analysis | Databases
0% found this document useful (0 votes)
6 views17 pages

Data Science

Dihckyckk

Uploaded by

Manoj B R
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
6 views17 pages

Data Science

Dihckyckk

Uploaded by

Manoj B R
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 17

POSTGRADUATE PROGRAM IN

DATA SCIENCE
About the Program
The Foundation of Data Science is a practical, outcomes-driven program designed for learners who
want to become effective contributors in data-driven teams. It takes you through the full journey of
working with data—starting from accessing and exploring raw datasets, to interpreting trends and
building models that drive business outcomes.

In the first phase, you’ll learn to identify what data is relevant, structure it for analysis, and apply
logic to filter and organize it efficiently. You’ll gain clarity on how to approach real-world business
questions using a structured, data-first mindset.

Next, you’ll dive into analyzing and summarizing data to uncover patterns, outliers, and correlations
that can influence decisions. You’ll work with large datasets to understand distributions, central
tendencies, and variability—key to identifying what’s normal, what’s unusual, and what matters
most in a business context.

As you progress, you’ll learn to:

• Evaluate assumptions and test business hypotheses using statistical methods

• Build forecasting and classification models that predict business outcomes or


customerbehavior

• Break down a business challenge into a step-by-step data analysis plan

• Translate raw insights into clear business recommendations through visual summaries and
storytelling

In the final phase, you’ll work on a capstone project simulating a real business scenario—where
you’ll source and prepare data, analyze it, build a solution, and present your findings in a
professional format.

*edX and the edX logo are registered trademarks of edX LLC. All rights reserved.
Program Highlights
Immersive & Hands-on Learning – Case-based, interactive sessions with real-world
problem-solving, applied analytics workflows, and guided exercises across
business scenarios.

Comprehensive Job-Focused Curriculum – Covers the full data journey: from


querying and analyzing data to applying statistical techniques and building
predictive models — ensuring you're ready for analytical roles.

Capstone & Applied Projects – 22-week program with integrated assessments and
a capstone project to demonstrate your ability to solve real-world data problems
end-to-end.

Live & Expert-Led Training – 70 - 90% of sessions delivered live by experienced


faculty and industry professionals with deep expertise in data-driven
decision-making.

Career Enablement & Readiness – Build a portfolio of work, gain practical exposure,
and receive structured career guidance to confidently pursue roles in analytics and
business intelligence.

Accessible to All Graduates – No prior coding or data experience required—ideal for


individuals from diverse academic backgrounds looking to enter data-centric
roles.

Eligibilty Criteria
• Bachelor's degree in any discipline
• STEM background preferred
Tools &
5+ PORTFOLIO Technologies
projects and case studies

MS Excel Power BI Python

5-7 HOURS
weekly effort

WORKSHOPS*
on industry-trending topics, such as
70%-90%
Advanced Deep Learning & Generative Instructor-led live online
Models, Model Deployment & classes
Productionization etc.

CAREER
COACHING
and mentorship

*Workshops duration - 6 Hours


CURRICULUM *
PROGRAM DURATION : 5-months
Approximately 5-7 Hours of student effort expected per week
Projects integrated through the curriculum*

MODULE SESSION LEARNING OUTCOMES TOOLS USED

Build a strong foundation in managing and querying


structured data using relational databases.
- Understand how databases store, structure, and manage data for
business use.
Deep Dive into SQL (MySQL or
Database & - Write SQL queries to filter, join, and aggregate datasets for insights. equivalent), DBMS
Query Writing platforms
- Use advanced SQL techniques like subqueries, CTEs, and window
functions.

- Apply SQL skills in analytics tasks like reporting, segmentation, and


performance tracking.

Learn to write Python code and automate data workflows


essential for analytics and reporting.
- Write clean, efficient Python code using variables, loops, and functions.
Python for Python, Jupyter
- Work with data structures (lists, dictionaries, sets) for organizing data.
Data Analysis Notebook
- Read and write data files (especially CSVs) using Python I/O functions.

- Use control structures and lambda functions for efficient data


processing.

Gain the ability to analyze data statistically and support


decision-making with evidence.
Statistical - Summarize datasets using central tendency, dispersion, and skewness.
Analysis & Python (NumPy,
- Apply probability concepts and distributions in real-world scenarios.
Decision SciPy), Excel
Making - Conduct hypothesis testing and interpret p-values, t-tests, and ANOVA.

- Translate statistical outputs into clear business recommendations.

*The curriculum is subject to changes


CURRICULUM *
MODULE SESSION LEARNING OUTCOMES TOOLS USED

Learn to transform, clean, and explore data to identify


trends and prepare it for modeling.
- Conduct univariate, bivariate, and multivariate exploratory analysis.
Python (Pandas,
Exploratory Data - Handle missing values, detect outliers, and apply data transformation Seaborn, Matplotlib),
Analysis (EDA) techniques. Excel
- Create group summaries, pivot tables, and correlation visualizations.

- Use EDA techniques to prepare high-quality, analysis-ready datasets.

Discover how to build predictive models that solve real


business problems.
- Understand machine learning concepts and build models using
Machine real-world data. Python (Scikit-learn,
Learning using Pandas, Matplotlib),
Python - Apply regression, classification, and clustering techniques effectively.
Jupyter Notebook
- Perform data preprocessing, feature engineering, and model tuning.

- Evaluate model performance using industry-standard metrics and


validation methods.

The capstone lets you prove your fundamental analytics


chops: take a messy public dataset, clean and shape it
with Python and SQL, explore patterns with statistics and
Capstone visualisations, then craft a concise data-story that drives
a clear business insight. It’s your hands-on showcase of
core scripting, querying, EDA, and storytelling skills.

*The curriculum is subject to changes


PORTFOLIO PROJECTS

Smarter Product Suggestions

Amazon’s huge review file tells who bought what and how they rated it. You’ll turn that
information into a mini “You May Also Like” engine. First, tidy the data: remove duplicates,
fill in missing ratings, and group purchases by customer. Next, build two simple
recommenders: “top sellers” and “people who bought X also bought Y.” Then try a smarter
maths trick called matrix factorisation that spots hidden patterns of taste. Test which
method offers the best five suggestions for each shopper. Finally, wrap the winner in a tiny
web app so anyone can enter a customer ID and see personalised picks.
Python (Pandas, Numpy, Scikit-learn), Excel

Will You Finish the Show?

Netflix wants to know if viewers will stick with a new series or stop halfway. You’ll com-
bine its public rating file with basic genre tags (comedy, drama, etc.). After cleaning the
data, create simple story-telling numbers like “days since release” and “average episodes
watched per week.” Train three common prediction models and see which guesses view-
ing time most accurately. Draw colourful “explain-me” charts that show, for instance, that
comedies released on Fridays keep people watching longer. End by building a friendly
dashboard where Netflix staff can tweak a show’s genre or release date and watch the
predicted viewing hours change.
Python (Pandas, Scikit-learn, SHAP/Matplotlib), Power BI, Jupyter Notebook

Fair-Price Helper for Hosts

New Airbnb hosts often ask, “What should I charge tonight?” Using an open dataset of
listings from major cities, you’ll build a pricing adviser. Clean messy fields like room size
and amenity lists, and group homes into neighbourhood clusters. Add helpful flags such
as “summer weekend” or “has private balcony.” Try out three pricing models—from a
straight-line formula to an advanced CatBoost tree—and compare their errors. The best
model is plugged into a simple form: enter your home’s details and today’s date, get a
suggested nightly rate and a “confidence band” so hosts know the safe range.

Python (Pandas, CatBoost, Scikit-learn), Excel


PORTFOLIO PROJECTS

How Much Coffee to Brew?

Starbucks stores don’t want to brew too little (lost sales) or too much (waste). You’ll use a
public sales file that includes daily cups sold, special offers, and local weather. Explore for
patterns like “rainy Mondays boost latte sales.” Then compare three forecasting methods,
from classic seasonal charts to a popular Facebook tool called Prophet. Check which
predicts next month’s demand most closely. Build a one-page results dashboard: store
managers can type the store ID and see how many cups of each drink to prep for the
coming weeks.
Python (Pandas, Prophet, Seaborn), Power BI, Excel

Will You Finish the Show?

Replacing a car battery pack is expensive, so Tesla wants an early warning. You’ll analyse
an open set of battery test readings: charge cycles, temperatures, and voltage curves.
Clean and normalise the numbers, then pull out tell-tale signs like “capacity dropping faster
than normal.” Train both a traditional rule-based model and a simple neural network.
Measure which one flags failing cells sooner without too many false alarms. Use an
explainer chart to show which parts of the voltage curve trigger the alert. Finish with a
short brief estimating how many warranty claims could be avoided each year.

Python (Numpy, Pandas, Keras/TensorFlow or Scikit-learn), Matplotlib/Seaborn, Excel


CASE STUDIES

Analyzing Supermarket Sales


Analyze the three-month sales of a supermarket chain and depict the strategies that can be
implemented soon to improve the product sales along with the customer ratings.

Analyzing the COVID-19 World Vaccination Progress


Analyze the COVID-19 World Vaccination progress data to observe and draw inferences on
vaccination statuses around the world.

Analyzing Global Superstore Sales


Analyze the sales data of the superstore to discern the trend of sales and profit of the
items sold.

Analyzing Zomato Restaurants


Analyze the restaurant ratings to differentiate the high-rated restaurants from the low-rated
ones by location.

Analyzing Employee Retention and Satisfaction


Analyze the HR data and visualize the insights to decide what steps should be taken for
employee retention and satisfaction.

House-price Prediction
Predict the final price of each home based on 79 explanatory variables describing (almost)
every aspect of residential homes in Ames, Iowa.
KEY FACULTY PROFILES

Kartik Mudaliar
• MS, Computer Science | KTH Royal Institute of Technology
• B.Tech, IT | Dharmsinh Desai University

8+ Years of Experience

Faculty Faculty
L&T Technology Infosys

Upendra Kumar
• M.Tech | Mahamaya Technical University

17+ Years of Experience

7+ Years of Experience

Data Science & Machine Learning Trainer


Synergistic Compusoft Pvt. Ltd.

Shakul Malik
• Master’s in Computer Science | MDU, Rohtak
• B.Sc. in Computer Science | MDU, Rohtak

14+ Years of Experience

Sr. Data Architect Data Engineering Trainer Data Analyst Trainer


Atharva AI TCS Michelin Tyres
KEY FACULTY PROFILES

Vigneshwar V
•Master's degree, Manufacturing Systems and Management |
College of Engineering, Guindy

09+ Years of Experience

Senior Al Consultant & Corporate Trainer


NTT DATA

Soumita Mukherjee
• MBA, Marketing | GIM, Goa
• Bachelor in Design | NIFT

15+ Years of Experience

Account Manager Marketing Manager Category Manager


Amazon Pidilite Industries HUL

Dr. Nitin Sachdeva


• Leading AI Solution at TVS
• Phd from Delhi University

19+ Years of Experience

Principal Data Scientist Senior Manager


TVS Protiviti India
KEY FACULTY PROFILES

Vasudev Gupta
• Master's degree, Artificial Intelligence & Machine Learning |
Indian Institute of Technology, Kanpur

10+ Years of Experience

Head of Data Science & AI


DecisionTree Analytics & Services

Dr. Avinash Kumar Singh


• Doctor of Philosophy (Ph.D.), Information Technology |
Indian Institute Of Information Technology Allahabad

14+ Years of Experience

AI Consultant, Mentor and Coach


Robaita

Jayantilal Bhanushali
• Bachelor of Technology - BTech, Computer Science |
University of Mumbai

11+ Years of Experience

Deputy Vice President - AI in Cybersecurity


Banking Sector
KEY FACULTY PROFILES

Rajan Chettri
• Masters in Computer Application |
Sikkim Manipal Institute of Technology - SMU

15+ Years of Experience

Senior Subject Matter Expert ( SRE / DevOps)


mthree
LEARNING HOURS
Assignments
TOTAL and Live Online Career
LIVE LEARNING
COURSE DURATION Self-Paced Projects (Faculty-Led) Services
SESSIONS

5
Months
10
Hours
50+
Hours
100+
Hours
10
Hours

TOTAL
TOTAL EFFORT
(Learning Hours + Self Work) 170+ Hours WEEKLY EFFORT 7-10 Hours
per week
CERTIFICATION
On successful completion of the program, you will be eligible for the following certificate*

*
Certificates are indicative and subject to change
THE HERO GROUP
IN EDUCATION
The Hero Group has made significant contributions in the field
of K12, medical education and higher education.

IN PRIMARY AND HIGHER SECONDARY EDUCATION

Raman Munjal BCM Chain Green Meadows


Vidya Mandir of Schools School

IN HIGHER EDUCATION

ISB Founding BML Munjal Dayanand Medical


Members University College & Hospital

THE HERO STORY


The Hero Group is one of the leading business conglomerates in the world.
The company saw its humble beginning in 1956 when the four Munjal brothers
migrated to Ludhiana from Kamalia (now in Pakistan). As first-generation
entrepreneurs, they started out by manufacturing bicycle components and then
rapidly expanding the business. From there, they continued their growth story
by diversifying and deepening their expertise across domains.

Today, the US $5 billion diversified Hero Group is a conglomerate of Indian


companies with primary interests and operations in automotive
manufacturing, financing, renewable energy, electronics and education.
Wait Nahi, Great Kar!

Want more information on this program?


Reach us at 1800 309 3939 | Visit us at www.herovired.com

You might also like