POSTGRADUATE PROGRAM IN
DATA SCIENCE
About the Program
The Foundation of Data Science is a practical, outcomes-driven program designed for learners who
want to become effective contributors in data-driven teams. It takes you through the full journey of
working with data—starting from accessing and exploring raw datasets to interpreting trends and
building models that drive business outcomes.
In the first phase, you’ll learn to identify relevant data, structure it for analysis, and apply logic to
filter and organize it efficiently. You’ll gain clarity on how to approach real-world business questions
using a structured, data-first mindset.
Next, you’ll dive into analyzing and summarizing data to uncover patterns, outliers, and correlations
that can influence decisions. You’ll work with large datasets to understand distributions, central
tendencies, and variability—key to identifying what’s normal, what’s unusual, and what matters most
in a business context.
As you progress, you’ll learn to:
• Evaluate assumptions and test business hypotheses using statistical methods
• Build forecasting and classification models that predict business outcomes or customer
behavior
• Break down a business challenge into a step-by-step data analysis plan
• Translate raw insights into clear business recommendations through visual summaries and
storytelling
In the final phase, you’ll work on a capstone project simulating a real business scenario—where
you’ll source and prepare data, analyze it, build a solution, and present your findings in a
professional format.
Program Highlights
Immersive & Hands-on Learning – Case-based, interactive sessions with
real-world problem-solving, applied analytics workflows, and guided
exercises across business scenarios.
Comprehensive Job-Focused Curriculum – Covers the full data journey: from
querying and analyzing data to applying statistical techniques and building
predictive models — ensuring you're ready for analytical roles.
Capstone & Applied Projects – 22-week program with integrated
assessments and a capstone project to demonstrate your ability to solve
real-world data problems end-to-end.
Live & Expert-Led Training – 70 - 90% of sessions delivered live by
experienced faculty and industry professionals with deep expertise in
data-driven decision-making.
Career Enablement & Readiness – Build a portfolio of work, gain practical
exposure, and receive structured career guidance to confidently pursue
roles in analytics and business intelligence.
Accessible to All Graduates – No prior coding or data experience
required—ideal for individuals from diverse academic backgrounds looking
to enter data-centric roles.
Tools &
5+ PORTFOLIO Technologies
projects and case studies
MS Excel Power BI Python
70%-90%
Instructor-led live online
classes
7-10 HOURS
weekly effort CAREER
COACHING
and mentorship
CURRICULUM*
PROGRAM DURATION : 5-months
Approximately 7-10 Hours of student effort expected per week
Projects integrated through the curriculum*
MODULE SESSION LEARNING OUTCOMES TOOLS USED
Build a strong foundation in managing and querying
structured data using relational databases.
- Understand how databases store, structure, and manage data for
business use.
Deep Dive into SQL (MySQL or
Database & - Write SQL queries to filter, join, and aggregate datasets for insights. equivalent), DBMS
Query Writing platforms
- Use advanced SQL techniques like subqueries, CTEs, and window
functions.
- Apply SQL skills in analytics tasks like reporting, segmentation, and
performance tracking.
Learn to write Python code and automate data workflows
essential for analytics and reporting.
- Write clean, efficient Python code using variables, loops, and functions.
Python for Python, Jupyter
- Work with data structures (lists, dictionaries, sets) for organizing data.
Data Analysis Notebook
- Read and write data files (especially CSVs) using Python I/O functions.
- Use control structures and lambda functions for efficient data
processing.
Gain the ability to analyze data statistically and support
decision-making with evidence.
Statistical - Summarize datasets using central tendency, dispersion, and skewness.
Analysis & Python (NumPy,
- Apply probability concepts and distributions in real-world scenarios.
Decision SciPy), Excel
Making - Conduct hypothesis testing and interpret p-values, t-tests, and ANOVA.
- Translate statistical outputs into clear business recommendations.
*The curriculum is subject to changes
CURRICULUM*
MODULE SESSION LEARNING OUTCOMES TOOLS USED
Learn to transform, clean, and explore data to identify
trends and prepare it for modeling.
- Conduct univariate, bivariate, and multivariate exploratory analysis.
Python (Pandas,
Exploratory Data - Handle missing values, detect outliers, and apply data transformation Seaborn, Matplotlib),
Analysis (EDA) techniques. Excel
- Create group summaries, pivot tables, and correlation visualizations.
- Use EDA techniques to prepare high-quality, analysis-ready datasets.
Discover how to build predictive models that solve real
business problems.
- Understand machine learning concepts and build models using
Machine real-world data. Python (Scikit-learn,
Learning using Pandas, Matplotlib),
Python - Apply regression, classification, and clustering techniques effectively.
Jupyter Notebook
- Perform data preprocessing, feature engineering, and model tuning.
- Evaluate model performance using industry-standard metrics and
validation methods.
The capstone lets you prove your fundamental analytics
chops: take a messy public dataset, clean and shape it
with Python and SQL, explore patterns with statistics and
Capstone visualisations, then craft a concise data-story that drives
a clear business insight. It’s your hands-on showcase of
core scripting, querying, EDA, and storytelling skills.
*The curriculum is subject to changes
PORTFOLIO PROJECTS
Smarter Product Suggestions
Amazon’s huge review file tells who bought what and how they rated it. You’ll turn that
information into a mini “You May Also Like” engine. First, tidy the data: remove duplicates,
fill in missing ratings, and group purchases by customer. Next, build two simple
recommenders: “top sellers” and “people who bought X also bought Y.” Then try a smarter
maths trick called matrix factorisation that spots hidden patterns of taste. Test which
method offers the best five suggestions for each shopper. Finally, wrap the winner in a tiny
web app so anyone can enter a customer ID and see personalised picks.
Python (Pandas, Numpy, Scikit-learn), Excel
Will You Finish the Show?
Netflix wants to know if viewers will stick with a new series or stop halfway. You’ll com-
bine its public rating file with basic genre tags (comedy, drama, etc.). After cleaning the
data, create simple story-telling numbers like “days since release” and “average episodes
watched per week.” Train three common prediction models and see which guesses view-
ing time most accurately. Draw colourful “explain-me” charts that show, for instance, that
comedies released on Fridays keep people watching longer. End by building a friendly
dashboard where Netflix staff can tweak a show’s genre or release date and watch the
predicted viewing hours change.
Python (Pandas, Scikit-learn, SHAP/Matplotlib), Power BI, Jupyter Notebook
Fair-Price Helper for Hosts
New Airbnb hosts often ask, “What should I charge tonight?” Using an open dataset of
listings from major cities, you’ll build a pricing adviser. Clean messy fields like room size
and amenity lists, and group homes into neighbourhood clusters. Add helpful flags such
as “summer weekend” or “has private balcony.” Try out three pricing models—from a
straight-line formula to an advanced CatBoost tree—and compare their errors. The best
model is plugged into a simple form: enter your home’s details and today’s date, get a
suggested nightly rate and a “confidence band” so hosts know the safe range.
Python (Pandas, CatBoost, Scikit-learn), Excel
PORTFOLIO PROJECTS
How Much Coffee to Brew?
Starbucks stores don’t want to brew too little (lost sales) or too much (waste). You’ll use a
public sales file that includes daily cups sold, special offers, and local weather. Explore for
patterns like “rainy Mondays boost latte sales.” Then compare three forecasting methods,
from classic seasonal charts to a popular Facebook tool called Prophet. Check which
predicts next month’s demand most closely. Build a one-page results dashboard: store
managers can type the store ID and see how many cups of each drink to prep for the
coming weeks.
Python (Pandas, Prophet, Seaborn), Power BI, Excel
Will You Finish the Show?
Replacing a car battery pack is expensive, so Tesla wants an early warning. You’ll analyse
an open set of battery test readings: charge cycles, temperatures, and voltage curves.
Clean and normalise the numbers, then pull out tell-tale signs like “capacity dropping faster
than normal.” Train both a traditional rule-based model and a simple neural network.
Measure which one flags failing cells sooner without too many false alarms. Use an
explainer chart to show which parts of the voltage curve trigger the alert. Finish with a
short brief estimating how many warranty claims could be avoided each year.
Python (Numpy, Pandas, Keras/TensorFlow or Scikit-learn), Matplotlib/Seaborn, Excel
CASE STUDIES
Analyzing Analyzing the COVID-19
Supermarket Sales World Vaccination Progress
Analyze the three-month sales of a supermarket Analyze the COVID-19 World Vaccination
chain and depict the strategies that can be progress data to observe & draw inferences on
implemented soon to improve the product sales vaccination statuses around the world.
along with the customer ratings.
Analyzing Global Analyzing Zomato
Superstore Sales Zomato
Restaurants
Analyze the sales data of the superstore to Analyze the restaurant ratings to differentiate
discern the trend of sales and profit of the the high-rated restaurants from the low-rated
items sold. ones by location.
Analyzing Employee House-price
Retention & Satisfaction Prediction
Analyze the HR data and visualize the Predict the final price of each home based on
insights to decide what steps should be 79 explanatory variables describing (almost)
taken for employee retention and satisfaction. every aspect of residential homes in Ames,
Iowa.
Who Should Enroll
The Postgraduate Program in Data Science is ideal for professionals and aspiring analysts who
want to build strong foundations in data science and bridge the gap between raw data handling
and impactful business decision-making. Whether you are a fresher looking to start your career
in analytics or a working professional seeking to upskill, this program will prepare you with
hands-on projects, advanced tools, and industry-driven outcomes to accelerate your career in
the data domain.
Professionals from these industries will benefit
• Finance & Banking – Loan Default Prediction: Classify customers into risk groups using
decision trees or logistic regression
• Marketing & Customer Analytics – Customer Churn Prediction: Use logistic regression to
identify at-risk customers and suggest retention strategies
• HR & People Analytics – Attrition Prediction Model: Predict employee attrition with
minimal-feature classification approaches
• Logistics & SCM – Delivery Time Prediction: Forecast delivery duration using historical data
for logistics optimization
• IT & Technology (Infrastructure Management) – Failure Prediction: Predict failures before
they happen using machine learning on server logs. Enable businesses to prevent costly
downtimes and maintain robust, reliable IT infrastructure with data-driven maintenance
strategies.
*These are representative industry use cases.
Eligibility
• Bachelor's degree in any discipline
• STEM background preferred
By enrolling, you’ll gain hands-on training across Python, SQL, and ML techniques, complete
portfolio-ready projects, and graduate ready to take on impactful roles in analytics and data
science
KEY FACULTY PROFILES
Kartik Mudaliar
• MS, Computer Science | KTH Royal Institute of Technology
• B.Tech, IT | Dharmsinh Desai University
8+ Years of Experience
Faculty Faculty
L&T Technology Infosys
Upendra Kumar
• M.Tech | Mahamaya Technical University
17+ Years of Experience
7+ Years of Experience
Data Science & Machine Learning Trainer
Synergistic Compusoft Pvt. Ltd.
Shakul Malik
• Master’s in Computer Science | MDU, Rohtak
• B.Sc. in Computer Science | MDU, Rohtak
14+ Years of Experience
Sr. Data Architect Data Engineering Trainer Data Analyst Trainer
Atharva AI TCS Michelin Tyres
KEY FACULTY PROFILES
Vigneshwar V
•Master's degree, Manufacturing Systems and Management |
College of Engineering, Guindy
09+ Years of Experience
Senior Al Consultant & Corporate Trainer
NTT DATA
Soumita Mukherjee
• MBA, Marketing | GIM, Goa
• Bachelor in Design | NIFT
15+ Years of Experience
Account Manager Marketing Manager Category Manager
Amazon Pidilite Industries HUL
Dr. Nitin Sachdeva
• Leading AI Solution at TVS
• Phd from Delhi University
19+ Years of Experience
Principal Data Scientist Senior Manager
TVS Protiviti India
KEY FACULTY PROFILES
Vasudev Gupta
• Master's degree, Artificial Intelligence & Machine Learning |
Indian Institute of Technology, Kanpur
10+ Years of Experience
Head of Data Science & AI
DecisionTree Analytics & Services
Dr. Avinash Kumar Singh
• Doctor of Philosophy (Ph.D.), Information Technology |
Indian Institute Of Information Technology Allahabad
14+ Years of Experience
AI Consultant, Mentor and Coach
Robaita
Jayantilal Bhanushali
• Bachelor of Technology - BTech, Computer Science |
University of Mumbai
11+ Years of Experience
Deputy Vice President - AI in Cybersecurity
Banking Sector
KEY FACULTY PROFILES
Rajan Chettri
• Masters in Computer Application |
Sikkim Manipal Institute of Technology - SMU
15+ Years of Experience
Senior Subject Matter Expert ( SRE / DevOps)
mthree
LEARNING HOURS
Assignments
TOTAL and Live Online Career
LIVE LEARNING
COURSE DURATION Self-Paced Projects (Faculty-Led) Services
SESSIONS
5
Months
10
Hours
50+
Hours
100+
Hours
10
Hours
TOTAL
TOTAL EFFORT
(Learning Hours + Self Work) 170+ Hours WEEKLY EFFORT 7-10 Hours
per week
CERTIFICATION
On successful completion of the program, you will be eligible for the following certificate*
*
Certificates are indicative and subject to change
THE HERO GROUP
IN EDUCATION
The Hero Group has made significant contributions in the field
of K12, medical education and higher education.
IN PRIMARY AND HIGHER SECONDARY EDUCATION
Raman Munjal BCM Chain Green Meadows
Vidya Mandir of Schools School
IN HIGHER EDUCATION
ISB Founding BML Munjal Dayanand Medical
Members University College & Hospital
THE HERO STORY
The Hero Group is one of the leading business conglomerates in the world.
The company saw its humble beginning in 1956 when the four Munjal brothers
migrated to Ludhiana from Kamalia (now in Pakistan). As first-generation
entrepreneurs, they started out by manufacturing bicycle components and then
rapidly expanding the business. From there, they continued their growth story
by diversifying and deepening their expertise across domains.
Today, the US $5 billion diversified Hero Group is a conglomerate of Indian
companies with primary interests and operations in automotive
manufacturing, financing, renewable energy, electronics and education.
Wait Nahi, Great Kar!
Want more information on this program?
Reach us at 1800 309 3939 | Visit us at www.herovired.com