Machine Learning Deep Learning AI and Data Science
The document provides a comprehensive overview of machine learning, deep learning, artificial intelligence, and data science, detailing definitions, applications, and interconnectedness of these fields. It discusses the significance of algorithms, data types, and various applications such as credit risk assessment and image recognition. Additionally, it outlines necessary skills and training paths for aspiring data scientists and dispels common myths associated with the profession.
Introduction to key concepts: Machine Learning (ML), Deep Learning (DL), Artificial Intelligence (AI), Data Science (DS). Discusses the common thoughts and foundational terms.
Defines Machine Learning; leverages historical data for predictions and explores various data types (numerical, image, video, sound, text).
Examples including credit risk models, marketing analytics, fraud detection, demonstrating ML’s practical value on numerical datasets.
Outlines ML applications in computer vision—face recognition, object recognition, self-driving cars, emphasizing image processing.
Focuses on NLP applications like sentiment analysis, topic extraction, and document classification, showing ML’s use in text.
Introduction to Deep Learning, specifically Artificial Neural Networks (ANNs), and their structure, including deep vs shallow networks.
Comparison of deep and shallow networks, discussing their flexibility, efficiency in learning features, and applications in AI.
Explains how Machine Learning is a subset of AI and how it integrates into Data Science, highlighting various applications and roles.
Describes the learning journey for aspiring data scientists, detailing skill levels and suggested tools to practice through various stages. Debunks common myths surrounding data science and ML, such as the need for extensive math skills or handling large datasets.
Introduction of e-learning modules, trainer's experience, and overall training statistics illustrating the effectiveness of the presented course.
Activity
Close you reyes an d th in k ab ou t two terms –
Mac h in e Learn in g an d A rtific ial Intelligen c e.
• What is the first thing that comes to
your mind when you hear these
terms
• Machine Learning
• Artificial Intelligence
3
Machine learning (ML)is the scientific study of
algorithms and statistical models that computer
systems use to effectively perform a specific task
without using explicit instructions, relying on models
and inference instead.
7
OK GOOGLE …WHAT IS
MACHINE LEARNING?
• Using historicaldata to make future
predictions
• Building models on historical data to
predictions
• Taking training data, building models on
the training data using the models to
make the future predictions
• Making the machine learn the patterns
in the data
9
IN SIMPLETERMS ..
10.
10
DATA IS INDIFFERENT FORMS
Numerical data
Image data (pixel intensities)
Video data (frames per second)
Sound data (waves)
Text data (tweets, comments, feedback)
11.
APPLICATIONS OF NUMERICALDATA
11
Identifying risky customers
before offering a loan
CREDIT RISK MODELS
Do you receive any marketing
calls? Have you ever received
any marketing call for Audi car?
MARKETING ANALYTICS
Have you ever wondered, why
only you are getting
promotional offers on cloths
and accessories where as I am
getting offers on apartments?
RETAIL SALES ANALYTICS
How does a bank decide the
potential fraud transactions
from millions of credit card
swipes?
FRAUD ANALYTICS
12.
• Face recognition– Using image as input data
• Object recognition – Pixels is the input data
• Digit recognition – Using text as image
• Self Driving Cars – Using video data as input
12
APPLICATIONS OF
MACHINE LEARNING –
IMAGES ANDVIDEO DATA
16
• ANN- ArtificialNeural Network
• ANN is one of the technique in Machine Learning
• ANN has input layer , hidden layer and output layer
• For a really complex and non liner datasets we need several hidden
layers
• ANN with multiple hidden layers is known as deep neural network
ANN
17.
17
• ANN witha single layer is known as shallow network
• ANN with multiple hidden layers is known as deep neural network
• Not just multiple hidden layers sometimes the type of hidden layer is
also different.
• This concept of solving problems with multiple hidden layers is known
as deep learning
DEEP LEARNING
18.
DEEPVS SHALLOW NETWORKS
18
Aneural network with single hidden layer
is called a shallow network
A neural network with more than one
hidden layer is called deep neural
network
shallow network Deep network
19.
DEEPVS SHALLOW NETWORKS
19
Asingle layer might not have the
flexibility to capture all the non linear
patterns in the data
A deep network first learns the primitive
features followed by high level features.
This helps in building efficient models
shallow network Deep network
20.
20
• Lot ofexperiments have shown that a deep network with less
parameters performs better than a shallow network
• For example deep network with hidden nodes [10,10,10,10] might
perform better than shallow network with [80] hidden nodes
• Deep neural networks are amazingly powerful.
• With sufficient number of hidden layers and nodes, we can fit a model
to any type of data
• They have the power to capture any amount of non linearity
DEEP NEURAL NETWORKS
21.
DEEP LEARNING ISA SUBSET OF MACHINE LEARNING
21
Machine Learning
Deep Learning
MACHINE LEARNING MODELS
25
Newdata Apply Model
Class2
Get Prediction
One way models
This prediction can be
right or wrong
26.
AI = MACHINELEARNING MODELS + FEEDBACK LOOP
26
Training data Model
27.
AI = MACHINELEARNING MODELS + FEEDBACK LOOP
27
New data Apply Model
Class2
Get Prediction
Feedback Loop
28.
AI = MACHINELEARNING MODELS + FEEDBACK LOOP
28
Update Training
data based on
feedback
Update the Model
based on data Prediction
Feedback Loop
Class2
29.
• Manual entryafter going through test
cases – Google maps
• Indirect feedback collection based on
user actions for - User click vs not
click on your YouTube ad
• Indirect feedback collection based on
actions – In case of self driving car,
hitting a wall is an action.
29
HOW IS FEEDBACK
COLLECTED
30.
• Self drivingcars
• SIRI / Ok-google
• Alexa /Google home
• Recommendation systems
• Image recognition
• Speech recognition
• Spam filtering
30
APPLICATIONS OF AI
31.
MACHINE LEARNING ISA SUBSET OF ARTIFICIAL
INTELLIGENCE
31
Artificial Intelligence
Machine Learning
Deep Learning
• Data DrivenDecision making
• Making sense out of data
• Finding hidden patterns in the data
• Analysis using not just machine
learning models but also using data
visualizations, intelligent reports
• Most of the techniques and tools
seen in data analysis in early days are
now falling under data science
33
WHAT IS DATA SCIENCE?
34.
• Mathematics
• Statistics
•Coding
• Database management
• Data Analytics
• Predictive modelling
• Machine Learning
• Deep Learning
34
DATA SCIENCE IS A
FUSION OF MANY FIELDS
35.
DATA SCIENCE– FOURMAJORTYPE OF SKILLS
35
Database
Analytics & ML
Bigdata
Presentation
36.
THETECHNIQUESYOU NEEDTO KNOW
Database
Knowledge
•Database Management
•Data blending
•Querying
•Data manipulations
•ETL
Predictive Analytics
& ML
•Basic descriptive
statistics
•Advanced analytics
•Predictive modeling
•Machine Learning
Big Data knowledge
•Distributed Computing
•Big Data analytics
•Unstructured data
analysis
Presentation Skill
•Data visualizations
•Report design
•Insights presentation
36
DATA SCIENCE -DESIGNATIONS
DatabaseDeveloper
ETL Developer
MIS & DB Developer
Data Architect
Data Engineer
Data Analyst
Statisticians
Business Analyst
Data Scientist
Bigdata Developer
Hadoop Developer
Software Engineer
MIS Analyst
Reporting Analyst
Business Analyst
38
39.
Data Science
MACHINE LEARNINGIS A PART OF DATA SCIENCE
Statinfer.com
39
Statinfer.com
Artificial Intelligence
Machine Learning
* These are individual interpretations
Deep Learning
FAQ BY DATASCIENCE
ASPIRANTS
• I want to be data scientist what
training should I take?
• I already have knowledge on few
tools, what are my next steps?
• What skill should I add to my profile
to make it to next level?
• I am new to data science, where can I
start ?
41
42.
You need trainingbased on your skill level.
Based on skill set we can divide the whole data
science aspirants into four categories
1. Beginner - Completely new to Data Science
and ML
2. Intermediate - MIS and Reporting Analyst
3. Advanced – Data Analyst and Predictive
Modeler
4. Complete Data Scientist – ML, Hadoop, R,
Python, DL, AI
42
CATEGORIES OF PROFILES
43.
THE LEARNING PATH
Statinfer.com
43
Tools& Coding
R/SAS/Python/
Hadoop/Weka
Basic Statistics and
Mathematics
Basic Algorithms -
Regression,
Classification and
Segmentation
Advanced ML
Algorithms -Neural
Networks, SVMs,
Random Forest and
Boosting
Deep Learning
Models
CNN, RNN and LSTM
AI Models
Deep Q Learning
Reinforced Learning
Markov Decision
process
44.
THE LEARNING PATH– OUR SUGGESTIONS
Statinfer.com
44
Tools & Coding
R/SAS/Python/
Hadoop/Weka
Basic Statistics and
Mathematics
Basic Algorithms -
Regression,
Classification and
Segmentation
Advanced ML
Algorithms -Neural
Networks, SVMs,
Random Forest and
Boosting
Deep Learning
Models
CNN, RNN and LSTM
AI Models
Deep Q Learning
Reinforced Learning
Markov Decision
process
1. Do not try to learn all the steps in one sitting.
2. You need to learn, absorb and then practise before you reach the next step
Stage-1 Stage-2 Stage-3
45.
THE LEARNING PATH– OUR SUGGESTIONS
Statinfer.com
45
Tools & Coding
R/SAS/Python/
Hadoop/Weka
Basic Statistics and
Mathematics
Basic Algorithms -
Regression,
Classification and
Segmentation
Advanced ML
Algorithms -Neural
Networks, SVMs,
Random Forest and
Boosting
Deep Learning
Models
CNN, RNN and LSTM
AI Models
Deep Q Learning
Reinforced Learning
Markov Decision
process
1. R or Python. Both are really good. Pick any one of them
2. It also depends on your business problem
3. If you are planning to learn deep learning then go for python
Stage-1 Stage-2 Stage-3
46.
THE LEARNING PATH– OUR SUGGESTIONS
Statinfer.com
46
Tools & Coding
R/SAS/Python/
Hadoop/Weka
Basic Statistics and
Mathematics
Basic Algorithms -
Regression,
Classification and
Segmentation
Advanced ML
Algorithms -Neural
Networks, SVMs,
Random Forest and
Boosting
Deep Learning
Models
CNN, RNN and LSTM
AI Models
Deep Q Learning
Reinforced Learning
Markov Decision
process
1. Do not start with stage-2 or stage-3 directly.
2. Strong fundamentals will make the learning easy in later stages.
Stage-1 Stage-2 Stage-3
47.
THE LEARNING PATH– OUR SUGGESTIONS
Statinfer.com
47
Tools & Coding
R/SAS/Python/
Hadoop/Weka
Basic Statistics and
Mathematics
Basic Algorithms -
Regression,
Classification and
Segmentation
Advanced ML
Algorithms -Neural
Networks, SVMs,
Random Forest and
Boosting
Deep Learning
Models
CNN, RNN and LSTM
AI Models
Deep Q Learning
Reinforced Learning
Markov Decision
process
1. While learning these concepts, try to avoid academic style courses.
2. Look for the courses with lot of hands-on exercises and case studies
Stage-1 Stage-2 Stage-3
48.
THE LEARNING PATH– OUR SUGGESTIONS
Statinfer.com
48
Tools & Coding
R/SAS/Python/
Hadoop/Weka
Basic Statistics and
Mathematics
Basic Algorithms -
Regression,
Classification and
Segmentation
Advanced ML
Algorithms -Neural
Networks, SVMs,
Random Forest and
Boosting
Deep Learning
Models
CNN, RNN and LSTM
AI Models
Deep Q Learning
Reinforced Learning
Markov Decision
process
1. Do not focus on the tool, focus on the technique and algorithm
2. Learning python or R tool, will not make you a data scientist
Stage-1 Stage-2 Stage-3
FOCUS IS ONFIRSTTWO STAGES
Statinfer.com
50
Tools & Coding
R/SAS/Python/
Hadoop/Weka
Basic Statistics and
Mathematics
Basic Algorithms -
Regression,
Classification and
Segmentation
Advanced ML
Algorithms -Neural
Networks, SVMs,
Random Forest and
Boosting
Deep Learning
Models
CNN, RNN and LSTM
AI Models
Deep Q Learning
Reinforced Learning
Markov Decision
process
Stage-1 Stage-2 Stage-3
51.
DURATION – 10DAYS
Statinfer.com
51
Tools & Coding
R/SAS/Python/
Hadoop/Weka
Basic Statistics and
Mathematics
Basic Algorithms -
Regression,
Classification and
Segmentation
Advanced ML
Algorithms -Neural
Networks, SVMs,
Random Forest and
Boosting
Deep Learning
Models
CNN, RNN and LSTM
AI Models
Deep Q Learning
Reinforced Learning
Markov Decision
process
Stage-1 Stage-2 Stage-3
52.
TWO PHASES
Python fordata science
Data manipulations in python
Basic Statistics
Data validation and Cleaning
Regression
Logistic Regression
Decision Trees
Cluster Analysis
Model Selection and Cross
validation
52
PHASE-1 (5DAYS)
ANN – Artificial Neural networks
SVM – Support Vector Machines
Random Forest
Boosting
NLP & Text mining
TensorFlow & keras
Deep Learning Models
Convolution Neural Network
Recurrent Neural Networks
PHASE-2 (5DAYS)
53.
• 100% Hands-onTraining
• 30 case studies laced in the course
• Created for Non- Statisticians
• Datasets from multiple domains,
codes files and in class exercises
• Team assignments and mentoring
• Final Assessment
• E-learning material support
53
COURSE FEATURES
MYTH-1 : MATHEMATICS
Myth-1: To be a good data scientist, you need to be exceptional at
statistics, mathematics, calculous, algorithms etc.,
Not necessarily.
Statinfer.com
55
56.
MYTH-2 : PROGRAMMING
Myth-2: To be a good data scientist, you need to have exceptional coding
skills like Python, Java, C++ etc.,
Not necessarily.
Statinfer.com
56
57.
MYTH-3 : COMPLICATEDMODELS
Myth-3 :Data science is all about building complex predictive and
machine learning models to solving business problems
Not necessarily.
Statinfer.com
57
58.
MYTH-4 : MODELBUILDING
Myth-4 :After collection of the data, most of the time is spent on model
building process.
Not necessarily.
Statinfer.com
58
59.
MYTH-5 : LARGEDATASETS
Myth-5:While solving machine learning problems we need to handle
really large datasets or most of the datasets are really large
Not necessarily.
Statinfer.com
59
60.
MYTH-6 : MACHINELEARNING IN BUSINESS
Myth-6:Companies use really advanced deep learning and AI models for
while building all their business strategies
Not necessarily.
Statinfer.com
60
61.
MYTH-7 : DIVERSEALGORITHMS
Myth-7: A data scientist will be using all the models in their day to day
life
Not necessarily.
Statinfer.com
61
62.
Have you triedour app?
We have data science and machine learning quiz app
to make the learning easy and fun
Click here
62
ABOUTVENKATA REDDY KONASANI
•11 Years into Data Analytics
• 5+ Years into Training
• Author of the book “Practical Business Analytics
using SAS”
• Statinfer.com – Key member in the core team
• Work Experience
• HP – Data Scientist
• Trend wise Analytics -Data Scientist
• HSBC – Data Analyst
• Citi – Risk Analyst
• Masters in Applied Statistics and Informatics from
IIT Bombay
66
5,000+
Training Hours
1,500+
Participants
75+
Corporate Batches
25+
Companies