0% found this document useful (0 votes)

72 views55 pages

9 Supervised Learning - II

Uploaded by

mahesh Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

72 views55 pages

9 Supervised Learning - II

Uploaded by

mahesh Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 55

How To Make The Best Use Of Live Sessions

• Please log in 10 mins before the class starts and check your internet connection to avoid any network issues during the LIVE
session

• All participants will be on mute, by default, to avoid any background noise. However, you will be unmuted by instructor if
required. Please use the “Questions” tab on your webinar tool to interact with the instructor at any point during the class

• Feel free to ask and answer questions to make your learning interactive. Instructor will address your queries at the end of on-
going topic

• Raise a ticket through your LMS in case of any queries. Our dedicated support team is available 24 x 7 for your assistance

• Your feedback is very much appreciated. Please share feedback after each class, which will help us enhance your learning
experience

Copyright © edureka and/or its affiliates. All rights reserved.

Course Outline

Introduction to Python Dimensionality Reduction

Sequences and File Operations Supervised Learning - II

Deep Dive-Functions, OOPS,

Modules, Errors and Exceptions Unsupervised Learning

Introduction to Numpy, Pandas Association Rules Mining and

and Matplotlib Recommendation Systems

Data Manipulation Reinforcement Learning

Introduction to Machine Learning

with Python Time Series Analysis

Supervised Learning - I Model Selection and Boosting

Copyright © edureka and/or its affiliates. All rights reserved.

Supervised Learning - II
Topics
The topics covered in this module are:
▪ Naïve Bayes Classifier
▪ Support Vector Machine(SVM)
▪ Hyperparameter Optimization
▪ Grid Search vs Random Search

Copyright © edureka and/or its affiliates. All rights reserved.

Objectives
After completing this module, you should be able to:
▪ Understand what is Naïve Bayes Classifier
▪ Naïve Bayes Classifier Steps
▪ Building Likelihood Tables
▪ Predicting the Output
▪ naiveBayes() in Python

▪ Support Vector Machine (SVM) Classifier

▪ Analyze how SVM works?
▪ Perform Hyperparameter Optimization
▪ Implement Grid Search vs Random Search

Copyright © edureka and/or its affiliates. All rights reserved.

Naïve Bayes Classifier

Copyright © edureka and/or its affiliates. All rights reserved.

Let’s understand Naïve Bayes classifier using
the same Use-Case

‘Game Decision Forecast using Weather Data’

Copyright © edureka and/or its affiliates. All rights reserved.

Naïve Bayes Classifier
It is a classification technique based on Bayes' Theorem with an assumption of independence among predictors.

In simple terms, a Naive Bayes classifier assumes that the presence of a particular feature in a class is unrelated
to the presence of any other feature.

Let’s find what

Likelihood Class Prior Probability these values
are

P(x|c)P(c)
P(c|x) =
P(x)

Predictor Prior Probability

Posterior Probability

Copyright © edureka and/or its affiliates. All rights reserved.

Naïve Bayes Classifier Steps
First we will create a frequency table using each attribute of the dataset,
Play
Frequency Table
Yes No
Sunny 2 3
Outlook Overcast 4 0
Rainy 3 2

Play
Frequency Table
Yes No
High 3 4
Humidity
Normal 6 1

Play
Frequency Table
Yes No
Strong 6 2
Wind
Weak 3 3

Copyright © edureka and/or its affiliates. All rights reserved.

Building Likelihood Tables
For each frequency table we will generate a likelihood table
P(x|c) = P(Sunny|Yes) = 2/9 = 0.22
Play
Likelihood Table
Yes No
Sunny 2/9 3/5 5/14 P(x) = P(Sunny) = 5/14 = 0.36
Outlook Overcast 4/9 0/5 4/14
Rainy 3/9 2/5 5/14
9/14 5/14
P(c) = P(Yes) = 9/14 = 0.64

Likelihood of ‘Yes’ given Sunny is

P(c|x) = P(Yes|Sunny) = P(Sunny|Yes)* P(Yes) / P(Sunny) = (0.22 x 0.64) /0.36 = 0.3911

Similarly Likelihood of ‘No’ given Sunny is

P(c|x) = P(No|Sunny) = P(Sunny|No)* P(No) / P(Sunny) = (0.6 x 0.36) /0.36 = 0.60

Copyright © edureka and/or its affiliates. All rights reserved.

Building Likelihood Tables
Similarly the ‘Likelihood Table’ of other attributes are

Likelihood table for Humidity Likelihood table for Wind

Play Play
Likelihood Table Likelihood Table
Yes No Yes No
High 3/9 4/5 7/14 Weak 6/9 2/5 8/14
Humidity Wind
Normal 6/9 1/5 7/14 Strong 3/9 3/5 6/14
9/14 5/14 9/14 5/14

P(Yes|High) = 0.33 x 0.6 / 0.5 = 0.42 P(Yes|Weak) = 0.67 x 0.64 / 0.57 = 0.75

P(No|High) = 0.8 x 0.36 / 0.5 = 0.58 P(No|Weak) = 0.4 x 0.36 / 0.57 = 0.25

Copyright © edureka and/or its affiliates. All rights reserved.

Predicting the Output
Suppose we have a day with the following values
Outlook = Rain
Humidity = High
Wind = Weak
Play = ?

Likelihood of ‘Yes’ on that Day = P(Outlook = Rain|Yes)P(Humidity= High|Yes) P(Wind= Weak|Yes)*P(Yes)

= 3/9 * 3/9 * 6/9 * 9/14 = 0.0476

Likelihood of ‘No’ on that Day = P(Outlook = Rain|No)P(Humidity= High|No) P(Wind= Weak|No)*P(No)

= 2/5 * 4/5 * 2/5 * 5/14 = 0.0166

Copyright © edureka and/or its affiliates. All rights reserved.

Predicting the Output
Now we normalize the values, then
Our model predicts that
there is a 74% chance
P(Yes) = 0.0476 / (0.0476 + 0.0166) = 0.74 there will be game
tomorrow
P(No) = 0.0166 / (0.0476 + 0.0166) = 0.26

Copyright © edureka and/or its affiliates. All rights reserved.

naiveBayes() in Python
To implement Naïve Bayes algorithm in Python, we will use the following library and function

from sklearn.naive bayes import GaussianNB Library

gnb = GaussianNB()
Function
y_pred_gnb = gnb.fit(X_train, y_train).predict(X_test)

Copyright © edureka and/or its affiliates. All rights reserved.

Use-Case 1

Copyright © edureka and/or its affiliates. All rights reserved.

Use-Case 1
As discussed earlier in Module 4, we have data about Hurricanes and Typhoons, from (1851-2014)

The data comprises of Location, wind, and pressure of tropical cyclones in Pacific Oceans

Based on the data we have to classify the storms into hurricanes, typhoons and their sub categories as per the
predefined classes mentioned ahead.

In this module we will implement Naïve Bayes and Random Forest.

Copyright © edureka and/or its affiliates. All rights reserved.

Predefined Class Description
1. TD – Tropical cyclone of tropical depression intensity (< 34 knots)

2. TS – Tropical cyclone of tropical storm intensity (34-63 knots)

3. HU – Tropical cyclone of hurricane intensity (> 64 knots)

4. EX – Extratropical cyclone (of any intensity)

5. SD – Subtropical cyclone of subtropical depression intensity (< 34 knots)

6. SS – Subtropical cyclone of subtropical storm intensity (> 34 knots)

7. LO – A low that is neither a tropical cyclone, a subtropical cyclone, nor an extratropical cyclone (of any
intensity) DB – Disturbance (of any intensity)

Copyright © edureka and/or its affiliates. All rights reserved.

Use-Case 1 Solution
The problem over here has seven predefined classes, As logistic regression is best suited for binary classification,
We will solve this problem with other classifiers and compare their output

1. Naïve Bayes

2. SVM

Copyright © edureka and/or its affiliates. All rights reserved.

Loading Necessary Libraries
We will load the necessary libraries as done earlier, using the code as shown below:

import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt
from sklearn.model_selection import train_test_split
from sklearn import metrics
from sklearn import tree

Copyright © edureka and/or its affiliates. All rights reserved.

Data Import
You can download the dataset from LMS, than use the following code to load the data

data = pd.read_csv('pacific.csv')
print(data.head(6))

Output

Copyright © edureka and/or its affiliates. All rights reserved.

Data Manipulation
For categorical classification, we need the values to be categorical, so we will convert them as shown below:
data = pd.read_csv('pacific.csv')
print(data.head(6))
data.status = pd.categorical(data.status)
data['Status'] = data.status.cat.codes
Cat stands for categorical, each category has been assigned a number, let’s see how does it look now

Copyright © edureka and/or its affiliates. All rights reserved.

Plotting Typhon Class Frequency
To see the frequency of various categories, let’s create a frequency bar plot, use the code as shown below:
#lets count the frequency of different typhoons
sns.countplot(data['Status'], label = "Count")
plt.show()

Output

Copyright © edureka and/or its affiliates. All rights reserved.

Data Wrangling
We don’t need columns such as Name, Event, Latitude, Longitude to classify the data.

Hence we need to drop these columns from prediction variables as shown below:

pred_columns = data[:]
pred_columns.drop(['Status'], axis = 1, inplace = True)
pred_columns.drop(['Event'], axis = 1, inplace = True)
pred_columns.drop(['Latitude'], axis = 1, inplace = True)
pred_columns.drop(['Longitude'], axis = 1, inplace = True)
pred_columns.drop(['ID'], axis = 1, inplace = True)
pred_columns.drop(['Name'], axis = 1, inplace = True)
prediction_var = pred_columns.columns
print(list(prediction_var))

Output:

Copyright © edureka and/or its affiliates. All rights reserved.

Train-Test Split
Training and Testing partitions are used to provide:-
▪ Honest assessments of the performance of our predictive models
▪ Least amount of mathematical reasoning and manipulation of results

Scikit learn provides a function called train – test split to train and test data

train,test = train_test_split(data, test_size =0.3)#in

this our main data is splitted into train and test
#we can check their dimension
print(train.shape)
print(test.shape)

Output

Copyright © edureka and/or its affiliates. All rights reserved.

Creating Response and Target Variable
To create ‘Response’ and ‘Target’ variable, we will use the following code:

#taking the training data input

train_X = train[prediction_var]
train_y = train['Status']
print(list(train.columns))

#same for test

test_X = test[prediction_var]#taking test data inputs
test_y = test['Status'] #output value of test data

Copyright © edureka and/or its affiliates. All rights reserved.

Confusion Matrix
Let’s create a confusion matrix using the function below:

cnf_matrix_gnb = confusion_matrix(y_test, y_pred_gnb)

print(cnf_matrix_gnb)

That’s hard to
Output
interpret, let’s
calculate the
accuracy to
evaluate it

Copyright © edureka and/or its affiliates. All rights reserved.

Accuracy Prediction
To check model performance, we will see how inputs have been incorrectly classified using the code below:

print("Number of mislabeled points out of a total %d points: %d")

%(data.shape[0], (test_y ! = y_pred_gnb).sum())

Out of 26137 points, 7411 have been misclassified

Hence accuracy = (26137-7411)/26137 = 0.7164

Copyright © edureka and/or its affiliates. All rights reserved.

Support Vector Machine(SVM)

Copyright © edureka and/or its affiliates. All rights reserved.

Support Vector Machine
• Support Vector Machine(SVM) is a supervised machine learning algorithm which can be used for both
classification or regression challenges.

• It tries to define a hyperplane which can split the data in the most optimal way such that there is a wide
margin among the hyperplane and the observations.
Classifier Data
Line points
• It is one of the most efficient algorithm in Machine Learning.

Data
points

Copyright © edureka and/or its affiliates. All rights reserved.

What is Hyperplane
An hyperplane is a generalization of a plane.
➢ in one dimension, a hyperplane is called a point
➢ in two dimensions, it is a line
➢ in three dimensions, it is a plane
➢ in more dimensions you can call it an hyperplane

Example:

This point is the separating hyperplane in

one dimension.

Copyright © edureka and/or its affiliates. All rights reserved.

Support Vector Machine
An SVM model is a representation of the examples as points in space, mapped so that the examples of the
separate categories are divided by a clear gap that is as wide as possible.

New examples are then mapped into that same space and predicted to belong to a category based on which side
of the gap they fall. Separating
Margin Hyperplane

Support Vectors are simply the co-ordinates of individual

observation.

Support
Vectors

Copyright © edureka and/or its affiliates. All rights reserved.

How it works
Suppose we have two classes plotted as,
Just by looking at the
plot, we can see that it is
possible to separate the
Y-axis data using a straight line.

X-axis

Copyright © edureka and/or its affiliates. All rights reserved.

How it works
We can draw a separating lines as,

Multiple separating
lines can be drawn here
Y-axis

X-axis

Copyright © edureka and/or its affiliates. All rights reserved.

How it works
Purpose of SVM is to find optimal hyperplane,
We need to choose one
hyperplane which will
separate this data in an
Y-axis optimal way

X-axis

Copyright © edureka and/or its affiliates. All rights reserved.

How it works
If we choose the red hyperline, we can see that some of the observations will get misclassified. Intuitively, we
can see that if we select a hyperplane which is close to the data points of one class, then it might not generalize
well. So we will try to select
an hyperplane as far as
possible from data
points from each
Y-axis

Copyright © edureka and/or its affiliates. All rights reserved.

How it works
This hyperplane will help us in real life data for perfect classification.

Now lets see how we

got the optimal
hyperplane
Y-axis

X-axis

Copyright © edureka and/or its affiliates. All rights reserved.

Choosing Optimal Hyperplane
Given a particular hyperplane, we can compute the distance between the hyperplane and the closest data point.

Once we have this

value, if we double it
we will get what is
Y-axis

called the margin.

Distance between hyperplane and

the closest point

X-axis

Choosing Optimal Hyperplane
Basically the margin is a no man's land. There will never be any data point inside the margin

Similarly we will find

the margin for every
other hyperplanes
Y-axis

Distance between hyperplane and

the closest point

X-axis

Choosing Optimal Hyperplane
After we find the margins of all hyperplanes, we will select the hyperplane having the largest margin as our
separating hyperplane

Here Margin 2 is
greater so will select
the purple line as our
Y-axis

hyperplane.

Optimal Hyperplane

Margin 1

Margin 2
X-axis

svm() in Python

svm() in Python
For implementing it we need to load the library using the code below:

from sklearn import svm Let’s understand what

Kernel, c and gamma
Syntax for support vector machine function is: values are

model = svm.svc(kernel='linear', c=1, gamma=1)

Kernels in SVM
There are three types of kernel available in svm

Linear Polynomial RBF

When data is linearly When data is non linearly When data is non linearly
separable, we use linear separable, and can be separable, and cannot be
kernel. classified using a curve. classified using a curve.
We use polynomial kernel. We use rbf kernel.

The ‘C’-Value
The ‘C’ value determines the width of the margin, larger the c value smaller is the margin.

‘C’ value directly affects the misclassification error

Recommended value of ‘C’ is between: 2-10 to 210

The ‘Gamma’-Value
Gamma is the parameter of a Gaussian Kernel (to handle non-linear classification).
The data we have is not linearly separable in 2D so you want to transform
them to a higher dimension where they will be linearly separable.

Imagine "raising" the green points, then you can separate them from the
red points with a plane (hyperplane)

To "raise" the points you use the RBF kernel, gamma controls the shape of
the "peaks" where you raise the points.

The ‘Gamma’-Value
A small gamma gives you a pointed bump in the higher dimensions, a
large gamma gives you a softer, broader bump.

So a small gamma will give you low bias and high variance while a large
gamma will give you higher bias and low variance.

Hyperparameter Search
We now know that we can configure our model using the hyperparameters, how to choose their best fit value is
a tedious task if done manually, hence we use hyparameter search to do this task.

Hyperparameter search can be done in two ways:

1. Grid wise Search

2. Random Search

Grid Wise Search
In Grid wise search the parameter values are equally distributed within the parameter range specified by the
user. Each value and it’s output are checked before dividing the final value.

For range -10 to 10, we can get grid search values as

-9,-7,-5,-3,-1,1,3,5,7,9 and so on

Random Search
In random search method the value of the hyperparameter is randomly chosen within the range specified by the
user.

For range -10 to 10, we can get random value as

-9,+9,0,3,7,-8,5,-3,9 and so on

SVM Model Building
Let’s code to do hyperparameter search and build the model using python

from sklearn.metrics import accuracy_score

from sklearn import svm
# To import the svm classifier

model = svm.SVC(kernel='linear')
model.fit(train_X,train_y)
#Predict Output
predicted= model.predict(test_X)

Model Accuracy
To check model accuracy use the following code:

print("SVM accuray:",accuracy_score(test_y, predicted))

Eager vs Lazy Learner

Eager Learner Lazy Learner

Generalized model from training dataset is

Training dataset is stored in system to build the model
constructed

On querying similarity between test data and training

Using the model the class of the test dataset is
set records is calculated to predict the class of test
predicted
data

Example – Decision Tree Example – K-nearest neighbour

Summary
In this module, you should have learnt:
▪ What is Naïve Bayes Classifier
▪ How Naïve Bayes Classifier works?
▪ Support Vector Machine (SVM) Classifier
▪ How SVM works?
▪ Hyperparameter Optimization
▪ Grid Search vs Random Search

Copyright
Copyright
© 2018,
© edureka and/or its affiliates. All rights reserved.
Copyright © edureka and/or its affiliates. All rights reserved.
Copyright © edureka and/or its affiliates. All rights reserved.

MERN Stack 45-Day Challenge
No ratings yet
MERN Stack 45-Day Challenge
21 pages
Syllabus of Data Science 5
No ratings yet
Syllabus of Data Science 5
7 pages
Data Science
No ratings yet
Data Science
15 pages
39 MBA Final Result Dec2013
No ratings yet
39 MBA Final Result Dec2013
691 pages
Textbook ML - Removed - Removed
No ratings yet
Textbook ML - Removed - Removed
44 pages
Shahid's Oracle DBA Blog - Migrate From 32 Bit To 64 Bit Using RMAN
No ratings yet
Shahid's Oracle DBA Blog - Migrate From 32 Bit To 64 Bit Using RMAN
2 pages
Wk05 Machine Learning
No ratings yet
Wk05 Machine Learning
6 pages
Azure Interview Questions and Answers
No ratings yet
Azure Interview Questions and Answers
12 pages
Cognizant Internship Report
No ratings yet
Cognizant Internship Report
54 pages
Lecture MachineLearning
No ratings yet
Lecture MachineLearning
139 pages
MERN Stack Full Stack Developer Course
No ratings yet
MERN Stack Full Stack Developer Course
18 pages
New Advances in Machine Learning: ISBN 978-953-307-034-6
No ratings yet
New Advances in Machine Learning: ISBN 978-953-307-034-6
378 pages
Wk01 Machine Learning
No ratings yet
Wk01 Machine Learning
6 pages
ML Lectures Summary 2
No ratings yet
ML Lectures Summary 2
52 pages
Data Science & Python Programming Guide
No ratings yet
Data Science & Python Programming Guide
23 pages
AML - Mid Term - Merged
No ratings yet
AML - Mid Term - Merged
192 pages
Linear Algebra LectureNote
No ratings yet
Linear Algebra LectureNote
288 pages
Introduction To Machine Learning PART 1
No ratings yet
Introduction To Machine Learning PART 1
6 pages
Azure Interview Q
No ratings yet
Azure Interview Q
17 pages
Oracle Live Project
No ratings yet
Oracle Live Project
2 pages
JDK Installation Guide
100% (1)
JDK Installation Guide
8 pages
Oracle - Premium.1z0 148.by .VCEplus.20q DEMO
No ratings yet
Oracle - Premium.1z0 148.by .VCEplus.20q DEMO
19 pages
Module 1 Topic-2-ML Applications
No ratings yet
Module 1 Topic-2-ML Applications
44 pages
ML CHP 123
No ratings yet
ML CHP 123
69 pages
RDBMS Concepts: © Tata Consultancy Services Ltd. July 7, 2018 1
No ratings yet
RDBMS Concepts: © Tata Consultancy Services Ltd. July 7, 2018 1
38 pages
Skyess Spark Syllabus
No ratings yet
Skyess Spark Syllabus
12 pages
Advanced SQL 3 1730883488
No ratings yet
Advanced SQL 3 1730883488
14 pages
Iiitb Ed ML Ai
No ratings yet
Iiitb Ed ML Ai
24 pages
ML Merged
No ratings yet
ML Merged
433 pages
Clustering Techniques Explained
No ratings yet
Clustering Techniques Explained
80 pages
Popegm
No ratings yet
Popegm
246 pages
2nd Exam Question Paper 2
No ratings yet
2nd Exam Question Paper 2
16 pages
Oracle SQL Syllabus
No ratings yet
Oracle SQL Syllabus
9 pages
Bookstore E-Commerce Platform With MERN Stack
No ratings yet
Bookstore E-Commerce Platform With MERN Stack
66 pages
MLOps Specialization Course January 2024!5!15
No ratings yet
MLOps Specialization Course January 2024!5!15
11 pages
Sales Management System Guide
No ratings yet
Sales Management System Guide
29 pages
Devops With Awscourse Content Latest
No ratings yet
Devops With Awscourse Content Latest
10 pages
Decision Trees & Clustering Basics
No ratings yet
Decision Trees & Clustering Basics
71 pages
Aindumps 2023-Aug-06 by Edison 117q Vce
No ratings yet
Aindumps 2023-Aug-06 by Edison 117q Vce
9 pages
DR Antonio Gulli - A Collection of Advanced Data Science and Machine Learning Interview Questions Solved in Python and Spark (II) - Hands-On Big Data and Machine - Programming Interview Questions) (
No ratings yet
DR Antonio Gulli - A Collection of Advanced Data Science and Machine Learning Interview Questions Solved in Python and Spark (II) - Hands-On Big Data and Machine - Programming Interview Questions) (
112 pages
Machine Learning Course Overview
No ratings yet
Machine Learning Course Overview
145 pages
CNN Architectures for Text and Image
No ratings yet
CNN Architectures for Text and Image
167 pages
01 Bombay
No ratings yet
01 Bombay
17 pages
Data Science Bootcamp Insights
No ratings yet
Data Science Bootcamp Insights
161 pages
MLOPs
No ratings yet
MLOPs
20 pages
Exam Questions 1Z0-062: Oracle Database 12c: Installation and Administration
No ratings yet
Exam Questions 1Z0-062: Oracle Database 12c: Installation and Administration
33 pages
Kannan M5L3 Notes
No ratings yet
Kannan M5L3 Notes
98 pages
CloudyML Mega Combo Course Brochure
No ratings yet
CloudyML Mega Combo Course Brochure
19 pages
R20 IT 4-1 - DevOps - UNIT - 3
No ratings yet
R20 IT 4-1 - DevOps - UNIT - 3
15 pages
Module 1 Topic-3-ML Framework
No ratings yet
Module 1 Topic-3-ML Framework
82 pages
Machine Learning 1707965934
No ratings yet
Machine Learning 1707965934
15 pages
Machine Learning
No ratings yet
Machine Learning
102 pages
Train With Shubham Syllabus
No ratings yet
Train With Shubham Syllabus
61 pages
Core Java Complete Marerial
No ratings yet
Core Java Complete Marerial
448 pages
MSSQL Server 2008 Developer
No ratings yet
MSSQL Server 2008 Developer
240 pages
Future Academy Machine Learning Brochure
No ratings yet
Future Academy Machine Learning Brochure
14 pages
AI Class PDF
No ratings yet
AI Class PDF
542 pages
Data Science & ML Course Guide
No ratings yet
Data Science & ML Course Guide
83 pages
AWS & PySpark Interview Prep
No ratings yet
AWS & PySpark Interview Prep
16 pages
Naïve Bayes Classifier Algorithm
No ratings yet
Naïve Bayes Classifier Algorithm
11 pages
Voice Recording Guideliness
No ratings yet
Voice Recording Guideliness
2 pages
Module 4 String Data Structure
No ratings yet
Module 4 String Data Structure
9 pages
Prepare A Database of Standardized Datasets Which Can Be Used For Training and Evolution of Models
No ratings yet
Prepare A Database of Standardized Datasets Which Can Be Used For Training and Evolution of Models
6 pages
Facerecognition Results Metrics
No ratings yet
Facerecognition Results Metrics
3 pages
Key Fact Statement
No ratings yet
Key Fact Statement
7 pages
Case Study 3
No ratings yet
Case Study 3
2 pages
Univarite Hope
No ratings yet
Univarite Hope
103 pages
Chatgpt Prompt Engineering
50% (2)
Chatgpt Prompt Engineering
12 pages
MachineLearning Algorithm - Hope
No ratings yet
MachineLearning Algorithm - Hope
125 pages
Feature Selection Engineering
No ratings yet
Feature Selection Engineering
72 pages
SymphonyAI Overview
No ratings yet
SymphonyAI Overview
8 pages
Case Study 1
No ratings yet
Case Study 1
4 pages
Case Study 1
No ratings yet
Case Study 1
2 pages
11 Association Rules Mining and Recommendation Systems
No ratings yet
11 Association Rules Mining and Recommendation Systems
70 pages
Streaming Data Sample
No ratings yet
Streaming Data Sample
5 pages
MaheshKumar ApplicationForm
No ratings yet
MaheshKumar ApplicationForm
8 pages
Options With Python
100% (2)
Options With Python
203 pages
Chatgpt Prompt Engineering
0% (1)
Chatgpt Prompt Engineering
9 pages
CV 2022081018080367
No ratings yet
CV 2022081018080367
2 pages
Python For Data Science .
100% (5)
Python For Data Science .
112 pages
5G Network Fault Management Guide
No ratings yet
5G Network Fault Management Guide
27 pages
2021.08.29 News Chapter 5 Chain
No ratings yet
2021.08.29 News Chapter 5 Chain
53 pages
Can You Finish A 300 Page Book in An Hour - Google Search
No ratings yet
Can You Finish A 300 Page Book in An Hour - Google Search
1 page
Syllabus Isye6501
No ratings yet
Syllabus Isye6501
5 pages
Information For User Myo Plus Pattern Recognition and Bebionic Hand en
No ratings yet
Information For User Myo Plus Pattern Recognition and Bebionic Hand en
10 pages
22 208 219 Ajsshr (S) Use+of+Digital+Sports+Technologies+in+Sports+Television
No ratings yet
22 208 219 Ajsshr (S) Use+of+Digital+Sports+Technologies+in+Sports+Television
12 pages
HIstory of CAAP
No ratings yet
HIstory of CAAP
3 pages
International Retailing
100% (1)
International Retailing
29 pages
DCP-F-CTL-052014 (Exi FCU Catalogue)
100% (1)
DCP-F-CTL-052014 (Exi FCU Catalogue)
12 pages
Group 10 - Uber Strategic Alliances
No ratings yet
Group 10 - Uber Strategic Alliances
10 pages
Regedit XLR8 FFXX
No ratings yet
Regedit XLR8 FFXX
5 pages
Petitioner Respondent: Fil-Estate Properties, Inc., Realty, Inc.
No ratings yet
Petitioner Respondent: Fil-Estate Properties, Inc., Realty, Inc.
8 pages
ĐỀ LỚP 10 Ma - de - 103
No ratings yet
ĐỀ LỚP 10 Ma - de - 103
5 pages
3-Channel Laser Diode Driver + Oscillator Features: Get Full Datasheet
No ratings yet
3-Channel Laser Diode Driver + Oscillator Features: Get Full Datasheet
2 pages
Product Data Sheet: Circuit Breaker Masterpact NW12H1 - 1250 A - 3 Poles - Drawout - W/o Trip Unit
No ratings yet
Product Data Sheet: Circuit Breaker Masterpact NW12H1 - 1250 A - 3 Poles - Drawout - W/o Trip Unit
2 pages
Promo: Ganadores de La Semana:)
No ratings yet
Promo: Ganadores de La Semana:)
4 pages
Data Analytics
100% (3)
Data Analytics
190 pages
Princeton Chromatography SFC & HPLC Solutions
No ratings yet
Princeton Chromatography SFC & HPLC Solutions
20 pages
PP Sap Table
100% (1)
PP Sap Table
4 pages
Establishing A TSFP
100% (1)
Establishing A TSFP
2 pages
DBMS Lab Practical Guide
No ratings yet
DBMS Lab Practical Guide
25 pages
Xycris - Marketing Plan
75% (12)
Xycris - Marketing Plan
73 pages
Sbi FD Slip
No ratings yet
Sbi FD Slip
1 page
Chapter 1 PowerPoint Slides PDF
No ratings yet
Chapter 1 PowerPoint Slides PDF
20 pages
Android - Failed To Resolve - Com - github.PhilJay - MPAndroidChart - v2.1.4 - Stack Overflow PDF
No ratings yet
Android - Failed To Resolve - Com - github.PhilJay - MPAndroidChart - v2.1.4 - Stack Overflow PDF
1 page
Insider Trading: India vs USA
100% (1)
Insider Trading: India vs USA
13 pages
Action Plan School Based Reading Edited
100% (1)
Action Plan School Based Reading Edited
6 pages
MLA Style
No ratings yet
MLA Style
4 pages
Aitken Spence Hotel Holdings PLC
No ratings yet
Aitken Spence Hotel Holdings PLC
314 pages
Magnesium Alloys Containing Rare Earth Metals Structure and Properties 1st Edition L.L. Rokhlin (Author) Download PDF
100% (13)
Magnesium Alloys Containing Rare Earth Metals Structure and Properties 1st Edition L.L. Rokhlin (Author) Download PDF
84 pages

9 Supervised Learning - II

Uploaded by

9 Supervised Learning - II

Uploaded by

How To Make The Best Use Of Live Sessions

Copyright © edureka and/or its affiliates. All rights reserved.

Introduction to Python Dimensionality Reduction

Sequences and File Operations Supervised Learning - II

Deep Dive-Functions, OOPS,

Introduction to Numpy, Pandas Association Rules Mining and

Data Manipulation Reinforcement Learning

Introduction to Machine Learning

Supervised Learning - I Model Selection and Boosting

Copyright © edureka and/or its affiliates. All rights reserved.

Copyright © edureka and/or its affiliates. All rights reserved.

▪ Support Vector Machine (SVM) Classifier

Copyright © edureka and/or its affiliates. All rights reserved.

Copyright © edureka and/or its affiliates. All rights reserved.

‘Game Decision Forecast using Weather Data’

Copyright © edureka and/or its affiliates. All rights reserved.

Let’s find what

Predictor Prior Probability

Copyright © edureka and/or its affiliates. All rights reserved.

Copyright © edureka and/or its affiliates. All rights reserved.

Likelihood of ‘Yes’ given Sunny is

Similarly Likelihood of ‘No’ given Sunny is

Copyright © edureka and/or its affiliates. All rights reserved.

Likelihood table for Humidity Likelihood table for Wind

Copyright © edureka and/or its affiliates. All rights reserved.

Likelihood of ‘Yes’ on that Day = P(Outlook = Rain|Yes)*P(Humidity= High|Yes)* P(Wind= Weak|Yes)*P(Yes)

Likelihood of ‘No’ on that Day = P(Outlook = Rain|No)*P(Humidity= High|No)* P(Wind= Weak|No)*P(No)

Copyright © edureka and/or its affiliates. All rights reserved.

Copyright © edureka and/or its affiliates. All rights reserved.

from sklearn.naive bayes import GaussianNB Library

Copyright © edureka and/or its affiliates. All rights reserved.

Copyright © edureka and/or its affiliates. All rights reserved.

In this module we will implement Naïve Bayes and Random Forest.

Copyright © edureka and/or its affiliates. All rights reserved.

2. TS – Tropical cyclone of tropical storm intensity (34-63 knots)

3. HU – Tropical cyclone of hurricane intensity (> 64 knots)

4. EX – Extratropical cyclone (of any intensity)

5. SD – Subtropical cyclone of subtropical depression intensity (< 34 knots)

6. SS – Subtropical cyclone of subtropical storm intensity (> 34 knots)

Copyright © edureka and/or its affiliates. All rights reserved.

Copyright © edureka and/or its affiliates. All rights reserved.

Copyright © edureka and/or its affiliates. All rights reserved.

Copyright © edureka and/or its affiliates. All rights reserved.

Copyright © edureka and/or its affiliates. All rights reserved.

Copyright © edureka and/or its affiliates. All rights reserved.

Copyright © edureka and/or its affiliates. All rights reserved.

train,test = train_test_split(data, test_size =0.3)#in

Copyright © edureka and/or its affiliates. All rights reserved.

#taking the training data input

#same for test

Copyright © edureka and/or its affiliates. All rights reserved.

cnf_matrix_gnb = confusion_matrix(y_test, y_pred_gnb)

Copyright © edureka and/or its affiliates. All rights reserved.

print("Number of mislabeled points out of a total %d points: %d")

Out of 26137 points, 7411 have been misclassified

Hence accuracy = (26137-7411)/26137 = 0.7164

Copyright © edureka and/or its affiliates. All rights reserved.

Copyright © edureka and/or its affiliates. All rights reserved.

Copyright © edureka and/or its affiliates. All rights reserved.

This point is the separating hyperplane in

Copyright © edureka and/or its affiliates. All rights reserved.

Support Vectors are simply the co-ordinates of individual

Copyright © edureka and/or its affiliates. All rights reserved.

Copyright © edureka and/or its affiliates. All rights reserved.

Copyright © edureka and/or its affiliates. All rights reserved.

Copyright © edureka and/or its affiliates. All rights reserved.

Copyright © edureka and/or its affiliates. All rights reserved.

Now lets see how we

Copyright © edureka and/or its affiliates. All rights reserved.

Once we have this

called the margin.

Distance between hyperplane and

Copyright © edureka and/or its affiliates. All rights reserved.

Similarly we will find

Distance between hyperplane and

Copyright © edureka and/or its affiliates. All rights reserved.

Likelihood of ‘Yes’ on that Day = P(Outlook = Rain|Yes)P(Humidity= High|Yes) P(Wind= Weak|Yes)*P(Yes)

Likelihood of ‘No’ on that Day = P(Outlook = Rain|No)P(Humidity= High|No) P(Wind= Weak|No)*P(No)