0% found this document useful (0 votes)

828 views19 pages

Naive Bayes Classifier in Machine Learning - Javatpoint

The document discusses the Naive Bayes classifier algorithm. It begins by explaining that Naive Bayes is a supervised learning algorithm based on Bayes' theorem used for classification problems. It then provides examples of applications like spam filtering. It discusses the assumptions of conditional independence between features that give Naive Bayes its name. The document proceeds to give a mathematical example of how Naive Bayes works step-by-step. It concludes by discussing the advantages, disadvantages, and types of Naive Bayes models.

Uploaded by

mangotwin22

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

828 views19 pages

Naive Bayes Classifier in Machine Learning - Javatpoint

Uploaded by

mangotwin22

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

Home AI Machine Learning DBMS Java Blockchain Control System Selenium

⇧ SCROLL TO TOP
Naïve Bayes Classifier Algorithm
Naïve Bayes algorithm is a supervised learning algorithm, which is based on Bayes
theorem and used for solving classification problems.

It is mainly used in text classification that includes a high-dimensional training dataset.

Naïve Bayes Classifier is one of the simple and most effective Classification algorithms
which helps in building the fast machine learning models that can make quick predictions.

It is a probabilistic classifier, which means it predicts on the basis of the probability of an

object.

Some popular examples of Naïve Bayes Algorithm are spam filtration, Sentimental
analysis, and classifying articles.

Why is it called Naïve Bayes?

The Naïve Bayes algorithm is comprised of two words Naïve and Bayes, Which can be described
as:

Naïve: It is called Naïve because it assumes that the occurrence of a certain feature is
independent of the occurrence of other features. Such as if the fruit is identified on the
bases of color, shape, and taste, then red, spherical, and sweet fruit is recognized as an
apple. Hence each feature individually contributes to identify that it is an apple without
depending on each other.

Bayes: It is called Bayes because it depends on the principle of Bayes' Theorem.

Bayes' Theorem:

⇧ SCROLL TO TOP
Bayes' theorem is also known as Bayes' Rule or Bayes' law, which is used to determine the
probability of a hypothesis with prior knowledge. It depends on the conditional probability.

The formula for Bayes' theorem is given as:

Where,

P(A|B) is Posterior probability: Probability of hypothesis A on the observed event B.

P(B|A) is Likelihood probability: Probability of the evidence given that the probability of a
hypothesis is true.

⇧ SCROLL TO TOP
P(A) is Prior Probability: Probability of hypothesis before observing the evidence.

P(B) is Marginal Probability: Probability of Evidence.

Working of Naïve Bayes' Classifier:

Working of Naïve Bayes' Classifier can be understood with the help of the below example:

Suppose we have a dataset of weather conditions and corresponding target variable "Play". So
using this dataset we need to decide that whether we should play or not on a particular day
according to the weather conditions. So to solve this problem, we need to follow the below steps:

1. Convert the given dataset into frequency tables.

2. Generate Likelihood table by finding the probabilities of given features.

3. Now, use Bayes theorem to calculate the posterior probability.

⇧Problem:
SCROLL If
TOthe
TOP
weather is sunny, then the Player should play or not?
Solution: To solve this, first consider the below dataset:

Outlook Play

0 Rainy Yes

1 Sunny Yes

2 Overcast Yes

3 Overcast Yes

4 Sunny No

5 Rainy Yes

6 Sunny Yes

7 Overcast Yes

8 Rainy No

9 Sunny No

10 Sunny Yes

11 Rainy No

12 Overcast Yes

13 Overcast Yes

Frequency table for the Weather Conditions:

⇧ SCROLL TO TOP
Weather Yes No

Overcast 5 0

Rainy 2 2

Sunny 3 2

Total 10 5

Likelihood table weather condition:

Weather No Yes

Overcast 0 5 5/14= 0.35

Rainy 2 2 4/14=0.29

Sunny 2 3 5/14=0.35

All 4/14=0.29 10/14=0.71

Applying Bayes'theorem:

P(Yes|Sunny)= P(Sunny|Yes)*P(Yes)/P(Sunny)

P(Sunny|Yes)= 3/10= 0.3

⇧ SCROLL TO TOP
P(Sunny)= 0.35
P(Yes)=0.71

So P(Yes|Sunny) = 0.3*0.71/0.35= 0.60

P(No|Sunny)= P(Sunny|No)*P(No)/P(Sunny)

P(Sunny|NO)= 2/4=0.5

P(No)= 0.29

P(Sunny)= 0.35

So P(No|Sunny)= 0.5*0.29/0.35 = 0.41

So as we can see from the above calculation that P(Yes|Sunny)>P(No|Sunny)

Hence on a Sunny day, Player can play the game.

Advantages of Naïve Bayes Classifier:

Naïve Bayes is one of the fast and easy ML algorithms to predict a class of datasets.

It can be used for Binary as well as Multi-class Classifications.

⇧ SCROLL TO TOP
It performs well in Multi-class predictions as compared to the other Algorithms.
It is the most popular choice for text classification problems.

Disadvantages of Naïve Bayes Classifier:

Naive Bayes assumes that all features are independent or unrelated, so it cannot learn the
relationship between features.

Applications of Naïve Bayes Classifier:

It is used for Credit Scoring.

It is used in medical data classification.

It can be used in real-time predictions because Naïve Bayes Classifier is an eager learner.

It is used in Text classification such as Spam filtering and Sentiment analysis.

Types of Naïve Bayes Model:

There are three types of Naive Bayes Model, which are given below:

Gaussian: The Gaussian model assumes that features follow a normal distribution. This
means if predictors take continuous values instead of discrete, then the model assumes
that these values are sampled from the Gaussian distribution.

Multinomial:
⇧ SCROLL TO TOP The Multinomial Naïve Bayes classifier is used when the data is multinomial
distributed. It is primarily used for document classification problems, it means a particular
document belongs to which category such as Sports, Politics, education, etc.

The classifier uses the frequency of words for the predictors.

Bernoulli: The Bernoulli classifier works similar to the Multinomial classifier, but the
predictor variables are the independent Booleans variables. Such as if a particular word is
present or not in a document. This model is also famous for document classification tasks.

Python Implementation of the Naïve Bayes algorithm:

Now we will implement a Naive Bayes Algorithm using Python. So for this, we will use the
"user_data" dataset, which we have used in our other classification model. Therefore we can
easily compare the Naive Bayes model with the other models.

Steps to implement:

Data Pre-processing step

Fitting Naive Bayes to the Training set

Predicting the test result

Test accuracy of the result(Creation of Confusion matrix)

Visualizing the test set result.

1) Data Pre-processing step:

In this step, we will pre-process/prepare the data so that we can use it efficiently in our code. It is
similar as we did in data-pre-processing. The code for this is given below:

Importing the libraries
import numpy as nm
import matplotlib.pyplot as mtp
import pandas as pd

# Importing the dataset
dataset = pd.read_csv('user_data.csv')
x = dataset.iloc[:, [2, 3]].values
y = dataset.iloc[:, 4].values
⇧ SCROLL TO TOP

# Splitting the dataset into the Training set and Test set
from sklearn.model_selection import train_test_split
x_train, x_test, y_train, y_test = train_test_split(x, y, test_size = 0.25, random_state = 0)

# Feature Scaling
from sklearn.preprocessing import StandardScaler
sc = StandardScaler()
x_train = sc.fit_transform(x_train)
x_test = sc.transform(x_test)

In the above code, we have loaded the dataset into our program using "dataset =
pd.read_csv('user_data.csv'). The loaded dataset is divided into training and test set, and then we
have scaled the feature variable.

The output for the dataset is given as:

⇧ SCROLL TO TOP
2) Fitting Naive Bayes to the Training Set:

After the pre-processing step, now we will fit the Naive Bayes model to the Training set. Below is
the code for it:

# Fitting Naive Bayes to the Training set
from sklearn.naive_bayes import GaussianNB
classifier = GaussianNB()
⇧ SCROLL TO TOP
classifier.fit(x_train, y_train)
In the above code, we have used the GaussianNB classifier to fit it to the training dataset. We can
also use other classifiers as per our requirement.

Output:

Out[6]: GaussianNB(priors=None, var_smoothing=1e-09)

3) Prediction of the test set result:

Now we will predict the test set result. For this, we will create a new predictor variable y_pred, and
will use the predict function to make the predictions.

# Predicting the Test set results
y_pred = classifier.predict(x_test)

Output:

⇧ SCROLL TO TOP
The above output shows the result for prediction vector y_pred and real vector y_test. We can see
that some predications are different from the real values, which are the incorrect predictions.

4) Creating Confusion Matrix:

Now we will check the accuracy of the Naive Bayes classifier using the Confusion matrix. Below is
the code for it:

# Making the Confusion Matrix
from sklearn.metrics import confusion_matrix
cm = confusion_matrix(y_test, y_pred)
⇧ SCROLL TO TOP
Output:

As we can see in the above confusion matrix output, there are 7+3= 10 incorrect predictions, and
65+25=90 correct predictions.

5) Visualizing the training set result:

Next we will visualize the training set result using Naïve Bayes Classifier. Below is the code for it:

# Visualising the Training set results
from matplotlib.colors import ListedColormap
x_set, y_set = x_train, y_train
X1, X2 = nm.meshgrid(nm.arange(start = x_set[:, 0].min() - 1, stop = x_set[:, 0].max() + 1, step = 0.0
                     nm.arange(start = x_set[:, 1].min() - 1, stop = x_set[:, 1].max() + 1, step = 0.01))
mtp.contourf(X1, X2, classifier.predict(nm.array([X1.ravel(), X2.ravel()]).T).reshape(X1.shape),
             alpha = 0.75, cmap = ListedColormap(('purple', 'green')))
mtp.xlim(X1.min(), X1.max())
mtp.ylim(X2.min(), X2.max())
for i, j in enumerate(nm.unique(y_set)):
    mtp.scatter(x_set[y_set == j, 0], x_set[y_set == j, 1],
                c = ListedColormap(('purple', 'green'))(i), label = j)
⇧ SCROLL TO TOP
mtp.title('Naive Bayes (Training set)')
mtp.xlabel('Age')
mtp.ylabel('Estimated Salary')
mtp.legend()
mtp.show()

Output:

In the above output we can see that the Naïve Bayes classifier has segregated the data points
with the fine boundary. It is Gaussian curve as we have used GaussianNB classifier in our code.

6) Visualizing the Test set result:

# Visualising the Test set results
from matplotlib.colors import ListedColormap
x_set, y_set = x_test, y_test
X1, X2 = nm.meshgrid(nm.arange(start = x_set[:, 0].min() - 1, stop = x_set[:, 0].max() + 1, step = 0.0
                     nm.arange(start = x_set[:, 1].min() - 1, stop = x_set[:, 1].max() + 1, step = 0.01))
mtp.contourf(X1, X2, classifier.predict(nm.array([X1.ravel(), X2.ravel()]).T).reshape(X1.shape),
             alpha = 0.75, cmap = ListedColormap(('purple', 'green')))
mtp.xlim(X1.min(), X1.max())
mtp.ylim(X2.min(), X2.max())
for i, j in enumerate(nm.unique(y_set)):
    mtp.scatter(x_set[y_set == j, 0], x_set[y_set == j, 1],
                c = ListedColormap(('purple', 'green'))(i), label = j)
mtp.title('Naive Bayes (test set)')
mtp.xlabel('Age')
mtp.ylabel('Estimated Salary')
mtp.legend()
mtp.show()

Output:
⇧ SCROLL TO TOP
The above output is final output for test set data. As we can see the classifier has created a
Gaussian curve to divide the "purchased" and "not purchased" variables. There are some wrong
predictions which we have calculated in Confusion matrix. But still it is pretty good classifier.

← Prev
Next →

Youtube
For Videos Join Our Youtube Channel: Join Now

Feedback

Send your Feedback to feedback@javatpoint.com

Help Others, Please Share

Learn Latest Tutorials

⇧ SCROLL TO TOP
Splunk SPSS tutorial Swagger T-SQL
tutorial SPSS
tutorial tutorial
Splunk Swagger Transact-SQL

Tumblr React tutorial Regex

tutorial tutorial Reinforcement
ReactJS
learning
Tumblr Regex
tutorial
Reinforcement
Learning

R RxJS tutorial React Native Python

Programming RxJS
tutorial Design Patterns
tutorial React Native Python Design
R Programming Patterns

Python Python Keras

Pillow tutorial Turtle tutorial tutorial
Python Pillow Python Turtle Keras

Preparation

Aptitude Logical Verbal Interview

Aptitude
Reasoning Ability Questions
Reasoning Verbal Ability Interview
Questions

Company
Interview
Questions
Company
Questions

⇧ SCROLL TO TOP
Trending Technologies

Artificial AWS Tutorial Selenium Cloud

Intelligence AWS
tutorial Computing
Artificial Selenium Cloud Computing
Intelligence

Hadoop ReactJS Data Science Angular 7

tutorial Tutorial Tutorial Tutorial
Hadoop ReactJS Data Science Angular 7

Blockchain Git Tutorial Machine DevOps

Tutorial Git
Learning Tutorial
Blockchain
Tutorial DevOps
Machine
Learning

B.Tech / MCA

DBMS Data DAA tutorial Operating

tutorial Structures DAA
System
DBMS
tutorial Operating
Data Structures System

Computer Compiler Computer Discrete

Network Design tutorial Organization Mathematics
tutorial Compiler Design
and Tutorial
Computer
Architecture Discrete
Network Computer Mathematics
Organization
Ethical Computer Software html tutorial
Hacking Graphics Engineering Web Technology
Ethical Hacking
Tutorial Software
Computer Engineering
⇧ SCROLL TO TOP Graphics
Cyber Automata C Language C++ tutorial
Security Tutorial tutorial C++
tutorial Automata C Programming
Cyber Security

Java tutorial .Net Python List of

Java
Framework tutorial Programs
tutorial Python Programs
.Net

Control Data Mining Data

Systems Tutorial Warehouse
tutorial Tutorial
Data Mining
Control System Data Warehouse

⇧ SCROLL TO TOP

AI Knowledge Representation Guide
No ratings yet
AI Knowledge Representation Guide
39 pages
Artificial Intelligence Module 5
No ratings yet
Artificial Intelligence Module 5
23 pages
Unsupervised Learning Notes
No ratings yet
Unsupervised Learning Notes
21 pages
ISOMAP in ML
No ratings yet
ISOMAP in ML
12 pages
Unit - 1 MACHINE LEARNING BASICS, LINEAR ALGEBRA
No ratings yet
Unit - 1 MACHINE LEARNING BASICS, LINEAR ALGEBRA
41 pages
STM Question Paper R18
No ratings yet
STM Question Paper R18
2 pages
AI Probabilistic Reasoning Guide
No ratings yet
AI Probabilistic Reasoning Guide
14 pages
Artificial Intelligence: Chapter 6: Representing Knowledge Using Rules
No ratings yet
Artificial Intelligence: Chapter 6: Representing Knowledge Using Rules
54 pages
Ai Unit-V Expert Systems
No ratings yet
Ai Unit-V Expert Systems
20 pages
Module 3 Informed Search Techniques and Knowledge Representation
No ratings yet
Module 3 Informed Search Techniques and Knowledge Representation
26 pages
NITHYA S - 412520403004 - Project Report
No ratings yet
NITHYA S - 412520403004 - Project Report
39 pages
FIND-S Algorithm: Machine Learning 15CSL76
No ratings yet
FIND-S Algorithm: Machine Learning 15CSL76
3 pages
Spam Email. Classifier
No ratings yet
Spam Email. Classifier
16 pages
Question Bank: T.E. (Computer Engineering) Data Science and Big Data Analytics (2019 Pattern)
No ratings yet
Question Bank: T.E. (Computer Engineering) Data Science and Big Data Analytics (2019 Pattern)
4 pages
AI & ML Unit 4 Notes
No ratings yet
AI & ML Unit 4 Notes
16 pages
Machine Learning
No ratings yet
Machine Learning
7 pages
Greedy-Layerwise in Deep Learning
No ratings yet
Greedy-Layerwise in Deep Learning
15 pages
ML Unit-1
No ratings yet
ML Unit-1
32 pages
Machine Learning Basics for Students
100% (1)
Machine Learning Basics for Students
21 pages
Lab Program
100% (1)
Lab Program
15 pages
Unit 5 1
No ratings yet
Unit 5 1
18 pages
Iv Semester: Data Mining Question Bank: Unit 2 2 Mark Questions)
No ratings yet
Iv Semester: Data Mining Question Bank: Unit 2 2 Mark Questions)
5 pages
AI Fundamentals for Beginners
No ratings yet
AI Fundamentals for Beginners
45 pages
DevOps (UNIT - I)
No ratings yet
DevOps (UNIT - I)
21 pages
AI Planning & Learning Basics
No ratings yet
AI Planning & Learning Basics
23 pages
Unit-4 Part-1 ML Ai&Ml r23
No ratings yet
Unit-4 Part-1 ML Ai&Ml r23
20 pages
Unit 4
No ratings yet
Unit 4
17 pages
Unit 4
No ratings yet
Unit 4
26 pages
Question Bank Module-1: Department of Computer Applications 18mca53 - Machine Learning
No ratings yet
Question Bank Module-1: Department of Computer Applications 18mca53 - Machine Learning
7 pages
18.4 Evaluating and Choosing The Best Hypothesis: Model Selection: Complexity vs. Goodness of Fit
No ratings yet
18.4 Evaluating and Choosing The Best Hypothesis: Model Selection: Complexity vs. Goodness of Fit
8 pages
Chpater 1 - Unit 2
No ratings yet
Chpater 1 - Unit 2
31 pages
ML Unit4
No ratings yet
ML Unit4
41 pages
01 - The Role of Algorithms in Computing
0% (1)
01 - The Role of Algorithms in Computing
30 pages
ML - CSA 301 - ML Perspective and Issues
No ratings yet
ML - CSA 301 - ML Perspective and Issues
34 pages
Mining Graphs
No ratings yet
Mining Graphs
23 pages
Unification and Lifting
No ratings yet
Unification and Lifting
8 pages
Hill Climbing Algorithm
100% (1)
Hill Climbing Algorithm
49 pages
Designing A Learning System
No ratings yet
Designing A Learning System
21 pages
BTCS9202 Data Sciences Lab Manual
No ratings yet
BTCS9202 Data Sciences Lab Manual
39 pages
Unit 1 - Machine Learning
No ratings yet
Unit 1 - Machine Learning
21 pages
Module-02 AIML NOTES
No ratings yet
Module-02 AIML NOTES
29 pages
1) Aim: Demonstration of Preprocessing of Dataset Student - Arff
No ratings yet
1) Aim: Demonstration of Preprocessing of Dataset Student - Arff
26 pages
Smooth N-Gram
No ratings yet
Smooth N-Gram
2 pages
Ai Unit 6 Techknow
No ratings yet
Ai Unit 6 Techknow
31 pages
Unit 4 - Domain Testing
100% (1)
Unit 4 - Domain Testing
76 pages
Gujarat Technological University: Computer Engineering Machine Learning SUBJECT CODE: 3710216
No ratings yet
Gujarat Technological University: Computer Engineering Machine Learning SUBJECT CODE: 3710216
2 pages
Machine Learning Quantum
No ratings yet
Machine Learning Quantum
64 pages
DATA ANALYTICS Syllabus 3 Units
No ratings yet
DATA ANALYTICS Syllabus 3 Units
37 pages
AD8402 - Artificial Intelligence (Unit III)
No ratings yet
AD8402 - Artificial Intelligence (Unit III)
24 pages
R22 ML Question Bank For It and CSM
No ratings yet
R22 ML Question Bank For It and CSM
4 pages
KNN (K Nearest Neighbor)
No ratings yet
KNN (K Nearest Neighbor)
21 pages
Unit III Knowledge, Reasoning and Planning
No ratings yet
Unit III Knowledge, Reasoning and Planning
99 pages
ML Unit 4
No ratings yet
ML Unit 4
34 pages
(New) (New) ML KNN Introduction Handwritten Notes
No ratings yet
(New) (New) ML KNN Introduction Handwritten Notes
6 pages
Concept Learning
No ratings yet
Concept Learning
85 pages
3.4 Lda
No ratings yet
3.4 Lda
12 pages
Naive Bayes Classifier in Machine Learning
No ratings yet
Naive Bayes Classifier in Machine Learning
16 pages
Ame: Waqar Ali
No ratings yet
Ame: Waqar Ali
22 pages
Naïve Bayes Classifier Algorithm
No ratings yet
Naïve Bayes Classifier Algorithm
11 pages
Artificial Neural Network Tutorial - Javatpoint
No ratings yet
Artificial Neural Network Tutorial - Javatpoint
13 pages
Density-Based Clustering in Data Minin - Javatpoint
No ratings yet
Density-Based Clustering in Data Minin - Javatpoint
11 pages
What Are The Components of Ict 60f50f112b1c112d6c121f88
No ratings yet
What Are The Components of Ict 60f50f112b1c112d6c121f88
3 pages
Transmission Media - Javatpoint
No ratings yet
Transmission Media - Javatpoint
9 pages
Essay Contest for College Students
No ratings yet
Essay Contest for College Students
1 page
Types of Network PDF
No ratings yet
Types of Network PDF
2 pages
Marengo Privacy and AI Sample 1693399544
No ratings yet
Marengo Privacy and AI Sample 1693399544
13 pages
Lecture Dimensionality Reduction
No ratings yet
Lecture Dimensionality Reduction
34 pages
Salesforce AI Associate Exam Guide
No ratings yet
Salesforce AI Associate Exam Guide
7 pages
How ChatGPT Works The Model Behind The Bot by Molly Ruby Towards Data Science
No ratings yet
How ChatGPT Works The Model Behind The Bot by Molly Ruby Towards Data Science
15 pages
Principal Component Analysis (PCA) For Image Compression and Eigenvectors - Week9
No ratings yet
Principal Component Analysis (PCA) For Image Compression and Eigenvectors - Week9
7 pages
Sentiment Analysis with AI-Deep Learning
No ratings yet
Sentiment Analysis with AI-Deep Learning
74 pages
Python Unit 2
No ratings yet
Python Unit 2
81 pages
ML - Chapter 5 - Neural Network
No ratings yet
ML - Chapter 5 - Neural Network
64 pages
Design and Analysis of Algorithms - AD3351 - Important Questions With Answer - Unit 3 - Dynamic Programming and Greedy Technique
No ratings yet
Design and Analysis of Algorithms - AD3351 - Important Questions With Answer - Unit 3 - Dynamic Programming and Greedy Technique
8 pages
ML Unit - V
No ratings yet
ML Unit - V
46 pages
DLunit 3
No ratings yet
DLunit 3
13 pages
Data Science Course Content
No ratings yet
Data Science Course Content
4 pages
Sign Language To Text Conversion
50% (2)
Sign Language To Text Conversion
27 pages
AI Strategies for Customer Engagement
No ratings yet
AI Strategies for Customer Engagement
12 pages
PyTorch Guide
No ratings yet
PyTorch Guide
17 pages
AI & Neural Networks Exam
No ratings yet
AI & Neural Networks Exam
8 pages
Article 26
No ratings yet
Article 26
37 pages
Global Toolkit On AI and The Rule of Law For The Judiciary
No ratings yet
Global Toolkit On AI and The Rule of Law For The Judiciary
206 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
21 pages
KNN VS Kmeans
No ratings yet
KNN VS Kmeans
3 pages
Deep Representation Learning Techniques For Audio Signal Processing
No ratings yet
Deep Representation Learning Techniques For Audio Signal Processing
152 pages
Ai Health Care Chat Bot
No ratings yet
Ai Health Care Chat Bot
8 pages
Comparison Kubeflow TFX
No ratings yet
Comparison Kubeflow TFX
12 pages
AIA 6600 - Module 1
No ratings yet
AIA 6600 - Module 1
5 pages
Harnessing Large Language Models For Training-Free Video Anomaly Detection
No ratings yet
Harnessing Large Language Models For Training-Free Video Anomaly Detection
13 pages
A Systematic Review: B-Cell Conformational Epitope Prediction From Epitope Characteristics View
No ratings yet
A Systematic Review: B-Cell Conformational Epitope Prediction From Epitope Characteristics View
7 pages
A Canvas of Air and Signs: Integrating Voice Activated Hand Sign Recognition and Air Canvas For Hearing Impaired and Non-Verbal People
No ratings yet
A Canvas of Air and Signs: Integrating Voice Activated Hand Sign Recognition and Air Canvas For Hearing Impaired and Non-Verbal People
4 pages
Model Question Paper - AIML
No ratings yet
Model Question Paper - AIML
4 pages
(2018) Estimation of The Generation Rate of Different Types of Plastic Wastes and Possible Revenue Recovery From Informal Recycling - AGUNG
No ratings yet
(2018) Estimation of The Generation Rate of Different Types of Plastic Wastes and Possible Revenue Recovery From Informal Recycling - AGUNG
10 pages
Machine Learning With Spark Nick Pentreath - Download The Ebook Now For Full and Detailed Access
100% (6)
Machine Learning With Spark Nick Pentreath - Download The Ebook Now For Full and Detailed Access
67 pages

Naive Bayes Classifier in Machine Learning - Javatpoint

Uploaded by

Naive Bayes Classifier in Machine Learning - Javatpoint

Uploaded by

Home AI Machine Learning DBMS Java Blockchain Control System Selenium

It is mainly used in text classification that includes a high-dimensional training dataset.

It is a probabilistic classifier, which means it predicts on the basis of the probability of an

Why is it called Naïve Bayes?

Bayes: It is called Bayes because it depends on the principle of Bayes' Theorem.

The formula for Bayes' theorem is given as:

P(A|B) is Posterior probability: Probability of hypothesis A on the observed event B.

P(B) is Marginal Probability: Probability of Evidence.

Working of Naïve Bayes' Classifier:

1. Convert the given dataset into frequency tables.

2. Generate Likelihood table by finding the probabilities of given features.

3. Now, use Bayes theorem to calculate the posterior probability.

Frequency table for the Weather Conditions:

Likelihood table weather condition:

Overcast 0 5 5/14= 0.35

All 4/14=0.29 10/14=0.71

P(Sunny|Yes)= 3/10= 0.3

So P(Yes|Sunny) = 0.3*0.71/0.35= 0.60

So P(No|Sunny)= 0.5*0.29/0.35 = 0.41

So as we can see from the above calculation that P(Yes|Sunny)>P(No|Sunny)

Hence on a Sunny day, Player can play the game.

Advantages of Naïve Bayes Classifier:

It can be used for Binary as well as Multi-class Classifications.

Disadvantages of Naïve Bayes Classifier:

Applications of Naïve Bayes Classifier:

It is used for Credit Scoring.

It is used in medical data classification.

It is used in Text classification such as Spam filtering and Sentiment analysis.

Types of Naïve Bayes Model:

The classifier uses the frequency of words for the predictors.

Python Implementation of the Naïve Bayes algorithm:

Data Pre-processing step

Fitting Naive Bayes to the Training set

Predicting the test result

Test accuracy of the result(Creation of Confusion matrix)

Visualizing the test set result.

1) Data Pre-processing step:

The output for the dataset is given as:

Out[6]: GaussianNB(priors=None, var_smoothing=1e-09)

3) Prediction of the test set result:

4) Creating Confusion Matrix:

5) Visualizing the training set result:

6) Visualizing the Test set result:

Send your Feedback to feedback@javatpoint.com

Help Others, Please Share

Learn Latest Tutorials

Tumblr React tutorial Regex

R RxJS tutorial React Native Python

Python Python Keras

Aptitude Logical Verbal Interview

Artificial AWS Tutorial Selenium Cloud

Hadoop ReactJS Data Science Angular 7

Blockchain Git Tutorial Machine DevOps

DBMS Data DAA tutorial Operating

Computer Compiler Computer Discrete

Java tutorial .Net Python List of

Control Data Mining Data

You might also like