0% found this document useful (0 votes)

160 views17 pages

Bayesian Classification Examples

- Naive Bayesian classifiers are based on Bayes' theorem and assume attribute independence. - They calculate the probability of a new data point belonging to each class based on the probabilities of the attributes given each class. - While making a strong independence assumption, naive Bayes classifiers are fast, simple, and often highly accurate in practice.

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

160 views17 pages

Bayesian Classification Examples

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Bayes Classifiers

We are about to see some of the mathematical formalisms and

examples, but keep in mind the basic idea.

Find out the probability of the previously unseen instance

belonging to each class, then simply pick the most probable class.
Bayes Classifiers
• Bayesian classifiers use Bayes theorem, which says
p(cj | d ) = p(d | cj ) p(cj)
p(d)

• p(cj | d) = probability of instance d being in class cj,

This is what we are trying to compute
• p(d | cj) = probability of generating instance d given class cj,
We can imagine that being in class cj, causes you to have feature d
with some probability
• p(cj) = probability of occurrence of class cj,
This is just how frequent the class cj, is in our database
• p(d) = probability of instance d occurring
This can actually be ignored, since it is the same for all classes
Assume that we have two classes (Note: “Drew
c1 = male, and c2 = female. can be a male
or female
name”)
We have a person whose gender we do
not know, say “drew” or d. Drew Barrymore
Classifying drew as male or female is
equivalent to asking is it more probable
that drew is male or female, I.e which is
greater p(male | drew) or p(female | drew)

Drew Carey
What is the probability of being called
“drew” given that you are a male?
What is the probability
of being a male?
p(male | drew) = p(drew | male ) p(male)
What is the probability of
p(drew)
being named “drew”?
(actually irrelevant, since it is
that same for all classes)
This is Officer Drew. Is Officer Drew a
Male or Female?
Luckily, we have a small
database with names and
gender.
We can use it to apply Bayes Name gender
rule…
Drew Male
Officer Drew Claudia Female
Drew Female
Drew Female
p(cj | d) = p(d | cj ) p(cj) Alberto Male
p(d) Karin Female
Nina Female
Sergio Male
Name gender
Drew Male
Claudia Female
Drew Female
Drew Female
p(cj | d) = p(d | cj ) p(cj) Alberto Male
p(d) Karin Female
Officer Drew Nina Female
Sergio Male
p(male | drew) = 1/3 * 3/8 = 0.125
3/8 3/8 Officer Drew is
more likely to be
p(female | drew) = 2/5 * 5/8 = 0.250 a Female.
3/8 3/8
Officer Drew IS a female!

Officer Drew

p(male | drew) = 1/3 * 3/8 = 0.125

3/8 3/8

p(female | drew) = 2/5 * 5/8 = 0.250

3/8 3/8
So far we have only considered Bayes p(cj | d) = p(d | cj ) p(cj)
Classification when we have one
attribute (the “name”). But we may p(d)
have many features.
How do we use all the features?

Name Over 170CM Eye Hair length gender

Drew No Blue Short Male
Claudia Yes Brown Long Female
Drew No Blue Long Female
Drew No Blue Long Female
Alberto Yes Brown Short Male
Karin No Blue Long Female
Nina Yes Brown Short Female
Sergio Yes Blue Long Male
• To simplify the task, naïve Bayesian classifiers assume
attributes have independent distributions, and thereby estimate

p(d|cj) = p(d1|cj) * p(d2|cj) * ….* p(dn|cj)

The probability of
class cj generating
instance d, equals….
The probability of class cj
generating the observed
value for feature 1,
multiplied by..
The probability of class cj
generating the observed
value for feature 2,
multiplied by..
• To simplify the task, naïve Bayesian classifiers
assume attributes have independent distributions, and
thereby estimate
p(d|cj) = p(d1|cj) * p(d2|cj) * ….* p(dn|cj)

p(officer drew|cj) = p(over_170cm = yes|cj) * p(eye =blue|cj) * ….

Officer Drew
is blue-eyed, p(officer drew| Female) = 2/5 * 3/5 * ….
over 170cm
tall, and has p(officer drew| Male) = 2/3 * 2/3 * ….
long hair
The Naive Bayes classifiers
is often represented as this
type of graph…
cj
Note the direction of the
arrows, which state that
each class causes certain
features, with a certain
probability

p(d1|cj) p(d2|cj) … p(dn|cj)

Naïve Bayes is fast and cj
space efficient

We can look up all the probabilities

with a single scan of the database and
store them in a (small) table…

p(d1|cj) p(d2|cj) … p(dn|cj)

gender Over190cm gender Long Hair gender

Male Yes 0.15 Male Yes 0.05 Male
No 0.85 No 0.95
Female Yes 0.01 Female Yes 0.70 Female
No 0.99 No 0.30
Naïve Bayes is NOT sensitive to irrelevant features...

Suppose we are trying to classify a persons gender based

on several features, including eye color. (Of course, eye
color is completely irrelevant to a persons gender)
p(Jessica |cj) = p(eye = brown|cj) * p( wears_dress = yes|cj) * ….

p(Jessica | Female) = 9,000/10,000 * 9,975/10,000 * ….

p(Jessica | Male) = 9,001/10,000 * 2/10,000 * ….
Almost the same!

However, this assumes that we have good enough estimates of

the probabilities, so the more data the better.
An obvious point. I have used a
simple two class problem, and
cj
two possible values for each
example, for my previous
examples. However we can have
an arbitrary number of classes, or
feature values

p(d1|cj) p(d2|cj) … p(dn|cj)

Animal Mass >10kg Animal Color Animal

Cat Yes 0.15 Cat Black 0.33 Cat
No 0.85 White 0.23
Dog Yes 0.91 Brown 0.44 Dog

No 0.09 Dog Black 0.97

Pig
Pig Yes 0.99 White 0.03
No 0.01 Brown 0.90
Pig Black 0.04
White 0.01
Problem! Naïve Bayesian
p(d|cj)
Classifier
Naïve Bayes assumes
independence of
features…

p(d1|cj) p(d2|cj) p(dn|cj)

gender Over 6 gender Over 200

foot pounds
Male Yes 0.15 Male Yes 0.11
No 0.85 No 0.80
Female Yes 0.01 Female Yes 0.05
No 0.99 No 0.95
Solution Naïve Bayesian
p(d|cj)
Classifier
Consider the
relationships between
attributes…

p(d1|cj) p(d2|cj) p(dn|cj)

gender Over 6 gender Over 200 pounds

foot
Male Yes and Over 6 foot 0.11
Male Yes 0.15
No and Over 6 foot 0.59
No 0.85
Yes and NOT Over 6 foot 0.05
Female Yes 0.01
No and NOT Over 6 foot 0.35
No 0.99
Solution Naïve Bayesian
p(d|cj)
Classifier
Consider the
relationships between
attributes…

p(d1|cj) p(d2|cj) p(dn|cj)

But how do we find the set of connecting arcs??

Advantages/Disadvantages of Naïve Bayes
• Advantages:
– Fast to train (single scan). Fast to classify
– Not sensitive to irrelevant features
– Handles real and discrete data
– Handles streaming data well
• Disadvantages:
– Assumes independence of features

CPE412 Pattern Recognition (Week 5) - Updated
No ratings yet
CPE412 Pattern Recognition (Week 5) - Updated
36 pages
Naive Bayes Classifier Guide
No ratings yet
Naive Bayes Classifier Guide
36 pages
ML Lecture#5
No ratings yet
ML Lecture#5
65 pages
Lect 7 DM
No ratings yet
Lect 7 DM
65 pages
CPE412 Pattern Recognition (Week 4)
No ratings yet
CPE412 Pattern Recognition (Week 4)
47 pages
ML 09 Naive Bayes Classifier
No ratings yet
ML 09 Naive Bayes Classifier
24 pages
Naïve Bayes for Data Classification
No ratings yet
Naïve Bayes for Data Classification
75 pages
Class Adv Classification IV
No ratings yet
Class Adv Classification IV
49 pages
Naive Bayes
No ratings yet
Naive Bayes
13 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Naïve Bayes Classifier
No ratings yet
Naïve Bayes Classifier
39 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Naïve Bayes for Data Scientists
No ratings yet
Naïve Bayes for Data Scientists
31 pages
Data Mining Classification: Naïve Bayes Classifier Lecture Notes For Chapter 4 &5
No ratings yet
Data Mining Classification: Naïve Bayes Classifier Lecture Notes For Chapter 4 &5
26 pages
Classification With NaiveBayes
No ratings yet
Classification With NaiveBayes
19 pages
Simple Bayesian Classifier: Assist - Prof. Songül Albayrak Yıldız Teknik Üniversitesi Bilgisayar Müh. Bölümü
No ratings yet
Simple Bayesian Classifier: Assist - Prof. Songül Albayrak Yıldız Teknik Üniversitesi Bilgisayar Müh. Bölümü
15 pages
Bayes Classifier
No ratings yet
Bayes Classifier
20 pages
Bays Classifier (Machine Learning)
No ratings yet
Bays Classifier (Machine Learning)
16 pages
Nayes Bayes Classifier
No ratings yet
Nayes Bayes Classifier
46 pages
Bayesian Classification: Cse 634 Data Mining - Prof. Anita Wasilewska
No ratings yet
Bayesian Classification: Cse 634 Data Mining - Prof. Anita Wasilewska
66 pages
Foundations of Data Science - Unit 6 - Naive Bayes
No ratings yet
Foundations of Data Science - Unit 6 - Naive Bayes
12 pages
05 ZeroR OneR Bayes KNN
No ratings yet
05 ZeroR OneR Bayes KNN
76 pages
Bayesian and Naïve Bayes Classifiers
No ratings yet
Bayesian and Naïve Bayes Classifiers
13 pages
20 - Naive Bayes
No ratings yet
20 - Naive Bayes
28 pages
6 - Naive Bayes
No ratings yet
6 - Naive Bayes
26 pages
Machine Learning Machine Learning ML 06 Class Notes None
No ratings yet
Machine Learning Machine Learning ML 06 Class Notes None
119 pages
ML Lec 15 Naive Bayes
No ratings yet
ML Lec 15 Naive Bayes
16 pages
29-Naive Bayes-03-10-2024
No ratings yet
29-Naive Bayes-03-10-2024
48 pages
Navie Classifier
No ratings yet
Navie Classifier
8 pages
NaiveBayes Classifier+EvaluationMatrics
No ratings yet
NaiveBayes Classifier+EvaluationMatrics
15 pages
Naïve Bayes Classifier
No ratings yet
Naïve Bayes Classifier
21 pages
IML Module 3
No ratings yet
IML Module 3
95 pages
Module05 - Bayesian Reasoning
No ratings yet
Module05 - Bayesian Reasoning
37 pages
Lecture 6 - Generative Models
No ratings yet
Lecture 6 - Generative Models
33 pages
Naive Bayes Classifier Guide
No ratings yet
Naive Bayes Classifier Guide
24 pages
Bayes
No ratings yet
Bayes
10 pages
06 - NaiveBayes and ME
No ratings yet
06 - NaiveBayes and ME
25 pages
Bayesian Learning
No ratings yet
Bayesian Learning
41 pages
14.0 Naive Bayes
No ratings yet
14.0 Naive Bayes
25 pages
Naive Bayesian Classifier: National Institute of Technology Sikkim
No ratings yet
Naive Bayesian Classifier: National Institute of Technology Sikkim
6 pages
A5 PDF
No ratings yet
A5 PDF
9 pages
Machine Learning and Data Mining: Prof. Alexander Ihler
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler
51 pages
Machine Learning and Data Mining: Prof. Alexander Ihler
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler
51 pages
Lecture 5 Bayesian Classification
No ratings yet
Lecture 5 Bayesian Classification
16 pages
Bayesian
No ratings yet
Bayesian
23 pages
Naive Bayes
No ratings yet
Naive Bayes
13 pages
Naive Bayes
No ratings yet
Naive Bayes
19 pages
Naïve Bayes Classifier: April 25, 2006
No ratings yet
Naïve Bayes Classifier: April 25, 2006
19 pages
Bayesian Classification Insights
No ratings yet
Bayesian Classification Insights
7 pages
Pgm5 With Output
No ratings yet
Pgm5 With Output
13 pages
Naive Bayes
No ratings yet
Naive Bayes
6 pages
Naive Bayes
No ratings yet
Naive Bayes
19 pages
Bayesian Classification Guide
No ratings yet
Bayesian Classification Guide
6 pages
K - Nearest Neighbours Classifier / Regressor
No ratings yet
K - Nearest Neighbours Classifier / Regressor
35 pages
Naive Ba Yes
No ratings yet
Naive Ba Yes
28 pages
Two Way Anova & Manova
No ratings yet
Two Way Anova & Manova
10 pages
Chapter 6
No ratings yet
Chapter 6
4 pages
Fall 2004 Statistics Exam Paper
100% (1)
Fall 2004 Statistics Exam Paper
45 pages
Static Games With Incomplete Information
No ratings yet
Static Games With Incomplete Information
8 pages
Queueing Model Selection Guide
No ratings yet
Queueing Model Selection Guide
6 pages
AIXI: Theoretical AI Framework
No ratings yet
AIXI: Theoretical AI Framework
4 pages
Learning Worksheet No. In: 4 Statistics and Probability
No ratings yet
Learning Worksheet No. In: 4 Statistics and Probability
8 pages
TD Meth 2024
No ratings yet
TD Meth 2024
6 pages
18 Probability Part 1 of 3
No ratings yet
18 Probability Part 1 of 3
19 pages
Esi 4313 Syllabus S20 PDF
No ratings yet
Esi 4313 Syllabus S20 PDF
2 pages
CH 04
No ratings yet
CH 04
18 pages
CS1A, April19 To April22
No ratings yet
CS1A, April19 To April22
118 pages
B Tech in Computer Science Engineering-Prospectus
No ratings yet
B Tech in Computer Science Engineering-Prospectus
47 pages
Reading 8: Probability Concepts
No ratings yet
Reading 8: Probability Concepts
31 pages
STAT 231 Most Probable Que Paper
No ratings yet
STAT 231 Most Probable Que Paper
2 pages
Engineering Probability Guide
No ratings yet
Engineering Probability Guide
23 pages
Chapter 13 Test
No ratings yet
Chapter 13 Test
7 pages
Multivariate Capability Analysis Webinar
No ratings yet
Multivariate Capability Analysis Webinar
32 pages
Markov Chains
No ratings yet
Markov Chains
73 pages
Ptspunit2 VRC
No ratings yet
Ptspunit2 VRC
164 pages
Concepts of Probability: Chapter 4
No ratings yet
Concepts of Probability: Chapter 4
24 pages
Statistics Lecture: Central Limit Theorem
No ratings yet
Statistics Lecture: Central Limit Theorem
7 pages
Confidence Interval Estimation: Chapter 8, Slide 1
No ratings yet
Confidence Interval Estimation: Chapter 8, Slide 1
52 pages
Sampling Error
No ratings yet
Sampling Error
2 pages
Bability: Normal
No ratings yet
Bability: Normal
1 page
r05220101 Probability and Statistics
No ratings yet
r05220101 Probability and Statistics
8 pages
Multivariate Normal Distribution: 1 Random Vector
No ratings yet
Multivariate Normal Distribution: 1 Random Vector
3 pages
Assignments 6th Sem 2018-19
No ratings yet
Assignments 6th Sem 2018-19
4 pages
Econ 371 1
No ratings yet
Econ 371 1
3 pages
WEEK 8 Percentiles and T Distribution
100% (2)
WEEK 8 Percentiles and T Distribution
6 pages

Bayesian Classification Examples

Uploaded by

Bayesian Classification Examples

Uploaded by

Bayes Classifiers

We are about to see some of the mathematical formalisms and

Find out the probability of the previously unseen instance

• p(cj | d) = probability of instance d being in class cj,

p(male | drew) = 1/3 * 3/8 = 0.125

p(female | drew) = 2/5 * 5/8 = 0.250

Name Over 170CM Eye Hair length gender

p(d|cj) = p(d1|cj) * p(d2|cj) * ….* p(dn|cj)

p(officer drew|cj) = p(over_170cm = yes|cj) * p(eye =blue|cj) * ….

p(d1|cj) p(d2|cj) … p(dn|cj)

We can look up all the probabilities

p(d1|cj) p(d2|cj) … p(dn|cj)

gender Over190cm gender Long Hair gender

Suppose we are trying to classify a persons gender based

p(Jessica | Female) = 9,000/10,000 * 9,975/10,000 * ….

However, this assumes that we have good enough estimates of

p(d1|cj) p(d2|cj) … p(dn|cj)

Animal Mass >10kg Animal Color Animal

No 0.09 Dog Black 0.97

p(d1|cj) p(d2|cj) p(dn|cj)

gender Over 6 gender Over 200

p(d1|cj) p(d2|cj) p(dn|cj)

gender Over 6 gender Over 200 pounds

p(d1|cj) p(d2|cj) p(dn|cj)

But how do we find the set of connecting arcs??

You might also like