0% found this document useful (0 votes)

37 views27 pages

ML-Unit I - Decision Tree

1 3 0 The document discusses decision trees, providing examples to illustrate how they 1.5 are built. It explains that decision trees break down a dataset into smaller subsets 2 8 0 using decision nodes and leaf nodes. The examples show how decision trees use 5.5 attributes like age and lower birth to determine ticket fare concessions. It also 9 3.5 1 discusses concepts like information gain, entropy, and purity to select the best 9.5 attribute to split the tree on at each step. 10 4 0 11 12 5 1

Uploaded by

Pranav Reddy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views27 pages

ML-Unit I - Decision Tree

Uploaded by

Pranav Reddy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

Machine Learning

Dr. Sunil Saumya

IIIT Dharwad
ML Algorithm: Decision Tree
● Decision tree builds classification or regression models in the form of a
tree structure.
● It breaks down a dataset into smaller and smaller subsets while at the same
time an associated decision tree is incrementally developed.
● The final result is a tree with decision nodes and leaf nodes.
○ A decision node has two or more branches.
○ Leaf node represents a classification or decision. The top most
decision node in a tree which corresponds to the best predictor called
root node.
● Decision trees can handle both categorical and numerical data.
Decision Tree understanding : example 1
Consider a dataset:

Age Lower birth Ticket fare concession

(LB)

45 Yes No Concession

60 Yes Concession with LB

62 No Concession without LB

58 No No Concession

How do we find who will get concession?

Decision Tree understanding: example 1
Consider a dataset:
We can write a simple If-else
Age Lower birth Ticket fare concession statement to take the decision as:
(LB)
if Age<60
45 Yes No Concession print (“No Concession”)
else
60 Yes Concession with LB if Lower birth == Yes
print (“Concession with LB”)
62 No Concession without LB
else
58 No No Concession print(“Concession without LB”)

How do we find who will get concession?

How do we find who will get concession? Where is the tree?

Decision Tree understanding: example 1
● The nested if-else statements is our tree

Will get a ticket fare concession?

Age < 60 ?
Yes No

No concession Lower berth available?

Yes No

Concession given Concession given

with lower birth without lower birth
Decision Tree understanding: example 2
● Consider the following dataset:
Decision Tree understanding: example 2
● Consider the following dataset:
Decision Tree understanding: example 2
● Consider the following dataset:
Decision Tree understanding: example 2
● Consider the following dataset:
Decision Tree understanding: example 3
Decision Tree understanding: example 3
Decision Tree understanding: example 3

Assigns cheat to
“No”
Decision tree geometric intuition
Decision tree: important points
● Programmatically, decision tree is nothing but a giant structure of nested
if-else condition.
● Mathematically, decision tree uses hyperplanes which run parallel to any one
of the axes to cut your coordinates system into hyper cuboid.
Decision Tree: core idea
● The core idea of a building decision tree is to identify the attribute at which
we split the tree (or the dataset) such that it gives the least impurity.
● There are several method that can be applied to find the best splitting
attribute:
○ Gini index
○ Information gain
Decision Tree: Example
● Consider the following dataset and draw a decision tree:
● Step 1 is to find the split using
Review Readability Class
Length information gain using Entropy.
10 4 0 ○ Entropy is uncertainty/ randomness in
9 3.5 1
the data, the more the randomness the
higher will be the entropy.
2 8 0
○ Information gain uses entropy to
1 3 0
make decisions.
12 5 1
○ If the entropy is less, information gain
will be more and will have higher
chances to get the pure subset on split.
Decision Tree: Example
● Consider the following dataset and draw a decision tree:
● Information gain =
Review Readability Class
Length Entropy before split - Entropy after split
10 4 0
● Information gain (class, reviewLength) =
9 3.5 1
Entropy (class) - Entropy (class, reviewLength)
2 8 0

1 3 0
● Information gain (class, Readability) =

12 5 1 Entropy (class) - Entropy (class, reviewLength)

Whichever, Information gain will be higher that

will our splitting attribute.
Decision Tree: Example
● Consider the following dataset and draw a decision tree:
● Entropy (class) = E(3, 2) = E(0.6, 0.4)
Review Readability Class
Length => -(0.6 log20.6 ) - (0.4 log20.4 ) => 0.29
1 3 0 ● Information gain (class, reviewLength) = E(Class) -
1.5
2 8 0 E(class, reviewLength), Consider the threshold= 1.5
5.5
ReviewLength<=1.5
9 3.5 1
9.5 Yes No
10 4 0
Class Class
11 1 0 1 0
12 5 1
______ ______
0 1 2 2
Decision Tree: Example
● Consider the following dataset and draw a decision tree:
● Entropy (class) = E(3, 2) = E(0.6, 0.4)
Review Readability Class
Length => -(0.6 log20.6 ) - (0.4 log20.4 ) => 0.29
1 3 0 ● Information gain (class, reviewLength) = E(Class) -
1.5
2 8 0 E(class, reviewLength), Consider the threshold= 5.5
5.5
ReviewLength<=5.5
9 3.5 1
No
9.5 Yes
10 4 0
Class Class
11 1 0 1 0
12 5 1
______ ______
0 2 2 1
Decision Tree: Example
● Consider the following dataset and draw a decision tree:
● Similarly, we try for other thresholds also and find the
Review Readability Class Information gain in each case.
Length ● Then we pick that threshold on which we get the maximum
gain.
1 3 0 ● Consider for review length <= 5.5, we get the maximum gain.
1.5 So, the corresponding tree will be following:
2 8 0
5.5
ReviewLength<=5.5
9 3.5 1
No
9.5 Yes
10 4 0
Class Class
11 1 0 1 0
12 5 1
______ ______
0 2 2 1
Decision Tree: Example
● Consider the following dataset and draw a decision tree:
● Similarly, we try for other thresholds also and find the
Review Readability Class Information gain in each case.
Length ● Then we pick that threshold on which we get the maximum
gain.
1 3 0 ● Consider for review length <= 5.5, we get the maximum gain.
1.5 So, the corresponding tree will be following:
2 8 0
5.5
ReviewLength<=5.5
9 3.5 1
No
9.5 Yes
10 4 0
Class Class
11 1 0 1 0
12 5 1
______ ______
0 2 2 1
Pure set and class is 0. Impure set further split is
required.
Decision Tree: Example
● Consider the following dataset and draw a decision tree:
● Similarly, we try for other thresholds also and find the
Review Readability Class Information gain in each case.
Length ● Then we pick that threshold on which we get the maximum
gain.
1 3 0 ● Consider for review length <= 5.5, we get the maximum gain.
1.5 So, the corresponding tree will be following:
2 8 0
5.5
ReviewLength<=5.5
9 3.5 1
9.5 Yes No
10 4 0
Class
11 1 0
12 5 1 Class 0
______
2 1
Impure set further split is
required.
Decision Tree: Example
● Consider the following dataset and draw a decision tree:
ReviewLength<=5.5

Review Readability Class Yes No

Length Readability<=4.5
Class 0
1 3 0 Yes No
1.5
Class Class
2 8 0
5.5 1 0 1 0
9 3.5 1 ______ ______
1 1 1 0
9.5 3.8
10 4 0
11 4.5 Impure set Pure set and
12 5 1 further split is class is 1.
required.
Decision Tree: Example
● Consider the following dataset and draw a decision tree:
ReviewLength<=5.5

Review Readability Class Yes No

Length Readability<=4.5
Class 0
1 3 0 Yes No
1.5
Class
2 8 0
5.5 1 0 Class 1
9 3.5 1 ______
1 1
9.5 3.8
10 4 0
11 4.5 Impure set
12 5 1 further split is
required.
Decision Tree: Example
● Consider the following dataset and draw a decision tree:
ReviewLength<=5.5

Yes No
Review Readability Class
Length Readability<=4.5
Class 0
1 3 0 Yes No
1.5
Readability<
2 8 0
5.5 =3.8 Class 1
9 3.5 1 Yes No
9.5 3.8
10 4 0 Class Class
11 4.5 1 0 1 0
12 5 1 ______ ______
1 0 0 1
Decision Tree: Example
● Consider the following dataset and draw a decision tree:
ReviewLength<=5.5

Yes No
Review Readability Class
Length Readability<=4.5
Class 0
1 3 0 Yes No
1.5
Readability<
2 8 0
5.5 =3.8 Class 1
9 3.5 1 No
Yes
9.5 3.8
10 4 0
11 4.5 Class 1 Class 0
12 5 1

Final Classification tree

Unit 1 ML (NN& ML Techniques)
No ratings yet
Unit 1 ML (NN& ML Techniques)
40 pages
Unit 1 ML (DT)
No ratings yet
Unit 1 ML (DT)
24 pages
Unit IV Decision Trees
No ratings yet
Unit IV Decision Trees
37 pages
Lecture 17 18
No ratings yet
Lecture 17 18
52 pages
Decision Trees for Beginners
No ratings yet
Decision Trees for Beginners
45 pages
Decision Trees
No ratings yet
Decision Trees
45 pages
16-Decision Tree Classification Algorithm Advantages With Examples (Iterative Dichotomiser 3-ID3) - 22-03-2024
No ratings yet
16-Decision Tree Classification Algorithm Advantages With Examples (Iterative Dichotomiser 3-ID3) - 22-03-2024
83 pages
Decision Tree Basics for Data Scientists
No ratings yet
Decision Tree Basics for Data Scientists
61 pages
Unit 5. Decision Trees
No ratings yet
Unit 5. Decision Trees
58 pages
AIML Lec-11
No ratings yet
AIML Lec-11
18 pages
DT-0 (3 Files Merged)
No ratings yet
DT-0 (3 Files Merged)
143 pages
T6 Decision Tree
No ratings yet
T6 Decision Tree
38 pages
Machine Learning: Professor Department of Computer Science & Engineering
No ratings yet
Machine Learning: Professor Department of Computer Science & Engineering
45 pages
Decision Tree
No ratings yet
Decision Tree
41 pages
Machine - Learning - Lecture - 08 - Decision Tree Learning
No ratings yet
Machine - Learning - Lecture - 08 - Decision Tree Learning
67 pages
DataMining-Handouts1 5
No ratings yet
DataMining-Handouts1 5
8 pages
Practical 9 Decision Tree Classification
No ratings yet
Practical 9 Decision Tree Classification
24 pages
Decision Trees - A Complete Introduction With Examples - by Shubham Koli - Medium
No ratings yet
Decision Trees - A Complete Introduction With Examples - by Shubham Koli - Medium
22 pages
Lecture 19 - Decision Tress
No ratings yet
Lecture 19 - Decision Tress
21 pages
06 - Decision Trees
No ratings yet
06 - Decision Trees
14 pages
Unit IV Da Online - PPTX 2 82
No ratings yet
Unit IV Da Online - PPTX 2 82
81 pages
Unit Ii
No ratings yet
Unit Ii
22 pages
Unit 3.2 Decision Tree Algorithm Wit Examples
No ratings yet
Unit 3.2 Decision Tree Algorithm Wit Examples
85 pages
Trees
No ratings yet
Trees
78 pages
Tree Based Algorithms in Machine Learning
No ratings yet
Tree Based Algorithms in Machine Learning
8 pages
Decision Trees for CS Students
No ratings yet
Decision Trees for CS Students
54 pages
Unit 3 Notes
No ratings yet
Unit 3 Notes
25 pages
Dokumen - Tips Decision Tree and Random Forest 58f9e8a0cce07
No ratings yet
Dokumen - Tips Decision Tree and Random Forest 58f9e8a0cce07
17 pages
Chapter 4classification and Prediction
No ratings yet
Chapter 4classification and Prediction
19 pages
MLT UNIT-3 Notes
No ratings yet
MLT UNIT-3 Notes
35 pages
Lecture 11 Classification-1
No ratings yet
Lecture 11 Classification-1
30 pages
19 - Decision Tree - ID3
No ratings yet
19 - Decision Tree - ID3
87 pages
Decision Trees
No ratings yet
Decision Trees
15 pages
Data Science Lectures 3
No ratings yet
Data Science Lectures 3
46 pages
Decision Trees in Machine Learning
No ratings yet
Decision Trees in Machine Learning
62 pages
Lecture Note #5 - PEC-CS701E
No ratings yet
Lecture Note #5 - PEC-CS701E
16 pages
Chapter 4 SqCzYr
No ratings yet
Chapter 4 SqCzYr
47 pages
6.2.unit-2 ML Handsout
No ratings yet
6.2.unit-2 ML Handsout
18 pages
Adobe Scan 16 May 2023
No ratings yet
Adobe Scan 16 May 2023
14 pages
S&ML Unit 6 - Q & A
No ratings yet
S&ML Unit 6 - Q & A
12 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
15 pages
Decision Tree
No ratings yet
Decision Tree
20 pages
Decision Trees-Lecture 9&10
No ratings yet
Decision Trees-Lecture 9&10
60 pages
Decision Tree Learning Guide
No ratings yet
Decision Tree Learning Guide
79 pages
ML Unit 3
No ratings yet
ML Unit 3
22 pages
Unit-4 (1) .Docx ML
No ratings yet
Unit-4 (1) .Docx ML
42 pages
AI&Ml-module 4 (Complete)
No ratings yet
AI&Ml-module 4 (Complete)
124 pages
AI&Ml-module 4 (Part 1)
No ratings yet
AI&Ml-module 4 (Part 1)
85 pages
Cse 445 Lecture 8 Mma
No ratings yet
Cse 445 Lecture 8 Mma
107 pages
Decision Tree Induction Basics
No ratings yet
Decision Tree Induction Basics
55 pages
Chapter 3
No ratings yet
Chapter 3
88 pages
Week 11 - Decision Tree Learning
No ratings yet
Week 11 - Decision Tree Learning
43 pages
Decision Tree 1113fasdlfka'
No ratings yet
Decision Tree 1113fasdlfka'
33 pages
Lecture 5a
No ratings yet
Lecture 5a
24 pages
Decision Tree in Machine Learning
No ratings yet
Decision Tree in Machine Learning
11 pages
LVC 1 Post-Session Summary
No ratings yet
LVC 1 Post-Session Summary
9 pages
Decision Tree Algorithm - A Complete Guide: Data Science Blogathon
No ratings yet
Decision Tree Algorithm - A Complete Guide: Data Science Blogathon
13 pages
Lecture 07 On Decision Trees
No ratings yet
Lecture 07 On Decision Trees
36 pages
Decision Tree Introduction
No ratings yet
Decision Tree Introduction
14 pages
Listening 9 10
No ratings yet
Listening 9 10
4 pages
Unified Contact Center Custom Application Software Services
No ratings yet
Unified Contact Center Custom Application Software Services
11 pages
Googlepreview PDF
100% (2)
Googlepreview PDF
103 pages
Simple Past Crossword Puzzle
100% (2)
Simple Past Crossword Puzzle
1 page
National Art Education Association
No ratings yet
National Art Education Association
16 pages
Verkholantsev, Julia - The Slavonic Letters of St. Jerome, 2014
100% (1)
Verkholantsev, Julia - The Slavonic Letters of St. Jerome, 2014
275 pages
Bir Form 1707 CGT Car
No ratings yet
Bir Form 1707 CGT Car
2 pages
Cambridge International AS Level: 8021/22 English General Paper
No ratings yet
Cambridge International AS Level: 8021/22 English General Paper
8 pages
Probability Concepts for Students
No ratings yet
Probability Concepts for Students
4 pages
Rafa Souza Academy Software ZBrush 2024
No ratings yet
Rafa Souza Academy Software ZBrush 2024
26 pages
Dave Ramsey's Complete Guide To - Dave Ramsey PDF
100% (10)
Dave Ramsey's Complete Guide To - Dave Ramsey PDF
438 pages
SC Election Dispute: Jurisdiction Denied
No ratings yet
SC Election Dispute: Jurisdiction Denied
10 pages
BARTHES That Old Thing Art
100% (1)
BARTHES That Old Thing Art
4 pages
Work Value & Employee Responsibilities
No ratings yet
Work Value & Employee Responsibilities
140 pages
Global Infrastructure and Cultural Influences
No ratings yet
Global Infrastructure and Cultural Influences
2 pages
IBDP Biology Internal Assessment Rubric: Investigation Title
No ratings yet
IBDP Biology Internal Assessment Rubric: Investigation Title
6 pages
Real-Time Radar SLAM: 11. Workshop Fahrerassistenzsysteme Und Automatisiertes Fahren
No ratings yet
Real-Time Radar SLAM: 11. Workshop Fahrerassistenzsysteme Und Automatisiertes Fahren
10 pages
Al2O3 Phase Transition Prediction
No ratings yet
Al2O3 Phase Transition Prediction
5 pages
ASpects of English
No ratings yet
ASpects of English
8 pages
LP Superminds 4
No ratings yet
LP Superminds 4
4 pages
Python Modules for Beginners
No ratings yet
Python Modules for Beginners
17 pages
Ghana & China Festivals Guide
No ratings yet
Ghana & China Festivals Guide
22 pages
My Brother's Peculiar Chicken Powerpoint
100% (3)
My Brother's Peculiar Chicken Powerpoint
18 pages
Viral Traffic Blast Checklist
No ratings yet
Viral Traffic Blast Checklist
11 pages
Du 1-17 20170117
No ratings yet
Du 1-17 20170117
8 pages
Reliability Engineering Basics
75% (4)
Reliability Engineering Basics
21 pages
Brand Audit of Super Asia
93% (15)
Brand Audit of Super Asia
48 pages
Keystone Species Virtual Lab - Prad
No ratings yet
Keystone Species Virtual Lab - Prad
6 pages
Erlang C Table PDF
0% (1)
Erlang C Table PDF
2 pages
Kahopatra - PDF Neha His
No ratings yet
Kahopatra - PDF Neha His
3 pages

ML-Unit I - Decision Tree

Uploaded by

ML-Unit I - Decision Tree

Uploaded by

Machine Learning

Dr. Sunil Saumya

Age Lower birth Ticket fare concession

60 Yes Concession with LB

How do we find who will get concession?

How do we find who will get concession?

How do we find who will get concession? Where is the tree?

Will get a ticket fare concession?

No concession Lower berth available?

Concession given Concession given

12 5 1 Entropy (class) - Entropy (class, reviewLength)

Whichever, Information gain will be higher that

Review Readability Class Yes No

Review Readability Class Yes No

Final Classification tree

You might also like