100% found this document useful (1 vote)

182 views11 pages

Decision Tree Classification

The document discusses decision tree classification. It explains that decision trees break down a dataset into smaller subsets while developing a tree structure with decision and leaf nodes. The ID3 algorithm is used to build decision trees by calculating entropy to measure the homogeneity of samples and selecting the attribute with the highest information gain at each node. Information gain is the decrease in entropy after a dataset is split on an attribute, with the goal being to create the most homogeneous branches. The algorithm recursively splits the data until all instances are classified at the leaf nodes.

Uploaded by

vikas dhale

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

182 views11 pages

Decision Tree Classification

Uploaded by

vikas dhale

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 11

Decision Tree Classification

Decision tree builds classification or regression models in the form of a tree structure. It breaks
down a dataset into smaller and smaller subsets while at the same time an associated decision
tree is incrementally developed. The final result is a tree with decision nodes and leaf nodes.

A decision node (e.g., Outlook) has two or more branches (e.g., Sunny, Overcast and Rainy).

Leaf node (e.g., Play) represents a classification or decision.

The topmost decision node in a tree which corresponds to the best predictor called root node.
Decision trees can handle both categorical and numerical data.

Information theory

Introduced by Claude Shannon. It has quantified entropy. This is key measure of information
which is usually expressed by the average number of bits needed to store or communicate one
symbol in a message.

Information theory measure information in bits

ID3 Algorithm:

Based on greedy search through the space of all possible branches with no backtracking

Includes all predictors

Entropy

A decision tree is built top-down from a root node and involves partitioning the data into
subsets that contain instances with similar values (homogenous). ID3 algorithm uses entropy to
calculate the homogeneity of a sample. If the sample is completely homogeneous the entropy is
zero and if the sample is an equally divided it has entropy of one.

entropy(p1,p2,…,pn)=−p1log(p1)−p2log(p2)−⋯−pnlog(pn)
To build a decision tree, we need to calculate two types of entropy using frequency tables as
follows:

a) Entropy using the frequency table of one attribute:

b) Entropy using the frequency table of two attributes:

Information Gain
The information gain is based on the decrease in entropy after a dataset is split on an attribute.
Constructing a decision tree is all about finding attribute that returns the highest information
gain (i.e., the most homogeneous branches).

Step 1: Calculate entropy of the target.

Step 2: The dataset is then split on the different attributes. The entropy for each branch is
calculated. Then it is added proportionally, to get total entropy for the split. The resulting
entropy is subtracted from the entropy before the split. The result is the Information Gain, or
decrease in entropy.

Step 3: Choose attribute with the largest information gain as the decision node, divide the
dataset by its branches and repeat the same process on every branch.
Step 4a: A branch with entropy of 0 is a leaf node.

Step 4b: A branch with entropy more than 0 needs further splitting.

Step 5: The ID3 algorithm is run recursively on the non-leaf branches, until all data is classified.
Decision Tree to Decision Rules
A decision tree can easily be transformed to a set of rules by mapping from the root node to
the leaf nodes one by one.

Detailed Explanation about Calculations

Overall Entropy

Expected new entropy for each attribute

Outlook

Outlook attribute contains 3 distinct values: Overcast, Rainy, Sunny

Overcast: 4 records; 4 are “Yes”

-[(4/4) log2 (4/4)] = 0

Rainy: 5 Records; 3 are “Yes”, 2 are “No”

E(3,2) = E(0.6, 0.4)

= -[(0.6) log2 (0.6) + (0.4) log2 (0.4)]

= -[(0.6) (-0.7369) + (0.4) (-1.3219)]

= 0.971

sunny: 5 records, 2 are “yes”, 3 are “No”

E(2,3) = E(0.4, 0.6)

= -[(0.4) log2 (0.4) + (0.6) log2 (0.6) ]

= 0.971

Thus, Expected New Entropy =

-[(4/14) * 0 + (5/14) * 0.971 + (5/14) * 0.971]

= 0.693

Gain (Outlook): 0.940 – 0.693 = 0.247

Temp

Temp attribute contains 3 distinct values: Hot, Mild, Cool

Hot: 4 records; 2 are “Yes”, 2 are “No”

E(2,2) = E(0.5, 0.5) = -[(0.5) log2 (0.5) + (0.5) log2 (0.5)]

= 1.0 { Put log2 (0.5) = -1}

Mild: 6 records; 4 are “Yes”, 2 are “No”

E(4,2) = E(0.666, 0.333) = -[(0.666) log2 (0.666) + (0.333) log2 (0.333)]

= -[(0.666) log2 (-0.586) + (0.333) log2 (-1.586)] = 0.918

Cool: 4 Records; 3 are “Yes”, 1 are “No”

E(3,1) = E(0.75, 0.25)= -[(0.75) log2 (0.75) + (0.25) log2 (0.25)]

= -[(0.75) (-0.4150) + (0.25) (-2)]

= 0.81125

Thus, Expected New Entropy =

-[(4/14) * (1) + (6/14) * 0.918 + (4/14) * 0.81125]

= 0.9109

Gain (Temp): 0.940 – 0.9109 = 0.029

Similarly, you can compute

Gain (Humidity) = .152

Gain (Windy) = .048

Gain by splitting at Outlook is largest., so we take outlook as decision node and divide the dataset by
its branches

Repeat the steps for Sunny, Overcast and Rainy

(Calculation sheets circulated)

Overall Entropy

E(Sunny): Yes - 3; No – 2

E(3,2) = E(0.6, 0.4) = -[(0.6) log2 (0.6) + (0.4) log2 (0.4)] = -[(0.6) (-0.737) + (0.4) (-1.322)] = 0.971

Now compute for temp, humidity and windy

Temp

2 distinct values: mild, cool

Mild: yes: 2; no: 1

E(2,1) = E(0.67, 0.33) = -[(0.67) log2 (0.67) + (0.33) log2 (0.33)] = -[(0.67) (-0.577) + (0.33) (-1.599)] = 0.914

Cool: yes: 1; no: 1

E(1,1) = E(0.67, 0.33) = -[(0.5) log2 (0.5) + (0.5) log2 (0.5)] = -[(0.5 (-1) + (0.5) (-1)] = 1.0

Expected New Entropy for Temp= (3/5) (0.914) + (2/5)(1.0)= 0.9484

Gain (Temp) = 0.971 – 0.9484 = .0226

Humidity

2 distinct values: high, normal

high: yes: 1; no: 1

E(1,1) = E(0.5, 0.5) = 1.0

normal: yes: 2; no: 1

E(2,1) = E(0.67, 0.33) = 0.914

Expected New Entropy for humidity= (2/5) (1.0) + (3/5) (0.914)= 0.9484

Gain (humidity) = 0.971 – 0.9484 = .0226

Windy

2 distinct values: false, true

false: yes: 3; no: 0

E(3,0) = -[(3/3) log2 (3/3) = 0

true: yes: 0; no: 2

E(0,2) = -[(2/2) log2 (2/2) = 0

Expected New Entropy for humidity= (3/5) (0) + (2/5) (0)= 0

Gain (windy) = 0.971 – 0 = .971 (Maximum)

Thus, split at Windy

Rainy

Overall Entropy
E(Rainy): Yes - 2; No – 3

E(2,3) = E(0.4, 0.6) = 0.971

Now compute expected new entropy for temp, humidity and windy

Temp

3 distinct values: hot, mild, cool

Hot: yes: 0; no: 2

E(0,2) = 0

Mild: yes: 1; no: 1

E(1,1) = 1

Cool: yes: 1; no: 0

E(1,0) = 0

Expected New Entropy for Temp= (2/5) (0) + (2/5)(1.0) + (1/5)(0)= 0.40

Gain (Temp) = 0.971 – 0.400 = .571

Humidity

2 distinct values: high, normal

high: yes: 0; no: 3

E(0,3) = 0

normal: yes: 2; no: 0

E(2,0) = 0

Expected New Entropy for humidity= (3/5) (0) + (2/5) (0)= 0

Gain (humidity) = 0.971 – 0 = .971 (Maximum)

Windy

2 distinct values: false, true

false: yes: 1; no: 2

E(1,2) = 0.914
true: yes: 1; no: 1

E(1,1) = 1

Expected New Entropy for humidity= (3/5) (0.914) + (2/5) (1)= 0.9484

Gain (windy) = 0.971 – 0.9484= .0226

Thus, split at WIndy

Name: Reg. No.: Lab Exercise:: Shivam Batra 19BPS1131
100% (1)
Name: Reg. No.: Lab Exercise:: Shivam Batra 19BPS1131
10 pages
SAS Categorical Data Analysis Guide
100% (1)
SAS Categorical Data Analysis Guide
16 pages
Blank: CFC Cumulative Forecast Error or Bias Error
100% (1)
Blank: CFC Cumulative Forecast Error or Bias Error
2 pages
EDA Lecture Module 2
100% (1)
EDA Lecture Module 2
42 pages
CS229 Lecture 3 PDF
100% (1)
CS229 Lecture 3 PDF
35 pages
Logistic Regression Tutorial
100% (1)
Logistic Regression Tutorial
22 pages
Stats For Managers - Intro
100% (1)
Stats For Managers - Intro
101 pages
Homework 2
100% (1)
Homework 2
12 pages
Community Medicine Trans - Epidemic Investigation 2
100% (1)
Community Medicine Trans - Epidemic Investigation 2
10 pages
Poly
100% (1)
Poly
108 pages
Quiz Feedback1 - Coursera
100% (1)
Quiz Feedback1 - Coursera
7 pages
Risk Return Summery
100% (1)
Risk Return Summery
85 pages
Project 5 PDF
100% (1)
Project 5 PDF
48 pages
1-Introduction To Statistics PDF
100% (1)
1-Introduction To Statistics PDF
37 pages
Intro to Statistics & Data Analysis
100% (1)
Intro to Statistics & Data Analysis
30 pages
Import As
100% (1)
Import As
27 pages
1.1 Simple Linear Regression Model
100% (1)
1.1 Simple Linear Regression Model
15 pages
Correlation & Regression Guide
100% (1)
Correlation & Regression Guide
53 pages
Using Statistical Techniq Ues in Analyzing Data
100% (1)
Using Statistical Techniq Ues in Analyzing Data
40 pages
7. Heteroscedasticity: y = β + β x + · · · + β x + u
100% (1)
7. Heteroscedasticity: y = β + β x + · · · + β x + u
21 pages
Photon Prog Guide
100% (1)
Photon Prog Guide
919 pages
Stat1012 Cheatsheet Double-Sided
100% (1)
Stat1012 Cheatsheet Double-Sided
2 pages
Logistic Regression
100% (1)
Logistic Regression
17 pages
Engineering Regression Analysis
100% (1)
Engineering Regression Analysis
21 pages
Python For You and Me: Release 0.3.alpha1
100% (1)
Python For You and Me: Release 0.3.alpha1
143 pages
Leer Los Datos: Import As Import As Import As From Import From Import
100% (1)
Leer Los Datos: Import As Import As Import As From Import From Import
14 pages
EDA Techniques in R with dlookr
100% (2)
EDA Techniques in R with dlookr
11 pages
KPMG - Data Set
100% (1)
KPMG - Data Set
1,685 pages
Airbnbs in Seattle, Wa: Questions
100% (1)
Airbnbs in Seattle, Wa: Questions
5 pages
Logistic Regression: Gunjan Bharadwaj Assistant Professor Dept of CEA
100% (1)
Logistic Regression: Gunjan Bharadwaj Assistant Professor Dept of CEA
42 pages
Employee Attrition Miniblogs
100% (1)
Employee Attrition Miniblogs
15 pages
Regression Models Course Project
100% (1)
Regression Models Course Project
4 pages
Py Notes
100% (1)
Py Notes
169 pages
Dokumen - Pub Approaching Almost Any Machine Learning Problem 9788269211528 L 5276104
100% (1)
Dokumen - Pub Approaching Almost Any Machine Learning Problem 9788269211528 L 5276104
151 pages
Tutor
100% (1)
Tutor
309 pages
Data Analysis Nirvana: Excel 2013 Business Intelligence Features
100% (1)
Data Analysis Nirvana: Excel 2013 Business Intelligence Features
27 pages
Logistic Regression Model Study Assignment
100% (1)
Logistic Regression Model Study Assignment
5 pages
MacVille's Sydney Expansion Strategy
100% (1)
MacVille's Sydney Expansion Strategy
12 pages
Python Vs R in Data and Machine Learning PDF
100% (1)
Python Vs R in Data and Machine Learning PDF
6 pages
Tableau MMMF PDF
100% (1)
Tableau MMMF PDF
11 pages
LPTHW
100% (1)
LPTHW
220 pages
M&A Deal of ABC Inc. and XYZ Inc.: Insert Your Title Here
100% (1)
M&A Deal of ABC Inc. and XYZ Inc.: Insert Your Title Here
25 pages
January 1, 1983 1990 5 July 1994 1930 1960
100% (1)
January 1, 1983 1990 5 July 1994 1930 1960
13 pages
CPE412 Pattern Recognition (Week 8)
100% (1)
CPE412 Pattern Recognition (Week 8)
25 pages
Human Life Span Prediction Using Machine Learning
100% (1)
Human Life Span Prediction Using Machine Learning
9 pages
1
100% (1)
1
385 pages
Quest Stat
100% (1)
Quest Stat
2 pages
Correlation Regression and Trend Analysis
No ratings yet
Correlation Regression and Trend Analysis
27 pages
Machine Learning (Analytics Vidhya) : What Is Logistic Regression?
100% (1)
Machine Learning (Analytics Vidhya) : What Is Logistic Regression?
5 pages
Intro to Machine Learning Basics
100% (1)
Intro to Machine Learning Basics
52 pages
LLSPS - INT - 2831 - Predicting Life Expectancy Using Machine Learning
100% (1)
LLSPS - INT - 2831 - Predicting Life Expectancy Using Machine Learning
36 pages
M.Tech Lung Disease Prediction
No ratings yet
M.Tech Lung Disease Prediction
36 pages
1694600777-Unit2.2 Logistic Regression CU 2.0
100% (1)
1694600777-Unit2.2 Logistic Regression CU 2.0
37 pages
Linear - Regression
100% (1)
Linear - Regression
39 pages
Decision Trees for Beginners
100% (1)
Decision Trees for Beginners
10 pages
What Is An ID3 Algorithm?
No ratings yet
What Is An ID3 Algorithm?
10 pages
ML 19
No ratings yet
ML 19
28 pages
ML Unit-3
No ratings yet
ML Unit-3
29 pages
Fire Control and Navigation: Arjun Gunnery Simulator
No ratings yet
Fire Control and Navigation: Arjun Gunnery Simulator
2 pages
Styrene Acrylic Emulsion
No ratings yet
Styrene Acrylic Emulsion
7 pages
Compliance Sheet-Foundation Waterproofing MFGC
No ratings yet
Compliance Sheet-Foundation Waterproofing MFGC
2 pages
C20 CM-103 Ay2020-21 Compleated (20 Series)
No ratings yet
C20 CM-103 Ay2020-21 Compleated (20 Series)
40 pages
Kanon Green Binder Rev 2
100% (1)
Kanon Green Binder Rev 2
202 pages
BT Price List 1stmay2018
No ratings yet
BT Price List 1stmay2018
24 pages
Bicycle Kinetic Energy Recovery Design
No ratings yet
Bicycle Kinetic Energy Recovery Design
65 pages
Gas Laws Ws PDF
No ratings yet
Gas Laws Ws PDF
6 pages
LRCX2012
No ratings yet
LRCX2012
213 pages
The Gentle Art of Preserving Pickling Smoking Freezing Drying Curing Fermenting Bottling Canning and Making Jams Jellies and Cordials 1st Edition Katie Caldesi Download
100% (1)
The Gentle Art of Preserving Pickling Smoking Freezing Drying Curing Fermenting Bottling Canning and Making Jams Jellies and Cordials 1st Edition Katie Caldesi Download
54 pages
GIS in Supply Chain Management
No ratings yet
GIS in Supply Chain Management
10 pages
BHMCT 3rd Semester Syllabus
No ratings yet
BHMCT 3rd Semester Syllabus
6 pages
TALAT Lecture 4101: Definition and Classification of Mechanical Fastening Methods
No ratings yet
TALAT Lecture 4101: Definition and Classification of Mechanical Fastening Methods
19 pages
AGGC Product Familiarization
No ratings yet
AGGC Product Familiarization
88 pages
Theory and Application of Field Effect Transistors
No ratings yet
Theory and Application of Field Effect Transistors
73 pages
Unit-I TO Composite Materials
100% (1)
Unit-I TO Composite Materials
40 pages
Honor 6 Plus - Pe-Tl10 QSG - (01, All, Neu, Si, L)
No ratings yet
Honor 6 Plus - Pe-Tl10 QSG - (01, All, Neu, Si, L)
144 pages
Slayers v1.5
93% (15)
Slayers v1.5
100 pages
Threshold 14
No ratings yet
Threshold 14
194 pages
Baker Netep On-Line Motor Analysis System
No ratings yet
Baker Netep On-Line Motor Analysis System
8 pages
Inventory Inspection
No ratings yet
Inventory Inspection
1 page
Ahdp Gse Deicer Cat
No ratings yet
Ahdp Gse Deicer Cat
4 pages
Medicinal Plant Treasures of India
No ratings yet
Medicinal Plant Treasures of India
150 pages
4 Skin Plate Form Work PDF
No ratings yet
4 Skin Plate Form Work PDF
4 pages
Nippon Steel Arcelor Mittal Catalogue
0% (1)
Nippon Steel Arcelor Mittal Catalogue
8 pages
Reading Genre Descriptive-Text
No ratings yet
Reading Genre Descriptive-Text
5 pages
Quantitative Ability
No ratings yet
Quantitative Ability
9 pages
Audio Engineer & Sound Designer Expertise
No ratings yet
Audio Engineer & Sound Designer Expertise
3 pages
Salas: Curanderismo Hub & Eco Challenges
No ratings yet
Salas: Curanderismo Hub & Eco Challenges
9 pages
Ultimate Burger Recipe Collection
No ratings yet
Ultimate Burger Recipe Collection
12 pages

Decision Tree Classification

Uploaded by

Decision Tree Classification

Uploaded by

Decision Tree Classification

Leaf node (e.g., Play) represents a classification or decision.

Information theory measure information in bits

Includes all predictors

a) Entropy using the frequency table of one attribute:

Step 1: Calculate entropy of the target.

Detailed Explanation about Calculations

Expected new entropy for each attribute

Outlook attribute contains 3 distinct values: Overcast, Rainy, Sunny

Overcast: 4 records; 4 are “Yes”

Rainy: 5 Records; 3 are “Yes”, 2 are “No”

E(3,2) = E(0.6, 0.4)

= -[(0.6) log2 (0.6) + (0.4) log2 (0.4)]

= -[(0.6) (-0.7369) + (0.4) (-1.3219)]

sunny: 5 records, 2 are “yes”, 3 are “No”

E(2,3) = E(0.4, 0.6)

= -[(0.4) log2 (0.4) + (0.6) log2 (0.6) ]

Thus, Expected New Entropy =

-[(4/14) * 0 + (5/14) * 0.971 + (5/14) * 0.971]

Gain (Outlook): 0.940 – 0.693 = 0.247

Temp attribute contains 3 distinct values: Hot, Mild, Cool

Hot: 4 records; 2 are “Yes”, 2 are “No”

E(2,2) = E(0.5, 0.5) = -[(0.5) log2 (0.5) + (0.5) log2 (0.5)]

= 1.0 { Put log2 (0.5) = -1}

Mild: 6 records; 4 are “Yes”, 2 are “No”

E(4,2) = E(0.666, 0.333) = -[(0.666) log2 (0.666) + (0.333) log2 (0.333)]

= -[(0.666) log2 (-0.586) + (0.333) log2 (-1.586)] = 0.918

Cool: 4 Records; 3 are “Yes”, 1 are “No”

E(3,1) = E(0.75, 0.25)= -[(0.75) log2 (0.75) + (0.25) log2 (0.25)]

= -[(0.75) (-0.4150) + (0.25) (-2)]

Thus, Expected New Entropy =

-[(4/14) * (1) + (6/14) * 0.918 + (4/14) * 0.81125]

Gain (Temp): 0.940 – 0.9109 = 0.029

Similarly, you can compute

Gain (Humidity) = .152

Gain (Windy) = .048

Repeat the steps for Sunny, Overcast and Rainy

(Calculation sheets circulated)

Now compute for temp, humidity and windy

2 distinct values: mild, cool

Cool: yes: 1; no: 1

Expected New Entropy for Temp= (3/5) (0.914) + (2/5)(1.0)= 0.9484

Gain (Temp) = 0.971 – 0.9484 = .0226

2 distinct values: high, normal

high: yes: 1; no: 1

E(1,1) = E(0.5, 0.5) = 1.0

normal: yes: 2; no: 1

E(2,1) = E(0.67, 0.33) = 0.914

Gain (humidity) = 0.971 – 0.9484 = .0226

2 distinct values: false, true

false: yes: 3; no: 0

E(3,0) = -[(3/3) log2 (3/3) = 0

true: yes: 0; no: 2

E(0,2) = -[(2/2) log2 (2/2) = 0

Expected New Entropy for humidity= (3/5) (0) + (2/5) (0)= 0

Gain (windy) = 0.971 – 0 = .971 (Maximum)

Thus, split at Windy

E(2,3) = E(0.4, 0.6) = 0.971

3 distinct values: hot, mild, cool

Hot: yes: 0; no: 2

Mild: yes: 1; no: 1

Cool: yes: 1; no: 0

Gain (Temp) = 0.971 – 0.400 = .571

2 distinct values: high, normal

high: yes: 0; no: 3

normal: yes: 2; no: 0

Expected New Entropy for humidity= (3/5) (0) + (2/5) (0)= 0

Gain (humidity) = 0.971 – 0 = .971 (Maximum)

2 distinct values: false, true

false: yes: 1; no: 2

Gain (windy) = 0.971 – 0.9484= .0226

Thus, split at WIndy

You might also like