0% found this document useful (0 votes)

22 views37 pages

07 - ML - Decision Tree

Uploaded by

11-Nguyễn Thị Quỳnh Châu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views37 pages

07 - ML - Decision Tree

Uploaded by

11-Nguyễn Thị Quỳnh Châu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

Ho Chi Minh University of Banking

Department of Economic Mathematics

Machine Learning
Decision Tree

Vuong Trong Nhân (nhanvt@hub.edu.vn)

Outline
 Decision tree representation
 ID3 learning algorithm
 Which attribute is best?
 C4.5: real valued attributes
 Which hypothesis is best?
 Noise
 From Trees to Rules
 Miscellaneous

2
Decision Tree Representation
Day Outlook Temperature Humidity Wind PlayTennis
D1 Sunny Hot High Weak No
D2 Sunny Hot High Strong No
D3 Overcast Hot High Weak Yes
D4 Rain Mild High Weak Yes
D5 Rain Cool Normal Weak Yes
D6 Rain Cool Normal Strong No
D7 Overcast Cool Normal Strong Yes
D8 Sunny Mild High Weak No
D9 Sunny Cool Normal Weak Yes
D10 Rain Mild Normal Weak Yes
D11 Sunny Mild Normal Strong Yes
D12 Overcast Mild High Strong Yes
D13 Overcast Hot Normal Weak Yes
D14 Rain Mild High Strong No

 Outlook, Temperature, etc.: attributes

 PlayTennis: class
 Shall I play tennis today?
3
Decision Tree for PlayTennis

Outlook

Sunny Overcast Rain

Humidity Yes Wind

High Normal Strong Weak

No Yes No Yes

4
Alternative Decision Tree for PlayTennis
Temperature

hot mild cool

{1,2,3,13} {4,8,10,11,12,14} {5,6,7,9}

Humidity

Normal High
{1,2,3} {13] ...

YES
Wind
Mild Strong
{1,3} {2}
Outlook NO  What is different?
Sunny Overcast  Sequence of attributes influences
{1} {3}
size and shape of tree
NO YES
5
Occam’s Principle

 Occam’s Principle:
“If two theories explain the
facts equally well, then the
simpler theory is preferred”

 Preferred the smallest tree

that correctly classifies all
training examples

6
Decision Trees
Decision tree representation:
• Each internal node tests an attribute
• Each branch corresponds to attribute value
• Each leaf node assigns a classification

How would we represent:

A
• ∧, ∨, XOR Example XOR:
yes no

B B
yes no yes no

NO YES YES NO

7
When to Consider Decision Trees

 Instances describable by attribute–value pairs

 Target function is discrete valued
 Disjunctive hypothesis may be required
 Possibly noisy training data
 Interpretable result of learning is required

 Examples:
 Medical diagnosis
 Text classification
 Credit risk analysis
8
Top-Down Induction of Decision Trees, ID3

 ID3 (Quinlan, 1986) operates on whole training set S

 Algorithm:
1. create a new node
2. If current training set is sufficiently pure:
• Label node with respective class
• We’re done
3. Else:
• x → the “best” decision attribute for current training set
• Assign x as decision attribute for node
• For each value of x, create new descendant of node
• Sort training examples to leaf nodes
• Iterate over new leaf nodes and apply algorithm recursively

9
Example ID3
• Look at current training set S

• Determine best attribute

• Split training set according to different values

10
Example ID3

• Tree

• Apply algorithm recursively

11
Example – Resulting Tree

Outlook

Sunny Overcast Rain

Humidity Yes Wind

High Normal Strong Weak

No Yes No Yes

12
ID3 – Intermediate Summary

• Recursive splitting of the training set

• Stop, if current training set is sufficiently pure
• ... What means pure? Can we allow for errors?
• What is the best attribute?
• How can we tell that the tree is really good?
• How shall we deal with continuous values?

13
Which attribute is best?

• Assume a training set { + , + , − , − , + , − , + , + , − , − }

(only classes)
• Assume binary attributes x 1 , x 2 , and x 3
• Produced splits:

Value 1 Value 2
x1 {+, +, −, −, + } {−, +, +, −, −}
x2 {+} {+, −, −, +, −, +, +, −, −}
x3 {+, +, +, +, −} {−, −, −, −, + }

• No attribute is perfect
• Which one to choose?

14
Entropy

• p⊕ is the proportion of positive

examples
1.0
• p⊖ is the proportion of negative
Entropy (S)

examples
0.5
• Entropy measures the impurity of S

Entropy(S) ≡ −p ⊕ log2 p⊕ − p⊖ log2 p⊖

0.0 0.5 1.0
p⊕
• Information can be seen as the
negative of entropy

15
Entropy

S = { + + + + + + + + + , − − − − − } = { 9 + , 5−}. Entropy( S) = ?

Entropy( S) = −9/14 log(9/14) − 5/14 log(5/14) = 0.94

S = { + + + + + + + + , − − − − − − } = { 8 + , 6−}. Entropy (S) = ?

Entropy( S) = −8/14 log(8/14) − 6/14 log(6/14) = 0.98

S = { + + + + + + + + + + + + + + + } = {14+}. Entropy( S) = ?

Entr opy( S) = 0
S = { + + + + + + + + − − − − − − − } = { 7 + , 7−}.
Entr opy( S) = ?

Entr opy( S) = 1
16
Entropy

• All members of 𝑆 belong to the same

1.0
class
Entropy (S)

• 𝐸𝑛𝑡𝑟𝑜𝑝𝑦(𝑆) = 0 (the purest set)

0.5
• Numbers of positive and negative
examples are equal (p ⊕ = p⊖ = 0.5)
• 𝐸𝑛𝑡𝑟𝑜𝑝𝑦(𝑆) = 1 (most impurity)
0.0 0.5 1.0
p⊕
• Numbers of positive and negative
examples are unequal
• Entropy is between 0 and 1.

17
Information Gain
• Measuring attribute x creates subsets S 1 and S 2 with
different entropies
• Taking the mean of Entropy(S 1 ) and Entropy(S 2 )
gives conditional entropy Entropy(S|x), i.e. in general:
𝑠𝑣
𝐸𝑛𝑡𝑟𝑜𝑝𝑦(𝑆|𝑥) = 𝐸𝑛𝑡𝑟𝑜𝑝𝑦(𝑆𝑣 )
𝑆
𝑣∈𝑉𝑎𝑙𝑢𝑒𝑠(𝑥)

• → Choose that attribute that maximizes difference:

| sv |
Gain( S , x)  Entropy ( S )   vValues ( x ) Entropy ( Sv )
|S|

 𝑉𝑎𝑙𝑢𝑒 (x): the set of all possible values for the attribute x.
 S𝑣: the subset of S for which x has value 𝑣

 Information Gain is a measure of the effectiveness of an

attribute in classifying data.
 It is the expected reduction in entropy caused by
partitioning the objects according to this attribute.

19
Example - Training Set
Day Outlook Temperature Humidity Wind PlayTennis
D1 Sunny Hot High Weak No
D2 Sunny Hot High Strong No
D3 Overcast Hot High Weak Yes
D4 Rain Mild High Weak Yes
D5 Rain Cool Normal Weak Yes
D6 Rain Cool Normal Strong No
D7 Overcast Cool Normal Strong Yes
D8 Sunny Mild High Weak No
D9 Sunny Cool Normal Weak Yes
D10 Rain Mild Normal Weak Yes
D11 Sunny Mild Normal Strong Yes
D12 Overcast Mild High Strong Yes
D13 Overcast Hot Normal Weak Yes
D14 Rain Mild High Strong No

20
Example
| sv |
Gain( S , x)  Entropy ( S )   vValues ( x ) Entropy ( Sv )
|S|

For top node: S = { 9 + , 5−}, E n t r o p y ( S ) = 0.94 ID Wind

Play
Tennis
Attribute Wind: D1 Weak No
S _ w e a k = { 6 + , 2−}, | S _we a k | = 8 D8 Weak No

S _s t r o n g = { 3 + , 3−}, | S _s t r o n g | = 6 D3 Weak Yes

E n t r o p y ( S _w e a k ) = −6/8*l o g ( 6 / 8 ) − - D4 Weak Yes

2 / 8 *l o g ( 2 / 8 ) = 0.81 D5 Weak Yes

D9 Weak Yes
E n t r o p y ( S _s t r o n g ) = 1
Expected Entropy when assuming attribute ’Wind’:
D10 Weak Yes
D13 Weak Yes
E n t r o p y ( S | W i n d ) = 8 / 1 4 * E n t r o p y ( S _w e a k ) +
D2 Strong No
6 / 1 4 *E n t r o p y ( S _s t r o n g ) = 0.89
D6 Strong No
D14 Strong No
Gain(S, Wind) = D7 Strong Yes
0.94 − 0.89 ≈ 0.05 D11 Strong Yes
D12 Strong Yes

21
Selecting the Next Attribute
• For whole training set:
 G a i n ( S , O u t l o o k ) = 0.246
 G a i n ( S , H u m i d i t y ) = 0.151
 G a i n ( S , W i n d ) = 0.048
 G a i n ( S , Te m p e r a t u r e ) = 0.029
→ O u t l o o k should be used to split training set!

• Further down in the tree, E n t r o p y ( S ) is computed locally

• Usually, the tree does not have to be minimized
• Reason of good performance of ID3!

22
Next step in growing the decision tree

23
The Resulting Decision Tree & Its Rules

24
Some issues: Real-Valued Attributes
 Temperature = 82.5
 Create discrete attributes to test continuous:
 (Temperature > 54) = true or = false
 Sort attribute values that occur in training set:

Temperature: 40 48 60 72 80 90
PlayTennis: No No Yes Yes Yes No

 Determine points where the class changes

 Candidates are (48 + 60) / 2 and (80 + 90) / 2
 Select best one using info gain
 Implemented in the system C4.5 (successor of ID3)

25
Some issues: Noise
 Consider adding noisy (=wrongly labeled) training example #15:
 S u n n y , M i l d , N o r m a l , We a k , P l a y Te n n i s = N o
, i.e. outlook = sunny, humidity = normal
 What effect on earlier tree? Outlook

Sunny Overcast Rain

Humidity Yes Wind

High Normal Strong Weak

No Yes No Yes

26
Some issues: Overfitting Outlook

Sunny Overcast Rain

Humidity Yes Wind

High Normal Strong Weak

No Yes No Yes

• Algorithm will introduce new test

• Unnecessary, because new example was erroneous due to the
presence of Noise
→ Overfitting corresponds to learning coincidental regularities
• Unfortunately, we generally don’t know which examples are noisy
• ... and also not the amount, e.g. percentage, of noisy examples
27
Some issues: Overfitting
 An example: continuing to grow the tree can improve the accuracy
on the training data, but perform badly on the test data.

28
[Mitchell, 1997]
Overfitting: solutions
 Some solutions:
 Stop learning early: prevent the tree before it fits the
training data perfectly.
 Prune the full tree: grow the tree to its full size, and then
post prune the tree.
 It is hard to decide when to stop learning.
 Post-pruning the tree empirically results in better
performance. But
 How to decide the good size of a tree?
 When to stop pruning?
 We can use a validation set to do pruning, such
as, reduced-error pruning, and rule-post pruning
29
Summary

 Decision tree is a Machine Learning algorithm that

can perform both classification and regression
tasks.
 Decision tree represents a function by using a tree.
 Each decision tree can be interpreted as a set of
rules of the form: IF-THEN
 Decision trees have been used in many practical
applications.

30
Advantages & Disadvantages

 Advantages
 It is simple to understand as it follows the same
process which a human follow while making any
decision in real-life.
 It can be very useful for solving decision-related
problems.
 It helps to think about all the possible outcomes for
a problem.
 There is less requirement of data cleaning
compared to other algorithms.

31
Advantages & Disadvantages

 Disadvantages
 The decision tree contains lots of layers, which
makes it complex.
 It may have an overfitting issue, which can be
resolved using the Random Forest algorithm.
 For more class labels, the computational
complexity of the decision tree may increase

32
Random forests
 Random forests (RF) is a method by Leo Breiman
(2001) for both classification and regression.
 Main idea: prediction is based on combination of many
decision trees, by taking the average of all individual
predictions
 Each tree in RF is simple but random.
 Each tree is grown differently, depending on the
choices of the attributes and training data

33
Random forests
 RF currently is one of the most popular and accurate
methods [Fernández-Delgado et al., 2014]
 It is also very general.
 RF can be implemented easily and efficiently.
 It can work with problems of very high dimensions,
without overfitting

34
How Random Forest work

35
RF: three basic ingredients

 Randomization and no pruning:

 For each tree and at each node, we select randomly a subset of
attributes.
 Find the best split, and then grow appropriate subtrees.
 Every tree will be grown to its largest size without pruning.

 Combination: each prediction later is made by taking the

average of all predictions of individual trees.

 Bagging: the training set for each tree is generated by

sampling (with replacement) from the original data.

36
Exersice

1. Build Decision tree

2. Predict :
customer (15, youth, medium, no, fair)
Customer (16, senior, low, yes, excellent) 37

Decision Tree & Random Forest
No ratings yet
Decision Tree & Random Forest
41 pages
7-Decision Trees Learning
No ratings yet
7-Decision Trees Learning
51 pages
CENG313 Introduction To Data Science: Lecture 12: Classification Decision Trees
No ratings yet
CENG313 Introduction To Data Science: Lecture 12: Classification Decision Trees
61 pages
2.3 Decision-Tree-Algorithm
No ratings yet
2.3 Decision-Tree-Algorithm
61 pages
L3 - Decision Trees
No ratings yet
L3 - Decision Trees
28 pages
Decision Tree Learning Guide
No ratings yet
Decision Tree Learning Guide
79 pages
Decision Tree Induction Basics
No ratings yet
Decision Tree Induction Basics
55 pages
L5 - Decision Tree - B
No ratings yet
L5 - Decision Tree - B
51 pages
Decision Trees
No ratings yet
Decision Trees
53 pages
Decision Trees for Data Scientists
No ratings yet
Decision Trees for Data Scientists
75 pages
Decision Tree: Courtesy: Prof. Pabitra Mitra, CSE, IIT Kharagpur
No ratings yet
Decision Tree: Courtesy: Prof. Pabitra Mitra, CSE, IIT Kharagpur
73 pages
Decision Trees for Data Scientists
No ratings yet
Decision Trees for Data Scientists
14 pages
T6 Decision Tree
No ratings yet
T6 Decision Tree
38 pages
Module 3-Decision Tree Learning
100% (1)
Module 3-Decision Tree Learning
33 pages
M01 Tree-Based Methods
No ratings yet
M01 Tree-Based Methods
38 pages
2.decision Tree
No ratings yet
2.decision Tree
56 pages
Unit 3
No ratings yet
Unit 3
81 pages
Decision Trees
No ratings yet
Decision Trees
15 pages
Unit 3
No ratings yet
Unit 3
90 pages
Decision Trees for CS Students
No ratings yet
Decision Trees for CS Students
54 pages
Decision Trees CLS
No ratings yet
Decision Trees CLS
43 pages
SDG Sdgs DF
No ratings yet
SDG Sdgs DF
23 pages
Decision Tree
No ratings yet
Decision Tree
42 pages
Decision Tree
No ratings yet
Decision Tree
58 pages
Decision Trees: Decision Tree Representation ID3 Learning Algorithm Entropy, Information Gain Overfitting
No ratings yet
Decision Trees: Decision Tree Representation ID3 Learning Algorithm Entropy, Information Gain Overfitting
33 pages
Decision - Tree
No ratings yet
Decision - Tree
75 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
70 pages
Lec-2 Decision Tree - 13-8-2024
No ratings yet
Lec-2 Decision Tree - 13-8-2024
38 pages
Class 16 Decision Tree
No ratings yet
Class 16 Decision Tree
45 pages
Module - 2 Decision Tree Learning
No ratings yet
Module - 2 Decision Tree Learning
79 pages
L8 1 Decisiontrees Random Forest
No ratings yet
L8 1 Decisiontrees Random Forest
118 pages
Chapter 3
No ratings yet
Chapter 3
88 pages
Decision Tree Learning: - A Learned Decision Tree Can Also Be Re-Represented As A Set of If-Then Rules
No ratings yet
Decision Tree Learning: - A Learned Decision Tree Can Also Be Re-Represented As A Set of If-Then Rules
49 pages
06 Classification Decision Tree
No ratings yet
06 Classification Decision Tree
42 pages
Machine Learning for Engineers
100% (1)
Machine Learning for Engineers
80 pages
Random Forest Regression
No ratings yet
Random Forest Regression
57 pages
Decision Tree (Class 37-38) 169692509554958626652505a71d481
No ratings yet
Decision Tree (Class 37-38) 169692509554958626652505a71d481
45 pages
Tree Models
No ratings yet
Tree Models
42 pages
ID3 Algorithm Machine Learning, Btech Cse
No ratings yet
ID3 Algorithm Machine Learning, Btech Cse
6 pages
Classification & Prediction Guide
No ratings yet
Classification & Prediction Guide
103 pages
Decision Trees for Data Scientists
No ratings yet
Decision Trees for Data Scientists
61 pages
16-Decision Tree Classification Algorithm Advantages With Examples (Iterative Dichotomiser 3-ID3) - 22-03-2024
No ratings yet
16-Decision Tree Classification Algorithm Advantages With Examples (Iterative Dichotomiser 3-ID3) - 22-03-2024
83 pages
Geometric Intuition of Decision Tree: Axis Parallel Hyperplanes
No ratings yet
Geometric Intuition of Decision Tree: Axis Parallel Hyperplanes
7 pages
7 DecisionTree
No ratings yet
7 DecisionTree
58 pages
Chapter 4
No ratings yet
Chapter 4
103 pages
Week 11 - Decision Tree Learning
No ratings yet
Week 11 - Decision Tree Learning
43 pages
Unit 5. Decision Trees
No ratings yet
Unit 5. Decision Trees
58 pages
ML Lec5
No ratings yet
ML Lec5
7 pages
Machine Learning: Professor Department of Computer Science & Engineering
No ratings yet
Machine Learning: Professor Department of Computer Science & Engineering
45 pages
Decision-Tree Learning .
No ratings yet
Decision-Tree Learning .
29 pages
MLT UNIT-3 Notes
No ratings yet
MLT UNIT-3 Notes
35 pages
Lecture 6 - Decision Tree
No ratings yet
Lecture 6 - Decision Tree
30 pages
Unit6 - 2 Classification-Decision-Trees
No ratings yet
Unit6 - 2 Classification-Decision-Trees
36 pages
New Module 3 Part1
No ratings yet
New Module 3 Part1
69 pages
Decision Trees for Data Classification
No ratings yet
Decision Trees for Data Classification
33 pages
Decision Tree Algorithm: Comp328 Tutorial 1 Kai Zhang
No ratings yet
Decision Tree Algorithm: Comp328 Tutorial 1 Kai Zhang
25 pages
Decision Trees
No ratings yet
Decision Trees
34 pages
Chapter 5 2018 2019
No ratings yet
Chapter 5 2018 2019
5 pages
Decision Tree Learning Guide
No ratings yet
Decision Tree Learning Guide
33 pages
Pimentel Speech On Organic Farming
No ratings yet
Pimentel Speech On Organic Farming
19 pages
Moby Dick: A Journey with Ishmael
No ratings yet
Moby Dick: A Journey with Ishmael
10 pages
BHMCT 3rd Semester Syllabus
No ratings yet
BHMCT 3rd Semester Syllabus
6 pages
Diagnostic Imaging of Child Abuse 3rd Edition Paul K. Kleinman (Editor) Instant Download
100% (5)
Diagnostic Imaging of Child Abuse 3rd Edition Paul K. Kleinman (Editor) Instant Download
32 pages
Burma & Arakan History
No ratings yet
Burma & Arakan History
49 pages
Civil Eng. Lab Report: Flakiness Test
100% (1)
Civil Eng. Lab Report: Flakiness Test
33 pages
Styrene Acrylic Emulsion
No ratings yet
Styrene Acrylic Emulsion
7 pages
Inventory Inspection
No ratings yet
Inventory Inspection
1 page
Worksheets Comparatives and Superlatives
No ratings yet
Worksheets Comparatives and Superlatives
15 pages
19 Standar Gorong Gorong Persegi Beton Bertulang Box Culvert Double
No ratings yet
19 Standar Gorong Gorong Persegi Beton Bertulang Box Culvert Double
12 pages
Rajeev Srivastava: Tech Investor & Philanthropist
No ratings yet
Rajeev Srivastava: Tech Investor & Philanthropist
1 page
Computational Modeling of Nanometer-Scale Tribology
No ratings yet
Computational Modeling of Nanometer-Scale Tribology
6 pages
204-02 Rear Suspension - Removal and Installation - Rear Shock Absorber
No ratings yet
204-02 Rear Suspension - Removal and Installation - Rear Shock Absorber
3 pages
MaxiTPMS ITS600 Pro User Manual US
No ratings yet
MaxiTPMS ITS600 Pro User Manual US
173 pages
Document
No ratings yet
Document
5 pages
Fluid Pressure Measurement Basics
No ratings yet
Fluid Pressure Measurement Basics
10 pages
Theory and Application of Field Effect Transistors
No ratings yet
Theory and Application of Field Effect Transistors
73 pages
Paper Chromatography
No ratings yet
Paper Chromatography
12 pages
Complete Daily Curriculum For Early Childhood Revised Over 12ties To Support Multiple Intelligences and Learning Styles The
100% (6)
Complete Daily Curriculum For Early Childhood Revised Over 12ties To Support Multiple Intelligences and Learning Styles The
24 pages
STP EEDIC 020700 001 Rev3 Electrical Interconnection Diagram M007 Markup1 21.10.2024 SMU1 25.05.25
No ratings yet
STP EEDIC 020700 001 Rev3 Electrical Interconnection Diagram M007 Markup1 21.10.2024 SMU1 25.05.25
65 pages
Clarivate Top 100 New Global Brands Report 2022
No ratings yet
Clarivate Top 100 New Global Brands Report 2022
17 pages
Descriptio: Model GXO Sliding Sleeve
No ratings yet
Descriptio: Model GXO Sliding Sleeve
4 pages
Design Report (HK)
100% (1)
Design Report (HK)
22 pages
Practical Risk Theory For Actuaries
0% (1)
Practical Risk Theory For Actuaries
1 page
Cement Raw Material Homogenization
No ratings yet
Cement Raw Material Homogenization
16 pages
Surface Areas and Volumes Case Study
100% (2)
Surface Areas and Volumes Case Study
16 pages
Basic Electrical Engineering
No ratings yet
Basic Electrical Engineering
32 pages
Reading Genre Descriptive-Text
No ratings yet
Reading Genre Descriptive-Text
5 pages
3M PGF Cutting Tools Catalog LR 61 5002 8282 9
No ratings yet
3M PGF Cutting Tools Catalog LR 61 5002 8282 9
12 pages
7.0 Principles of Normal Development in Infancy and Early Childhood
No ratings yet
7.0 Principles of Normal Development in Infancy and Early Childhood
9 pages

07 - ML - Decision Tree

Uploaded by

07 - ML - Decision Tree

Uploaded by

Ho Chi Minh University of Banking

Department of Economic Mathematics

Vuong Trong Nhân (nhanvt@hub.edu.vn)

 Outlook, Temperature, etc.: attributes

Sunny Overcast Rain

Humidity Yes Wind

High Normal Strong Weak

hot mild cool

 Preferred the smallest tree

How would we represent:

 Instances describable by attribute–value pairs

 ID3 (Quinlan, 1986) operates on whole training set S

• Determine best attribute

• Split training set according to different values

• Apply algorithm recursively

Sunny Overcast Rain

Humidity Yes Wind

High Normal Strong Weak

• Recursive splitting of the training set

• Assume a training set { + , + , − , − , + , − , + , + , − , − }

• p⊕ is the proportion of positive

Entropy(S) ≡ −p ⊕ log2 p⊕ − p⊖ log2 p⊖

Entropy( S) = −9/14 log(9/14) − 5/14 log(5/14) = 0.94

S = { + + + + + + + + , − − − − − − } = { 8 + , 6−}. Entropy (S) = ?

Entropy( S) = −8/14 log(8/14) − 6/14 log(6/14) = 0.98

• All members of 𝑆 belong to the same

• 𝐸𝑛𝑡𝑟𝑜𝑝𝑦(𝑆) = 0 (the purest set)

• → Choose that attribute that maximizes difference:

 Information Gain is a measure of the effectiveness of an

For top node: S = { 9 + , 5−}, E n t r o p y ( S ) = 0.94 ID Wind

S _s t r o n g = { 3 + , 3−}, | S _s t r o n g | = 6 D3 Weak Yes

E n t r o p y ( S _w e a k ) = −6/8*l o g ( 6 / 8 ) − - D4 Weak Yes

2 / 8 *l o g ( 2 / 8 ) = 0.81 D5 Weak Yes

• Further down in the tree, E n t r o p y ( S ) is computed locally

 Determine points where the class changes

Sunny Overcast Rain

Humidity Yes Wind

High Normal Strong Weak

Sunny Overcast Rain

Humidity Yes Wind

High Normal Strong Weak

• Algorithm will introduce new test

 Decision tree is a Machine Learning algorithm that

 Randomization and no pruning:

 Combination: each prediction later is made by taking the

 Bagging: the training set for each tree is generated by

1. Build Decision tree

You might also like