Feature Selection

Uploaded by

malkmoh781.mm

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views17 pages

Feature Selection

Uploaded by

malkmoh781.mm

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Feature Selection

Machine learning
Prepared by / Abdelrahman Hassan
INTRODUCTION
5

WHAT IS FEATURE SELECTION?

Agenda FEATURE SELECTION MODELS

6
HOW TO CHOOSE A FEATURE
SELECTION MODEL?
2
Introduction
• The input variables that we give to our machine learning models are
called features. Each column in our dataset constitutes a feature. To
train an optimal model, we need to make sure that we use only the
essential features. If we have too many features, the model can
capture the unimportant patterns and learn from noise. The method
of choosing the important parameters of our data is called Feature
Selection.

20XX presentation title 3

Cont.
• To train a model, we collect enormous quantities of data
to help the machine learn better. Usually, a good portion
of the data collected is noise, while some of the
columns of our dataset might not contribute
significantly to the performance of our model. Further,
having a lot of data can slow down the training process
and cause the model to be slower. The model may also
learn from this irrelevant data and be inaccurate.

20XX presentation title 4

Cont.
• Consider a table which contains information on old cars. The model
decides which cars must be crushed for spare parts.

20XX presentation title 5

Cont.
• In the above table, we can see that the model of the car, the year of
manufacture, and the miles it has traveled are important to find out if
the car is old enough to be crushed or not. However, the name of the
previous owner of the car does not decide if the car should be crushed
or not. Further, it can confuse the algorithm into finding patterns
between names and the other features. Hence, we can drop the
column.

20XX presentation title 6

Cont.

20XX presentation title 7

What is feature selection?
• Feature Selection is the method of reducing the input variable to
your model by using only relevant data and getting rid of noise in
data.

20XX presentation title 8

Feature selection models
• Feature selection models are of two types:

1. Supervised Models: Supervised feature selection refers to the method

which uses the output label class for feature selection. They use the
target variables to identify the variables which can increase the
efficiency of the model

2. Unsupervised Models: Unsupervised feature selection refers to the

method which does not need the output label class for feature selection.
We use them for unlabeled data.

20XX presentation title 9

Cont.

20XX presentation title 10

Cont.
• Filter Method: In this method, features
are dropped based on their relation to
the output, or how they
are correlating to the output. We use
correlation to check if the features are
positively or negatively correlated to
the output labels and drop features
accordingly. E.g: Information Gain,
Fisher’s Score, etc.

20XX presentation title 11

Cont.
• Wrapper Method: We split our data into
subsets and train a model using this.
Based on the output of the model, we
add and subtract features and train the
model again. It forms the subsets using
a greedy approach and evaluates the
accuracy of all the possible
combinations of features. E.g: Forward
Selection, Backwards Elimination, etc.

20XX presentation title 12

Cont.
• Intrinsic Method: This method
combines the qualities of both the
Filter and Wrapper method to create
the best subset.

20XX presentation title 13

Cont.

20XX presentation title 14

How to choose a feature selection
model?
• How do we know which feature selection model will work out for our
model? The process is relatively simple, with the model depending on
the types of input and output variables.

Variables are of two main types:

• Numerical Variables: Which include integers and float numbers.
• Categorical Variables: Which include labels, strings, Boolean variables,
etc.

20XX presentation title 15

Cont.
• Based on whether we have
numerical or categorical
variables as inputs and
outputs, we can choose our
feature selection model as
follows:

20XX presentation title 16

Thank you

Module-3 DSV
No ratings yet
Module-3 DSV
20 pages
Presentation 1
No ratings yet
Presentation 1
22 pages
Workbook of Pattern Recognition
No ratings yet
Workbook of Pattern Recognition
11 pages
Types of Data (Qualitative and Quantitative)
No ratings yet
Types of Data (Qualitative and Quantitative)
89 pages
Lecture#10
No ratings yet
Lecture#10
24 pages
Introduction To Feature Selection Methods With An Example
No ratings yet
Introduction To Feature Selection Methods With An Example
10 pages
Feature Selection: Slide 1
No ratings yet
Feature Selection: Slide 1
29 pages
Feature Selection Techniques in Machine Learning - Javatpoint
No ratings yet
Feature Selection Techniques in Machine Learning - Javatpoint
9 pages
Feature Selection for Data Scientists
No ratings yet
Feature Selection for Data Scientists
61 pages
Feature Selection - Study Material
No ratings yet
Feature Selection - Study Material
6 pages
ML Module VI
No ratings yet
ML Module VI
24 pages
Dimensionality Reduction in ML
No ratings yet
Dimensionality Reduction in ML
10 pages
Feature Engineering Essentials
No ratings yet
Feature Engineering Essentials
29 pages
Module 3
No ratings yet
Module 3
89 pages
Understanding Datasets Features Selection Train Test Validation Sets L12
No ratings yet
Understanding Datasets Features Selection Train Test Validation Sets L12
25 pages
Feature Selection in PR
No ratings yet
Feature Selection in PR
6 pages
Feature Selection Engineering
No ratings yet
Feature Selection Engineering
72 pages
Feature Selection Techniques For ML - A Survey of More Than Two Decades of Research - Dipti Theng
No ratings yet
Feature Selection Techniques For ML - A Survey of More Than Two Decades of Research - Dipti Theng
63 pages
An Introduction To Feature Selection
No ratings yet
An Introduction To Feature Selection
45 pages
AI5003 AML Week07
No ratings yet
AI5003 AML Week07
14 pages
Module5.2 Feature Selection Methods
No ratings yet
Module5.2 Feature Selection Methods
64 pages
Xplore Feature Engineering
No ratings yet
Xplore Feature Engineering
9 pages
Feature Selection in Machine Learning
No ratings yet
Feature Selection in Machine Learning
9 pages
Kernels, Model & Feature Selection
No ratings yet
Kernels, Model & Feature Selection
5 pages
Module 2 Data Preprocessing
No ratings yet
Module 2 Data Preprocessing
31 pages
Wa0028.
No ratings yet
Wa0028.
10 pages
3b Features PDF
No ratings yet
3b Features PDF
40 pages
Feature Pruning and Normalization
No ratings yet
Feature Pruning and Normalization
8 pages
Feature Engineering and Dimensionality Reduction
No ratings yet
Feature Engineering and Dimensionality Reduction
146 pages
Data Selection
No ratings yet
Data Selection
6 pages
A Comparative Study Between Feature Selection Algorithms - Ok
No ratings yet
A Comparative Study Between Feature Selection Algorithms - Ok
10 pages
Wrapper Method
No ratings yet
Wrapper Method
58 pages
Filter Based Feature Selection Using ANOVA: Suppose A Company Wants To Analyze Whether The
No ratings yet
Filter Based Feature Selection Using ANOVA: Suppose A Company Wants To Analyze Whether The
66 pages
Module-3 - DS (Autosaved)
No ratings yet
Module-3 - DS (Autosaved)
18 pages
Feature Selection for Data Mining
No ratings yet
Feature Selection for Data Mining
12 pages
DM Prathameshwadnerkar92
No ratings yet
DM Prathameshwadnerkar92
9 pages
کتاب پنجم بارگزاری شده
No ratings yet
کتاب پنجم بارگزاری شده
35 pages
An Introduction To Variable and Feature Selection
No ratings yet
An Introduction To Variable and Feature Selection
26 pages
Chapter 3 NeeLXU
No ratings yet
Chapter 3 NeeLXU
68 pages
Feature Selection Technique
No ratings yet
Feature Selection Technique
7 pages
A Short Guide For Feature Engineering and Feature Selection
No ratings yet
A Short Guide For Feature Engineering and Feature Selection
32 pages
Feature Selection: A Literature Review
No ratings yet
Feature Selection: A Literature Review
19 pages
Chandra Shekar 2014
No ratings yet
Chandra Shekar 2014
13 pages
Literature Review On Feature Selection Methods For HighDimensional Data
No ratings yet
Literature Review On Feature Selection Methods For HighDimensional Data
9 pages
Dimensionality Reduction Techniques
No ratings yet
Dimensionality Reduction Techniques
7 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
117 pages
Feature Selection - 3136625
No ratings yet
Feature Selection - 3136625
45 pages
Lua Chon Dac Trung
No ratings yet
Lua Chon Dac Trung
18 pages
Data Mining: Dimensionality Reduction
No ratings yet
Data Mining: Dimensionality Reduction
135 pages
International Journal of Engineering Research and Development (IJERD)
No ratings yet
International Journal of Engineering Research and Development (IJERD)
5 pages
Feature Selection
No ratings yet
Feature Selection
6 pages
Dimensionality Reduction Guide
No ratings yet
Dimensionality Reduction Guide
24 pages
7 Selectia Trasaturilor
No ratings yet
7 Selectia Trasaturilor
54 pages
Feature Selection for Data Scientists
No ratings yet
Feature Selection for Data Scientists
56 pages
Data-Science Feature Selection & Extraction
No ratings yet
Data-Science Feature Selection & Extraction
15 pages
Lecture 14
No ratings yet
Lecture 14
17 pages
Module 3 DS
100% (1)
Module 3 DS
44 pages
Feature Engg Pre Processing Python
No ratings yet
Feature Engg Pre Processing Python
68 pages
Image Compression Algorithmes
No ratings yet
Image Compression Algorithmes
19 pages
DB ch3-4
No ratings yet
DB ch3-4
8 pages
DBMS Chapter 2 MCQs with Answers
No ratings yet
DBMS Chapter 2 MCQs with Answers
5 pages
DB ch1
No ratings yet
DB ch1
7 pages
Sheet ML
No ratings yet
Sheet ML
3 pages
Selected Topic in Cs 1
No ratings yet
Selected Topic in Cs 1
53 pages
SWE Summary AW
No ratings yet
SWE Summary AW
30 pages
DC Lec1
No ratings yet
DC Lec1
32 pages
C2-Digital Systems PDF
No ratings yet
C2-Digital Systems PDF
16 pages
2 - Module3-Routh Stability Criterion
No ratings yet
2 - Module3-Routh Stability Criterion
15 pages
Search Techniques-Uninformed Search
No ratings yet
Search Techniques-Uninformed Search
67 pages
Example 1
No ratings yet
Example 1
2 pages
RM-Lab20 - Correlation and Regression Analysis Using SPSS
No ratings yet
RM-Lab20 - Correlation and Regression Analysis Using SPSS
6 pages
Lesson 6: Dividing Polynomials/ Factor and Remainder Theorem
No ratings yet
Lesson 6: Dividing Polynomials/ Factor and Remainder Theorem
29 pages
Division of Polynomials
No ratings yet
Division of Polynomials
27 pages
Svmsmote 061430
No ratings yet
Svmsmote 061430
2 pages
Quiz Format (1) DAA4A
No ratings yet
Quiz Format (1) DAA4A
1 page
Market Basket Analysis For A Supermarket
No ratings yet
Market Basket Analysis For A Supermarket
9 pages
QB104744
No ratings yet
QB104744
4 pages
Introduction To Deep Convolutional Neural Networks: March 2016
No ratings yet
Introduction To Deep Convolutional Neural Networks: March 2016
51 pages
Problems For Chapter 7 185
No ratings yet
Problems For Chapter 7 185
3 pages
Skill: Numerical Ability::Worksheet Number:63
No ratings yet
Skill: Numerical Ability::Worksheet Number:63
2 pages
A Review of Meta Heuristic Algorithms For Reactive 2018 Ain Shams Engineeri
No ratings yet
A Review of Meta Heuristic Algorithms For Reactive 2018 Ain Shams Engineeri
17 pages
COS2633 Exam 2014 s2 Memo
No ratings yet
COS2633 Exam 2014 s2 Memo
8 pages
The Division Algorithm (Keith Conrad)
No ratings yet
The Division Algorithm (Keith Conrad)
10 pages
Rah
100% (1)
Rah
2 pages
Shortest Job First
No ratings yet
Shortest Job First
12 pages
Supplement 1
No ratings yet
Supplement 1
122 pages
Machine Learning For Business Analytics: Concepts, Techniques and Applications With JMP Pro, 2nd Edition Galit Shmueliinstant Download
100% (4)
Machine Learning For Business Analytics: Concepts, Techniques and Applications With JMP Pro, 2nd Edition Galit Shmueliinstant Download
51 pages
Quiz 2 Col100
No ratings yet
Quiz 2 Col100
4 pages
Metahur
No ratings yet
Metahur
25 pages
Machine Learning Notes For KTU Semester 7
No ratings yet
Machine Learning Notes For KTU Semester 7
226 pages
Transportation, Assignment & Transshipment Problem
100% (1)
Transportation, Assignment & Transshipment Problem
7 pages
Lab Report On CSE, C Loop Operations Solution
No ratings yet
Lab Report On CSE, C Loop Operations Solution
9 pages
2019人工智能发展报告
No ratings yet
2019人工智能发展报告
391 pages
Re Skip Net Skip Connected Convolutional Autoencoderfor Original Document Denoising
No ratings yet
Re Skip Net Skip Connected Convolutional Autoencoderfor Original Document Denoising
7 pages
Magma (Computer Algebra System) - Wikipedia, The Free Encyclopedia
No ratings yet
Magma (Computer Algebra System) - Wikipedia, The Free Encyclopedia
3 pages
Numerical Integratio N: Prepared By: Engr. Cielito V. Maligalig
No ratings yet
Numerical Integratio N: Prepared By: Engr. Cielito V. Maligalig
22 pages