0% found this document useful (0 votes)

38 views11 pages

SVM

Support Vector Machine (SVM) is a supervised machine learning algorithm primarily used for classification and regression tasks, focusing on finding the optimal hyperplane to separate data points into different classes while maximizing the margin. It employs techniques like kernels to handle non-linearly separable data and can be classified into linear and non-linear SVM based on the decision boundary. SVM offers advantages such as high-dimensional performance and resilience to outliers, but it also has drawbacks like slow training and sensitivity to noise.

Uploaded by

gangadhar.srigowda

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views11 pages

SVM

Uploaded by

gangadhar.srigowda

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Support Vector Machine (SVM) Algorithm

Last Updated : 27 Jan, 2025

Support Vector Machine (SVM) is a supervised machine learning algorithm used for
classification and regression tasks. While it can handle regression problems, SVM is particularly
well-suited for classification tasks.

SVM aims to find the optimal hyperplane in an N-dimensional space to separate data points into
different classes. The algorithm maximizes the margin between the closest points of different
classes.

Support Vector Machine (SVM) Terminology

Hyperplane: A decision boundary separating different classes in feature space, represented by

the equation wx + b = 0 in linear classification.

Support Vectors: The closest data points to the hyperplane, crucial for determining the
hyperplane and margin in SVM.

Margin: The distance between the hyperplane and the support vectors. SVM aims to maximize
this margin for better classification performance.

Kernel: A function that maps data to a higher-dimensional space, enabling SVM to handle non-
linearly separable data.

Hard Margin: A maximum-margin hyperplane that perfectly separates the data without
misclassifications.

Soft Margin: Allows some misclassifications by introducing slack variables, balancing margin
maximization and misclassification penalties when data is not perfectly separable.

C: A regularization term balancing margin maximization and misclassification penalties. A

higher C value enforces a stricter penalty for misclassifications.

Hinge Loss: A loss function penalizing misclassified points or margin violations, combined with
regularization in SVM.

Dual Problem: Involves solving for Lagrange multipliers associated with support vectors,
facilitating the kernel trick and efficient computation.
How does Support Vector Machine Algorithm Work?

The key idea behind the SVM algorithm is to find the hyperplane that best separates two
classes by maximizing the margin between them. This margin is the distance from the
hyperplane to the nearest data points (support vectors) on each side.

Multiple hyperplanes separating the data from two classes

Multiple hyperplanes separate the data from two classes

The best hyperplane, also known as the “hard margin,” is the one that maximizes the distance
between the hyperplane and the nearest data points from both classes. This ensures a clear
separation between the classes. So, from the above figure, we choose L2 as hard margin.

Let’s consider a scenario like shown below:

Selecting hyperplane for data with outlier

Here, we have one blue ball in the boundary of the red ball.

How does SVM classify the data?

It’s simple! The blue ball in the boundary of red ones is an outlier of blue balls. The SVM
algorithm has the characteristics to ignore the outlier and finds the best hyperplane that
maximizes the margin. SVM is robust to outliers.

Hyperplane which is the most optimized one

A soft margin allows for some misclassifications or violations of the margin to improve
generalization. The SVM optimizes the following equation to balance margin maximization and
penalty minimization:

[Tex]\text{Objective Function} = (\frac{1}{\text{margin}}) + \lambda \sum \text{penalty }[/Tex]

The penalty used for violations is often hinge loss, which has the following behavior:

If a data point is correctly classified and within the margin, there is no penalty (loss = 0).

If a point is incorrectly classified or violates the margin, the hinge loss increases proportionally
to the distance of the violation.

Till now, we were talking about linearly separable data(the group of blue balls and red balls are
separable by a straight line/linear line).

What to do if data are not linearly separable?

When data is not linearly separable (i.e., it can’t be divided by a straight line), SVM uses a
technique called kernels to map the data into a higher-dimensional space where it becomes
separable. This transformation helps SVM find a decision boundary even for non-linear data.

Original 1D dataset for classification

A kernel is a function that maps data points into a higher-dimensional space without explicitly
computing the coordinates in that space. This allows SVM to work efficiently with non-linear
data by implicitly performing the mapping.

For example, consider data points that are not linearly separable. By applying a kernel function,
SVM transforms the data points into a higher-dimensional space where they become linearly
separable.
Linear Kernel: For linear separability.

Polynomial Kernel: Maps data into a polynomial space.

Radial Basis Function (RBF) Kernel: Transforms data into a space based on distances between
data points.

Mapping 1D data to 2D to become able to separate the two classes

In this case, the new variable y is created as a function of distance from the origin.

Mathematical Computation: SVM

Consider a binary classification problem with two classes, labeled as +1 and -1. We have a
training dataset consisting of input feature vectors X and their corresponding class labels Y.

The equation for the linear hyperplane can be written as:

[Tex]w^Tx+ b = 0[/Tex]

Where:

[Tex]w[/Tex] is the normal vector to the hyperplane (the direction perpendicular to it).

[Tex]b[/Tex] is the offset or bias term, representing the distance of the hyperplane from the
origin along the normal vector [Tex]w[/Tex].

Distance from a Data Point to the Hyperplane

The distance between a data point x_i and the decision boundary can be calculated as:

[Tex]d_i = \frac{w^T x_i + b}{||w||}[/Tex]

where ||w|| represents the Euclidean norm of the weight vector w. Euclidean norm of the normal
vector W

Linear SVM Classifier

Distance from a Data Point to the Hyperplane:

[Tex]\hat{y} = \left\{ \begin{array}{cl} 1 & : \ w^Tx+b \geq 0 \\ 0 & : \ w^Tx+b < 0 \end{array}
\right.[/Tex]

Where [Tex]\hat{y}[/Tex] is the predicted label of a data point.

Optimization Problem for SVM

For a linearly separable dataset, the goal is to find the hyperplane that maximizes the margin
between the two classes while ensuring that all data points are correctly classified. This leads
to the following optimization problem:

[Tex]\underset{w,b}{\text{minimize}}\frac{1}{2}\left\| w \right\|^{2} [/Tex]

Subject to the constraint:

[Tex] y_i(w^Tx_i + b) \geq 1 \;for\; i = 1, 2,3, \cdots,m[/Tex]

Where:

[Tex]y_i[/Tex] is the class label (+1 or -1) for each training instance.

[Tex]x_i[/Tex] is the feature vector for the [Tex]i[/Tex]-th training instance.

[Tex]m[/Tex] is the total number of training instances.

The condition [Tex]y_i (w^T x_i + b) \geq 1[/Tex] ensures that each data point is correctly
classified and lies outside the margin.

Soft Margin Linear SVM Classifier

In the presence of outliers or non-separable data, the SVM allows some misclassification by
introducing slack variables [Tex]\zeta_i[/Tex]. The optimization problem is modified as:

[Tex]\underset{w, b}{\text{minimize }} \frac{1}{2} \|w\|^2 + C \sum_{i=1}^{m} \zeta_i[/Tex]

Subject to the constraints:

[Tex]y_i (w^T x_i + b) \geq 1 – \zeta_i \quad \text{and} \quad \zeta_i \geq 0 \quad \text{for } i =
1, 2, \dots, m[/Tex]

Where:

[Tex]C[/Tex] is a regularization parameter that controls the trade-off between margin

maximization and penalty for misclassifications.

[Tex]\zeta_i[/Tex] are slack variables that represent the degree of violation of the margin by
each data point.

Dual Problem for SVM

The dual problem involves maximizing the Lagrange multipliers associated with the support
vectors. This transformation allows solving the SVM optimization using kernel functions for non
-linear classification.

The dual objective function is given by:

[Tex]\underset{\alpha}{\text{maximize }} \frac{1}{2} \sum_{i=1}^{m} \sum_{j=1}^{m} \alpha_i

\alpha_j t_i t_j K(x_i, x_j) – \sum_{i=1}^{m} \alpha_i[/Tex]
Where:

[Tex]\alpha_i[/Tex] are the Lagrange multipliers associated with the [Tex]i[/Tex]-th training
sample.

[Tex]t_i[/Tex] is the class label for the iii-th training sample (+1+1+1 or −1-1−1).

[Tex]K(x_i, x_j)[/Tex] is the kernel function that computes the similarity between data points
[Tex]x_i[/Tex] and [Tex]x_j[/Tex]. The kernel allows SVM to handle non-linear classification
problems by mapping data into a higher-dimensional space.

The dual formulation optimizes the Lagrange multipliers [Tex]\alpha_i[/Tex], and the support
vectors are those training samples where [Tex]\alpha_i > 0[/Tex].

SVM Decision Boundary

Once the dual problem is solved, the decision boundary is given by:

[Tex]w = \sum_{i=1}^{m} \alpha_i t_i K(x_i, x) + b[/Tex]

Where [Tex]w[/Tex] is the weight vector, [Tex]x[/Tex] is the test data point, and [Tex]b[/Tex] is
the bias term.

Finally, the bias term [Tex]b[/Tex] is determined by the support vectors, which satisfy:

[Tex]t_i (w^T x_i – b) = 1 \quad \Rightarrow \quad b = w^T x_i – t_i[/Tex]

Where [Tex]x_i[/Tex] is any support vector.

This completes the mathematical framework of the Support Vector Machine algorithm, which
allows for both linear and non-linear classification using the dual problem and kernel trick.
Types of Support Vector Machine

Based on the nature of the decision boundary, Support Vector Machines (SVM) can be divided
into two main parts:

Linear SVM: Linear SVMs use a linear decision boundary to separate the data points of different
classes. When the data can be precisely linearly separated, linear SVMs are very suitable. This
means that a single straight line (in 2D) or a hyperplane (in higher dimensions) can entirely
divide the data points into their respective classes. A hyperplane that maximizes the margin
between the classes is the decision boundary.

Non-Linear SVM: Non-Linear SVM can be used to classify data when it cannot be separated into
two classes by a straight line (in the case of 2D). By using kernel functions, nonlinear SVMs can
handle nonlinearly separable data. The original input data is transformed by these kernel
functions into a higher-dimensional feature space, where the data points can be linearly
separated. A linear SVM is used to locate a nonlinear decision boundary in this modified space.

Implementing SVM Algorithm in Python

Predict if cancer is Benign or malignant. Using historical data about patients diagnosed with
cancer enables doctors to differentiate malignant cases and benign ones are given independent
attributes.

Load the breast cancer dataset from sklearn.datasets

Separate input features and target variables.

Build and train the SVM classifiers using RBF kernel.

Plot the scatter plot of the input features.

Python

# Load the important packages

from sklearn.datasets import load_breast_cancer

import matplotlib.pyplot as plt

from sklearn.inspection import DecisionBoundaryDisplay

from sklearn.svm import SVC

# Load the datasets

cancer = load_breast_cancer()

X = cancer.data[:, :2]

y = cancer.target

#Build the model

svm = SVC(kernel="rbf", gamma=0.5, C=1.0)

# Trained the model

svm.fit(X, y)

# Plot Decision Boundary

DecisionBoundaryDisplay.from_estimator(

svm,

response_method="predict",

cmap=plt.cm.Spectral,

alpha=0.8,

xlabel=cancer.feature_names[0],

ylabel=cancer.feature_names[1],

# Scatter plot

plt.scatter(X[:, 0], X[:, 1],

c=y,
s=20, edgecolors="k")

plt.show()

Output:

Breast Cancer Classifications with SVM RBF kernel-Geeksforgeeks

Breast Cancer Classifications with SVM RBF kernel

Advantages of Support Vector Machine (SVM)

High-Dimensional Performance: SVM excels in high-dimensional spaces, making it suitable for

image classification and gene expression analysis.

Nonlinear Capability: Utilizing kernel functions like RBF and polynomial, SVM effectively handles
nonlinear relationships.

Outlier Resilience: The soft margin feature allows SVM to ignore outliers, enhancing robustness
in spam detection and anomaly detection.

Binary and Multiclass Support: SVM is effective for both binary classification and multiclass
classification, suitable for applications in text classification.

Memory Efficiency: SVM focuses on support vectors, making it memory efficient compared to
other algorithms.

Disadvantages of Support Vector Machine (SVM)

Slow Training: SVM can be slow for large datasets, affecting performance in SVM in data mining
tasks.

Parameter Tuning Difficulty: Selecting the right kernel and adjusting parameters like C requires
careful tuning, impacting SVM algorithms.

Noise Sensitivity: SVM struggles with noisy datasets and overlapping classes, limiting
effectiveness in real-world scenarios.

Limited Interpretability: The complexity of the hyperplane in higher dimensions makes SVM less
interpretable than other models.

Feature Scaling Sensitivity: Proper feature scaling is essential; otherwise, SVM models may
perform poorly.
Support Vector Machine (SVM) Algorithm- FAQs

How does SVM work in machine learning?

SVM works by finding the maximum-margin hyperplane that best separates the data points of
different classes. It uses support vectors, which are the closest data points to the hyperplane, to
define this boundary.

What are the key advantages of using SVM in machine learning?

SVMs are effective for high-dimensional data, robust to outliers, and versatile due to kernel
functions, allowing them to handle both linear and nonlinear relationships.

What is the difference between hard margin and soft margin SVM?

A hard margin SVM perfectly separates classes without misclassification, while a soft margin
SVM allows some misclassifications to better accommodate outliers, balancing the margin and
penalties.

What types of kernel functions are used in SVM?

Common kernel functions in SVM include linear, polynomial, radial basis function (RBF), and
sigmoid, each mapping input data into higher-dimensional spaces for better separation.

When should I use SVM in data mining?

Use SVM in data mining when dealing with complex datasets, especially when you need to
classify data with high dimensions, non-linear boundaries, or when robustness to outliers is
important.

SVM Notes Unit 4
No ratings yet
SVM Notes Unit 4
8 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
4 pages
Third Year Engineering: Unit II: Supervised Machine Learning
No ratings yet
Third Year Engineering: Unit II: Supervised Machine Learning
11 pages
Unit 2
No ratings yet
Unit 2
47 pages
Unit2 Notes What Is A Support Vector Machine
No ratings yet
Unit2 Notes What Is A Support Vector Machine
11 pages
Unit 2 PPT - Part 2
100% (1)
Unit 2 PPT - Part 2
81 pages
Support Vector Machine
No ratings yet
Support Vector Machine
17 pages
Machine Learning Unit-3.3
No ratings yet
Machine Learning Unit-3.3
38 pages
Support Vector Machine (SVM) Terminology Hyperplane WX + B 0 Support Vectors Margin Kernel Hard Margin Soft Margin
No ratings yet
Support Vector Machine (SVM) Terminology Hyperplane WX + B 0 Support Vectors Margin Kernel Hard Margin Soft Margin
6 pages
Support Vector Machine
No ratings yet
Support Vector Machine
31 pages
S V M (SVM) : Upport Ector Achine
No ratings yet
S V M (SVM) : Upport Ector Achine
67 pages
Overview of SVM: A Support Vector Machine (SVM) Performs by Finding The That The Margin Between The
No ratings yet
Overview of SVM: A Support Vector Machine (SVM) Performs by Finding The That The Margin Between The
20 pages
Presentation On Support Vector Machine (SVM)
100% (2)
Presentation On Support Vector Machine (SVM)
22 pages
Module 3 ML 24
No ratings yet
Module 3 ML 24
65 pages
Support Vector Machine
No ratings yet
Support Vector Machine
9 pages
Ann Unit III
No ratings yet
Ann Unit III
20 pages
Support Vector Machine
100% (1)
Support Vector Machine
40 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
28 pages
Chapter 07 SVM
No ratings yet
Chapter 07 SVM
20 pages
Unit 2 - SVM - 241016 - 104220
No ratings yet
Unit 2 - SVM - 241016 - 104220
47 pages
SVM (Repaired)
No ratings yet
SVM (Repaired)
39 pages
Support Vector Machine
No ratings yet
Support Vector Machine
19 pages
Unit - 2
No ratings yet
Unit - 2
15 pages
Support Vector Machine: Prof. Subodh Kumar Mohanty
No ratings yet
Support Vector Machine: Prof. Subodh Kumar Mohanty
52 pages
W12 SVM
No ratings yet
W12 SVM
52 pages
Supervised Learning-SVM: Support Vector Machines
No ratings yet
Supervised Learning-SVM: Support Vector Machines
29 pages
Support Vector Machine Algorithm
No ratings yet
Support Vector Machine Algorithm
8 pages
Support Vector Machine: Abinas Panda
No ratings yet
Support Vector Machine: Abinas Panda
52 pages
Support Vector Machine
No ratings yet
Support Vector Machine
52 pages
ML Unit 3
No ratings yet
ML Unit 3
14 pages
SVMs
No ratings yet
SVMs
30 pages
SVMs
No ratings yet
SVMs
30 pages
Data Mining Techniques
No ratings yet
Data Mining Techniques
27 pages
Support Vector Machines
No ratings yet
Support Vector Machines
11 pages
Support Vector Machine - Explanation
No ratings yet
Support Vector Machine - Explanation
12 pages
SVM Basics for Data Scientists
No ratings yet
SVM Basics for Data Scientists
139 pages
UNIT-III Support Vector Machines
No ratings yet
UNIT-III Support Vector Machines
43 pages
Support Vector Machine (SVM) Algorithm
No ratings yet
Support Vector Machine (SVM) Algorithm
10 pages
Understanding Support Vector Machines
No ratings yet
Understanding Support Vector Machines
32 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
4 pages
Support Vector Machine
No ratings yet
Support Vector Machine
18 pages
Machine Learning (CSO851) - Lecture 05
No ratings yet
Machine Learning (CSO851) - Lecture 05
27 pages
SVM Notes
No ratings yet
SVM Notes
4 pages
SVM: High Accuracy Classifier Guide
No ratings yet
SVM: High Accuracy Classifier Guide
7 pages
10 Classification SVM
No ratings yet
10 Classification SVM
22 pages
Final - Support Vector Machine - Class - Modifie
No ratings yet
Final - Support Vector Machine - Class - Modifie
69 pages
Support Vector Machine
No ratings yet
Support Vector Machine
45 pages
Support Vector Machines: Detailed Notes: Compiled From Geeksforgeeks and Other Sources September 14, 2025
No ratings yet
Support Vector Machines: Detailed Notes: Compiled From Geeksforgeeks and Other Sources September 14, 2025
6 pages
ML 18-20 SVM
No ratings yet
ML 18-20 SVM
44 pages
SVM - Feb 15
No ratings yet
SVM - Feb 15
34 pages
CS-13410 Introduction To Machine Learning
No ratings yet
CS-13410 Introduction To Machine Learning
33 pages
SVM Tutorial
No ratings yet
SVM Tutorial
31 pages
SVM 1
No ratings yet
SVM 1
36 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
26 pages
Unit - 2-1
No ratings yet
Unit - 2-1
7 pages
27-Module 4 - Support Vector Machine and Naïve Bayes-20-09-2024
No ratings yet
27-Module 4 - Support Vector Machine and Naïve Bayes-20-09-2024
31 pages
SVM Algorithm: Key Concepts & Implementation
No ratings yet
SVM Algorithm: Key Concepts & Implementation
30 pages
SVM Tutorial
No ratings yet
SVM Tutorial
28 pages
Automatic Road Inventory Using A Low-Cost Mobile M
No ratings yet
Automatic Road Inventory Using A Low-Cost Mobile M
20 pages
Big Data and Ai To Analyse Mobility in Spain 0
No ratings yet
Big Data and Ai To Analyse Mobility in Spain 0
24 pages
Here Are Some Essential Leadership Qualities All Leaders Should Have in The Workplace
No ratings yet
Here Are Some Essential Leadership Qualities All Leaders Should Have in The Workplace
1 page
Scan New Features 2022 03
No ratings yet
Scan New Features 2022 03
35 pages
Deep Learning MLS - MOBILE MAPPING
No ratings yet
Deep Learning MLS - MOBILE MAPPING
131 pages
Dumpsys ANR WindowManager
No ratings yet
Dumpsys ANR WindowManager
3,790 pages
EDIST 2024 Agenda
No ratings yet
EDIST 2024 Agenda
5 pages
Magnesium Alloys Containing Rare Earth Metals Structure and Properties 1st Edition L.L. Rokhlin (Author) Download PDF
100% (13)
Magnesium Alloys Containing Rare Earth Metals Structure and Properties 1st Edition L.L. Rokhlin (Author) Download PDF
84 pages
Sony Soundbar Manual
No ratings yet
Sony Soundbar Manual
2 pages
Brine Shrimp (Artemia) : Conditions Needed To Survive (Habitat)
No ratings yet
Brine Shrimp (Artemia) : Conditions Needed To Survive (Habitat)
3 pages
PHP Solved MCQs
No ratings yet
PHP Solved MCQs
17 pages
Electronic Chattel Paper-Invitation Accepted
No ratings yet
Electronic Chattel Paper-Invitation Accepted
27 pages
Joel M. Bowman Et Al - Variational Quantum Approaches For Computing Vibrational Energies of Polyatomic Molecules
No ratings yet
Joel M. Bowman Et Al - Variational Quantum Approaches For Computing Vibrational Energies of Polyatomic Molecules
73 pages
Annexure IV
No ratings yet
Annexure IV
2 pages
Fictions in Autobiography CHP 1
No ratings yet
Fictions in Autobiography CHP 1
54 pages
Sprayit Gravity Feed Spray Gun SPRAYIT
No ratings yet
Sprayit Gravity Feed Spray Gun SPRAYIT
8 pages
The Art of Startup Fundraising 1st Edition Alejandro Cremades Newest Edition 2025
100% (4)
The Art of Startup Fundraising 1st Edition Alejandro Cremades Newest Edition 2025
155 pages
MCQ Bank For Promotion Test - UDC LDC Assistant DEO DPS Associate Steno
No ratings yet
MCQ Bank For Promotion Test - UDC LDC Assistant DEO DPS Associate Steno
354 pages
Connection 07
No ratings yet
Connection 07
17 pages
Gr-10 - Unit 1 - Communication Skills
No ratings yet
Gr-10 - Unit 1 - Communication Skills
8 pages
A229 Pro English Manual
No ratings yet
A229 Pro English Manual
22 pages
Compendium Part 21 1
No ratings yet
Compendium Part 21 1
7 pages
Why Is Selecting Computer Hardware and Software For The Organization An Important Business Decision?
No ratings yet
Why Is Selecting Computer Hardware and Software For The Organization An Important Business Decision?
1 page
Ongoing Regular Recruit Intake Applications
No ratings yet
Ongoing Regular Recruit Intake Applications
3 pages
Intermediate Microeconomics: Market Demand
No ratings yet
Intermediate Microeconomics: Market Demand
4 pages
Makalah Social Media Group 3
No ratings yet
Makalah Social Media Group 3
6 pages
CR-1010 2ND Basement Plan
No ratings yet
CR-1010 2ND Basement Plan
1 page
ImagePROGRAF TM Series Brochure 200
No ratings yet
ImagePROGRAF TM Series Brochure 200
4 pages
Libro Ingles ID 3 Profesores
No ratings yet
Libro Ingles ID 3 Profesores
192 pages
Malaysian Industrial Contacts Directory
No ratings yet
Malaysian Industrial Contacts Directory
13 pages
Abrasive Water Jet Machining
No ratings yet
Abrasive Water Jet Machining
30 pages
Connect Diag SUR-RON - 105V - Brake - v08
No ratings yet
Connect Diag SUR-RON - 105V - Brake - v08
1 page
Phase Diagram & Heat Treatment
100% (1)
Phase Diagram & Heat Treatment
16 pages
Introduction: Pestel Analysis
No ratings yet
Introduction: Pestel Analysis
44 pages
Reflection Paper - Informatics
No ratings yet
Reflection Paper - Informatics
6 pages

SVM

Uploaded by

SVM

Uploaded by

Support Vector Machine (SVM) Algorithm

Last Updated : 27 Jan, 2025

Support Vector Machine (SVM) Terminology

Hyperplane: A decision boundary separating different classes in feature space, represented by

C: A regularization term balancing margin maximization and misclassification penalties. A

Multiple hyperplanes separating the data from two classes

Multiple hyperplanes separate the data from two classes

Let’s consider a scenario like shown below:

Selecting hyperplane for data with outlier

Selecting hyperplane for data with outlier

How does SVM classify the data?

Hyperplane which is the most optimized one

Hyperplane which is the most optimized one

[Tex]\text{Objective Function} = (\frac{1}{\text{margin}}) + \lambda \sum \text{penalty }[/Tex]

What to do if data are not linearly separable?

Original 1D dataset for classification

Original 1D dataset for classification

Polynomial Kernel: Maps data into a polynomial space.

Mapping 1D data to 2D to become able to separate the two classes

Mapping 1D data to 2D to become able to separate the two classes

Mathematical Computation: SVM

The equation for the linear hyperplane can be written as:

Distance from a Data Point to the Hyperplane

[Tex]d_i = \frac{w^T x_i + b}{||w||}[/Tex]

Linear SVM Classifier

Distance from a Data Point to the Hyperplane:

Where [Tex]\hat{y}[/Tex] is the predicted label of a data point.

Optimization Problem for SVM

[Tex]\underset{w,b}{\text{minimize}}\frac{1}{2}\left\| w \right\|^{2} [/Tex]

Subject to the constraint:

[Tex] y_i(w^Tx_i + b) \geq 1 \;for\; i = 1, 2,3, \cdots,m[/Tex]

[Tex]x_i[/Tex] is the feature vector for the [Tex]i[/Tex]-th training instance.

[Tex]m[/Tex] is the total number of training instances.

Soft Margin Linear SVM Classifier

[Tex]\underset{w, b}{\text{minimize }} \frac{1}{2} \|w\|^2 + C \sum_{i=1}^{m} \zeta_i[/Tex]

Subject to the constraints:

[Tex]C[/Tex] is a regularization parameter that controls the trade-off between margin

Dual Problem for SVM

The dual objective function is given by:

[Tex]\underset{\alpha}{\text{maximize }} \frac{1}{2} \sum_{i=1}^{m} \sum_{j=1}^{m} \alpha_i

SVM Decision Boundary

[Tex]w = \sum_{i=1}^{m} \alpha_i t_i K(x_i, x) + b[/Tex]

[Tex]t_i (w^T x_i – b) = 1 \quad \Rightarrow \quad b = w^T x_i – t_i[/Tex]

Where [Tex]x_i[/Tex] is any support vector.

Implementing SVM Algorithm in Python

Load the breast cancer dataset from sklearn.datasets

Separate input features and target variables.

Build and train the SVM classifiers using RBF kernel.

Plot the scatter plot of the input features.

# Load the important packages

from sklearn.datasets import load_breast_cancer

import matplotlib.pyplot as plt

from sklearn.inspection import DecisionBoundaryDisplay

from sklearn.svm import SVC

#Build the model

svm = SVC(kernel="rbf", gamma=0.5, C=1.0)

# Trained the model

# Plot Decision Boundary

plt.scatter(X[:, 0], X[:, 1],

Breast Cancer Classifications with SVM RBF kernel-Geeksforgeeks

Breast Cancer Classifications with SVM RBF kernel

Advantages of Support Vector Machine (SVM)

High-Dimensional Performance: SVM excels in high-dimensional spaces, making it suitable for

Disadvantages of Support Vector Machine (SVM)

How does SVM work in machine learning?

What are the key advantages of using SVM in machine learning?

What types of kernel functions are used in SVM?

When should I use SVM in data mining?

You might also like