0% found this document useful (0 votes)

18 views19 pages

Lecture 5

Lecture 5 of SYSC4415 focuses on Support Vector Machines (SVM) and K-Nearest Neighbors (KNN) for data classification. Key concepts include the SVM kernel for nonlinear decision boundaries, the hinge loss function, and the mechanics of KNN classification. The lecture also emphasizes hands-on activities to implement these classifiers and understand their underlying principles.

Uploaded by

Esraa Al dn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views19 pages

Lecture 5

Uploaded by

Esraa Al dn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

SYSC4415

Introduction to Machine Learning

Lecture 5

Prof James Green

jrgreen@sce.Carleton.ca
Systems and Computer Engineering, Carleton University
Learning Objectives for Lecture 5
• Understand how SVM and KNN models classify data.
• Understand how to train a SVM model from labelled data.
• Introduce concept of SVM kernel to achieve nonlinear decision boundary.
• Define a "support vector"
• Be able to classify data by hand using a KNN classifier.

Pre-lecture Assignment:
• Read Section 3.4-3.5 of the 100pMLB

In-class activities:
• Poll Everywhere review questions
• Review fundamental equations governing SVM
• Discuss impact of kernel function
• Implement a KNN classifier by hand using sample data
Key terms
• Hinge loss function, kernel trick, kernel functions (kernels), RBF
kernel, Euclidean distance, k-Nearest Neighbors, cosine similarity,
support vector.
>
-
Support Vector Machines - Review
• Review of support vector machines (SVM)
Note that 100pMLbook doesn’t
show dot product
f(x)=sign(wTx-b)=sign(w·x-b)
Learn w, b such that:

How is this equivalent to that?

Why minimize ||w||?

Support Vector Machines – new concepts
• Data can be linearly inseparable due to 1) noise or 2) inherent structure
Cost Function Hinge LOSS Function : predict
if
22-Regularizat hinge-loss ~
Wrong
- ~
- max (0 ,
1 -
y , w 2 b)
+

cllwll + * yi( (i Lif predict

↑

,
max(0 ,
1 -
.
+ b) right
determine tradeoff between increasing size of decision boundary
and classification
Regulates empirical
3
large 2 width MOST risk
>
-
of margin matters

Small - > classifying matters more and generalization => TRADE OFF

·
soft-margin SVM : Optimize hinge-loss function
.

· hard-margin SVM : no hinge-loss

Inherent Non-linearity :

·
implicitly transform the original space into a higher dimension
during the cost function optimization
. This is called the Kernel trich .

&
1) Noisy/mislabelled data
• Use hinge loss function to allow for incorrectly classified training data

large 2 >
- width
of margin matters MOST

incorrectly
classified
Small - > classifying matters
<0 for correctly penalty
-
ma makes
classified samples dominate
↓ -
h
value becomes
-
ve

FOR CORRECTLY
Minimize classified points
. contributes nothing
i

. e

Minimize Hinge loss function

||w|| to (For each misclassified
Maximize sample, minimize distance
margin from decision margin)

• Hard vs. soft margin SVM

• C hyperparameter
• Either way, solve ‘optimization under constraints’ using Lagrangian multipliers then quadratic programming
>
-
2) Inherent structure of data
• Can introduce nonlinear mapping to map data to higher dimensional
space where data become linearly separable
• “Kernel trick” (computational trick)
• It turns out that we only need to compute dot products between pairs of
vectors (training/testing instances) in the new space, and not the actual
transformed vectors explicitly
• Therefore, define a “kernel function” to bypass mapping:
From: http://www.eric-kim.net/eric-kim-net/posts/1/kernel_trick.html

For , , , = ,

Where °, ° is an inner product of , M > N,

and transforms to :
2) Inherent structure of data (example)

From http://www.eric-kim.net/eric-kim-net/posts/1/kernel_trick.html 12
2) Inherent structure of data (example con’t)

From http://www.eric-kim.net/eric-kim-net/posts/1/kernel_trick.html 13
2) Inherent structure of data
Common kernels:
1) Linear kernel: , =
No mapping to higher dimensional space…

2) Polynomial kernel: , = + (dth-order polynomial)

Euclidian distance (squared)

3) Radial basis function kernel: , =

• Infinite dimensions, but chance of overfitting only depends on # of support vectors
• Varying controls smooth/curvy nature of decision boundary

• For the math lovers, https://www.youtube.com/watch?v=_PwhiWxHK8o shows a near-intuitive

derivation of SVMs (50 min MIT grad lecture)
>
-
K-Nearest Neighbour classifier
• Non-parametric
• Vote among classes of K nearest training points to test point
• Hyperparameters:
• K, # neighbours to examine
• Choice of distance metric
• City block distance, Euclidian distance, Mahalanobis distance, etc.
• Cosine similarity often used for comparing two documents (binary feature vectors):
K-Nearest Neighbour Example
• Consider the following table of training data:
Weight Length Fish Type
4 19 Salmon
6 17 Salmon
8 15 Salmon
8 12 Sea Bass
7 13 Sea Bass
9 15 Sea Bass

• For K=3, use city-block-distance to classify the following test point:

• Weight = 8, length = 12
Salman

mseabass

unknown
weight
A

9 -
&

=
O
123
S -
& - - -
&
a
neighbours ass

7 -
I & & 7 neighbour = Salmon
2
6 -

5 -

4- ~ A

length
is in is i in is is
I >

12
>
-
>
-
don't have to compute any distances because all will the close . ex : if more

apples then
oranges ,
will
always the apples

Handout 03 Classic Classifiers
No ratings yet
Handout 03 Classic Classifiers
39 pages
This Is
No ratings yet
This Is
7 pages
Introduction to Support Vector Machines
No ratings yet
Introduction to Support Vector Machines
33 pages
Lect 11-SVM
No ratings yet
Lect 11-SVM
14 pages
Kernel Models for Data Scientists
No ratings yet
Kernel Models for Data Scientists
56 pages
Introduction to Support Vector Machines
No ratings yet
Introduction to Support Vector Machines
40 pages
Kernal and Multiclass
No ratings yet
Kernal and Multiclass
51 pages
Introduction To Support Vector Machines
No ratings yet
Introduction To Support Vector Machines
23 pages
MACHINE LEARNING Notes
No ratings yet
MACHINE LEARNING Notes
8 pages
Support Vector Machines
No ratings yet
Support Vector Machines
43 pages
Support Vector Machines: Detailed Notes: Compiled From Geeksforgeeks and Other Sources September 14, 2025
No ratings yet
Support Vector Machines: Detailed Notes: Compiled From Geeksforgeeks and Other Sources September 14, 2025
6 pages
Introduction To: Support Vector Machines
No ratings yet
Introduction To: Support Vector Machines
53 pages
Icml Tutorial
No ratings yet
Icml Tutorial
85 pages
Pattern Recognition & Learning II: © UW CSE Vision Faculty
No ratings yet
Pattern Recognition & Learning II: © UW CSE Vision Faculty
47 pages
Support Vector Machines: Theory, Implementation, and Applications
No ratings yet
Support Vector Machines: Theory, Implementation, and Applications
40 pages
Machine Learning
No ratings yet
Machine Learning
45 pages
2.11 Chapter 5 SVM
No ratings yet
2.11 Chapter 5 SVM
25 pages
Machine Learning SVM - Supervised
No ratings yet
Machine Learning SVM - Supervised
32 pages
Extending Machine Learning Models
No ratings yet
Extending Machine Learning Models
64 pages
L5-Support Vector Machine
No ratings yet
L5-Support Vector Machine
61 pages
Lecture 6 Classification P3 SVM
No ratings yet
Lecture 6 Classification P3 SVM
44 pages
03 - Kernelization
No ratings yet
03 - Kernelization
32 pages
Atc Lecture Tyliu
No ratings yet
Atc Lecture Tyliu
48 pages
Lecture 5 Classification SVM
No ratings yet
Lecture 5 Classification SVM
44 pages
Presentation - SVM & KM - May 2009
No ratings yet
Presentation - SVM & KM - May 2009
24 pages
Ain3001 - 04 - Support - Vector.machines
No ratings yet
Ain3001 - 04 - Support - Vector.machines
50 pages
Lecture 6 Classification SVM
No ratings yet
Lecture 6 Classification SVM
44 pages
SML Unit 4
No ratings yet
SML Unit 4
61 pages
L6 Lecture Image - Classification.fundemental v4
No ratings yet
L6 Lecture Image - Classification.fundemental v4
66 pages
תרגול - SVM 1
No ratings yet
תרגול - SVM 1
32 pages
Hands-On Machine Learning: Chapter 5: Support Vector Machines
No ratings yet
Hands-On Machine Learning: Chapter 5: Support Vector Machines
32 pages
Support Vector Machines: Xiaojin Zhu
No ratings yet
Support Vector Machines: Xiaojin Zhu
41 pages
Support Vector Machines: Logisic Regression
No ratings yet
Support Vector Machines: Logisic Regression
10 pages
MLQA
No ratings yet
MLQA
17 pages
SVM Intro
No ratings yet
SVM Intro
23 pages
SP14 CS188 Lecture 23 - Kernels and Clustering - Print
No ratings yet
SP14 CS188 Lecture 23 - Kernels and Clustering - Print
39 pages
SVM and Naïve Bayes Exercises
No ratings yet
SVM and Naïve Bayes Exercises
4 pages
Support Vector Machines Explained
No ratings yet
Support Vector Machines Explained
36 pages
Support Vector and Kernel Methods - Detailed Notes
No ratings yet
Support Vector and Kernel Methods - Detailed Notes
10 pages
SVM Basics for Computer Science Students
No ratings yet
SVM Basics for Computer Science Students
36 pages
SVM Kernal
No ratings yet
SVM Kernal
5 pages
SVM
No ratings yet
SVM
8 pages
SVM Guide for Data Scientists
No ratings yet
SVM Guide for Data Scientists
24 pages
Unit II 2.2 ML Kernel Machines SVM
No ratings yet
Unit II 2.2 ML Kernel Machines SVM
50 pages
Lecture 6 - Classification - SVM
No ratings yet
Lecture 6 - Classification - SVM
48 pages
2024 Scu ML 2 1 SVM
No ratings yet
2024 Scu ML 2 1 SVM
36 pages
Support Vector Machine
No ratings yet
Support Vector Machine
34 pages
Support Vector Machine: Abinas Panda
No ratings yet
Support Vector Machine: Abinas Panda
52 pages
Chapter 6 ML Classifications
100% (1)
Chapter 6 ML Classifications
51 pages
HandsOnML Ch5E
No ratings yet
HandsOnML Ch5E
31 pages
ML SVM Lect10 11
No ratings yet
ML SVM Lect10 11
27 pages
Support Vector Machine: Prof. Subodh Kumar Mohanty
No ratings yet
Support Vector Machine: Prof. Subodh Kumar Mohanty
52 pages
SVM
No ratings yet
SVM
4 pages
Machine Learning Crash Course: Computer Vision James Hays
No ratings yet
Machine Learning Crash Course: Computer Vision James Hays
38 pages
Unit 6 Ai
No ratings yet
Unit 6 Ai
28 pages
Quiz 1 On Wednesday
No ratings yet
Quiz 1 On Wednesday
46 pages
MODULE - 4 - PART 2 - Support Vector Machines
No ratings yet
MODULE - 4 - PART 2 - Support Vector Machines
6 pages
Data Science Unit 3
No ratings yet
Data Science Unit 3
33 pages
An Incremental Zoom Sturdy MASH ADC: Ki-Hoon Seo, Il-Hoon Jang, Kyung-Jun Noh and Seung-Tak Ryu
No ratings yet
An Incremental Zoom Sturdy MASH ADC: Ki-Hoon Seo, Il-Hoon Jang, Kyung-Jun Noh and Seung-Tak Ryu
4 pages
Parametric Identification
No ratings yet
Parametric Identification
6 pages
Lecture 3 - MDPs and Dynamic Programming
No ratings yet
Lecture 3 - MDPs and Dynamic Programming
66 pages
Final Project: Power Method
No ratings yet
Final Project: Power Method
9 pages
CSIT Assignment 2
No ratings yet
CSIT Assignment 2
14 pages
NM 2 PDF
No ratings yet
NM 2 PDF
6 pages
Association-Analysis
No ratings yet
Association-Analysis
72 pages
Newton's Forward & Backward PPT's
71% (7)
Newton's Forward & Backward PPT's
21 pages
Autoencoder
No ratings yet
Autoencoder
14 pages
Artificial Intelligence and Machine Learning
No ratings yet
Artificial Intelligence and Machine Learning
2 pages
CV Exam-Mid Spring2024 Solution
No ratings yet
CV Exam-Mid Spring2024 Solution
6 pages
Unec 1710175479
100% (1)
Unec 1710175479
25 pages
Hashing
No ratings yet
Hashing
80 pages
AI Practical 05-Greedy Best First Search Implementation
100% (1)
AI Practical 05-Greedy Best First Search Implementation
19 pages
SEM1-DSA - Lesson01-Introduction To Data Structures and Algorithms
No ratings yet
SEM1-DSA - Lesson01-Introduction To Data Structures and Algorithms
29 pages
Chapter1 Assignment
0% (1)
Chapter1 Assignment
2 pages
BSC Computer Science Cs Semester 1 2023 April Problem Solving Using Computer and C Programming 2019 Pattern
No ratings yet
BSC Computer Science Cs Semester 1 2023 April Problem Solving Using Computer and C Programming 2019 Pattern
3 pages
Question Bank DSA
No ratings yet
Question Bank DSA
6 pages
Optimization Problem
No ratings yet
Optimization Problem
21 pages
MCSL-216 2024 English
No ratings yet
MCSL-216 2024 English
11 pages
Matrix Analysis and Application Xianda Zhang Download
No ratings yet
Matrix Analysis and Application Xianda Zhang Download
83 pages
Networkflow 31102023 102027am 12032025 085310am
No ratings yet
Networkflow 31102023 102027am 12032025 085310am
4 pages
Duality in Linear Programming
No ratings yet
Duality in Linear Programming
11 pages
In Practice, The Two Input Segments Often Do Not Intersect
No ratings yet
In Practice, The Two Input Segments Often Do Not Intersect
27 pages
Daa Question Bank
No ratings yet
Daa Question Bank
5 pages
What Is Recurrent Neural Network
No ratings yet
What Is Recurrent Neural Network
2 pages
Homework 1
No ratings yet
Homework 1
3 pages
Math Optimization Problems
No ratings yet
Math Optimization Problems
4 pages
Polynomials Project
57% (7)
Polynomials Project
21 pages
DSP Course Outline ABET
No ratings yet
DSP Course Outline ABET
7 pages