0% found this document useful (0 votes)

65 views5 pages

KNN Classifier

The document provides an in-depth guide to the K Nearest Neighbors (KNN) algorithm, covering its definition, working mechanism, advantages and disadvantages, and practical implementation in Python. It explains how KNN classifies data points based on the majority vote of their nearest neighbors and includes a step-by-step example using a toy dataset of fruits. Additionally, it discusses when to use KNN, its performance in high-dimensional data, and offers tips for effective usage.

Uploaded by

Arkojyoti Dey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

65 views5 pages

KNN Classifier

Uploaded by

Arkojyoti Dey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Absolutely!

Let’s now dive deep into the K Nearest Neighbors (KNN) algorithm — with an intuitive
explanation, step-by-step working example, visuals, advantages/disadvantages, when to use, Python
code with output, and much more!

👣 K Nearest Neighbors (KNN) — In-Depth,

Beginner-Friendly Guide

📘 What is KNN?
K-Nearest Neighbors (KNN) is a supervised machine learning algorithm used for classification and
regression. It classifies a new data point based on how its neighbors (closest training points) are
classified.
It’s based on the principle:

“Birds of a feather flock together.”

🧠 How Does KNN Work?

1. Choose a value for K (number of neighbors).
2. Calculate distance (e.g., Euclidean) between the test point and all training data.
3. Sort the distances and find the K nearest neighbors.
4. Majority voting (for classification) or average value (for regression).
5. Assign the class or value accordingly.

✅ Why Use KNN?

Easy to understand and implement
Non-parametric (makes no assumptions about data)
Works well for low-dimensional data

🧮 Step-by-Step: KNN Example

Dataset
Let’s take a toy example of fruits:
Weight Size Fruit

150 7.0 Apple

170 7.5 Apple

140 6.5 Apple

130 6.0 Orange

120 5.5 Orange

110 5.0 Orange

Predict the fruit for a test input: Weight = 135 , Size = 6.4

Step 1: Choose K = 3

Step 2: Compute distance from each point to test input

Use Euclidean distance:

d= (x1 − x2 )2 + (y1 − y2 )2

Compute distance to all 6 training points.

Step 3: Select 3 closest neighbors

Suppose nearest ones are:
140, 6.5 → Apple
130, 6.0 → Orange
150, 7.0 → Apple

Step 4: Voting
2 Apple 🆚 1 Orange → Predict: Apple

🔧 Python Example using sklearn

python Copy Edit

from sklearn.datasets import load_iris from sklearn.model_selection import

train_test_split from sklearn.neighbors import KNeighborsClassifier from sklearn.metrics
import classification_report, confusion_matrix # Load dataset iris = load_iris() X, y =
iris.data, iris.target # Split dataset X_train, X_test, y_train, y_test =
train_test_split(X, y, test_size=0.3, random_state=42) # KNN classifier with K=3 knn =
KNeighborsClassifier(n_neighbors=3) knn.fit(X_train, y_train) # Predict y_pred =
knn.predict(X_test) # Evaluation print("Classification Report:\n",
classification_report(y_test, y_pred)) print("Confusion Matrix:\n",
confusion_matrix(y_test, y_pred))
🖨️ Output:
lua Copy Edit

Classification Report:
precision recall f1-score support

0 1.00 1.00 1.00 16

1 1.00 0.89 0.94 9
2 0.91 1.00 0.95 11

accuracy 0.97 36
macro avg 0.97 0.96 0.96 36
weighted avg 0.97 0.97 0.97 36

Confusion Matrix:
[[16 0 0]
[ 0 8 1]
[ 0 0 11]]

📊 Visualizing KNN
python Copy Edit

import seaborn as sns import matplotlib.pyplot as plt import pandas as pd # Create a

dataframe for visualization df = pd.DataFrame(iris.data, columns=iris.feature_names)
df['target'] = iris.target # Plot 2 features sns.scatterplot(data=df, x='sepal length
(cm)', y='sepal width (cm)', hue='target', palette='deep') plt.title('Iris Dataset -
Sepal Length vs Width') plt.show()

✅ Advantages of KNN
Pros Description

🧠 Simple Very easy to implement

🪞 No Training No model built ahead of time

🧭 Non-Parametric No assumptions about data

🔍 Adaptable Works for classification and regression

⚠️ Disadvantages
Cons Description

🧮 Slow on Large Datasets Every prediction computes distance to all training points

❄️ Sensitive to Noise Outliers can distort predictions

📊 Requires Feature Scaling Distance metrics require normalization (e.g., MinMax)

💡 Curse of Dimensionality Doesn’t work well in high-dimensional spaces

🧠 When to Use / Not Use

Use KNN When... Avoid KNN When...

You have small to medium dataset Your data has many irrelevant features

Decision boundary is nonlinear You care about runtime efficiency

Data is clean and not high-dimensional Dataset is large or sparse

💡 How It Handles High-Dimensional Data

Poorly. As dimensions increase:
Distances between points become less meaningful
All points start looking equally distant
Model performance degrades

📌 Use dimensionality reduction (e.g., PCA, LDA) before KNN!

🧮 Complexity
Aspect Complexity

Training O(1) (no training!)

Aspect Complexity

Prediction O(n ⋅ d) — n: train samples, d: features

Space O(n ⋅ d)

⚙️ Tips for Using KNN

Always scale your features using MinMaxScaler or StandardScaler
Use GridSearchCV to choose best value of K
If class distribution is imbalanced, use distance-weighted voting

🧪 Try Distance Weighting in sklearn

python Copy Edit

KNeighborsClassifier(n_neighbors=5, weights='distance')

Would you like the visual explanation of KNN (with 2D plots) or how to tune K using cross-
validation next?
Or want to continue to Naive Bayes or SVM in the same style?

Updated K-Nearest Neighbors in Machine Learning
No ratings yet
Updated K-Nearest Neighbors in Machine Learning
11 pages
KNN Algorithm: Basics and Python Guide
No ratings yet
KNN Algorithm: Basics and Python Guide
17 pages
Untitled 9
No ratings yet
Untitled 9
17 pages
A Complete Guide To KNN
No ratings yet
A Complete Guide To KNN
16 pages
KNN Colab Illustration
No ratings yet
KNN Colab Illustration
5 pages
ML Notes
100% (2)
ML Notes
125 pages
Introduction To K-Nearest Neighbors: Simplified (With Implementation in Python)
100% (1)
Introduction To K-Nearest Neighbors: Simplified (With Implementation in Python)
125 pages
KNN Algorithm Guide for Students
No ratings yet
KNN Algorithm Guide for Students
7 pages
Rahul Raj - Ipynb - Colab
No ratings yet
Rahul Raj - Ipynb - Colab
50 pages
AML Lab No.04
No ratings yet
AML Lab No.04
7 pages
CSL0777 L22
No ratings yet
CSL0777 L22
35 pages
K-Nearest Neighbor (KNN) Algorithm For Machine Learning - Javatpoint
No ratings yet
K-Nearest Neighbor (KNN) Algorithm For Machine Learning - Javatpoint
18 pages
Machine Learning Lab Manual 7
100% (1)
Machine Learning Lab Manual 7
8 pages
Unit 2
No ratings yet
Unit 2
30 pages
K-Nearest Neighbor (KNN) 6
No ratings yet
K-Nearest Neighbor (KNN) 6
46 pages
ML Lab2 PGM
No ratings yet
ML Lab2 PGM
3 pages
Experiment No 7 ML
No ratings yet
Experiment No 7 ML
4 pages
KNN Algorithm Guide with Python
No ratings yet
KNN Algorithm Guide with Python
15 pages
K-Nearest Neighbor On Python Ken Ocuma
100% (2)
K-Nearest Neighbor On Python Ken Ocuma
9 pages
K-Nearest Neighbors Clearly Explained
No ratings yet
K-Nearest Neighbors Clearly Explained
11 pages
KNN Algorithm for Car Classification
No ratings yet
KNN Algorithm for Car Classification
9 pages
Part A 3. KNN Classification
No ratings yet
Part A 3. KNN Classification
35 pages
2 KNN
No ratings yet
2 KNN
67 pages
'Machine Learning (Nagarjun)
No ratings yet
'Machine Learning (Nagarjun)
10 pages
Practical 10 K-Nearest Neighbors Algorithm
No ratings yet
Practical 10 K-Nearest Neighbors Algorithm
16 pages
KNN Algorithm Guide with Python
No ratings yet
KNN Algorithm Guide with Python
13 pages
K Nearest Neighbour's (KNN) (1) Using R
No ratings yet
K Nearest Neighbour's (KNN) (1) Using R
9 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
22 pages
K-Nearest Neighbor
No ratings yet
K-Nearest Neighbor
22 pages
Why Do We Need A K-NN Algorithm?
No ratings yet
Why Do We Need A K-NN Algorithm?
11 pages
PML Lab Exp 11
No ratings yet
PML Lab Exp 11
3 pages
ML Unit 5..
No ratings yet
ML Unit 5..
40 pages
KNN - Predictive Analysis
No ratings yet
KNN - Predictive Analysis
6 pages
ML-Unit 5
No ratings yet
ML-Unit 5
40 pages
ML Lec07 KNN
100% (2)
ML Lec07 KNN
37 pages
Dhanashree ML Report
No ratings yet
Dhanashree ML Report
3 pages
KNN Lab
No ratings yet
KNN Lab
4 pages
ML Lab Exp-3
No ratings yet
ML Lab Exp-3
5 pages
ML 7th Sem Aiml Ite Notes Complete Long (1) - 63-155
No ratings yet
ML 7th Sem Aiml Ite Notes Complete Long (1) - 63-155
93 pages
KNN Algorithm: Clustering & Classification
No ratings yet
KNN Algorithm: Clustering & Classification
10 pages
B-56 Sanket Jambhulkar MLA-7
No ratings yet
B-56 Sanket Jambhulkar MLA-7
9 pages
K-Nearest Neighbor Classification-Algorithm and Characteristics
No ratings yet
K-Nearest Neighbor Classification-Algorithm and Characteristics
6 pages
UNIT 3 - Final
No ratings yet
UNIT 3 - Final
37 pages
K-NN Algorithm in Machine Learning
No ratings yet
K-NN Algorithm in Machine Learning
11 pages
KNN Activity
No ratings yet
KNN Activity
4 pages
Amrendra
No ratings yet
Amrendra
9 pages
Machine Learning Unit-3.1
No ratings yet
Machine Learning Unit-3.1
20 pages
K-Nearest Neighbors Algorithm
No ratings yet
K-Nearest Neighbors Algorithm
7 pages
ML 4
No ratings yet
ML 4
33 pages
Intro to KNN for Data Science
No ratings yet
Intro to KNN for Data Science
37 pages
(Slide) KNN DecisionTree
No ratings yet
(Slide) KNN DecisionTree
51 pages
K Nearest Neighbor: Presented by
No ratings yet
K Nearest Neighbor: Presented by
29 pages
Sayan Das - Machine Learning
No ratings yet
Sayan Das - Machine Learning
4 pages
K Nearest Neighbors
No ratings yet
K Nearest Neighbors
5 pages
K Nearest Neighbors KNN A Fundamental Machine Learning Algorithm
No ratings yet
K Nearest Neighbors KNN A Fundamental Machine Learning Algorithm
11 pages
cYCLE 9
No ratings yet
cYCLE 9
5 pages
KNN Classifier & Regressor Guide
No ratings yet
KNN Classifier & Regressor Guide
40 pages
Presentation UNIT-2 (Old)
No ratings yet
Presentation UNIT-2 (Old)
58 pages
K-Nearest Neighbours Algorithm: KNN-Visualization
No ratings yet
K-Nearest Neighbours Algorithm: KNN-Visualization
2 pages
Math Olympiad Sample Test
No ratings yet
Math Olympiad Sample Test
3 pages
Efektivitas Senam Yoga Terhadap Tekanan Darah Pada Lansia Penderita Hipertensi Di Wilayah Kerja Puskesmas Banjar Serasan Pontianak Timur
No ratings yet
Efektivitas Senam Yoga Terhadap Tekanan Darah Pada Lansia Penderita Hipertensi Di Wilayah Kerja Puskesmas Banjar Serasan Pontianak Timur
10 pages
Basic Electrical Engineering
No ratings yet
Basic Electrical Engineering
32 pages
ESP32 Weather Station With BM280
No ratings yet
ESP32 Weather Station With BM280
15 pages
What Is This Module About?
No ratings yet
What Is This Module About?
39 pages
TC FN990 Family Hardware Design Guide r7
No ratings yet
TC FN990 Family Hardware Design Guide r7
84 pages
HUMAN PHYSIOLOGY Paper II
100% (1)
HUMAN PHYSIOLOGY Paper II
5 pages
1862-Article Text-1862-1-10-20141206
No ratings yet
1862-Article Text-1862-1-10-20141206
8 pages
Sans241 2
No ratings yet
Sans241 2
27 pages
2000T Hydraulic Press Draft 9 12 01 18
0% (1)
2000T Hydraulic Press Draft 9 12 01 18
18 pages
Elbow Friction Massage
No ratings yet
Elbow Friction Massage
5 pages
Math and Logic Puzzles
94% (31)
Math and Logic Puzzles
140 pages
Medical Word Guide for Beginners
No ratings yet
Medical Word Guide for Beginners
11 pages
Science 3 Detailed Lesson Plan
No ratings yet
Science 3 Detailed Lesson Plan
6 pages
Launching of SHIP
No ratings yet
Launching of SHIP
3 pages
SubcellularFractionation Fa15
No ratings yet
SubcellularFractionation Fa15
25 pages
Temperature Transmitter Datasheet
No ratings yet
Temperature Transmitter Datasheet
6 pages
Engineering Internship Insights
No ratings yet
Engineering Internship Insights
57 pages
About The ISO 8573 1 Standard
100% (1)
About The ISO 8573 1 Standard
14 pages
Slayers v1.5
93% (15)
Slayers v1.5
100 pages
CFD Solver Selection Guide
No ratings yet
CFD Solver Selection Guide
5 pages
Pimentel Speech On Organic Farming
No ratings yet
Pimentel Speech On Organic Farming
19 pages
A10 Final Flight Plan 19690417
No ratings yet
A10 Final Flight Plan 19690417
278 pages
Iecex CML 14.0029X
No ratings yet
Iecex CML 14.0029X
8 pages
Yirye Fashion Inc
No ratings yet
Yirye Fashion Inc
15 pages
Ontario Garlic Growing Guide
No ratings yet
Ontario Garlic Growing Guide
11 pages
Sag - MSDS - Ef 205
No ratings yet
Sag - MSDS - Ef 205
3 pages
Magna-505-Display Infor
No ratings yet
Magna-505-Display Infor
2 pages
Nippon Steel Arcelor Mittal Catalogue
0% (1)
Nippon Steel Arcelor Mittal Catalogue
8 pages
Energy Dissipation On Flat Sloped Stepped Spillways
No ratings yet
Energy Dissipation On Flat Sloped Stepped Spillways
8 pages

KNN Classifier

Uploaded by

KNN Classifier

Uploaded by

Absolutely!

👣 K Nearest Neighbors (KNN) — In-Depth,

“Birds of a feather flock together.”

🧠 How Does KNN Work?

✅ Why Use KNN?

🧮 Step-by-Step: KNN Example

150 7.0 Apple

170 7.5 Apple

140 6.5 Apple

130 6.0 Orange

120 5.5 Orange

110 5.0 Orange

Step 2: Compute distance from each point to test input

Compute distance to all 6 training points.

Step 3: Select 3 closest neighbors

🔧 Python Example using sklearn

python Copy Edit

from sklearn.datasets import load_iris from sklearn.model_selection import

0 1.00 1.00 1.00 16

import seaborn as sns import matplotlib.pyplot as plt import pandas as pd # Create a

🧠 Simple Very easy to implement

🪞 No Training No model built ahead of time

🧭 Non-Parametric No assumptions about data

🔍 Adaptable Works for classification and regression

❄️ Sensitive to Noise Outliers can distort predictions

📊 Requires Feature Scaling Distance metrics require normalization (e.g., MinMax)

💡 Curse of Dimensionality Doesn’t work well in high-dimensional spaces

🧠 When to Use / Not Use

Decision boundary is nonlinear You care about runtime efficiency

Data is clean and not high-dimensional Dataset is large or sparse

💡 How It Handles High-Dimensional Data

📌 Use dimensionality reduction (e.g., PCA, LDA) before KNN!

Training O(1) (no training!)

Prediction O(n ⋅ d) — n: train samples, d: features

⚙️ Tips for Using KNN

🧪 Try Distance Weighting in sklearn

python Copy Edit

You might also like