0% found this document useful (0 votes)

31 views11 pages

Updated K-Nearest Neighbors in Machine Learning

Uploaded by

cskumar2019

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views11 pages

Updated K-Nearest Neighbors in Machine Learning

Uploaded by

cskumar2019

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Page 1 of 11

Home Whiteboard AI Assistant Online Compilers Jobs Tools Art

SQL HTML CSS Javascript Python Java C C++ PHP Scala C#

K-Nearest Neighbors (KNN) in Machine

Learning

K-Nearest Neighbors (KNN) Algorithm

K-nearest neighbors (KNN) algorithm is a type of supervised ML algorithm which can be
used for both classification as well as regression predictive problems. However, it is
mainly used for classification predictive problems in industry. The main idea behind KNN
is to find the k-nearest data points to a given test data point and use these nearest
neighbors to make a prediction. The value of k is a hyperparameter that needs to be
tuned, and it represents the number of neighbors to consider.

For classification problems, the KNN algorithm assigns the test data point to the class
that appears most frequently among the k-nearest neighbors. In other words, the class
with the highest number of neighbors is the predicted class.

For regression problems, the KNN algorithm assigns the test data point the average of
the k-nearest neighbors' values.

The distance metric used to measure the similarity between two data points is an
essential factor that affects the KNN algorithm's performance. The most commonly used
distance metrics are Euclidean distance, Manhattan distance, and Minkowski distance.

The following two properties would define KNN well −

Lazy learning algorithm − KNN is a lazy learning algorithm because it does not
have a specialized training phase and uses all the data for training while
classification.

Non-parametric learning algorithm − KNN is also a non-parametric learning

algorithm because it doesn't assume anything about the underlying data.

https://www.tutorialspoint.com/machine_learning/machine_learning_knn_nearest_neighbors.htm 1/11
Page 2 of 11

How Does K-Nearest Neighbors Algorithm Work?

K-nearest neighbors (KNN) algorithm uses 'feature similarity' to predict the values of
new datapoints which further means that the new data point will be assigned a value
based on how closely it matches the points in the training set. We can understand its
working with the help of following steps −

Step 1 − For implementing any algorithm, we need dataset. So during the first
step of KNN, we must load the training as well as test data.

Step 2 − Next, we need to choose the value of K i.e. the nearest data points. K
can be any integer.

Step 3 − For each point in the test data do the following −

3.1 − Calculate the distance between test data and each row of training data
with the help of any of the method namely: Euclidean, Manhattan or Hamming
distance. The most commonly used method to calculate distance is Euclidean.
3.2 − Now, based on the distance value, sort them in ascending order.
3.3 − Next, it will choose the top K rows from the sorted array.
3.4 − Now, it will assign a class to the test point based on most frequent class of
these rows.

Step 4 − End

Example

The following is an example to understand the concept of K and working of KNN

algorithm −

Suppose we have a dataset which can be plotted as follows −

https://www.tutorialspoint.com/machine_learning/machine_learning_knn_nearest_neighbors.htm 2/11
Page 3 of 11

Now, we need to classify new data point with black dot (at point 60,60) into blue or red
class. We are assuming K = 3 i.e. it would find three nearest data points. It is shown in
the next diagram −

We can see in the above diagram the three nearest neighbors of the data point with
black dot. Among those three, two of them lies in Red class hence the black dot will also
be assigned in red class.

https://www.tutorialspoint.com/machine_learning/machine_learning_knn_nearest_neighbors.htm 3/11
Page 4 of 11

Building a K Nearest Neighbors Model

We can follow the below steps to build a KNN model −

Load the data − The first step is to load the dataset into memory. This can be
done using various libraries such as pandas or numpy.

Split the data − The next step is to split the data into training and test sets. The
training set is used to train the KNN algorithm, while the test set is used to evaluate
its performance.
Normalize the data − Before training the KNN algorithm, it is essential to
normalize the data to ensure that each feature contributes equally to the distance
metric calculation.

Calculate distances − Once the data is normalized, the KNN algorithm calculates
the distances between the test data point and each data point in the training set.

Select k-nearest neighbors − The KNN algorithm selects the k-nearest neighbors
based on the distances calculated in the previous step.

Make a prediction − For classification problems, the KNN algorithm assigns the
test data point to the class that appears most frequently among the k-nearest
neighbors. For regression problems, the KNN algorithm assigns the test data point
the average of the k-nearest neighbors' values.

Evaluate performance − Finally, the KNN algorithm's performance is evaluated

using various metrics such as accuracy, precision, recall, and F1-score.

Implementation of KNN Algorithm in Python

As we know K-nearest neighbors (KNN) algorithm can be used for both classification as
well as regression. The following are the recipes in Python to use KNN as classifier as
well as regressor −

KNN as Classifier

First, start with importing necessary python packages −

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd

Next, download the iris dataset from its weblink as follows −

path = "https://archive.ics.uci.edu/ml/machine-learning-

https://www.tutorialspoint.com/machine_learning/machine_learning_knn_nearest_neighbors.htm 4/11
Page 5 of 11

databases/iris/iris.data"

Next, we need to assign column names to the dataset as follows −

headernames = ['sepal-length', 'sepal-width', 'petal-length', 'petal-width',

'Class']

Now, we need to read dataset to pandas dataframe as follows −

dataset = pd.read_csv(path, names=headernames)

dataset.head()

slno. sepal-length sepal-width petal-length petal-width Class

0 5.1 3.5 1.4 0.2 Iris-setosa

1 4.9 3.0 1.4 0.2 Iris-setosa

2 4.7 3.2 1.3 0.2 Iris-setosa

3 4.6 3.1 1.5 0.2 Iris-setosa

4 5.0 3.6 1.4 0.2 Iris-setosa

Data Preprocessing will be done with the help of following script lines −

X = dataset.iloc[:, :-1].values
y = dataset.iloc[:, 4].values

Next, we will divide the data into train and test split. Following code will split the dataset
into 60% training data and 40% of testing data −

from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.40)

Next, data scaling will be done as follows −

from sklearn.preprocessing import StandardScaler

scaler = StandardScaler()
scaler.fit(X_train)

https://www.tutorialspoint.com/machine_learning/machine_learning_knn_nearest_neighbors.htm 5/11
Page 6 of 11

X_train = scaler.transform(X_train)
X_test = scaler.transform(X_test)

Next, train the model with the help of KNeighborsClassifier class of sklearn as follows −

from sklearn.neighbors import KNeighborsClassifier

classifier = KNeighborsClassifier(n_neighbors=8)
classifier.fit(X_train, y_train)

At last we need to make prediction. It can be done with the help of following script −

y_pred = classifier.predict(X_test)

Next, print the results as follows −

from sklearn.metrics import classification_report, confusion_matrix,

accuracy_score
result = confusion_matrix(y_test, y_pred)
print("Confusion Matrix:")
print(result)
result1 = classification_report(y_test, y_pred)
print("Classification Report:",)
print (result1)
result2 = accuracy_score(y_test,y_pred)
print("Accuracy:",result2)

Output

Confusion Matrix:
[[21 0 0]
[ 0 16 0]
[ 0 7 16]]
Classification Report:
precision recall f1-score support
Iris-setosa 1.00 1.00 1.00 21
Iris-versicolor 0.70 1.00 0.82 16
Iris-virginica 1.00 0.70 0.82 23
micro avg 0.88 0.88 0.88 60
macro avg 0.90 0.90 0.88 60

https://www.tutorialspoint.com/machine_learning/machine_learning_knn_nearest_neighbors.htm 6/11
Page 7 of 11

weighted avg 0.92 0.88 0.88 60

Accuracy: 0.8833333333333333

KNN as Regressor
First, start with importing necessary Python packages −

import numpy as np
import pandas as pd

Next, download the iris dataset from its weblink as follows −

path = "https://archive.ics.uci.edu/ml/machine-learning-
databases/iris/iris.data"

Next, we need to assign column names to the dataset as follows −

headernames = ['sepal-length', 'sepal-width', 'petal-length', 'petal-width',

'Class']

Now, we need to read dataset to pandas dataframe as follows −

data = pd.read_csv(url, names=headernames)

array = data.values
X = array[:,:2]
Y = array[:,2]
data.shape

output:(150, 5)

Next, import KNeighborsRegressor from sklearn to fit the model −

from sklearn.neighbors import KNeighborsRegressor

knnr = KNeighborsRegressor(n_neighbors=10)
knnr.fit(X, y)

At last, we can find the MSE as follows −

https://www.tutorialspoint.com/machine_learning/machine_learning_knn_nearest_neighbors.htm 7/11
Page 8 of 11

print ("The MSE is:",format(np.power(y-knnr.predict(X),2).mean()))

Output

The MSE is: 0.12226666666666669

Pros and Cons of KNN

Pros

It is very simple algorithm to understand and interpret.

It is very useful for nonlinear data because there is no assumption about data in
this algorithm.

It is a versatile algorithm as we can use it for classification as well as regression.

It has relatively high accuracy but there are much better supervised learning
models than KNN.

Cons

It is computationally a bit expensive algorithm because it stores all the training

data.
High memory storage required as compared to other supervised learning
algorithms.
Prediction is slow in case of big N.

It is very sensitive to the scale of data as well as irrelevant features.

Applications of KNN
The following are some of the areas in which KNN can be applied successfully −

Banking System
Chapters
KNN can Categories
be used in banking system to predict weather an individual is fit for loan
approval? Does that individual have the characteristics similar to the defaulters one?

Calculating Credit Ratings

https://www.tutorialspoint.com/machine_learning/machine_learning_knn_nearest_neighbors.htm 8/11
Page 9 of 11

KNN algorithms can be used to find an individual's credit rating by comparing with the
persons having similar traits.

Politics
With the help of KNN algorithms, we can classify a potential voter into various classes
like "Will Vote", "Will not Vote", "Will Vote to Party 'Congress', "Will Vote to Party 'BJP'.

Other areas in which KNN algorithm can be used are Speech Recognition, Handwriting
Detection, Image Recognition and Video Recognition.

Cloud Computing Tutorial

Amazon Web Services Tutorial

Microsoft Azure Tutorial

Git Tutorial
Ethical Hacking Tutorial

Docker Tutorial
Kubernetes Tutorial
DSA Tutorial

Spring Boot Tutorial

SDLC Tutorial
Unix Tutorial

https://www.tutorialspoint.com/machine_learning/machine_learning_knn_nearest_neighbors.htm 9/11
Page 10 of 11

CERTIFICATIONS

Business Analytics Certification

Java & Spring Boot Advanced Certification

Data Science Advanced Certification
Cloud Computing And DevOps

Advanced Certification In Business Analytics

Artificial Intelligence And Machine Learning
DevOps Certification

Game Development Certification

Front-End Developer Certification
AWS Certification Training

Python Programming Certification

COMPILERS & EDITORS

Online Java Compiler

Online Python Compiler
Online Go Compiler

Online C Compiler
Online C++ Compiler
Online C# Compiler

Online PHP Compiler

Online MATLAB Compiler
Online Bash Compiler

Online SQL Compiler

Online Html Editor

ABOUT US | OUR TEAM | CAREERS | JOBS | CONTACT US | TERMS OF USE |

PRIVACY POLICY | REFUND POLICY | COOKIES POLICY | FAQ'S

https://www.tutorialspoint.com/machine_learning/machine_learning_knn_nearest_neighbors.htm 10/11
Page 11 of 11

Tutorials Point is a leading Ed Tech company striving to provide the best learning material on
technical and non-technical subjects.

https://www.tutorialspoint.com/machine_learning/machine_learning_knn_nearest_neighbors.htm 11/11

CSL0777 L22
No ratings yet
CSL0777 L22
35 pages
KNN Algorithm Guide with Python
No ratings yet
KNN Algorithm Guide with Python
15 pages
KNN Classifier
No ratings yet
KNN Classifier
5 pages
KNN Algorithm: Basics and Python Guide
No ratings yet
KNN Algorithm: Basics and Python Guide
17 pages
K-Nearest Neighbor (KNN) Algorithm For Machine Learning - Javatpoint
No ratings yet
K-Nearest Neighbor (KNN) Algorithm For Machine Learning - Javatpoint
18 pages
Introduction To K-Nearest Neighbors: Simplified (With Implementation in Python)
100% (1)
Introduction To K-Nearest Neighbors: Simplified (With Implementation in Python)
125 pages
Rahul Raj - Ipynb - Colab
No ratings yet
Rahul Raj - Ipynb - Colab
50 pages
ML Lab2 PGM
No ratings yet
ML Lab2 PGM
3 pages
Experiment No 7 ML
No ratings yet
Experiment No 7 ML
4 pages
ML KN
No ratings yet
ML KN
12 pages
Part A 3. KNN Classification
No ratings yet
Part A 3. KNN Classification
35 pages
K-Nearest Neighbor (KNN) 6
No ratings yet
K-Nearest Neighbor (KNN) 6
46 pages
Unit 2
No ratings yet
Unit 2
30 pages
Untitled 9
No ratings yet
Untitled 9
17 pages
K-Nearest Neighbor
No ratings yet
K-Nearest Neighbor
22 pages
ML Notes
100% (2)
ML Notes
125 pages
K-Nearest Neighbor On Python Ken Ocuma
100% (2)
K-Nearest Neighbor On Python Ken Ocuma
9 pages
Machine Learning Lab Manual 7
100% (1)
Machine Learning Lab Manual 7
8 pages
KNN Algorithm
No ratings yet
KNN Algorithm
11 pages
K-Nearest Neighbors Algorithm
No ratings yet
K-Nearest Neighbors Algorithm
7 pages
KMEANS
No ratings yet
KMEANS
9 pages
Machine Learning - K-Nearest Neighbors (KNN)
No ratings yet
Machine Learning - K-Nearest Neighbors (KNN)
3 pages
KNN Algorithm for Car Classification
No ratings yet
KNN Algorithm for Car Classification
9 pages
K-Nearest Neighbor Classification-Algorithm and Characteristics
No ratings yet
K-Nearest Neighbor Classification-Algorithm and Characteristics
6 pages
B-56 Sanket Jambhulkar MLA-7
No ratings yet
B-56 Sanket Jambhulkar MLA-7
9 pages
Seminar Report File On KNN Models: University Institute of Engineering and Technology, Kurukshetra University
No ratings yet
Seminar Report File On KNN Models: University Institute of Engineering and Technology, Kurukshetra University
24 pages
k-NN Algorithm: Basics, Applications, and Advantages
No ratings yet
k-NN Algorithm: Basics, Applications, and Advantages
42 pages
k-NN Algorithm Overview & Applications
No ratings yet
k-NN Algorithm Overview & Applications
35 pages
KNN Algorithm Guide with Python
No ratings yet
KNN Algorithm Guide with Python
13 pages
K-NN Algorithm in Machine Learning
No ratings yet
K-NN Algorithm in Machine Learning
11 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
22 pages
Week 07
No ratings yet
Week 07
24 pages
KNN Updated
No ratings yet
KNN Updated
30 pages
KNN Colab Illustration
No ratings yet
KNN Colab Illustration
5 pages
K Nearest Neighbour's (KNN) (1) Using R
No ratings yet
K Nearest Neighbour's (KNN) (1) Using R
9 pages
'Machine Learning (Nagarjun)
No ratings yet
'Machine Learning (Nagarjun)
10 pages
6 - KNN Classifier
No ratings yet
6 - KNN Classifier
10 pages
Bài nhóm tìm hiểu về KNN
No ratings yet
Bài nhóm tìm hiểu về KNN
5 pages
ML-Unit 5
No ratings yet
ML-Unit 5
40 pages
Machine Learning Unit-3.1
No ratings yet
Machine Learning Unit-3.1
20 pages
Intro to KNN for Data Science
No ratings yet
Intro to KNN for Data Science
37 pages
KNN Lab
No ratings yet
KNN Lab
4 pages
K-Nearest Neighbor Algorithm
100% (1)
K-Nearest Neighbor Algorithm
6 pages
KNN Algorithm Guide for Students
No ratings yet
KNN Algorithm Guide for Students
7 pages
Unit V Non Parametric Machine Learning
No ratings yet
Unit V Non Parametric Machine Learning
47 pages
ML Unit-2
No ratings yet
ML Unit-2
24 pages
AML Lab No.04
No ratings yet
AML Lab No.04
7 pages
2 KNN
No ratings yet
2 KNN
67 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
2 pages
K-Nearest Neighbors: KNN Algorithm Pseudocode
No ratings yet
K-Nearest Neighbors: KNN Algorithm Pseudocode
2 pages
ML Lec07 KNN
100% (2)
ML Lec07 KNN
37 pages
K-Nearest Neighbors: Marcel Van Velzen Junior Marte Garcia
No ratings yet
K-Nearest Neighbors: Marcel Van Velzen Junior Marte Garcia
8 pages
K-Nearest Neighbor (KNN) Algorithm: Last Updated: 14 May, 2025
No ratings yet
K-Nearest Neighbor (KNN) Algorithm: Last Updated: 14 May, 2025
14 pages
KNN Lecture Presentation
No ratings yet
KNN Lecture Presentation
9 pages
A Complete Guide To KNN
No ratings yet
A Complete Guide To KNN
16 pages
12 ML KNN
No ratings yet
12 ML KNN
28 pages
k-Nearest Neighbors Lecture Notes
No ratings yet
k-Nearest Neighbors Lecture Notes
23 pages
ML 4
No ratings yet
ML 4
33 pages
ML Unit 5..
No ratings yet
ML Unit 5..
40 pages
Linux 2025
No ratings yet
Linux 2025
3 pages
Gcu Online Thesis
100% (2)
Gcu Online Thesis
8 pages
IBM Watson Analytics Automating Visualization Desc
No ratings yet
IBM Watson Analytics Automating Visualization Desc
12 pages
Wires Desssa
No ratings yet
Wires Desssa
2 pages
Matrikon OPC UA Explorer: Datasheet
No ratings yet
Matrikon OPC UA Explorer: Datasheet
3 pages
18CS734 - UID Module 4 Notes
No ratings yet
18CS734 - UID Module 4 Notes
31 pages
School Enrollment Support Guide
No ratings yet
School Enrollment Support Guide
3 pages
HackerRank SQL Certification Roadmap
No ratings yet
HackerRank SQL Certification Roadmap
3 pages
Unit
No ratings yet
Unit
7 pages
E3d Command List
No ratings yet
E3d Command List
13 pages
Lab Report Cse
No ratings yet
Lab Report Cse
5 pages
Unit 3
No ratings yet
Unit 3
64 pages
Messari Report Crypto Theses For 2022
No ratings yet
Messari Report Crypto Theses For 2022
180 pages
Auto Insurance Fraud Detection
No ratings yet
Auto Insurance Fraud Detection
27 pages
Word Chapter 5 Study Guide
No ratings yet
Word Chapter 5 Study Guide
3 pages
Onshape College Lesson 10
No ratings yet
Onshape College Lesson 10
43 pages
GED Math Fact Sheets
No ratings yet
GED Math Fact Sheets
21 pages
Easy2forge Open Die Forge Software
No ratings yet
Easy2forge Open Die Forge Software
20 pages
EL1TNTT Training v0.1 31 02
No ratings yet
EL1TNTT Training v0.1 31 02
154 pages
Interview Questions Servicenow
No ratings yet
Interview Questions Servicenow
8 pages
Vigilohm Insulation Monitor
No ratings yet
Vigilohm Insulation Monitor
95 pages
QuickBooks Enterprise Contact Information
No ratings yet
QuickBooks Enterprise Contact Information
11 pages
Python Monthly Expense
No ratings yet
Python Monthly Expense
10 pages
Alka Tiwari
No ratings yet
Alka Tiwari
37 pages
Project Presentation
No ratings yet
Project Presentation
16 pages
MM Sap Transactions
No ratings yet
MM Sap Transactions
47 pages
Ie400 Project S2020
No ratings yet
Ie400 Project S2020
2 pages
Unit Converter: CM To Inches Converter - Rapidtables
No ratings yet
Unit Converter: CM To Inches Converter - Rapidtables
4 pages
Acfroga7lh 3qkjyenivl01jo7ajbmipe Nvvlmfdrm53id0o2x7hq Evlyzkpsyz0wydsfreraso3q6nvj8jqj7ke0uhlglzplv0j9dvprlkrcaaib0z 1dhbx1ywi
No ratings yet
Acfroga7lh 3qkjyenivl01jo7ajbmipe Nvvlmfdrm53id0o2x7hq Evlyzkpsyz0wydsfreraso3q6nvj8jqj7ke0uhlglzplv0j9dvprlkrcaaib0z 1dhbx1ywi
1 page
Sahil Sharma (Resume) PDF
No ratings yet
Sahil Sharma (Resume) PDF
1 page

Updated K-Nearest Neighbors in Machine Learning

Uploaded by

Updated K-Nearest Neighbors in Machine Learning

Uploaded by

Page 1 of 11

Home Whiteboard AI Assistant Online Compilers Jobs Tools Art

SQL HTML CSS Javascript Python Java C C++ PHP Scala C#

K-Nearest Neighbors (KNN) in Machine

K-Nearest Neighbors (KNN) Algorithm

The following two properties would define KNN well −

Non-parametric learning algorithm − KNN is also a non-parametric learning

How Does K-Nearest Neighbors Algorithm Work?

Step 3 − For each point in the test data do the following −

The following is an example to understand the concept of K and working of KNN

Suppose we have a dataset which can be plotted as follows −

Building a K Nearest Neighbors Model

Evaluate performance − Finally, the KNN algorithm's performance is evaluated

Implementation of KNN Algorithm in Python

First, start with importing necessary python packages −

Next, download the iris dataset from its weblink as follows −

Next, we need to assign column names to the dataset as follows −

headernames = ['sepal-length', 'sepal-width', 'petal-length', 'petal-width',

Now, we need to read dataset to pandas dataframe as follows −

dataset = pd.read_csv(path, names=headernames)

slno. sepal-length sepal-width petal-length petal-width Class

0 5.1 3.5 1.4 0.2 Iris-setosa

1 4.9 3.0 1.4 0.2 Iris-setosa

2 4.7 3.2 1.3 0.2 Iris-setosa

3 4.6 3.1 1.5 0.2 Iris-setosa

4 5.0 3.6 1.4 0.2 Iris-setosa

from sklearn.model_selection import train_test_split

Next, data scaling will be done as follows −

from sklearn.preprocessing import StandardScaler

from sklearn.neighbors import KNeighborsClassifier

Next, print the results as follows −

from sklearn.metrics import classification_report, confusion_matrix,

weighted avg 0.92 0.88 0.88 60

Next, download the iris dataset from its weblink as follows −

Next, we need to assign column names to the dataset as follows −

headernames = ['sepal-length', 'sepal-width', 'petal-length', 'petal-width',

Now, we need to read dataset to pandas dataframe as follows −

data = pd.read_csv(url, names=headernames)

Next, import KNeighborsRegressor from sklearn to fit the model −

from sklearn.neighbors import KNeighborsRegressor

At last, we can find the MSE as follows −

print ("The MSE is:",format(np.power(y-knnr.predict(X),2).mean()))

The MSE is: 0.12226666666666669

Pros and Cons of KNN

It is very simple algorithm to understand and interpret.

It is a versatile algorithm as we can use it for classification as well as regression.

It is computationally a bit expensive algorithm because it stores all the training

It is very sensitive to the scale of data as well as irrelevant features.

Calculating Credit Ratings

Cloud Computing Tutorial

Microsoft Azure Tutorial

Spring Boot Tutorial

Business Analytics Certification

Java & Spring Boot Advanced Certification

Advanced Certification In Business Analytics

Game Development Certification

Python Programming Certification

COMPILERS & EDITORS

Online Java Compiler

Online PHP Compiler

Online SQL Compiler

ABOUT US | OUR TEAM | CAREERS | JOBS | CONTACT US | TERMS OF USE |

PRIVACY POLICY | REFUND POLICY | COOKIES POLICY | FAQ'S

© Copyright 2025. All Rights Reserved.

You might also like