0% found this document useful (0 votes)

48 views6 pages

Lecture-11-K Nearest Neighbors-Part2 - Jupyter Notebook

This document is a Jupyter Notebook detailing the implementation of the K Nearest Neighbors (KNN) algorithm using Python. It covers data import, standardization, train-test splitting, model training, and evaluation, including the use of confusion matrices and classification reports. The notebook also discusses selecting an optimal K value using the elbow method.

Uploaded by

pateljil0247

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views6 pages

Lecture-11-K Nearest Neighbors-Part2 - Jupyter Notebook

Uploaded by

pateljil0247

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

4/5/23, 12:36 PM Lecture-11-K Nearest Neighbors-Part2 - Jupyter Notebook

Lecture 11-K Nearest Neighbour-Part 2

K Nearest Neighbors with Python

Import Libraries
In [10]: 1 import pandas as pd
2 import seaborn as sns
3 import matplotlib.pyplot as plt
4 import numpy as np
5 %matplotlib inline

Get the Data

In [11]: 1 df = pd.read_csv('Downloads/KNN_Project_Data')

In [12]: 1 df.head()

Out[12]:
XVPM GWYH TRAT TLLZ IGGA HYKR EDFS

0 1636.670614 817.988525 2565.995189 358.347163 550.417491 1618.870897 2147.641254 33

1 1013.402760 577.587332 2644.141273 280.428203 1161.873391 2084.107872 853.404981 44

2 1300.035501 820.518697 2025.854469 525.562292 922.206261 2552.355407 818.676686 84

3 1059.347542 1066.866418 612.000041 480.827789 419.467495 685.666983 852.867810 34

4 1018.340526 1313.679056 950.622661 724.742174 843.065903 1370.554164 905.469453 65

Standardize the Variables

In [4]: 1 from sklearn.preprocessing import StandardScaler

In [5]: 1 scaler = StandardScaler()

In [13]: 1 scaler.fit(df.drop('TARGET CLASS',axis=1))

Out[13]: StandardScaler()

localhost:8890/notebooks/Lecture-11-K Nearest Neighbors-Part2.ipynb 1/6

4/5/23, 12:36 PM Lecture-11-K Nearest Neighbors-Part2 - Jupyter Notebook

In [16]: 1 scaled_features = scaler.transform(df.drop('TARGET CLASS',axis=1))

In [20]: 1 df_feat = pd.DataFrame(scaled_features,columns=df.columns[:-1])

2 df_feat.head()

Out[20]:
XVPM GWYH TRAT TLLZ IGGA HYKR EDFS GUUB MGJM

0 1.568522 -0.443435 1.619808 -0.958255 -1.128481 0.138336 0.980493 -0.932794 1.008313

1 -0.112376 -1.056574 1.741918 -1.504220 0.640009 1.081552 -1.182663 -0.461864 0.258321

2 0.660647 -0.436981 0.775793 0.213394 -0.053171 2.030872 -1.240707 1.149298 2.184784

3 0.011533 0.191324 -1.433473 -0.100053 -1.507223 -1.753632 -1.183561 -0.888557 0.162310

4 -0.099059 0.820815 -0.904346 1.609015 -0.282065 -0.365099 -1.095644 0.391419 -1.365603

Train Test Split

In [21]: 1 from sklearn.model_selection import train_test_split

In [22]: 1 X_train, X_test, y_train, y_test = train_test_split(scaled_features,df['TA

2 test_size=0.30)

Using KNN
Remember that we are trying to come up with a model to predict whether someone will TARGET
CLASS or not. We'll start with k=1.

In [23]: 1 from sklearn.neighbors import KNeighborsClassifier

In [24]: 1 knn = KNeighborsClassifier(n_neighbors=1)

In [25]: 1 knn.fit(X_train,y_train)

Out[25]: KNeighborsClassifier(n_neighbors=1)

In [26]: 1 pred = knn.predict(X_test)

Predictions and Evaluations

Let's evaluate our KNN model!

In [27]: 1 from sklearn.metrics import classification_report,confusion_matrix

localhost:8890/notebooks/Lecture-11-K Nearest Neighbors-Part2.ipynb 2/6

4/5/23, 12:36 PM Lecture-11-K Nearest Neighbors-Part2 - Jupyter Notebook

In [28]: 1 print(confusion_matrix(y_test,pred))

[[109 45]
[ 33 113]]

In [29]: 1 print(classification_report(y_test,pred))

precision recall f1-score support

0 0.77 0.71 0.74 154

1 0.72 0.77 0.74 146

accuracy 0.74 300

macro avg 0.74 0.74 0.74 300
weighted avg 0.74 0.74 0.74 300

Choosing a K Value
Let's go ahead and use the elbow method to pick a good K Value:

In [30]: 1 error_rate = []
2
3 # Will take some time
4 for i in range(1,40):
5
6 knn = KNeighborsClassifier(n_neighbors=i)
7 knn.fit(X_train,y_train)
8 pred_i = knn.predict(X_test)
9 error_rate.append(np.mean(pred_i != y_test))

localhost:8890/notebooks/Lecture-11-K Nearest Neighbors-Part2.ipynb 3/6

4/5/23, 12:36 PM Lecture-11-K Nearest Neighbors-Part2 - Jupyter Notebook

In [31]: 1 plt.figure(figsize=(10,6))
2 plt.plot(range(1,40),error_rate,color='blue', linestyle='dashed', marker='
3 markerfacecolor='red', markersize=10)
4 plt.title('Error Rate vs. K Value')
5 plt.xlabel('K')
6 plt.ylabel('Error Rate')

Out[31]: Text(0, 0.5, 'Error Rate')

localhost:8890/notebooks/Lecture-11-K Nearest Neighbors-Part2.ipynb 4/6

4/5/23, 12:36 PM Lecture-11-K Nearest Neighbors-Part2 - Jupyter Notebook

In [32]: 1 # FIRST A QUICK COMPARISON TO OUR ORIGINAL K=1

2 knn = KNeighborsClassifier(n_neighbors=1)
3
4 knn.fit(X_train,y_train)
5 pred = knn.predict(X_test)
6
7 print('WITH K=1')
8 print('\n')
9 print(confusion_matrix(y_test,pred))
10 print('\n')
11 print(classification_report(y_test,pred))

WITH K=1

[[109 45]
[ 33 113]]

precision recall f1-score support

0 0.77 0.71 0.74 154

1 0.72 0.77 0.74 146

accuracy 0.74 300

macro avg 0.74 0.74 0.74 300
weighted avg 0.74 0.74 0.74 300

localhost:8890/notebooks/Lecture-11-K Nearest Neighbors-Part2.ipynb 5/6

4/5/23, 12:36 PM Lecture-11-K Nearest Neighbors-Part2 - Jupyter Notebook

In [35]: 1 # NOW WITH K=3

2 knn = KNeighborsClassifier(n_neighbors=23)
3
4 knn.fit(X_train,y_train)
5 pred = knn.predict(X_test)
6
7 print('WITH K=30')
8 print('\n')
9 print(confusion_matrix(y_test,pred))
10 print('\n')
11 print(classification_report(y_test,pred))

WITH K=30

[[114 40]
[ 20 126]]

precision recall f1-score support

0 0.85 0.74 0.79 154

1 0.76 0.86 0.81 146

accuracy 0.80 300

macro avg 0.80 0.80 0.80 300
weighted avg 0.81 0.80 0.80 300

In [ ]: 1

localhost:8890/notebooks/Lecture-11-K Nearest Neighbors-Part2.ipynb 6/6

Lab7.ipynb - Colaboratory
100% (1)
Lab7.ipynb - Colaboratory
5 pages
Risss ML Record 6
No ratings yet
Risss ML Record 6
6 pages
AML Lab No.04
No ratings yet
AML Lab No.04
7 pages
Dhanashree ML Report
No ratings yet
Dhanashree ML Report
3 pages
ML Lab2 PGM
No ratings yet
ML Lab2 PGM
3 pages
New Data Science Module Nearest Neighbors
No ratings yet
New Data Science Module Nearest Neighbors
22 pages
Assignment No 2 AI
No ratings yet
Assignment No 2 AI
4 pages
Updated K-Nearest Neighbors in Machine Learning
No ratings yet
Updated K-Nearest Neighbors in Machine Learning
11 pages
Rahul Raj - Ipynb - Colab
No ratings yet
Rahul Raj - Ipynb - Colab
50 pages
K-Nearest Neighbors Classifier: Import Import As Import As From Import From Import From Import Import As
No ratings yet
K-Nearest Neighbors Classifier: Import Import As Import As From Import From Import From Import Import As
6 pages
KNN Classification Lab Guide
No ratings yet
KNN Classification Lab Guide
4 pages
Worksheet - 2.3 20BCS7611
No ratings yet
Worksheet - 2.3 20BCS7611
6 pages
KnnClassifier - Jupyter Notebook
No ratings yet
KnnClassifier - Jupyter Notebook
2 pages
KNN - Predictive Analysis
No ratings yet
KNN - Predictive Analysis
6 pages
Lab 8
No ratings yet
Lab 8
7 pages
K-Nearest Neighbor: General Gist
No ratings yet
K-Nearest Neighbor: General Gist
14 pages
Q. Implement K-Nearest Neighbours Algorithm On Iris Dataset For Different Values of K. You Can Implement For K 4,5,6,7,8
No ratings yet
Q. Implement K-Nearest Neighbours Algorithm On Iris Dataset For Different Values of K. You Can Implement For K 4,5,6,7,8
2 pages
Worksheet - 2.3 20BCS7490
No ratings yet
Worksheet - 2.3 20BCS7490
6 pages
KNN Algorithm Guide with Python
No ratings yet
KNN Algorithm Guide with Python
13 pages
Implementing KNN with Iris Dataset
No ratings yet
Implementing KNN with Iris Dataset
7 pages
B-56 Sanket Jambhulkar MLA-7
No ratings yet
B-56 Sanket Jambhulkar MLA-7
9 pages
Assignment 3 B
No ratings yet
Assignment 3 B
7 pages
KNN Final
No ratings yet
KNN Final
4 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
2 pages
K-Nearest Neighbors: KNN Algorithm Pseudocode
No ratings yet
K-Nearest Neighbors: KNN Algorithm Pseudocode
2 pages
Implementing KNN Algorithm On The Iris Dataset
No ratings yet
Implementing KNN Algorithm On The Iris Dataset
7 pages
KNN and Random Forests Guide
No ratings yet
KNN and Random Forests Guide
6 pages
It - S All About Neighbors - Completed
No ratings yet
It - S All About Neighbors - Completed
14 pages
EX - NO:3: Algorithm
No ratings yet
EX - NO:3: Algorithm
11 pages
ML Lab Exp-3
No ratings yet
ML Lab Exp-3
5 pages
KNN Cookbook
No ratings yet
KNN Cookbook
8 pages
Part A 3. KNN Classification
No ratings yet
Part A 3. KNN Classification
35 pages
K-NN Algorithm: Need To Create Two Files File 1: KNN - Py Second File: Expt3.py
No ratings yet
K-NN Algorithm: Need To Create Two Files File 1: KNN - Py Second File: Expt3.py
4 pages
KNN Algorithm Implementation Guide
No ratings yet
KNN Algorithm Implementation Guide
2 pages
KNN Lab
No ratings yet
KNN Lab
4 pages
K-Nearest Neighbor On Python Ken Ocuma
100% (2)
K-Nearest Neighbor On Python Ken Ocuma
9 pages
KNN Colab Illustration
No ratings yet
KNN Colab Illustration
5 pages
Here's An Visualization of The K-Nearest Neighbors Algorithm
No ratings yet
Here's An Visualization of The K-Nearest Neighbors Algorithm
5 pages
PML Lab Exp 11
No ratings yet
PML Lab Exp 11
3 pages
KNN Classifier
No ratings yet
KNN Classifier
5 pages
ML - Lab-8.ipynb - Colab
No ratings yet
ML - Lab-8.ipynb - Colab
4 pages
Lab06 KNN 01
No ratings yet
Lab06 KNN 01
3 pages
KNN
No ratings yet
KNN
4 pages
KNN Algorithm Edt
No ratings yet
KNN Algorithm Edt
5 pages
KNN Algorithm Guide with Python
No ratings yet
KNN Algorithm Guide with Python
15 pages
Notes 02
No ratings yet
Notes 02
79 pages
K Nearest Neighbors
No ratings yet
K Nearest Neighbors
5 pages
ML - Labtask5.ipynb - K - Colab
No ratings yet
ML - Labtask5.ipynb - K - Colab
8 pages
K-NN for Business Analytics
No ratings yet
K-NN for Business Analytics
21 pages
MLT Lab 09
No ratings yet
MLT Lab 09
3 pages
KNN Algorithm: Classification Example
No ratings yet
KNN Algorithm: Classification Example
2 pages
KNN for Cancer Classification
No ratings yet
KNN for Cancer Classification
6 pages
ML Lab Week 7
No ratings yet
ML Lab Week 7
4 pages
ML Experiment WithDataset
No ratings yet
ML Experiment WithDataset
23 pages
cs4302 Lecture2
No ratings yet
cs4302 Lecture2
40 pages
Process of ML Code/Algorithm: KNN Type I - Input Test Sample Method
No ratings yet
Process of ML Code/Algorithm: KNN Type I - Input Test Sample Method
3 pages
ML Lab Manual
No ratings yet
ML Lab Manual
24 pages
Apple Vs Orange
No ratings yet
Apple Vs Orange
24 pages
3 CPE 413 Assembly Lang Instructions
No ratings yet
3 CPE 413 Assembly Lang Instructions
104 pages
20698B TrainerPrepGuide
No ratings yet
20698B TrainerPrepGuide
8 pages
Unit 3 Os QB With Answer
No ratings yet
Unit 3 Os QB With Answer
10 pages
Wireless Transmitter Module TX1-433MHz
No ratings yet
Wireless Transmitter Module TX1-433MHz
6 pages
Create User and Join To Domain
No ratings yet
Create User and Join To Domain
9 pages
Introduction To Computer Programming
No ratings yet
Introduction To Computer Programming
36 pages
GXT6000 230
No ratings yet
GXT6000 230
29 pages
Enterprise Library Hands On Labs
100% (1)
Enterprise Library Hands On Labs
147 pages
ServiceNow Data Import Guide
No ratings yet
ServiceNow Data Import Guide
7 pages
Cloud Architect Roadmap
No ratings yet
Cloud Architect Roadmap
3 pages
Unionbank of The Philippines: Verint 15.2 Preventive Maintenance
No ratings yet
Unionbank of The Philippines: Verint 15.2 Preventive Maintenance
7 pages
Naukri PrasadK (9y 2m)
No ratings yet
Naukri PrasadK (9y 2m)
5 pages
Doms 806257
No ratings yet
Doms 806257
5 pages
Cp1l - em Entradas Analogicas
No ratings yet
Cp1l - em Entradas Analogicas
3 pages
Steel Detailing Guide for Engineers
100% (1)
Steel Detailing Guide for Engineers
43 pages
It8711 Foss and Cloud Computing Lab Manual
No ratings yet
It8711 Foss and Cloud Computing Lab Manual
128 pages
Unit I Introduction To DevOps and The Culture
No ratings yet
Unit I Introduction To DevOps and The Culture
38 pages
新电影评论和评分
100% (2)
新电影评论和评分
7 pages
Dynamic Mutation & Crossover in GAs
No ratings yet
Dynamic Mutation & Crossover in GAs
36 pages
Recursion - Arrays
No ratings yet
Recursion - Arrays
14 pages
SRA Media: 360° Global Marketing Solutions
No ratings yet
SRA Media: 360° Global Marketing Solutions
12 pages
Chapter 2.3 - The Design Process
No ratings yet
Chapter 2.3 - The Design Process
15 pages
Detailed ICT Skills Cheat Sheet
No ratings yet
Detailed ICT Skills Cheat Sheet
2 pages
Synchro Weld
No ratings yet
Synchro Weld
12 pages
Blender Interface Guide & Tips
No ratings yet
Blender Interface Guide & Tips
6 pages
Google Fit
No ratings yet
Google Fit
27 pages
Handbook On Electronic Interlocking Maintenance Instruction Series I PDF
100% (2)
Handbook On Electronic Interlocking Maintenance Instruction Series I PDF
38 pages
Financial Performance Summary
No ratings yet
Financial Performance Summary
8 pages
LAB 1 Operating System
No ratings yet
LAB 1 Operating System
7 pages
Oracle Data Integrator Guide
No ratings yet
Oracle Data Integrator Guide
4 pages

Lecture-11-K Nearest Neighbors-Part2 - Jupyter Notebook

Uploaded by

Lecture-11-K Nearest Neighbors-Part2 - Jupyter Notebook

Uploaded by

4/5/23, 12:36 PM Lecture-11-K Nearest Neighbors-Part2 - Jupyter Notebook

Lecture 11-K Nearest Neighbour-Part 2

K Nearest Neighbors with Python

Get the Data

0 1636.670614 817.988525 2565.995189 358.347163 550.417491 1618.870897 2147.641254 33

1 1013.402760 577.587332 2644.141273 280.428203 1161.873391 2084.107872 853.404981 44

2 1300.035501 820.518697 2025.854469 525.562292 922.206261 2552.355407 818.676686 84

3 1059.347542 1066.866418 612.000041 480.827789 419.467495 685.666983 852.867810 34

4 1018.340526 1313.679056 950.622661 724.742174 843.065903 1370.554164 905.469453 65

Standardize the Variables

In [5]: 1 scaler = StandardScaler()

In [13]: 1 scaler.fit(df.drop('TARGET CLASS',axis=1))

localhost:8890/notebooks/Lecture-11-K Nearest Neighbors-Part2.ipynb 1/6

In [16]: 1 scaled_features = scaler.transform(df.drop('TARGET CLASS',axis=1))

In [20]: 1 df_feat = pd.DataFrame(scaled_features,columns=df.columns[:-1])

0 1.568522 -0.443435 1.619808 -0.958255 -1.128481 0.138336 0.980493 -0.932794 1.008313

1 -0.112376 -1.056574 1.741918 -1.504220 0.640009 1.081552 -1.182663 -0.461864 0.258321

2 0.660647 -0.436981 0.775793 0.213394 -0.053171 2.030872 -1.240707 1.149298 2.184784

3 0.011533 0.191324 -1.433473 -0.100053 -1.507223 -1.753632 -1.183561 -0.888557 0.162310

4 -0.099059 0.820815 -0.904346 1.609015 -0.282065 -0.365099 -1.095644 0.391419 -1.365603

Train Test Split

In [22]: 1 X_train, X_test, y_train, y_test = train_test_split(scaled_features,df['TA

In [23]: 1 from sklearn.neighbors import KNeighborsClassifier

In [24]: 1 knn = KNeighborsClassifier(n_neighbors=1)

In [26]: 1 pred = knn.predict(X_test)

Predictions and Evaluations

In [27]: 1 from sklearn.metrics import classification_report,confusion_matrix

localhost:8890/notebooks/Lecture-11-K Nearest Neighbors-Part2.ipynb 2/6

precision recall f1-score support

0 0.77 0.71 0.74 154

accuracy 0.74 300

localhost:8890/notebooks/Lecture-11-K Nearest Neighbors-Part2.ipynb 3/6

Out[31]: Text(0, 0.5, 'Error Rate')

localhost:8890/notebooks/Lecture-11-K Nearest Neighbors-Part2.ipynb 4/6

In [32]: 1 # FIRST A QUICK COMPARISON TO OUR ORIGINAL K=1

precision recall f1-score support

0 0.77 0.71 0.74 154

accuracy 0.74 300

localhost:8890/notebooks/Lecture-11-K Nearest Neighbors-Part2.ipynb 5/6

In [35]: 1 # NOW WITH K=3

precision recall f1-score support

0 0.85 0.74 0.79 154

accuracy 0.80 300

localhost:8890/notebooks/Lecture-11-K Nearest Neighbors-Part2.ipynb 6/6

You might also like