100% found this document useful (1 vote)

123 views5 pages

Lab7.ipynb - Colaboratory

This document implements a K-Nearest Neighbors model on an Iris flower dataset to classify species. It loads and explores the data, splits it into training and test sets, trains a KNN model with an optimized K value of 3, predicts species classifications on the test set, and evaluates the model's accuracy at 97.78%.

Uploaded by

PRAGASM PROG

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

123 views5 pages

Lab7.ipynb - Colaboratory

Uploaded by

PRAGASM PROG

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

5/10/22, 2:58 PM Lab7.

ipynb - Colaboratory

Implementing K Nearest Naighbour for a dataset.

Importing Libraries and Dataset: -

import numpy as np

import pandas as pd

import matplotlib.pyplot as plt

import seaborn as sns

from google.colab import files

uploaded = files.upload()

Choose Files Iris.csv

Iris.csv(text/csv) - 5107 bytes, last modified: 3/17/2022 - 100% done
Saving Iris.csv to Iris.csv

Creating Data frame: -

df=pd.read_csv('Iris.csv')

Printing first 10 values: -

df.head(10)

Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

3 4 4.6 3.1 1.5 0.2 Iris-setosa

4 5 5.0 3.6 1.4 0.2 Iris-setosa

5 6 5.4 3.9 1.7 0.4 Iris-setosa

6 7 4.6 3.4 1.4 0.3 Iris-setosa

7 8 5.0 3.4 1.5 0.2 Iris-setosa

8 9 4.4 2.9 1.4 0.2 Iris-setosa

9 10 4.9 3.1 1.5 0.1 Iris-setosa

Printing the all information of the dataset: -

df.info()

https://colab.research.google.com/drive/17EdAX0gZZGyDlojce0QA0Dn3jdLQt0Fa?authuser=1#scrollTo=IDas4r15mL2H&printMode=true 1/5
5/10/22, 2:58 PM Lab7.ipynb - Colaboratory

RangeIndex: 150 entries, 0 to 149

Data columns (total 6 columns):

# Column Non-Null Count Dtype

--- ------ -------------- -----

0 Id 150 non-null int64

1 SepalLengthCm 150 non-null float64

2 SepalWidthCm 150 non-null float64

3 PetalLengthCm 150 non-null float64

4 PetalWidthCm 150 non-null float64

5 Species 150 non-null object

dtypes: float64(4), int64(1), object(1)

memory usage: 7.2+ KB

Checking is there exists any null values in the dataset or not: -

df[df.isnull().any(axis=1)].head()

Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

Creating independent variable: -

X=df.iloc[:,[1,2,3,4]].values

Creating dependent variable: -

Y=df.iloc[:,5]

Splitting the dataset: -

from sklearn.model_selection import train_test_split

train_X,test_X,train_Y,test_Y = train_test_split(X, Y, test_size=0.3, random_state=0)

Standardizing the dataset: -

from sklearn.preprocessing import StandardScaler
sc = StandardScaler()
train_X = sc.fit_transform(train_X)
test_X = sc.transform(test_X)

Finding the optimised value of K: -

import math
from sklearn.neighbors import KNeighborsClassifier
from sklearn.metrics import accuracy_score
https://colab.research.google.com/drive/17EdAX0gZZGyDlojce0QA0Dn3jdLQt0Fa?authuser=1#scrollTo=IDas4r15mL2H&printMode=true 2/5
5/10/22, 2:58 PM Lab7.ipynb - Colaboratory

n=len(df.index)
li=list()
li2=list()
for i in range(1,int(pow(n,1/2))):
  kclass = KNeighborsClassifier(n_neighbors = i, metric = 'minkowski', p = 2)
  kclass.fit(train_X, train_Y)
  y_pred = kclass.predict(test_X)
  ac = accuracy_score(test_Y,y_pred)
  li.append(ac)
  li2.append(i)

max = li[0]
index = 0
for i in range(1,len(li)):
    if li[i] > max:
        max = li[i]
        index = i

k=li2[index]
print("The value of K is = ",k)

plt.plot(li2,li)
plt.title("Graph showing the Accuracy with K",size=15,fontweight="bold")
plt.xlabel("Value of K",size=12,fontweight="bold")
plt.ylabel("Accuracy",size=12,fontweight="bold")
plt.show()

The value of K is = 3

Importing the KNN classifier for implementing the model: -

from sklearn.neighbors import KNeighborsClassifier

kclass = KNeighborsClassifier(n_neighbors = k, metric = 'minkowski', p = 2)

Training the model: -

https://colab.research.google.com/drive/17EdAX0gZZGyDlojce0QA0Dn3jdLQt0Fa?authuser=1#scrollTo=IDas4r15mL2H&printMode=true 3/5
5/10/22, 2:58 PM Lab7.ipynb - Colaboratory

kclass.fit(train_X, train_Y)

KNeighborsClassifier(n_neighbors=3)

Predicting the values of the Y(y_pred): -

y_pred = kclass.predict(test_X)

The values of the predicted y are : -

y_pred

array(['Iris-virginica', 'Iris-versicolor', 'Iris-setosa',

'Iris-virginica', 'Iris-setosa', 'Iris-virginica', 'Iris-setosa',

'Iris-versicolor', 'Iris-versicolor', 'Iris-versicolor',

'Iris-virginica', 'Iris-versicolor', 'Iris-versicolor',

'Iris-versicolor', 'Iris-versicolor', 'Iris-setosa',

'Iris-versicolor', 'Iris-versicolor', 'Iris-setosa', 'Iris-setosa',

'Iris-virginica', 'Iris-versicolor', 'Iris-setosa', 'Iris-setosa',

'Iris-virginica', 'Iris-setosa', 'Iris-setosa', 'Iris-versicolor',

'Iris-versicolor', 'Iris-setosa', 'Iris-virginica',

'Iris-virginica', 'Iris-versicolor', 'Iris-setosa',

'Iris-virginica', 'Iris-versicolor', 'Iris-versicolor',

'Iris-virginica', 'Iris-setosa', 'Iris-virginica', 'Iris-setosa',

'Iris-setosa'], dtype=object)

Performance Measure: -

from sklearn.metrics import confusion_matrix,accuracy_score

cm = confusion_matrix(test_Y, y_pred)
print("Confusion Matrix: -\n",cm)

ac = accuracy_score(test_Y,y_pred)
print("\nAccuracy of the model(in %) is = ",ac*100)

Confusion Matrix: -

[[16 0 0]

[ 0 17 1]

[ 0 0 11]]

Accuracy of the model(in %) is = 97.77777777777777

https://colab.research.google.com/drive/17EdAX0gZZGyDlojce0QA0Dn3jdLQt0Fa?authuser=1#scrollTo=IDas4r15mL2H&printMode=true 4/5
5/10/22, 2:58 PM Lab7.ipynb - Colaboratory

check 0s completed at 2:57 PM

https://colab.research.google.com/drive/17EdAX0gZZGyDlojce0QA0Dn3jdLQt0Fa?authuser=1#scrollTo=IDas4r15mL2H&printMode=true 5/5

NYC Taxi Fare Data Cleaning
100% (1)
NYC Taxi Fare Data Cleaning
8 pages
HW1
100% (1)
HW1
8 pages
SAT and GPA Regression Analysis
100% (1)
SAT and GPA Regression Analysis
1 page
KNN for Telecom Customer Segmentation
100% (1)
KNN for Telecom Customer Segmentation
11 pages
Logistics Regression
100% (1)
Logistics Regression
5 pages
ML0101EN Clas Logistic Reg Churn Py v1
100% (1)
ML0101EN Clas Logistic Reg Churn Py v1
13 pages
Neural Network Based Rainfall Prediction System
100% (1)
Neural Network Based Rainfall Prediction System
6 pages
IRIS BPNN - Ipynb - Colaboratory
100% (1)
IRIS BPNN - Ipynb - Colaboratory
4 pages
An Introduction To Feature Selection
No ratings yet
An Introduction To Feature Selection
45 pages
Multicollinearity Exercise
100% (1)
Multicollinearity Exercise
6 pages
Machine Learning and Data Analytics Using Python Lab
No ratings yet
Machine Learning and Data Analytics Using Python Lab
36 pages
Linear - Regression
100% (1)
Linear - Regression
39 pages
Outliers, Hypothesis and Natural Language Processing
100% (1)
Outliers, Hypothesis and Natural Language Processing
7 pages
Glass Classification
100% (2)
Glass Classification
3 pages
Intro to Machine Learning Basics
100% (1)
Intro to Machine Learning Basics
52 pages
Credit Card Fraud Detection Using Machine Learning
100% (1)
Credit Card Fraud Detection Using Machine Learning
82 pages
Patient Data Management System
100% (1)
Patient Data Management System
27 pages
K-NN (Nearest Neighbor)
100% (1)
K-NN (Nearest Neighbor)
17 pages
Importing Libraries: Import As Import As Import As From Import As From Import From Import Import
100% (1)
Importing Libraries: Import As Import As Import As From Import As From Import From Import Import
11 pages
Unit V - Classification and Prediction 2020-21
100% (1)
Unit V - Classification and Prediction 2020-21
68 pages
(IJETA-V8I5P1) :yew Kee Wong
No ratings yet
(IJETA-V8I5P1) :yew Kee Wong
5 pages
A) What Is Motivation Behind Ensemble Methods? Give Your Answer in Probabilistic Terms
100% (1)
A) What Is Motivation Behind Ensemble Methods? Give Your Answer in Probabilistic Terms
6 pages
Unsupervised Feature Extraction With Autoencoders For EEG Based Multiclass Motor Imagery BCI
No ratings yet
Unsupervised Feature Extraction With Autoencoders For EEG Based Multiclass Motor Imagery BCI
10 pages
Outlines: Statements of Problems Objectives Bagging Random Forest Boosting Adaboost
100% (1)
Outlines: Statements of Problems Objectives Bagging Random Forest Boosting Adaboost
14 pages
Vinee
100% (1)
Vinee
28 pages
ML Lect1
100% (1)
ML Lect1
51 pages
Book
100% (1)
Book
480 pages
Linear Regression Models Guide
100% (1)
Linear Regression Models Guide
61 pages
Thinkcspy 3
100% (1)
Thinkcspy 3
415 pages
Regression Anallysis Hands0n 1
100% (1)
Regression Anallysis Hands0n 1
3 pages
Logistic Regression
100% (1)
Logistic Regression
29 pages
CS550 Regression Aug12
100% (1)
CS550 Regression Aug12
63 pages
Linear Regression: What Is Regression Analysis?
100% (1)
Linear Regression: What Is Regression Analysis?
21 pages
0.1 Stock Data
100% (1)
0.1 Stock Data
4 pages
Assignment10 4
100% (1)
Assignment10 4
3 pages
Lab 3. Linear Regression 230223
100% (1)
Lab 3. Linear Regression 230223
7 pages
SVM Guide for Data Science Enthusiasts
100% (1)
SVM Guide for Data Science Enthusiasts
28 pages
Charmi Shah 20bcp299 Lab2
100% (1)
Charmi Shah 20bcp299 Lab2
7 pages
Real-Time Face Detection On A "Dual-Sensor" Smart Camera Using Smooth-Edges Technique
No ratings yet
Real-Time Face Detection On A "Dual-Sensor" Smart Camera Using Smooth-Edges Technique
5 pages
ECG Image Classification with ML
100% (1)
ECG Image Classification with ML
16 pages
Artificial Neural Network (ANN)
No ratings yet
Artificial Neural Network (ANN)
34 pages
Econ209 f2024 Lab 4 Truong Gia Han
No ratings yet
Econ209 f2024 Lab 4 Truong Gia Han
11 pages
Churn Modeling
100% (1)
Churn Modeling
11 pages
Classification Problems
100% (1)
Classification Problems
25 pages
Currency Recognition On Mobile Phones Proposed System Modules
No ratings yet
Currency Recognition On Mobile Phones Proposed System Modules
26 pages
Csi 5155 ML Project Report
100% (1)
Csi 5155 ML Project Report
24 pages
Python Setup For Machine Learning
100% (1)
Python Setup For Machine Learning
3 pages
SMS Spam Detection with ML Algorithms
No ratings yet
SMS Spam Detection with ML Algorithms
4 pages
Decision Trees: at Some Point of Time You Have To Take A Decision Sitting On A Tree
100% (1)
Decision Trees: at Some Point of Time You Have To Take A Decision Sitting On A Tree
19 pages
Numerical Analysis MCQs for NET/SET
No ratings yet
Numerical Analysis MCQs for NET/SET
25 pages
RBF, KNN, SVM, DT
No ratings yet
RBF, KNN, SVM, DT
9 pages
PR01
100% (1)
PR01
41 pages
Computer Science Project
No ratings yet
Computer Science Project
19 pages
Assignment Updated 101
100% (1)
Assignment Updated 101
24 pages
Student Movie Ticket System Report
No ratings yet
Student Movie Ticket System Report
14 pages
01-Introduction Machine Learning
100% (1)
01-Introduction Machine Learning
48 pages
Computer Aided Technology Based On Graph Sample and Aggregate Attention Network Optimized For Soccer Teaching and Training
No ratings yet
Computer Aided Technology Based On Graph Sample and Aggregate Attention Network Optimized For Soccer Teaching and Training
18 pages
KNN - Predictive Analysis
No ratings yet
KNN - Predictive Analysis
6 pages
It - S All About Neighbors - Completed
No ratings yet
It - S All About Neighbors - Completed
14 pages
Practice Problems
No ratings yet
Practice Problems
4 pages
Measures of Centrality
No ratings yet
Measures of Centrality
13 pages
Connectivity
No ratings yet
Connectivity
28 pages
Unit 4 - IAPM
No ratings yet
Unit 4 - IAPM
17 pages
Unit 3 - BA - July 2022
No ratings yet
Unit 3 - BA - July 2022
94 pages
Business Analytics Essentials
No ratings yet
Business Analytics Essentials
43 pages
Enable Two-Finger Scroll in Windows
No ratings yet
Enable Two-Finger Scroll in Windows
9 pages
Series 90-30 System Manual For Windows Users
No ratings yet
Series 90-30 System Manual For Windows Users
116 pages
Manual Kronnus Mih61m-D
No ratings yet
Manual Kronnus Mih61m-D
19 pages
Interface Software: Entec
No ratings yet
Interface Software: Entec
58 pages
Quantum: Clock Module 140 DCF 077 00 User Manual
No ratings yet
Quantum: Clock Module 140 DCF 077 00 User Manual
28 pages
.NET Core, SDLC, Agile & C# Course
No ratings yet
.NET Core, SDLC, Agile & C# Course
34 pages
Odoo Sample Exercises
No ratings yet
Odoo Sample Exercises
6 pages
Robot ICT - RPA E-Book
No ratings yet
Robot ICT - RPA E-Book
45 pages
Exp 1 (Ismdr)
No ratings yet
Exp 1 (Ismdr)
5 pages
Java Collection Notes.
No ratings yet
Java Collection Notes.
37 pages
628 630
No ratings yet
628 630
3 pages
Logfile
No ratings yet
Logfile
2 pages
FPGA Interview Prep Guide
No ratings yet
FPGA Interview Prep Guide
8 pages
DQ Top20 - Meet India's Top 100 IT CompaniesDATAQUEST
No ratings yet
DQ Top20 - Meet India's Top 100 IT CompaniesDATAQUEST
6 pages
Concurrency: Deadlock and Starvation: William Stallings
No ratings yet
Concurrency: Deadlock and Starvation: William Stallings
87 pages
Mesa (Programming Language)
No ratings yet
Mesa (Programming Language)
5 pages
Iot Based Smart Electricity Meter and Power Theft Detection
No ratings yet
Iot Based Smart Electricity Meter and Power Theft Detection
6 pages
Object-Oriented Programming (CS F213) : BITS Pilani
No ratings yet
Object-Oriented Programming (CS F213) : BITS Pilani
14 pages
Dedicated Server
No ratings yet
Dedicated Server
3 pages
LX3V-4AD User Manual
No ratings yet
LX3V-4AD User Manual
12 pages
Smartphone Enterprise Applications
No ratings yet
Smartphone Enterprise Applications
3 pages
History and Evolution of Computers
No ratings yet
History and Evolution of Computers
3 pages
Lect 4 Introduction To PACS Copy-1
No ratings yet
Lect 4 Introduction To PACS Copy-1
40 pages
Open Rails Log
No ratings yet
Open Rails Log
27 pages
Newest - Booklet Theory 1 EDPM
No ratings yet
Newest - Booklet Theory 1 EDPM
30 pages
Jjeb Mock S6 Ict 1
100% (2)
Jjeb Mock S6 Ict 1
11 pages
Create Maintenance Notification IW21
No ratings yet
Create Maintenance Notification IW21
1 page
Fundamentals of Computer and Digital Systems
No ratings yet
Fundamentals of Computer and Digital Systems
418 pages
User Manual DUO
No ratings yet
User Manual DUO
254 pages
Database Management System
No ratings yet
Database Management System
9 pages

Lab7.ipynb - Colaboratory

Uploaded by

Lab7.ipynb - Colaboratory

Uploaded by

5/10/22, 2:58 PM Lab7.

Implementing K Nearest Naighbour for a dataset.

Importing Libraries and Dataset: -

Choose Files Iris.csv

Creating Data frame: -

Printing first 10 values: -

Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

3 4 4.6 3.1 1.5 0.2 Iris-setosa

4 5 5.0 3.6 1.4 0.2 Iris-setosa

5 6 5.4 3.9 1.7 0.4 Iris-setosa

6 7 4.6 3.4 1.4 0.3 Iris-setosa

7 8 5.0 3.4 1.5 0.2 Iris-setosa

8 9 4.4 2.9 1.4 0.2 Iris-setosa

9 10 4.9 3.1 1.5 0.1 Iris-setosa

Printing the all information of the dataset: -

RangeIndex: 150 entries, 0 to 149

# Column Non-Null Count Dtype

--- ------ -------------- -----

0 Id 150 non-null int64

1 SepalLengthCm 150 non-null float64

2 SepalWidthCm 150 non-null float64

3 PetalLengthCm 150 non-null float64

4 PetalWidthCm 150 non-null float64

5 Species 150 non-null object

dtypes: float64(4), int64(1), object(1)

memory usage: 7.2+ KB

Checking is there exists any null values in the dataset or not: -

Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

Creating independent variable: -

Creating dependent variable: -

Splitting the dataset: -

Standardizing the dataset: -

Finding the optimised value of K: -

Importing the KNN classifier for implementing the model: -

Training the model: -

Predicting the values of the Y(y_pred): -

The values of the predicted y are : -

array(['Iris-virginica', 'Iris-versicolor', 'Iris-setosa',

'Iris-virginica', 'Iris-setosa', 'Iris-virginica', 'Iris-setosa',

'Iris-versicolor', 'Iris-versicolor', 'Iris-versicolor',

'Iris-virginica', 'Iris-versicolor', 'Iris-versicolor',

'Iris-versicolor', 'Iris-versicolor', 'Iris-setosa',

'Iris-versicolor', 'Iris-versicolor', 'Iris-setosa', 'Iris-setosa',

'Iris-virginica', 'Iris-versicolor', 'Iris-setosa', 'Iris-setosa',

'Iris-virginica', 'Iris-setosa', 'Iris-setosa', 'Iris-versicolor',

'Iris-versicolor', 'Iris-setosa', 'Iris-virginica',

'Iris-versicolor', 'Iris-setosa', 'Iris-virginica',

'Iris-virginica', 'Iris-versicolor', 'Iris-setosa',

'Iris-virginica', 'Iris-versicolor', 'Iris-versicolor',

'Iris-virginica', 'Iris-setosa', 'Iris-virginica', 'Iris-setosa',

Accuracy of the model(in %) is = 97.77777777777777

check 0s completed at 2:57 PM

You might also like