0% found this document useful (0 votes)

38 views5 pages

SVM and Kmeans - Iris Dataset - Ipynb - Colab

Uploaded by

termp89

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views5 pages

SVM and Kmeans - Iris Dataset - Ipynb - Colab

Uploaded by

termp89

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

11/29/24, 9:30 PM SVM and Kmeans -Iris dataset.

ipynb - Colab

import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.svm import SVC
from sklearn.metrics import accuracy_score, classification_report, confusion_matrix
import matplotlib.pyplot as plt
import seaborn as sns

!kaggle datasets download -d uciml/iris

Dataset URL: https://www.kaggle.com/datasets/uciml/iris

License(s): CC0-1.0
Downloading iris.zip to /content
0% 0.00/3.60k [00:00<?, ?B/s]
100% 3.60k/3.60k [00:00<00:00, 7.28MB/s]

Loading of the dataset and creating dataframe

!unzip iris.zip

Archive: iris.zip
inflating: Iris.csv
inflating: database.sqlite

df = pd.read_csv('Iris.csv')
print(df.head())

Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa
1 2 4.9 3.0 1.4 0.2 Iris-setosa
2 3 4.7 3.2 1.3 0.2 Iris-setosa
3 4 4.6 3.1 1.5 0.2 Iris-setosa
4 5 5.0 3.6 1.4 0.2 Iris-setosa

Changing categorical to numbers

df['Species'] = df['Species'].astype('category').cat.codes

Selection of columns and assigning to X and Y

X = df.iloc[:, :-1].values
y = df.iloc[:, -1].values
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)
print("Training set shape:", X_train.shape)
print("Test set shape:", X_test.shape)

Training set shape: (120, 5)

Test set shape: (30, 5)

Training of the SVM Model

svm_model = SVC(kernel='linear', C=1.0, random_state=42)

Model fitting and Prediction

svm_model.fit(X_train, y_train)

y_pred = svm_model.predict(X_test)

Evaluation Metrics and Parameters

accuracy = accuracy_score(y_test, y_pred)

print("Accuracy:", accuracy)
print("\nClassification Report:")
print(classification_report(y_test, y_pred))

Accuracy: 1.0

Classification Report:
precision recall f1-score support

0 1.00 1.00 1.00 10

https://colab.research.google.com/drive/1kDkVaGxeyPshe6mgQPxShanNabVTF1v_#scrollTo=oHHbeiXRnXVu&printMode=true 1/5
11/29/24, 9:30 PM SVM and Kmeans -Iris dataset.ipynb - Colab
1 1.00 1.00 1.00 9
2 1.00 1.00 1.00 11

accuracy 1.00 30
macro avg 1.00 1.00 1.00 30
weighted avg 1.00 1.00 1.00 30

Confusion Matrix

conf_matrix = confusion_matrix(y_test, y_pred)

print("\nConfusion Matrix:")
print(conf_matrix)

Confusion Matrix:
[[10 0 0]
[ 0 9 0]
[ 0 0 11]]

HeatMap

sns.heatmap(conf_matrix, annot=True, cmap="YlGnBu", fmt='g')

plt.title("Confusion Matrix")
plt.xlabel("Predicted")
plt.ylabel("Actual")
plt.show()

K MEANS Implementation

import numpy as np

df=pd.read_csv('/content/Iris.csv')
df.head()

Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

3 4 4.6 3.1 1.5 0.2 Iris-setosa

4 5 5.0 3.6 1.4 0.2 Iris-setosa

Next steps: Generate code with df

toggle_off View recommended plots New interactive sheet

df.info()

https://colab.research.google.com/drive/1kDkVaGxeyPshe6mgQPxShanNabVTF1v_#scrollTo=oHHbeiXRnXVu&printMode=true 2/5
11/29/24, 9:30 PM SVM and Kmeans -Iris dataset.ipynb - Colab
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 150 entries, 0 to 149
Data columns (total 6 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 Id 150 non-null int64
1 SepalLengthCm 150 non-null float64
2 SepalWidthCm 150 non-null float64
3 PetalLengthCm 150 non-null float64
4 PetalWidthCm 150 non-null float64
5 Species 150 non-null object
dtypes: float64(4), int64(1), object(1)
memory usage: 7.2+ KB

df.drop(['Id'] ,axis=1, inplace=True)

df.isnull().sum()

SepalLengthCm 0

SepalWidthCm 0

PetalLengthCm 0

PetalWidthCm 0

Species 0

dtype: int64

df.describe()

SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm

count 150.000000 150.000000 150.000000 150.000000

mean 5.843333 3.054000 3.758667 1.198667

std 0.828066 0.433594 1.764420 0.763161

min 4.300000 2.000000 1.000000 0.100000

25% 5.100000 2.800000 1.600000 0.300000

50% 5.800000 3.000000 4.350000 1.300000

75% 6.400000 3.300000 5.100000 1.800000

max 7.900000 4.400000 6.900000 2.500000

df.head()

SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 5.1 3.5 1.4 0.2 Iris-setosa

1 4.9 3.0 1.4 0.2 Iris-setosa

2 4.7 3.2 1.3 0.2 Iris-setosa

3 4.6 3.1 1.5 0.2 Iris-setosa

4 5.0 3.6 1.4 0.2 Iris-setosa

Next steps: Generate code with df

toggle_off View recommended plots New interactive sheet

df_imp = df.iloc[:,0:4]
from sklearn.cluster import KMeans
k_meansclus = range(1,10)
sse = []

for k in k_meansclus :
km = KMeans(n_clusters =k)
km.fit(df_imp)
sse.append(km.inertia_)

plt.title('The Elbow Method')

plt.plot(k_meansclus,sse)
plt.show()

https://colab.research.google.com/drive/1kDkVaGxeyPshe6mgQPxShanNabVTF1v_#scrollTo=oHHbeiXRnXVu&printMode=true 3/5
11/29/24, 9:30 PM SVM and Kmeans -Iris dataset.ipynb - Colab

km1 = KMeans(n_clusters=3,max_iter=300 , random_state=0)

km1.fit(df_imp)
y_means = km1.fit_predict(df_imp)

km1.cluster_centers_

array([[5.88360656, 2.74098361, 4.38852459, 1.43442623],

[5.006 , 3.418 , 1.464 , 0.244 ],
[6.85384615, 3.07692308, 5.71538462, 2.05384615]])

df_imp = np.array(df_imp)

plt.scatter(df_imp[y_means==0,2 ],df_imp[y_means==0,3 ], color='g' , label='Iris-versicolor ')

plt.scatter(df_imp[y_means==1,2 ],df_imp[y_means==1,3 ], color='r' , label='Iris-setosa')
plt.scatter(df_imp[y_means==2,2 ],df_imp[y_means==2,3 ], color='b', label='Iris-virginica')
plt.legend()
plt.show()

plt.scatter(df_imp[y_means==0,0 ],df_imp[y_means==0,1], color='g' , label='Iris-versicolor ')

plt.scatter(df_imp[y_means==1,0 ],df_imp[y_means==1,1 ], color='r' , label='Iris-setosa')
plt.scatter(df_imp[y_means==2,0 ],df_imp[y_means==2,1 ], color='b', label='Iris-virginica')

plt.legend()
plt.show()

https://colab.research.google.com/drive/1kDkVaGxeyPshe6mgQPxShanNabVTF1v_#scrollTo=oHHbeiXRnXVu&printMode=true 4/5
11/29/24, 9:30 PM SVM and Kmeans -Iris dataset.ipynb - Colab

https://colab.research.google.com/drive/1kDkVaGxeyPshe6mgQPxShanNabVTF1v_#scrollTo=oHHbeiXRnXVu&printMode=true 5/5

Dsbda Assig 6 Data Analytcs 3
No ratings yet
Dsbda Assig 6 Data Analytcs 3
6 pages
TASK01 IrisFlowerClassificationwithMachineLearning 1752340862
No ratings yet
TASK01 IrisFlowerClassificationwithMachineLearning 1752340862
3 pages
Bagging, Random Forest, Gradient Boost, AdaBoost & PCA
No ratings yet
Bagging, Random Forest, Gradient Boost, AdaBoost & PCA
8 pages
ML Lab Assessment2.Ipynb - Colab
No ratings yet
ML Lab Assessment2.Ipynb - Colab
3 pages
A09Ass06 - Jupyter Notebook
No ratings yet
A09Ass06 - Jupyter Notebook
29 pages
Lab Manual
No ratings yet
Lab Manual
32 pages
Trần Mạnh Hùng 20192643.Ipynb - Colab
No ratings yet
Trần Mạnh Hùng 20192643.Ipynb - Colab
6 pages
Pra 8
No ratings yet
Pra 8
4 pages
PR 6
No ratings yet
PR 6
6 pages
KNN - Jupyter Notebook
No ratings yet
KNN - Jupyter Notebook
8 pages
02 - Decision Tree Classification On Iris Dataset
No ratings yet
02 - Decision Tree Classification On Iris Dataset
6 pages
DS 6
No ratings yet
DS 6
2 pages
Iris Flower Classification Project
No ratings yet
Iris Flower Classification Project
9 pages
Cota12 6
No ratings yet
Cota12 6
4 pages
SVM and KNN
No ratings yet
SVM and KNN
3 pages
Machine Learning Algorithm
No ratings yet
Machine Learning Algorithm
18 pages
Iris - Copy1 - Jupyter Notebook
No ratings yet
Iris - Copy1 - Jupyter Notebook
8 pages
Iris Dataset Decision Tree Analysis
No ratings yet
Iris Dataset Decision Tree Analysis
4 pages
ML Lab Manual
No ratings yet
ML Lab Manual
6 pages
Ai/Ml Lab-4: Name: Pratik Jadhav PRN: 20190802050
No ratings yet
Ai/Ml Lab-4: Name: Pratik Jadhav PRN: 20190802050
5 pages
Practical No - 1
No ratings yet
Practical No - 1
5 pages
ML N PY Programs
No ratings yet
ML N PY Programs
17 pages
DSE 6 - Colab
No ratings yet
DSE 6 - Colab
5 pages
KNN ALGORITHM - Ipynb - Colab
No ratings yet
KNN ALGORITHM - Ipynb - Colab
4 pages
19mid0034 (Chandru) - ML Lab Fat - Jupyter Notebook
No ratings yet
19mid0034 (Chandru) - ML Lab Fat - Jupyter Notebook
4 pages
TranMinhTu1 bt2 2
No ratings yet
TranMinhTu1 bt2 2
5 pages
Assignment - 10 - Pandas
No ratings yet
Assignment - 10 - Pandas
53 pages
AML Lab3 2021wb15156
No ratings yet
AML Lab3 2021wb15156
13 pages
EXP 9 DWM - Merged
No ratings yet
EXP 9 DWM - Merged
11 pages
DSBDA6
No ratings yet
DSBDA6
3 pages
Import As Import As Import As From Import Import As Import
No ratings yet
Import As Import As Import As From Import Import As Import
7 pages
Model - Ipynb - Colaboratory
No ratings yet
Model - Ipynb - Colaboratory
3 pages
BDA pr2
No ratings yet
BDA pr2
2 pages
Dsbda Ouput 1-10
No ratings yet
Dsbda Ouput 1-10
89 pages
ABHAYMLFILE
No ratings yet
ABHAYMLFILE
16 pages
Codes and Other Relevant Explanations For Supervised Learning (Part 1) - Session by Sabyasachi Mukhopadhyay - August 3
No ratings yet
Codes and Other Relevant Explanations For Supervised Learning (Part 1) - Session by Sabyasachi Mukhopadhyay - August 3
5 pages
E23CSEU2241 LAB9 Data Mining
No ratings yet
E23CSEU2241 LAB9 Data Mining
5 pages
Practical 5
No ratings yet
Practical 5
11 pages
L6 Tutorial - KNN - Jupyter Notebook
No ratings yet
L6 Tutorial - KNN - Jupyter Notebook
7 pages
Lab Manual ML
No ratings yet
Lab Manual ML
23 pages
Mlpy 2
No ratings yet
Mlpy 2
18 pages
ML LabReport Final Index Edited
No ratings yet
ML LabReport Final Index Edited
35 pages
Experiment-2-1-Ml Kritika
No ratings yet
Experiment-2-1-Ml Kritika
11 pages
Assignment No - 6-1
100% (1)
Assignment No - 6-1
3 pages
ML Lab Programs
No ratings yet
ML Lab Programs
23 pages
PGM 7
No ratings yet
PGM 7
3 pages
Comparison of Classifiers
No ratings yet
Comparison of Classifiers
6 pages
Iris Flower Classification
No ratings yet
Iris Flower Classification
47 pages
NaiveBayesClassifier - Jupyter Notebook
No ratings yet
NaiveBayesClassifier - Jupyter Notebook
2 pages
Support Vector Machine
No ratings yet
Support Vector Machine
7 pages
Iris Dataset Classifier Setup
No ratings yet
Iris Dataset Classifier Setup
1 page
Naive Bayes
No ratings yet
Naive Bayes
3 pages
Machine Learning - Lab Record
No ratings yet
Machine Learning - Lab Record
43 pages
Iris Data Analysis & Modeling
No ratings yet
Iris Data Analysis & Modeling
5 pages
ML 2.3 Prashant
No ratings yet
ML 2.3 Prashant
4 pages
ML L - Ab
No ratings yet
ML L - Ab
13 pages
4c Sklearn-Classification-Regression-Bkhw-Spring 2019
No ratings yet
4c Sklearn-Classification-Regression-Bkhw-Spring 2019
20 pages
Random Forest 1737667979
No ratings yet
Random Forest 1737667979
11 pages
Computador Daniel
No ratings yet
Computador Daniel
51 pages
Creating A Standby Using RMAN Duplicate (RAC or Non RAC) (Doc ID 1617946.1)
No ratings yet
Creating A Standby Using RMAN Duplicate (RAC or Non RAC) (Doc ID 1617946.1)
11 pages
Database Developer's Guide With Visual C++ 4 Second Edition
No ratings yet
Database Developer's Guide With Visual C++ 4 Second Edition
1,351 pages
ZT 485 Een
No ratings yet
ZT 485 Een
14 pages
Project Management Software Guide
No ratings yet
Project Management Software Guide
28 pages
A Data
No ratings yet
A Data
320 pages
Computer Architecture Exam Guide
No ratings yet
Computer Architecture Exam Guide
3 pages
Image Prompts 1
No ratings yet
Image Prompts 1
7 pages
Atanas Bogdanoski Resume
No ratings yet
Atanas Bogdanoski Resume
1 page
Report of Gym Website
100% (2)
Report of Gym Website
33 pages
Image Occlusion Enhanced Code (Old - Joe)
No ratings yet
Image Occlusion Enhanced Code (Old - Joe)
8 pages
Ditf403 Crypto-Currency and Blockchain
No ratings yet
Ditf403 Crypto-Currency and Blockchain
12 pages
Troubleshooting Multimedia Hardware
No ratings yet
Troubleshooting Multimedia Hardware
26 pages
GSM Based Industrial Security System: Abstract: Security and Automation Is A Prime
No ratings yet
GSM Based Industrial Security System: Abstract: Security and Automation Is A Prime
6 pages
Classification of Fingerprint
No ratings yet
Classification of Fingerprint
4 pages
What Are The Critical Factors To Consider During..
No ratings yet
What Are The Critical Factors To Consider During..
2 pages
Curriculum Vitae of Rifat
No ratings yet
Curriculum Vitae of Rifat
2 pages
Robotic WiFi Localization Advances
No ratings yet
Robotic WiFi Localization Advances
11 pages
Digital Number Systems 3 Ans
No ratings yet
Digital Number Systems 3 Ans
3 pages
B.Tech CSE Web Tech Lab File
No ratings yet
B.Tech CSE Web Tech Lab File
1 page
Front Sub OPB Communication Error
No ratings yet
Front Sub OPB Communication Error
1 page
Year 9 ICT MID TERM Exam
No ratings yet
Year 9 ICT MID TERM Exam
9 pages
DWG2000 New API Guide
No ratings yet
DWG2000 New API Guide
23 pages
MS MF RMD Motor CAN Protocol V2.35
No ratings yet
MS MF RMD Motor CAN Protocol V2.35
26 pages
Oracle: Question & Answers
No ratings yet
Oracle: Question & Answers
10 pages
Debarghya Das: Software Engineer Resume
No ratings yet
Debarghya Das: Software Engineer Resume
1 page
CV - Otavio Rocha Geraldo-1
No ratings yet
CV - Otavio Rocha Geraldo-1
5 pages
Uci 102 Group Work Assignment II
No ratings yet
Uci 102 Group Work Assignment II
2 pages
If You Are Not Connected To The VINCI Energies Network (Outside The Business Unit's Premises)
No ratings yet
If You Are Not Connected To The VINCI Energies Network (Outside The Business Unit's Premises)
16 pages
Introduction To Wireless Security
No ratings yet
Introduction To Wireless Security
3 pages

SVM and Kmeans - Iris Dataset - Ipynb - Colab

Uploaded by

SVM and Kmeans - Iris Dataset - Ipynb - Colab

Uploaded by

11/29/24, 9:30 PM SVM and Kmeans -Iris dataset.

!kaggle datasets download -d uciml/iris

Dataset URL: https://www.kaggle.com/datasets/uciml/iris

Loading of the dataset and creating dataframe

Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

Changing categorical to numbers

Selection of columns and assigning to X and Y

Training set shape: (120, 5)

Training of the SVM Model

svm_model = SVC(kernel='linear', C=1.0, random_state=42)

Model fitting and Prediction

Evaluation Metrics and Parameters

accuracy = accuracy_score(y_test, y_pred)

0 1.00 1.00 1.00 10

conf_matrix = confusion_matrix(y_test, y_pred)

sns.heatmap(conf_matrix, annot=True, cmap="YlGnBu", fmt='g')

Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

3 4 4.6 3.1 1.5 0.2 Iris-setosa

4 5 5.0 3.6 1.4 0.2 Iris-setosa

Next steps: Generate code with df

df.drop(['Id'] ,axis=1, inplace=True)

SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm

count 150.000000 150.000000 150.000000 150.000000

mean 5.843333 3.054000 3.758667 1.198667

std 0.828066 0.433594 1.764420 0.763161

min 4.300000 2.000000 1.000000 0.100000

25% 5.100000 2.800000 1.600000 0.300000

50% 5.800000 3.000000 4.350000 1.300000

75% 6.400000 3.300000 5.100000 1.800000

max 7.900000 4.400000 6.900000 2.500000

SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 5.1 3.5 1.4 0.2 Iris-setosa

1 4.9 3.0 1.4 0.2 Iris-setosa

2 4.7 3.2 1.3 0.2 Iris-setosa

3 4.6 3.1 1.5 0.2 Iris-setosa

4 5.0 3.6 1.4 0.2 Iris-setosa

Next steps: Generate code with df

plt.title('The Elbow Method')

km1 = KMeans(n_clusters=3,max_iter=300 , random_state=0)

array([[5.88360656, 2.74098361, 4.38852459, 1.43442623],

plt.scatter(df_imp[y_means==0,2 ],df_imp[y_means==0,3 ], color='g' , label='Iris-versicolor ')

plt.scatter(df_imp[y_means==0,0 ],df_imp[y_means==0,1], color='g' , label='Iris-versicolor ')

You might also like