0% found this document useful (0 votes)

61 views18 pages

Machine Learning-Lecture#7-Fall 2020

K-means clustering is an unsupervised learning technique that groups unlabeled data points into a specified number of clusters (K) based on feature similarity. It works by assigning data points to the cluster with the closest centroid and iteratively updating centroids until clusters are stable or the maximum number of iterations is reached. While efficient and easy to apply, K-means clustering has limitations such as being sensitive to initialization and not able to handle clusters of varying shapes, sizes, or densities.

Uploaded by

Syed Ali Raza Naqvi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

61 views18 pages

Machine Learning-Lecture#7-Fall 2020

Uploaded by

Syed Ali Raza Naqvi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Lecture#7

Unsupervised Learning
(Clustering)

1
What is Cluster Analysis?
Finding groups of objects such that the objects in a group
will be similar (or related) to one another and different
from (or unrelated to) the objects in other groups

Inter-cluster
Intra-cluster distances are
distances are maximized
minimized
Applications of Cluster Analysis
Understanding
– Group students who
succeed and fails in
the same exercises

Summarization
– Reduce the size of
large data sets

Clustering precipitation
in Australia
What is not Cluster Analysis?

Supervised classification
– Have class label information


Simple segmentation
– Dividing students into different registration groups
alphabetically, by last name
Types of Clusterings
A clustering is a set of clusters

Important distinction between hierarchical and

partitional sets of clusters

 Partitional Clustering
– A division data objects into non-overlapping subsets
(clusters) such that each data object is in exactly
one subset

 Hierarchical clustering
– A set of nested clusters organized as a hierarchical
tree
K-means Clustering

Partitional clustering approach

Each cluster is associated with a centroid (center
point)
Each point is assigned to the cluster with the closest
centroid
Number of clusters, K, must be specified
The basic algorithm is very simple
K-means Clustering
Partitional Clustering

Original Points
Partitional Clustering

Original Points with initial centres

Partitional Clustering

Original Points with clusters iteration 1

Partitional Clustering

Original Points with new centres

Partitional Clustering

Original Points with clusters and new centres iteration 2

Partitional Clustering

Original Points with clusters and new centres iteration 3

Partitional Clustering

Final clusters and centres A Partitional

Clustering
Two different K-means Clusterings
3

2.5

1.5
Original Points

y
1

0.5

-2 -1.5 -1 -0.5 0 0.5 1 1.5 2

3 3

2.5 2 .5

2 2

1.5 1 .5

y
y

1 1

0.5 0 .5

0 0

-2 - 1.5 -1 -0 .5 0 0 .5 1 1.5 2 -2 - 1.5 -1 -0.5 0 0 .5 1 1 .5 2

x x

Optimal Clustering Sub-optimal Clustering

Property of K-means

Sum of Squared Error (SSE) diminishes after each
iteration.

The SSE is not necessarily the optimal one .

K
SSE    (mi , x)
dist 2

i 1 xCi
Advantages of K-means

Is efficient.
 Can be computed in a distributive way.
 Is easy to apply.
Limitations of K-means
 How to determine the best K?
 May give a sub-optimal solution.

K.means has problems when clusters are of
differing
– Sizes
– Densities
– Non-globular shapes


K-means is sensible to outliers.

Clustering in Machine Learning
No ratings yet
Clustering in Machine Learning
20 pages
Unit 4
No ratings yet
Unit 4
74 pages
Chapter 5. Clustering Algorithms-Stud
No ratings yet
Chapter 5. Clustering Algorithms-Stud
44 pages
ML Module5 Clustering
No ratings yet
ML Module5 Clustering
71 pages
Introduction To Unsupervised Learning:: Clustering
No ratings yet
Introduction To Unsupervised Learning:: Clustering
21 pages
Week 9
No ratings yet
Week 9
66 pages
ML Unit-4 Final 2024-25
No ratings yet
ML Unit-4 Final 2024-25
28 pages
DSML-ML09. Unsupervised Learning
No ratings yet
DSML-ML09. Unsupervised Learning
69 pages
Cluster Analysis: Methods and Applications
No ratings yet
Cluster Analysis: Methods and Applications
14 pages
10 Lecture AI 10
No ratings yet
10 Lecture AI 10
48 pages
Datamining-Lect5 - Clustering. The K-Means Algorithm. Hierarchical Clustering. The DBSCAN Algorithm. Clustering Evaluation
No ratings yet
Datamining-Lect5 - Clustering. The K-Means Algorithm. Hierarchical Clustering. The DBSCAN Algorithm. Clustering Evaluation
110 pages
Cluster Analysis 1731695796
No ratings yet
Cluster Analysis 1731695796
91 pages
Intro to Cluster Analysis
No ratings yet
Intro to Cluster Analysis
90 pages
Clustering
No ratings yet
Clustering
84 pages
Lecture 1 (UNIT 1)
No ratings yet
Lecture 1 (UNIT 1)
68 pages
cz4041 10 Clustering
No ratings yet
cz4041 10 Clustering
67 pages
Unit 4
No ratings yet
Unit 4
125 pages
Clustering
No ratings yet
Clustering
29 pages
CH5 Cluster Analysis
No ratings yet
CH5 Cluster Analysis
26 pages
Unit 5
No ratings yet
Unit 5
63 pages
Unit 4 Clustering - K-Means and Hierarchical
No ratings yet
Unit 4 Clustering - K-Means and Hierarchical
40 pages
P-3 1 2-Kmeans
No ratings yet
P-3 1 2-Kmeans
43 pages
IT3080 Lecture04 2023
No ratings yet
IT3080 Lecture04 2023
56 pages
DMDWUNITV
No ratings yet
DMDWUNITV
72 pages
What Is Cluster Analysis?: - Cluster: A Collection of Data Objects
No ratings yet
What Is Cluster Analysis?: - Cluster: A Collection of Data Objects
42 pages
Lecture 9 Clustering
No ratings yet
Lecture 9 Clustering
36 pages
Clustering K-Means
100% (2)
Clustering K-Means
28 pages
Clustering
No ratings yet
Clustering
125 pages
Data Mining Lecture Notes-1: Bsc. (H) Computer Science: Vi Semester Teacher: Ms. Sonal Linda
No ratings yet
Data Mining Lecture Notes-1: Bsc. (H) Computer Science: Vi Semester Teacher: Ms. Sonal Linda
40 pages
22AIP3101A Session 9
No ratings yet
22AIP3101A Session 9
38 pages
Datamining Lect8
No ratings yet
Datamining Lect8
79 pages
Unit 5
No ratings yet
Unit 5
85 pages
MODULE 4 Clustering
No ratings yet
MODULE 4 Clustering
23 pages
Clustering Algorithm: An Unsupervised Learning Approach
No ratings yet
Clustering Algorithm: An Unsupervised Learning Approach
23 pages
Clustering Algorithm
No ratings yet
Clustering Algorithm
47 pages
Clustering
No ratings yet
Clustering
104 pages
ML Unit III
No ratings yet
ML Unit III
82 pages
Unsupervised Learning for Students
No ratings yet
Unsupervised Learning for Students
59 pages
K Means Clustering
No ratings yet
K Means Clustering
29 pages
DSV - Unit 3 - Data Analysis in Depth
No ratings yet
DSV - Unit 3 - Data Analysis in Depth
53 pages
K Mean Clustering1
No ratings yet
K Mean Clustering1
23 pages
Unsupervised Learning: K-Means Clustering
No ratings yet
Unsupervised Learning: K-Means Clustering
23 pages
Unsupervised Learning 1
No ratings yet
Unsupervised Learning 1
40 pages
Session 37 CO4 Unsupervised Learning
No ratings yet
Session 37 CO4 Unsupervised Learning
34 pages
Cluster
100% (1)
Cluster
72 pages
Unit V - Clustering
No ratings yet
Unit V - Clustering
19 pages
Clustering
No ratings yet
Clustering
80 pages
Cluster
No ratings yet
Cluster
50 pages
Machine Learning Unsupervised
No ratings yet
Machine Learning Unsupervised
20 pages
Clustering FinancialData
No ratings yet
Clustering FinancialData
38 pages
Module 5
No ratings yet
Module 5
98 pages
Clustering Part1
No ratings yet
Clustering Part1
84 pages
07 Clustering
No ratings yet
07 Clustering
34 pages
Week 10 Lecture - Introduction To Clustering
No ratings yet
Week 10 Lecture - Introduction To Clustering
35 pages
Machine Learning
No ratings yet
Machine Learning
23 pages
Chap 8 XI Mathematics
No ratings yet
Chap 8 XI Mathematics
7 pages
English X MCQS Key Chapter 14
No ratings yet
English X MCQS Key Chapter 14
2 pages
MBSD NOTES (PDF 2)
No ratings yet
MBSD NOTES (PDF 2)
25 pages
CV Ali
No ratings yet
CV Ali
3 pages
Lectures To Take VU
No ratings yet
Lectures To Take VU
1 page
Lecture#9: Support Vector Machine (SVM)
No ratings yet
Lecture#9: Support Vector Machine (SVM)
18 pages
Cybersecurity Basics and Threats Quiz
No ratings yet
Cybersecurity Basics and Threats Quiz
3 pages
Trailer Inspection for Logistics
No ratings yet
Trailer Inspection for Logistics
4 pages
PGD Contract MGT Cases - 23-5-21 - Part 2
No ratings yet
PGD Contract MGT Cases - 23-5-21 - Part 2
1 page
Raza
No ratings yet
Raza
4 pages
Computer Mock Plan
No ratings yet
Computer Mock Plan
1 page
CHP 7 MCQ
No ratings yet
CHP 7 MCQ
5 pages
Medical Thesis Writing Challenges & Solutions
100% (3)
Medical Thesis Writing Challenges & Solutions
8 pages
AI in Construction Project Management
No ratings yet
AI in Construction Project Management
2 pages
EDE Heramb Assignment 4
No ratings yet
EDE Heramb Assignment 4
12 pages
Lang Chain
No ratings yet
Lang Chain
7 pages
Sutherland - Brief
No ratings yet
Sutherland - Brief
8 pages
COURSE SPECIFICATION MSC Computer Science
No ratings yet
COURSE SPECIFICATION MSC Computer Science
8 pages
2024 McKinsey Todays-Good-To-Great-Next-Generation-Operational-Excellence
No ratings yet
2024 McKinsey Todays-Good-To-Great-Next-Generation-Operational-Excellence
10 pages
Mean Shift Clustering
No ratings yet
Mean Shift Clustering
23 pages
Deep Learning in Customer Churn Prediction: Unsupervised Feature Learning On Abstract Company Independent Feature Vectors
No ratings yet
Deep Learning in Customer Churn Prediction: Unsupervised Feature Learning On Abstract Company Independent Feature Vectors
22 pages
20241111-Navigating White-Water World-MDU - Comp
No ratings yet
20241111-Navigating White-Water World-MDU - Comp
60 pages
Disrupt or Be Disrupted - Navigating The New Business Landscape by Panos Kalsos
No ratings yet
Disrupt or Be Disrupted - Navigating The New Business Landscape by Panos Kalsos
12 pages
Expected Error Detection (New Pattern) For Upcoming Mains Exam
No ratings yet
Expected Error Detection (New Pattern) For Upcoming Mains Exam
31 pages
Movie Recommendation System KNN (ML-Usecase)
No ratings yet
Movie Recommendation System KNN (ML-Usecase)
7 pages
Stock Prediction via Deep Learning
No ratings yet
Stock Prediction via Deep Learning
9 pages
En WBNR New PDF Srgcm12405
No ratings yet
En WBNR New PDF Srgcm12405
17 pages
Machine Learning Algorithms, Real-World Applications and Research Directions
No ratings yet
Machine Learning Algorithms, Real-World Applications and Research Directions
73 pages
ICT5358 Himanshu Patel
No ratings yet
ICT5358 Himanshu Patel
5 pages
PyTorch Overview and Key Concepts
No ratings yet
PyTorch Overview and Key Concepts
35 pages
Academic Profile: Dr. Abhishek Thakur
No ratings yet
Academic Profile: Dr. Abhishek Thakur
8 pages
Capgemini Careers for IIM Indore
No ratings yet
Capgemini Careers for IIM Indore
10 pages
Jobs of The Future
No ratings yet
Jobs of The Future
28 pages
Unit I
No ratings yet
Unit I
38 pages
Chapter 1 AIS
No ratings yet
Chapter 1 AIS
31 pages
AI Mastery in Go: No Human Input
No ratings yet
AI Mastery in Go: No Human Input
43 pages
4-Machine Learning and Neural Networks
No ratings yet
4-Machine Learning and Neural Networks
9 pages
Mgt400 Group Assignment 2 BMW
No ratings yet
Mgt400 Group Assignment 2 BMW
20 pages
1 Customers Latest Yugasa Intro Deck
No ratings yet
1 Customers Latest Yugasa Intro Deck
16 pages
Automatic Ticket Assignment AIML Online Capstone Group 6
No ratings yet
Automatic Ticket Assignment AIML Online Capstone Group 6
21 pages
DigitalFluencyCCIG V 1.0
100% (1)
DigitalFluencyCCIG V 1.0
125 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
10 pages

Machine Learning-Lecture#7-Fall 2020

Uploaded by

Machine Learning-Lecture#7-Fall 2020

Uploaded by

Lecture#7

Important distinction between hierarchical and

Partitional clustering approach

Original Points with initial centres

Original Points with clusters iteration 1

Original Points with new centres

Original Points with clusters and new centres iteration 2

Original Points with clusters and new centres iteration 3

Final clusters and centres A Partitional

-2 -1.5 -1 -0.5 0 0.5 1 1.5 2

-2 - 1.5 -1 -0 .5 0 0 .5 1 1.5 2 -2 - 1.5 -1 -0.5 0 0 .5 1 1 .5 2

Optimal Clustering Sub-optimal Clustering

You might also like