0% found this document useful (0 votes)

13 views3 pages

Machine Learning

Clustering is an unsupervised machine learning technique used to group data points based on similarities, with applications in customer segmentation, image segmentation, and anomaly detection. The K-Means algorithm is a popular method within centroid-based clustering, which involves selecting initial centroids, assigning data points to clusters, and iterating until convergence. While K-Means is easy to implement and efficient, it requires prior specification of the number of clusters and struggles with outliers and large datasets.

Uploaded by

Angelo Vita

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views3 pages

Machine Learning

Uploaded by

Angelo Vita

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Clustering

Clustering is an unsupervised machine learning algorithm that is used to group or cluster data points
based on similarities or patterns. It can be used in data mining during the initial data exploration stage
as well as in the data processing state.

Applications of Clustering

Some of the most common applications of clustering includes:

• Customer Segmentation
• Image Segmentation
• Anomaly Detection
• Document Processing and Classification

Categories of Clustering Techniques

Based on literature, there is no universal number of categories of clustering methods or techniques.

But the most common includes:

• Centroid-Based Clustering
o K-Means Clustering
• Hierarchical Clustering
o Agglomerative Clustering
o Divisive Clustering
• Density-Based Clustering
o DBSCAN

K-Means Clustering

The K-Means clustering algorithm is one of the most popular centroid-based clustering
algorithms. The goal of K-Means clustering is to find the minimum pairwise distance between each
data point in the dataset and the cluster centroids.

The K-Means Clustering Algorithm

1. Select random K points from the dataset which will act as the initial cluster centroids.

2. For each data point in the dataset, calculate the distance between that point and each of the
K centroids.

3. Assign the data point to the cluster whose centroid is closest to it.

4. After the data points have been assigned to clusters, recalculate the centroids of the clusters
by taking the mean (average) of all data points assigned to each cluster.

5. Repeat steps 2 and 3 until the centroids no longer change significantly or when a specified
number of iterations is reached.

6. Once convergence is achieved, the algorithm outputs the final cluster centroids and the
assignment of each data point to a cluster.

You are granted access to this material for your personal use only. Unauthorized
distribution, reproduction, modification, transmission, or exploitation of this material in any way
without the written permission of the author is strictly prohibited. – Harold L. Costales, April 2024.
Illustrative Example: Not yet available

Elbow Method

The elbow method is a graphical method for finding the optimal K value in a k-means clustering
algorithm. The elbow graph shows the within-cluster-sum-of-square (WCSS) values on the y-axis
corresponding to the different values of K (on the x-axis). The optimal K value is the point at which the
graph forms an elbow.

Silhouette Score

The silhouette score and plot are used to evaluate the quality of a clustering solution produced
by the k-means algorithm. The silhouette score measures the similarity of each point to its own cluster
compared to other clusters, and the silhouette plot visualizes these scores for each sample. A high
silhouette score indicates that the clusters are well separated, and each sample is more similar to the
samples in its own cluster than to samples in other clusters. A silhouette score close to 0 suggests
overlapping clusters, and a negative score suggests poor clustering solutions.

Advantages

• This algorithm is very easy to understand and implement.

• This algorithm is efficient, Robust, and Flexible
• If data sets are distinct and spherical clusters, then give the best results

Disadvantages

• This algorithm needs prior specification for the number of cluster centers that is the value of
K.
• It cannot handle outliers and noisy data, as the centroids get deflected
• It does not work well with a very large set of datasets as it takes huge computational time.

GeeksforGeeks. (2022). Clustering in data mining. Retrieved on April 27, 2024 from
https://www.geeksforgeeks.org/clustering-in-data-mining/

IBM. (n.d.). What is clustering? Retrieved on April 27, 2024 from

https://www.ibm.com/topics/clustering

Sharma, P. (2024). The Ultimate Guide to K-Means Clustering: Definition, Methods and Applications.
Retrieved from https://www.analyticsvidhya.com/blog/2019/08/comprehensive-guide-k-means-
clustering/#How_to_Apply_K-Means_Clustering_Algorithm?

Tomar, A. (2023). Stop Using Elbow Method in K-Means Clustering. https://builtin.com/data-

science/elbow-method

Towards AI. (2023). What are the advantages and disadvantages of K-Means clustering? Retrieved
from https://towardsai.net/p/machine-learning/what-are-the-advantages-and-disadvantages-of-k-
means-clustering

K-Means Clustering Insights
No ratings yet
K-Means Clustering Insights
8 pages
K-Means Clustering Algorithm
No ratings yet
K-Means Clustering Algorithm
13 pages
K Means Clustering
No ratings yet
K Means Clustering
13 pages
ML Unit III
No ratings yet
ML Unit III
82 pages
K Means Clustering
No ratings yet
K Means Clustering
27 pages
Clustering FinancialData
No ratings yet
Clustering FinancialData
38 pages
K Means Clustering
No ratings yet
K Means Clustering
22 pages
Kmeansfinal
No ratings yet
Kmeansfinal
16 pages
K-Means Clustering Guide
No ratings yet
K-Means Clustering Guide
32 pages
Unit 4
No ratings yet
Unit 4
74 pages
Lecture 9 Kmean-V3
No ratings yet
Lecture 9 Kmean-V3
52 pages
Clustering Algorithm
No ratings yet
Clustering Algorithm
47 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
4 pages
K-Means Clustering Guide 2023
No ratings yet
K-Means Clustering Guide 2023
14 pages
K-Means Clustering
No ratings yet
K-Means Clustering
14 pages
Clustering Algorithm: An Unsupervised Learning Approach
No ratings yet
Clustering Algorithm: An Unsupervised Learning Approach
23 pages
KMean Merged
No ratings yet
KMean Merged
13 pages
KMeans Clustering Report
No ratings yet
KMeans Clustering Report
2 pages
K, Eans
No ratings yet
K, Eans
4 pages
Understanding Clustering - A Comprehensive Guide To
No ratings yet
Understanding Clustering - A Comprehensive Guide To
5 pages
K-Means Clustering Guide
No ratings yet
K-Means Clustering Guide
26 pages
Week 9
No ratings yet
Week 9
66 pages
ML Ch-5 Clustering, Dimensionality Reduction and Recommender System
No ratings yet
ML Ch-5 Clustering, Dimensionality Reduction and Recommender System
13 pages
K-Means Clustering
No ratings yet
K-Means Clustering
5 pages
PART2
No ratings yet
PART2
61 pages
Unit 4
No ratings yet
Unit 4
46 pages
Clustering Analysis
No ratings yet
Clustering Analysis
12 pages
Machine Learning
No ratings yet
Machine Learning
23 pages
Chapter 5. Clustering Algorithms-Stud
No ratings yet
Chapter 5. Clustering Algorithms-Stud
44 pages
Unsupervised Learning 1
No ratings yet
Unsupervised Learning 1
40 pages
6 Clustering
No ratings yet
6 Clustering
15 pages
ML Module5 Clustering
No ratings yet
ML Module5 Clustering
71 pages
K Means
No ratings yet
K Means
25 pages
Clustering
No ratings yet
Clustering
125 pages
Clustering Techniques Explained
No ratings yet
Clustering Techniques Explained
11 pages
Clustering - K-Means: Prerequisite
No ratings yet
Clustering - K-Means: Prerequisite
8 pages
Introduction To The K-Means Clustering Algorithm Based On The Elbow
No ratings yet
Introduction To The K-Means Clustering Algorithm Based On The Elbow
4 pages
Unsupervised Learning Insights
No ratings yet
Unsupervised Learning Insights
10 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
12 pages
U1 - KMeans - 5th Sem - DS
No ratings yet
U1 - KMeans - 5th Sem - DS
14 pages
Session 37 CO4 Unsupervised Learning
No ratings yet
Session 37 CO4 Unsupervised Learning
34 pages
Text Analytics Unit-3
No ratings yet
Text Analytics Unit-3
11 pages
"These Are Just Rough Notes For References" What Is K-Means Clustering
No ratings yet
"These Are Just Rough Notes For References" What Is K-Means Clustering
9 pages
M5
No ratings yet
M5
40 pages
Unit 4
No ratings yet
Unit 4
16 pages
DWDM Unit5
No ratings yet
DWDM Unit5
14 pages
AI ML Lecture 6
No ratings yet
AI ML Lecture 6
20 pages
Chapter 3 Unsupervised Learning
No ratings yet
Chapter 3 Unsupervised Learning
45 pages
Kmeans Clustering
No ratings yet
Kmeans Clustering
3 pages
Clustering Techniques Overview
No ratings yet
Clustering Techniques Overview
40 pages
K Clustering
No ratings yet
K Clustering
28 pages
Unit 4
No ratings yet
Unit 4
125 pages
K-Means Clustering Guide & Python Implementation
No ratings yet
K-Means Clustering Guide & Python Implementation
21 pages
Unit 4
No ratings yet
Unit 4
22 pages
Unit 4
No ratings yet
Unit 4
19 pages
Mod4 - Unsupervised Learning
No ratings yet
Mod4 - Unsupervised Learning
9 pages
SLide#4 - Clustering and Elbow Technique
No ratings yet
SLide#4 - Clustering and Elbow Technique
29 pages
EML %TH Module
No ratings yet
EML %TH Module
40 pages
AI's Impact on Employment Shift
No ratings yet
AI's Impact on Employment Shift
3 pages
Sunilbabu253 180317094114
No ratings yet
Sunilbabu253 180317094114
22 pages
Convolutional Networks For Images, Speech, and Time-Series: January 1995
No ratings yet
Convolutional Networks For Images, Speech, and Time-Series: January 1995
15 pages
Mindsdb
No ratings yet
Mindsdb
3 pages
Python
No ratings yet
Python
12 pages
ITBA
No ratings yet
ITBA
8 pages
4 - Instruction Finetune LLM
No ratings yet
4 - Instruction Finetune LLM
5 pages
The Impact of Industrial Robots On EU Employment and Wages: A Local Labour Market Approach
No ratings yet
The Impact of Industrial Robots On EU Employment and Wages: A Local Labour Market Approach
36 pages
Jabotinsky Pub
No ratings yet
Jabotinsky Pub
37 pages
Tia TR 60 The Role of Artificial Intelligence Ai in Managing Technology Life Cycles - 967276
No ratings yet
Tia TR 60 The Role of Artificial Intelligence Ai in Managing Technology Life Cycles - 967276
15 pages
Science 10 Unit C Plan
No ratings yet
Science 10 Unit C Plan
10 pages
Data Science Internship Report
No ratings yet
Data Science Internship Report
17 pages
Terms of Service - Synthflow
No ratings yet
Terms of Service - Synthflow
23 pages
IIoT Attack Detection for Experts
No ratings yet
IIoT Attack Detection for Experts
11 pages
Padma iSPIN Final Thesis Apr 2022
No ratings yet
Padma iSPIN Final Thesis Apr 2022
150 pages
Whats Inside The Black Box Ai Challenges For Lawyers and Researchers
No ratings yet
Whats Inside The Black Box Ai Challenges For Lawyers and Researchers
12 pages
CPP Presentation Final
No ratings yet
CPP Presentation Final
21 pages
DM - Ai22c07 - Unit 3
No ratings yet
DM - Ai22c07 - Unit 3
272 pages
Knowledge, Attitude, and Practice of Artificial Intelligence Applications in Medicine Among Physicians in Sudan
No ratings yet
Knowledge, Attitude, and Practice of Artificial Intelligence Applications in Medicine Among Physicians in Sudan
6 pages
Chapter 1 AIS
No ratings yet
Chapter 1 AIS
31 pages
NIMP-Robotics and Automation
No ratings yet
NIMP-Robotics and Automation
15 pages
AI Class 10 Board Exam Sample Paper
No ratings yet
AI Class 10 Board Exam Sample Paper
9 pages
Vals Final Writeup
No ratings yet
Vals Final Writeup
9 pages
Group 6
No ratings yet
Group 6
13 pages
AI - Introducing ModelOps To Operationalize AI
No ratings yet
AI - Introducing ModelOps To Operationalize AI
16 pages
6 JC Report Forest Fire
No ratings yet
6 JC Report Forest Fire
36 pages
Scikit Learn Tutorial PDF
100% (2)
Scikit Learn Tutorial PDF
151 pages
Web Evolution
No ratings yet
Web Evolution
12 pages
Introduction To Informatics Assignment
No ratings yet
Introduction To Informatics Assignment
12 pages
Command-R: AI for Enterprise Efficiency
No ratings yet
Command-R: AI for Enterprise Efficiency
8 pages

Machine Learning

Uploaded by

Machine Learning

Uploaded by

Clustering

Some of the most common applications of clustering includes:

Categories of Clustering Techniques

Based on literature, there is no universal number of categories of clustering methods or techniques.

The K-Means Clustering Algorithm

• This algorithm is very easy to understand and implement.

IBM. (n.d.). What is clustering? Retrieved on April 27, 2024 from

Tomar, A. (2023). Stop Using Elbow Method in K-Means Clustering. https://builtin.com/data-

You might also like