KEMBAR78
Lec 06 Clustering | PDF | Cluster Analysis | Statistical Data Types
0% found this document useful (0 votes)
71 views44 pages

Lec 06 Clustering

Clustering is the process of grouping similar data points together. There are two main types: hierarchical clustering, which creates nested groups, and partitional clustering like K-means, which divides data into a set number of groups. K-means clustering works by randomly assigning data to K clusters and then iteratively updating cluster means until the clusters are stable or a certain number of iterations is reached. It uses distance measures like Euclidean distance to assign data to the closest cluster mean.

Uploaded by

Zarfa Masood
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
71 views44 pages

Lec 06 Clustering

Clustering is the process of grouping similar data points together. There are two main types: hierarchical clustering, which creates nested groups, and partitional clustering like K-means, which divides data into a set number of groups. K-means clustering works by randomly assigning data to K clusters and then iteratively updating cluster means until the clusters are stable or a certain number of iterations is reached. It uses distance measures like Euclidean distance to assign data to the closest cluster mean.

Uploaded by

Zarfa Masood
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 44

11/30/2022

Clustering
Dr. Muhammad Shoaib Farooq
11/30/2022
11/30/2022

What is Clustering?
Clustering is the task of dividing the population or data points into a number of
groups such that data points in the same groups are more similar to other data points
in the same group and dissimilar to the data points in other groups. It is basically a
collection of objects on the basis of similarity and dissimilarity between them.
11/30/2022

Types of Clustering
11/30/2022

Hierarchical Clustering
11/30/2022

Hierarchical Clustering (Agglomerative)?


11/30/2022

Hierarchical Clustering (Divisive)?


11/30/2022

Partitional Clustering (K-Means)?


11/30/2022

Partitional Clustering (Fuzzy C-Means)?


11/30/2022

K-Means Clustering?
The K-Means clustering categorize the items into k groups of similarity.

The algorithm works as follows:

1- Initialize k points, called means, randomly.


2- Categorize each item to its closest mean and we update the mean’s
coordinates, which are the averages of the items categorized in that
mean so far.
3- Repeat the process for a given number of iterations and at the end, we
have our clusters.
11/30/2022
11/30/2022
11/30/2022

K-Means Clustering (Example)


11/30/2022

K-Means Clustering (Example)


11/30/2022

K-Means Clustering (Example)


11/30/2022

K-Means Clustering (Example)


11/30/2022

K-Means Clustering (Example)


11/30/2022

K-Means Clustering (Example)


11/30/2022

K-Means Clustering (Example)


11/30/2022

K-Means Clustering (Example)


11/30/2022

K-Means Clustering (Example)


11/30/2022
11/30/2022
11/30/2022
11/30/2022
11/30/2022
11/30/2022
11/30/2022
11/30/2022
11/30/2022
11/30/2022
11/30/2022
11/30/2022
11/30/2022
11/30/2022
11/30/2022
11/30/2022
11/30/2022
11/30/2022
11/30/2022

K-Means Clustering (Distance Measure)


11/30/2022

K-Means Clustering (Distance Measure)


11/30/2022

K-Means Clustering (Distance Measure)


11/30/2022

K-Means Clustering (Distance Measure)


11/30/2022

K-Means Clustering (Distance Measure)

You might also like