0% found this document useful (0 votes)

9 views7 pages

Hierarchical Clustering in Machine Learning

Hierarchical clustering is an unsupervised machine learning algorithm that groups unlabeled datasets into clusters, represented as a dendrogram. It has two main approaches: agglomerative (bottom-up) and divisive (top-down), with agglomerative being the more commonly used method. The algorithm does not require a predetermined number of clusters and employs various linkage methods to measure distances between clusters.

Uploaded by

karthikmarketing2004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views7 pages

Hierarchical Clustering in Machine Learning

Uploaded by

karthikmarketing2004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Hierarchical Clustering in Machine Learning

Hierarchical clustering is another unsupervised machine learning algorithm, which is used to

group the unlabeled datasets into a cluster and also known as hierarchical cluster analysis or
HCA.

In this algorithm, we develop the hierarchy of clusters in the form of a tree, and this tree-
shaped structure is known as the dendrogram.

Sometimes the results of K-means clustering and hierarchical clustering may look similar, but
they both differ depending on how they work. As there is no requirement to predetermine
the number of clusters as we did in the K-Means algorithm.

The hierarchical clustering technique has two approaches:

1. Agglomerative: Agglomerative is a bottom-up approach, in which the algorithm starts

with taking all data points as single clusters and merging them until one cluster is left.

2. Divisive: Divisive algorithm is the reverse of the agglomerative algorithm as it is a top-

down approach.

Why hierarchical clustering?

As we already have other clustering algorithms such as K-Means Clustering, then why we
need hierarchical clustering? So, as we have seen in the K-means clustering that there are
some challenges with this algorithm, which are a predetermined number of clusters, and it
always tries to create the clusters of the same size. To solve these two challenges, we can opt
for the hierarchical clustering algorithm because, in this algorithm, we don't need to have
knowledge about the predefined number of clusters.

Agglomerative Hierarchical clustering algorithm:

The agglomerative hierarchical clustering algorithm is a popular example of HCA. To group the
datasets into clusters, it follows the bottom-up approach. It means, this algorithm considers
each dataset as a single cluster at the beginning, and then start combining the closest pair of
clusters together. It does this until all the clusters are merged into a single cluster that contains
all the datasets.

This hierarchy of clusters is represented in the form of the dendrogram.

How the Agglomerative Hierarchical clustering Work?

The working of the AHC algorithm can be explained using the below steps:

o Step-1: Create each data point as a single cluster. Let's say there are N data points, so
the number of clusters will also be N.

o Step-2: Take two closest data points or clusters and merge them to form one cluster.
So, there will now be N-1 clusters.
o Step-3: Again, take the two closest clusters and merge them together to form one
cluster. There will be N-2 clusters.

o Step-4: Repeat Step 3 until only one cluster left. So, we will get the following clusters.
Consider the below images:
o Step-5: Once all the clusters are combined into one big cluster, develop the
dendrogram to divide the clusters as per the problem.

Measure for the distance between two clusters

As we have seen, the closest distance between the two clusters is crucial for the hierarchical
clustering. There are various ways to calculate the distance between two clusters, and these
ways decide the rule for clustering. These measures are called Linkage methods. Some of the
popular linkage methods are given below:
1. Single Linkage: It is the Shortest Distance between the closest points of the clusters.
Consider the below image:

2. Complete Linkage: It is the farthest distance between the two points of two different
clusters. It is one of the popular linkage methods as it forms tighter clusters than
single-linkage.

3. Average Linkage: It is the linkage method in which the distance between each pair of
datasets is added up and then divided by the total number of datasets to calculate the
average distance between two clusters. It is also one of the most popular linkage
methods.
4. Centroid Linkage: It is the linkage method in which the distance between the centroid
of the clusters is calculated. Consider the below image:

From the above-given approaches, we can apply any of them according to the type of problem
or business requirement.

Woking of Dendrogram in Hierarchical clustering

The dendrogram is a tree-like structure that is mainly used to store each step as a memory
that the HC algorithm performs. In the dendrogram plot, the Y-axis shows the Euclidean
distances between the data points, and the x-axis shows all the data points of the given
dataset.

The working of the dendrogram can be explained using the below diagram:
In the above diagram, the left part is showing how clusters are created in agglomerative
clustering, and the right part is showing the corresponding dendrogram.

o As we have discussed above, firstly, the datapoints P2 and P3 combine together and
form a cluster, correspondingly a dendrogram is created, which connects P2 and P3
with a rectangular shape. The hight is decided according to the Euclidean distance
between the data points.
o In the next step, P5 and P6 form a cluster, and the corresponding dendrogram is
created. It is higher than of previous, as the Euclidean distance between P5 and P6 is
a little bit greater than the P2 and P3.
o Again, two new dendrograms are created that combine P1, P2, and P3 in one
dendrogram, and P4, P5, and P6, in another dendrogram.
o At last, the final dendrogram is created that combines all the data points together.

Hierarchical Clustering
No ratings yet
Hierarchical Clustering
7 pages
Hierarchal Clustering
No ratings yet
Hierarchal Clustering
13 pages
Hierarchical Clustering Guide
No ratings yet
Hierarchical Clustering Guide
11 pages
Hierarchical Clustering - 11.3.2024 - Full
No ratings yet
Hierarchical Clustering - 11.3.2024 - Full
14 pages
Hierarchical Clustering Guide
No ratings yet
Hierarchical Clustering Guide
50 pages
Hierarchical Clustering PDF
No ratings yet
Hierarchical Clustering PDF
7 pages
ML Lec-17
No ratings yet
ML Lec-17
12 pages
Clustering
No ratings yet
Clustering
19 pages
Hierarchical Clusters
No ratings yet
Hierarchical Clusters
6 pages
Hierarchial Clustering
No ratings yet
Hierarchial Clustering
14 pages
Hierarchical Clustering in Machine Learning
No ratings yet
Hierarchical Clustering in Machine Learning
10 pages
Module-5-Cluster Analysis-Part1
No ratings yet
Module-5-Cluster Analysis-Part1
24 pages
Hierarchical Clustering in Machine Learning
No ratings yet
Hierarchical Clustering in Machine Learning
10 pages
10Hierarchical&Probabilistic Clustering & GMM (ML)
No ratings yet
10Hierarchical&Probabilistic Clustering & GMM (ML)
24 pages
DWM Exp8 127 133 137
No ratings yet
DWM Exp8 127 133 137
4 pages
Spooo
No ratings yet
Spooo
9 pages
Unt III (DS)
No ratings yet
Unt III (DS)
49 pages
6 - Chapter 6 - Hierarchical Clustering
No ratings yet
6 - Chapter 6 - Hierarchical Clustering
32 pages
Exp 8
No ratings yet
Exp 8
7 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
19 pages
Hierarchical
No ratings yet
Hierarchical
31 pages
Agnes
No ratings yet
Agnes
25 pages
Heirarchical Clustering
No ratings yet
Heirarchical Clustering
22 pages
(13.4.2019) Hierarchical Clustering
No ratings yet
(13.4.2019) Hierarchical Clustering
2 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
41 pages
Clustring
No ratings yet
Clustring
20 pages
1746780018-Lecture#44 Hierarchical Clustering
No ratings yet
1746780018-Lecture#44 Hierarchical Clustering
5 pages
Hierarchical Clustering Case Study
No ratings yet
Hierarchical Clustering Case Study
4 pages
4.unsupervised Learning Model-Clustering
No ratings yet
4.unsupervised Learning Model-Clustering
45 pages
Unit 4 Self Made
No ratings yet
Unit 4 Self Made
28 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
21 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
26 pages
Hierarchical Clustering in Unsupervised Learning
No ratings yet
Hierarchical Clustering in Unsupervised Learning
9 pages
Hierarchical Clustering Explained
No ratings yet
Hierarchical Clustering Explained
14 pages
Hierarchical Clustering: Class Program University Semester Lecturer Sources
100% (1)
Hierarchical Clustering: Class Program University Semester Lecturer Sources
33 pages
DWM 4
No ratings yet
DWM 4
14 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
10 pages
Data Analytics and Model Evaluation
No ratings yet
Data Analytics and Model Evaluation
55 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
6 pages
9536 DWM Expt 7 Merged
No ratings yet
9536 DWM Expt 7 Merged
14 pages
20 - 1 - ML - UNSUP - 02 - Hierarchical Clustering
No ratings yet
20 - 1 - ML - UNSUP - 02 - Hierarchical Clustering
41 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
3 pages
4.4 Hierarchical Clustering Methods
No ratings yet
4.4 Hierarchical Clustering Methods
39 pages
3.2 HierCluster
No ratings yet
3.2 HierCluster
17 pages
Exp 8
No ratings yet
Exp 8
5 pages
Hierarchical Clustering Guide
No ratings yet
Hierarchical Clustering Guide
40 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
4 pages
Advance Learning Methods Machine Learning Lecture Notes
No ratings yet
Advance Learning Methods Machine Learning Lecture Notes
13 pages
Unit-4 New
No ratings yet
Unit-4 New
36 pages
ML CO4 SESSION 30 Hierarchical Clustering
No ratings yet
ML CO4 SESSION 30 Hierarchical Clustering
20 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
15 pages
ML Module Iv
No ratings yet
ML Module Iv
27 pages
Unit5 CSM ML
No ratings yet
Unit5 CSM ML
32 pages
Clustering
No ratings yet
Clustering
32 pages
Hierarchical Clustering Guide
No ratings yet
Hierarchical Clustering Guide
110 pages
Hierarchical Clustering Guide
No ratings yet
Hierarchical Clustering Guide
3 pages
AI20 - Hierarchical-Clustering
No ratings yet
AI20 - Hierarchical-Clustering
31 pages
Un Supervised Learning
No ratings yet
Un Supervised Learning
22 pages
Lec11 Hierarchal Clustering
No ratings yet
Lec11 Hierarchal Clustering
12 pages
MPLS L2VPN Config Commands Guide
No ratings yet
MPLS L2VPN Config Commands Guide
28 pages
Vehicle Manual for Technicians
No ratings yet
Vehicle Manual for Technicians
1 page
JIS B 2401-1 Series: (Static/Dynamic Application)
No ratings yet
JIS B 2401-1 Series: (Static/Dynamic Application)
7 pages
Bommer Et Al 2015 A Sshac Level 3 Probabilistic Seismic Hazard Analysis For A New Build Nuclear Site in South Africa
No ratings yet
Bommer Et Al 2015 A Sshac Level 3 Probabilistic Seismic Hazard Analysis For A New Build Nuclear Site in South Africa
38 pages
Development of The Inclusion Attitude Scale For High School Teachers
No ratings yet
Development of The Inclusion Attitude Scale For High School Teachers
19 pages
E-Recruitment Insights for HR Pros
No ratings yet
E-Recruitment Insights for HR Pros
15 pages
Mfat Action Plan Implementation Sy 2021-2022
No ratings yet
Mfat Action Plan Implementation Sy 2021-2022
1 page
BCM MARKET SURVEY OF FLOORING AND PAVING N
No ratings yet
BCM MARKET SURVEY OF FLOORING AND PAVING N
14 pages
DOC01
No ratings yet
DOC01
2 pages
TDS 30060 Hardtop Eco Euk GB
No ratings yet
TDS 30060 Hardtop Eco Euk GB
6 pages
Algorithms For Polynomial and Rational Approximation
No ratings yet
Algorithms For Polynomial and Rational Approximation
141 pages
Sustainability 2 Marks Answers
No ratings yet
Sustainability 2 Marks Answers
3 pages
Lecture Notes On Topological Insulators: Zyuzin and Burkov 2012
No ratings yet
Lecture Notes On Topological Insulators: Zyuzin and Burkov 2012
3 pages
Unit 1 - Understanding Guidance
No ratings yet
Unit 1 - Understanding Guidance
13 pages
Biomass 1
No ratings yet
Biomass 1
22 pages
Cabin Interior System - Lavatory
No ratings yet
Cabin Interior System - Lavatory
66 pages
RNA & Protein Synthesis Quiz
67% (3)
RNA & Protein Synthesis Quiz
6 pages
Rating Scale For Student Teachers
100% (3)
Rating Scale For Student Teachers
3 pages
Nigeria Overpopulation Causes & Solutions
No ratings yet
Nigeria Overpopulation Causes & Solutions
12 pages
0054 Syllabus
No ratings yet
0054 Syllabus
2 pages
B.Tech Semester 7 Results
No ratings yet
B.Tech Semester 7 Results
2 pages
2 Bontrager
No ratings yet
2 Bontrager
1 page
Curriculum Map Contemporary Arts 1st and 2nd Quarter
No ratings yet
Curriculum Map Contemporary Arts 1st and 2nd Quarter
12 pages
SPM6-72L 380-400 Watt: Mono Crystalline Module
No ratings yet
SPM6-72L 380-400 Watt: Mono Crystalline Module
2 pages
European Commission. (2013, November) - Organic Versus Conventional Farming
No ratings yet
European Commission. (2013, November) - Organic Versus Conventional Farming
10 pages
LS 6 Lesson 1 - Parts of A Desktop Computer
No ratings yet
LS 6 Lesson 1 - Parts of A Desktop Computer
64 pages
(Original PDF) Mathematical Proofs: A Transition To Advanced Mathematics 4th Edition Available Instanly
100% (2)
(Original PDF) Mathematical Proofs: A Transition To Advanced Mathematics 4th Edition Available Instanly
155 pages
750com-In002 - En-P Profibus Card Instalation Manual
No ratings yet
750com-In002 - En-P Profibus Card Instalation Manual
4 pages
Suction Unit Hospivac User Manual
No ratings yet
Suction Unit Hospivac User Manual
10 pages
Ship Maintanance
100% (4)
Ship Maintanance
302 pages

Hierarchical Clustering in Machine Learning

Uploaded by

Hierarchical Clustering in Machine Learning

Uploaded by

Hierarchical Clustering in Machine Learning

Hierarchical clustering is another unsupervised machine learning algorithm, which is used to

The hierarchical clustering technique has two approaches:

1. Agglomerative: Agglomerative is a bottom-up approach, in which the algorithm starts

2. Divisive: Divisive algorithm is the reverse of the agglomerative algorithm as it is a top-

Why hierarchical clustering?

Agglomerative Hierarchical clustering algorithm:

This hierarchy of clusters is represented in the form of the dendrogram.

Measure for the distance between two clusters

Woking of Dendrogram in Hierarchical clustering

You might also like