0% found this document useful (0 votes)

12 views22 pages

Machine Learning Unit 4

This document provides an overview of unsupervised learning, focusing on clustering techniques such as K-means, hierarchical, and density-based methods. It explains the principles of clustering, differentiates it from classification, and discusses the applications of clustering in various fields. Additionally, it covers the association rule learning, particularly the Apriori algorithm, and its applications in market basket analysis and other domains.

Uploaded by

sherkeyashi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views22 pages

Machine Learning Unit 4

Uploaded by

sherkeyashi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

UNIT-4: Unsupervised Learning

Introduction to Clustering

Unsupervised vs Supervised Learning,

Different types of clustering techniques

Partitioning Methods

Hierarchical methods

Density Based Methods

Grid Based Methods

Model Based Clustering Methods.

4. K-means clustering,

5. Apriori algorithm and

Association rule.

Hierarchical clustering, K-Medoids,

Density-based methods DBSCAN.

Overview of Clustering

Introduction to clustering?

Definition:-1

Clustering is the task of dividing the population or data points into a number of groups such that data

points in the same groups are more similar to other data points in the same group and dissimilar to

the data points in other groups.

It is basically a collection of objects on the basis of similarity and dissimilarity between them.

Definition:-2

"A way of grouping the data points into different clusters, consisting of similar data points. The
objects with the possible similarities remain in a group that has less or no similarities with another

group."

What is clustering?

Grouping unlabeled examples is called clustering.

As the examples are unlabeled, clustering relies on unsupervised machine learning.

If the examples are labeled, then clustering is called classification.

Figure 1: Unlabeled examples grouped into three clusters.

Examples

Differentiate between Classification and Clustering

Types of Clustering Methods

The clustering methods are broadly divided into Hard clustering (data point belongs to only one

group) and Soft Clustering (data points can belong to another group also).

Partitioning Clustering

Hierarchical Clustering

Density-Based Clustering

Model-Based Clustering

Grid Based Methods

What are the Uses of Clustering?

Common applications for clustering include the following:

market segmentation

social network analysis

search result grouping

medical imaging

image segmentation

anomaly detection

Examples of clustering(Grouping):

Group stars by brightness.

Group organisms by genetic information into a taxonomy.

Group documents by topic.

Types of Clustering Methods

The clustering methods are broadly divided into Hard clustering (data point belongs to only one

group) and Soft Clustering (data points can belong to another group also).

Partitioning Clustering

Hierarchical Clustering

Density-Based Clustering

Model-Based Clustering

Grid-Based Methods

Partitioning Clustering

It is a type of clustering that divides the data into non-hierarchical groups.

It is also known as the centroid-based method.

The most common example of partitioning clustering is the K-Means Clustering algorithm.
In this type, the dataset is divided into a set of k-groups, where K is used to define the number of

pre-defined groups.

The cluster center is created in such a way that the distance between the data points of one cluster

is minimum as compared to another cluster centroid.

Working of K-Means Algorithm:

The following stages will help us understand how the K-Means clustering technique works:

Step 1: First, we need to provide the number of clusters, K.

Step 2: Next, choose K data points at random and assign each

data points to a cluster.

Step 3: The cluster centroids will now be computed.

Step 4: Iterate the steps below until we find the ideal centroid,

which is the assigning of data points to clusters that do not vary.

4.1 The sum of squared distances between data points and centroids would be calculated first.

4.2 At this point, we need to allocate each data point to the cluster that is closest to the others

(centroid).

4.3 Finally, compute the centroids for the clusters by averaging all of the clusters data points.

The working of the K-Means algorithm is given below:

Step-1: Select the number K to decide the number of clusters.

Step-2: Select random K points or centroids. (It can be other

from the input dataset).

Step-3: Assign each data point to their closest centroid, which will form the predefined K clusters.
Step-4: Calculate the distances between data points and centroids first and allocate each data point

to the cluster that is closest to the others (centroid). Place a new centroid of each cluster.

Step-5: Repeat the third steps, which means reassign each datapoint to the new closest centroid of

each cluster.

Step-6: If any reassignment occurs, then go to step-4 else go

to FINISH.

Step-7: The model is ready.

Let's understand the above steps by considering the visual plots:

Step-1:Let us pick k clusters, i.e., K=2, to identify the dataset and to put them into different clusters.

Step2:- We need to select some random k data points or centroid to form the cluster.

These points can be either the points from the dataset or any other point.

So, here we are selecting the below two points as k points, which are not the part of our dataset.

Consider the image below:

Now we will assign each data point of the scatter plot to its closest K-point or centroid.

We will compute it by applying some mathematics that we have studied to calculate the distance

between two points.

So, we will draw a median between both the centroids.

Consider the below image:

From the above image, it is clear that points left side of the line is near to the K1 or blue centroid,

and points to the right of the line are close to the yellow centroid.

Let's color them as blue and yellow for clear visualization.

The left Form cluster has a blue centroid, whereas the right Form cluster has a yellow centroid.

Repeat the procedure, this time by selecting a different centroid.

To choose the new centroids, we will compute the center of gravity of these centroids, and will find

new centroids as below:

Next, we will reassign each data point to the new centroid. For this, we will repeat the same process

of finding a median line. The median will be like below image:

From the above image, we can see, one yellow point is on the left side of the line, and two blue

points are right to the line. So, these three points will be assigned to new centroids.

As reassignment has taken place, so we will again go to the step-4, which is finding new centroids

or K-points.

We will repeat the process by finding the center of gravity of centroids, so the new centroids will be

as shown in the below image:

As we got the new centroids so again will draw the median line and reassign the data points. So, the

image will be:

We can see in the above image; there are no dissimilar data points on either side of the line, which

means our model is formed. Consider the below image:

As our model is ready, so we can now remove the assumed centroids, and the two final clusters are

as shown in the fig. below:

How to choose the value of "K number of clusters" in K-means Clustering?

Choosing the optimal number of clusters is a big task.

There are many different ways to find the optimal number of clusters, but here we are discussing the

most appropriate method to find the number of clusters or value of K.

The method is Elbow Method:

The Elbow method is one of the most popular ways to find the optimal number of clusters.

This method uses the concept of WCSS value.

WCSS stands for Within Cluster Sum of Squares, which defines the total variations within a cluster.

The formula to calculate the value of WCSS (for 3 clusters) is given below:

WCSS= Pi in Cluster1 distance(Pi C1)2 +Pi in Cluster2distance(Pi C2)2+Pi in

CLuster3 distance(Pi C3)2

Association rule
Association rule is a kind of unsupervised learning technique that tests for the reliance of one data

element on another data element and design appropriately so that it can be more cost-effective.

The association rule learning is the most important approach of machine learning, and it is employed

in Market Basket analysis, Web usage mining, continuous production, etc. In market basket

analysis, it is an approach used by several big retailers to find the relations between items.

Association rule learning can be divided into three types of algorithms:

1. Apriori algorithm

2. Eclat algorithm

3. F-P Growth algorithm

Associations: Market Basket Analysis

How does Association Rule Learning work?

Association rule learning works on the concept of If and Else Statement, such as if A then B.

To measure the associations between thousands of data items, there are several metrics.

These metrics are given below:

Support

Confidence

Lift
Applications of Machine Learning:

Learning Associations:

Support Count() Frequency of occurrence of a items.Here ({Milk, Diaper, Beer})=2 ({Bread,

Diaper, Beer})=2

Association Rule An implication expression of the form X -> Y, where X and Y are any 2 item sets.

Example: {Milk, Diaper}->{Beer}

Definition of Support:

Support is the frequency of A or how frequently an item set appears in the dataset.

From the above table:

Support(s)= ({Milk, Diaper, Beer})/|T| = 2/5

= 0.4

Where T is the total number of transactions.

Definition of Confidence:

How often the items X and Y occur together in the dataset when the occurrence of X is already

given.

It is the ratio of the transaction that contains X and Y to the number of records that contain X.

From Example {Milk, Diaper} > {Beer}

Confidence(c)= (Milk, Diaper, Beer)/(Milk, Diaper)=2/3

=0.67

Definition of Lift(l): The lift of the rule X=>Y is the confidence of the rule divided by the expected

confidence, assuming that the item sets X and Y are independent of each other. The expected

confidence is the confidence divided by the frequency of {Y}.

If Lift(l)=1: It indicates X and Y almost often appear together as expected,

If Lift(l)>1: It means they appear together more than expected and

If Lift(l)<1: It means they appear less than expected. Greater lift values indicate stronger association.

Lift(l)= Support(X,Y)/(Support(X)*Support(Y))

l=Supp({Milk, Diaper, Beer})/ (Supp({Milk, Diaper})*Supp({Beer}))

= 0.4/(0.6*0.6)

=1.11

Applications of Association Rule Learning

Below are some popular applications of association rule learning:

Market Basket Analysis: It is one of the popular examples and applications of association rule

mining. This technique is commonly used by big retailers to determine the association between

items. By discovering such associations, retailers produce marketing methods by analyzing which

elements are frequently purchased by users.

Medical Diagnosis: With the help of association rules, patients can be cured easily, as it helps in

identifying the probability of illness for a particular disease.

Protein Sequence: The association rules help in determining the synthesis of artificial Proteins.

Web usage mining: Web usage mining is basically the extraction of various types of interesting data

that is readily available and accessible in the ocean of huge web pages, from Internet.

UNIT-4: Unsupervised Learning

Introduction to Clustering

Unsupervised vs Supervised Learning,

Different types of clustering techniques

Partitioning Methods

Hierarchical methods

Density Based Methods

Grid Based Methods

Model Based Clustering Methods.

4. K-means clustering,

5. Apriori algorithm and

Association rule.

Hierarchical clustering, K-Medoids,

Density-based methods DBSCAN.

Hierarchical clustering

The hierarchical clustering methods are used to group the data into hierarchy or tree-like structure.

For example, in a machine learning problem of organizing employees of a university in different

departments, first the employees are grouped under the different departments in the university, and

then within each department, the employees can be grouped according to their roles such as

professors, assistant professors, supervisors, lab assistants, etc.

This creates a hierarchical structure of the employee data and eases visualization and analysis.

Similarly, there may be a data set which has an underlying hierarchy structure that we want to

discover and we can use the hierarchical clustering methods to achieve that.

Types of hierarchical clustering

There are two main hierarchical clustering methods: Agglomerative clustering and Divisive

clustering.

Agglomerative clustering: is a bottom-up technique which starts with individual objects as clusters

and then iteratively merges them to form larger clusters.

Divisive clustering: starts with one cluster with all given objects and then splits it iteratively to form

smaller clusters.

Agglomerative Clustering:

It uses a bottom-up approach.

It starts with each object forming its own cluster and then iteratively merges the clusters according to

their similarity to form large clusters.

It terminates either when certain clustering condition imposed by user is achieved or All clusters

merge into a single cluster.

Divisive Clustering:

Divisive clustering is the opposite of Agglomerative clustering.

It uses the top-down approach.

The starting point is the largest cluster with all objects in it and then split recursively to form smaller

and smaller clusters.

It terminates when the user-defined condition is achieved or final clusters contain only one object.

A dendrogram, which is a tree like structure, is used to represent hierarchical clustering.

Individual objects are represented by leaf nodes and the clusters are represented by root nodes. A

representation of dendrogram is shown in this figure:

Hierarchical Clustering

Produce a nested sequence of clusters.

One approach: recursive application of a partitional clustering algorithm.

Types of hierarchical clustering

Agglomerative (bottom up) clustering: It builds the dendrogram (tree) from the bottom level, and

merges the most similar (or nearest) pair of clusters

stops when all the data points are merged into a single cluster (i.e., the root cluster).

Divisive (top down) clustering: It starts with all data points in one cluster, the root.
Splits the root into a set of child clusters. Each child cluster is recursively divided further

stops when only singleton clusters of individual data points remain, i.e., each cluster with only a

single point

Dendrogram: Hierarchical Clustering

Dendrogram

Given an input set S

nodes represent subsets of S

Features of the tree:

The root is the whole input set S.

The leaves are the individual elements of S.

The internal nodes are defined as the union of their children.

Dendrogram: Hierarchical Clustering

Dendrogram

Each level of the tree represents a partition of the input data into several (nested) clusters or groups.

May be cut at any level: Each connected component forms a cluster.

Hierarchical clustering

Initialization

Each individual point is taken as a cluster

Construct distance/proximity matrix

Distance/Proximity Matrix

Intermediate State

After some merging steps, we have some clusters

C3
Distance/Proximity Matrix

Intermediate State

Merge the two closest clusters (C2 and C5) and update the distance matrix.

Distance/Proximity Matrix

After Merging

Update the distance matrix

C2 U C5

C3
? ? ? ?

C2 U C5

Closest Pair

A few ways to measure distances of two clusters.

Single-link

Similarity of the most similar (single-link)

Complete-link

Similarity of the least similar points

Centroid

Clusters whose centroids (centers of gravity) are the most similar

Average-link

Average cosine between pairs of elements

Single Link Example

It Can result in straggly (long and thin) clusters due to chaining effect.

Agglomerative Clustering Example

Consider the points A (1, 1), B(2, 3), C(3, 5), D(4,5), E(6,6), and F(7,5) and try to cluster them.

To perform clustering, we will first create a distance matrix consisting of the distance between each

point in the dataset. The distance matrix looks as follows.

Consider the following Sample Data:

The distance matrix is:

For our convenience, we will be considering only the lower bound values of the matrix as shown

below. Specifically, the lower bound values represent the minimum distance between any two points

in the dataset.

Step 1: Calculate Euclidean distance, create the distance matrix.

Step 2: Find the minimum value element from distance matrix.

The minimum value element is (p3,p6)and value is 0.11 i.e. cluster (p3,p6)

Step 3: Recalculate or update the distance matrix for cluster(p3,p6)

Same formula can be used for p2,p4,p5

Min[dist((p3,p6),p2)]

Min[dist((p3,p2),(p6,p2)]

Min[dist(0.15, 0.25)]=0.15

Min[dist((p3,p6),p4)]

Min[dist((p3,p4),(p6,p4)]

Min[dist(0.15, 0.22)]=0.15

Min[dist((p3,p6),p5)]

Min[dist((p3,p5),(p6,p5)]

Min[dist(0.28, 0.39)]=0.28
Updated distance matrix:

Step 4: repeat the step 2 & 3.

The minimum value element is (p2,p5)and value is 0.14 i.e. cluster (p2,p5)

Recalculate or update the distance matrix for cluster (p2,p5)

Same formula can be used for (p3,p6),p4

Min[dist((p2,p5),(p3,p6)]

Min[dist((p2,(p3,p6),(p5, (p3,p6))]

Min[dist(0.15, 0.28)]=0.15

Min[dist((p2,p5),p4)]

Min[dist((p2,p4),(p5,p4)]

Min[dist(0.20, 0.29)]=0.20

Updated distance matrix:

Step 5: repeat the step 2 & 3.

The minimum value element is (p2,p5,p3,p6) and value is 0.15

Here 2 values are same and minimum, then first element is choose as minimum value element. i.e.

cluster (p2,p5,p3,p6)

Recalculate or update the distance matrix for cluster (p2,p5,p3,p6)

Min[dist([(p2p5),(p3p6)],p1)]

Min[dist([((p2p5), p1)], [((p3p6), p1)]]

Min[dist([(0.23)], [(0.22)]]=0.22
Same formula can be used for p4

Min[dist([(p2p5),(p3p6)],p4)]

Min[dist([((p2p5), p4)], [((p3p6), p4)]]

Min[dist([(0.20)], [(0.15)]]=0.15

Updated distance matrix:

Step 6: repeat the step 2 & 3.

The minimum value element is (p2,p5,p3,p6,p4) and value is 0.15. i.e. our 4th cluster

(p2,p5,p3,p6,p4)

Recalculate or update the distance matrix for cluster (p2,p5,p3,p6,p4)

Updated distance matrix:

Step 8: repeat the step 3 & 4.

The minimum value element is (p2,p5,p3,p6,p4,p1) and value is 0.22

i.e. our 5th cluster (p2,p5,p3,p6,p4,p1)

Recalculate or update the distance matrix for cluster (p2,p5,p3,p6,p4,p1)

In this step only 1 value is remaining so it is by default cluster.

Step 8: Drawing the Dendogram.

Clustering
No ratings yet
Clustering
10 pages
K-Mean Clustering
No ratings yet
K-Mean Clustering
8 pages
Clustering
No ratings yet
Clustering
17 pages
UNIT - 3 - Clustering
No ratings yet
UNIT - 3 - Clustering
21 pages
Clustering Techniques Explained
No ratings yet
Clustering Techniques Explained
11 pages
Algo
No ratings yet
Algo
59 pages
DWDM Unit5
No ratings yet
DWDM Unit5
14 pages
Unit 4
No ratings yet
Unit 4
22 pages
MLF Mod3
No ratings yet
MLF Mod3
10 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
12 pages
K Means Clustering
No ratings yet
K Means Clustering
11 pages
AI Week 11
No ratings yet
AI Week 11
21 pages
ML Unit-4 Final 2024-25
No ratings yet
ML Unit-4 Final 2024-25
28 pages
Presentation 1
No ratings yet
Presentation 1
47 pages
K-Means Clustering Guide & Python Implementation
No ratings yet
K-Means Clustering Guide & Python Implementation
21 pages
K-Means Clustering Guide
No ratings yet
K-Means Clustering Guide
26 pages
K Clustering
No ratings yet
K Clustering
28 pages
K-Means Clustering Insights
No ratings yet
K-Means Clustering Insights
8 pages
Machine Learning
No ratings yet
Machine Learning
23 pages
Unit IV
No ratings yet
Unit IV
96 pages
ML Unit-2
No ratings yet
ML Unit-2
31 pages
ML Module5 Clustering
No ratings yet
ML Module5 Clustering
71 pages
K-Means Clustering Guide 2023
No ratings yet
K-Means Clustering Guide 2023
14 pages
PART2
No ratings yet
PART2
61 pages
K-Means and K-Medoids Clustering Guide
No ratings yet
K-Means and K-Medoids Clustering Guide
29 pages
ML Mod 4 Part 1
No ratings yet
ML Mod 4 Part 1
99 pages
Mod4 - Unsupervised Learning
No ratings yet
Mod4 - Unsupervised Learning
9 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
23 pages
Unit 3 Data
No ratings yet
Unit 3 Data
37 pages
Unsupervised Learning Insights
No ratings yet
Unsupervised Learning Insights
10 pages
Intro to Clustering Techniques
No ratings yet
Intro to Clustering Techniques
13 pages
AI ML Lecture 6
No ratings yet
AI ML Lecture 6
20 pages
K Means Clustering
No ratings yet
K Means Clustering
27 pages
MODULE 4 Clustering
No ratings yet
MODULE 4 Clustering
23 pages
Chapter 4
No ratings yet
Chapter 4
30 pages
Clustering Algorithms
No ratings yet
Clustering Algorithms
19 pages
Clustering
No ratings yet
Clustering
24 pages
Unit 4
No ratings yet
Unit 4
29 pages
Unsupervised Learning Part 1
No ratings yet
Unsupervised Learning Part 1
9 pages
ML Unit 2
No ratings yet
ML Unit 2
17 pages
Unit4 ML
No ratings yet
Unit4 ML
20 pages
(3rd Year) Pattern REcognition Lecture 4
No ratings yet
(3rd Year) Pattern REcognition Lecture 4
48 pages
K Means Algo
No ratings yet
K Means Algo
7 pages
Data Mining-4
No ratings yet
Data Mining-4
9 pages
KMeans Clustering
No ratings yet
KMeans Clustering
16 pages
AI Chapter 3 Part 5
No ratings yet
AI Chapter 3 Part 5
30 pages
Unsupervised Machine Learning Techniques
No ratings yet
Unsupervised Machine Learning Techniques
58 pages
Clustering Algorithm
No ratings yet
Clustering Algorithm
47 pages
Clustering Techniques Overview
No ratings yet
Clustering Techniques Overview
45 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
4 pages
Module 4 - 5TH Sem
No ratings yet
Module 4 - 5TH Sem
23 pages
Clustering
No ratings yet
Clustering
125 pages
K-Means Clustering Guide
100% (1)
K-Means Clustering Guide
14 pages
K Mean Clustering1
No ratings yet
K Mean Clustering1
23 pages
Unsupervised Learning - Clustering
No ratings yet
Unsupervised Learning - Clustering
55 pages
Unit 4
No ratings yet
Unit 4
74 pages
Clustering Techniques Overview
No ratings yet
Clustering Techniques Overview
40 pages
Clustering Kmeans
No ratings yet
Clustering Kmeans
6 pages
Unit 1-4
No ratings yet
Unit 1-4
94 pages
Unit 3 Notes
No ratings yet
Unit 3 Notes
10 pages
Unit 2
No ratings yet
Unit 2
20 pages
Unit 1
No ratings yet
Unit 1
39 pages
All Netapp2
No ratings yet
All Netapp2
167 pages
3 - Crossbreding For Slaughter Pig Production - Beta - 355450
No ratings yet
3 - Crossbreding For Slaughter Pig Production - Beta - 355450
23 pages
E-Commerce Insights for Students
No ratings yet
E-Commerce Insights for Students
11 pages
Grade 3 Science and English Test
No ratings yet
Grade 3 Science and English Test
16 pages
Petitioner Respondent: Fil-Estate Properties, Inc., Realty, Inc.
No ratings yet
Petitioner Respondent: Fil-Estate Properties, Inc., Realty, Inc.
8 pages
Social Media Presentation - Yelp - Kittia Dedtaryoon
No ratings yet
Social Media Presentation - Yelp - Kittia Dedtaryoon
11 pages
NCISM Rasa Shastra Evam Bhaishajya Kalpana Syllabus
No ratings yet
NCISM Rasa Shastra Evam Bhaishajya Kalpana Syllabus
17 pages
2024 Environmental Brochure INT
No ratings yet
2024 Environmental Brochure INT
16 pages
JMP Amoroto G1
No ratings yet
JMP Amoroto G1
30 pages
Belimo NMV-D3-MFT
No ratings yet
Belimo NMV-D3-MFT
10 pages
Session-5 - Prbs. On Potentiometer Transducer - 16-9-2020 (Autosaved)
No ratings yet
Session-5 - Prbs. On Potentiometer Transducer - 16-9-2020 (Autosaved)
24 pages
Binary vs. Interpolation Search
No ratings yet
Binary vs. Interpolation Search
23 pages
Magalogue 102023
No ratings yet
Magalogue 102023
43 pages
IJMSE
No ratings yet
IJMSE
13 pages
Immortal Swordslinger 1 1st Edition Dante King King Dante Download
100% (2)
Immortal Swordslinger 1 1st Edition Dante King King Dante Download
59 pages
Comments/Remarks:: Name: Tupas, Re Charles
No ratings yet
Comments/Remarks:: Name: Tupas, Re Charles
3 pages
Competencies, Objectives and Outcome
No ratings yet
Competencies, Objectives and Outcome
1 page
Module 6 Stoichiometry 1
No ratings yet
Module 6 Stoichiometry 1
37 pages
Manual DA5
No ratings yet
Manual DA5
71 pages
Rate Analysis - 2016
No ratings yet
Rate Analysis - 2016
23 pages
AMSOIL Synthetic Motor Oils For OE Oil Change Interval. 3000 Mile Oil Change
No ratings yet
AMSOIL Synthetic Motor Oils For OE Oil Change Interval. 3000 Mile Oil Change
2 pages
Mechanical Reliability
No ratings yet
Mechanical Reliability
3 pages
Fortinet
0% (1)
Fortinet
8 pages
BSD 1307 Object Oriented Analysis and Design
No ratings yet
BSD 1307 Object Oriented Analysis and Design
2 pages
Group 10 - Uber Strategic Alliances
No ratings yet
Group 10 - Uber Strategic Alliances
10 pages
Affidavit in Reply
No ratings yet
Affidavit in Reply
19 pages
Cell: The Building Blocks of Life: Awaluddin, M.Kes
No ratings yet
Cell: The Building Blocks of Life: Awaluddin, M.Kes
39 pages
Community Copy - Epic Legacy Tome of Titans - Vol. 2
91% (11)
Community Copy - Epic Legacy Tome of Titans - Vol. 2
499 pages
Yuken India Limited: 2024 Research Report
No ratings yet
Yuken India Limited: 2024 Research Report
9 pages
Benazepril
No ratings yet
Benazepril
2 pages