0% found this document useful (0 votes)

62 views5 pages

Unit IV Ensemble Unsupervised Learning

The document covers Ensemble Learning and Unsupervised Learning, detailing techniques to improve model performance and analyze unlabeled data. Ensemble Learning combines multiple models through methods like Bagging, Boosting, and Stacking, while Unsupervised Learning includes clustering and dimensionality reduction techniques. Key concepts such as Random Forest, AdaBoost, K-Means, and PCA are discussed, highlighting their advantages and disadvantages.

Uploaded by

Boomika G

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

62 views5 pages

Unit IV Ensemble Unsupervised Learning

Uploaded by

Boomika G

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Unit IV: Ensemble Learning & Unsupervised Learning – Study Material

Ensemble Learning

Ensemble Learning is a technique where multiple models are combined to improve

overall performance. It reduces errors, increases accuracy, and handles data variability
better than individual models.

### Key Features:

1. Combines multiple weak learners to create a strong learner.
2. Improves generalization and reduces overfitting.
3. Works well for both classification and regression tasks.

### Types of Ensemble Learning:

- **Bagging**: Reduces variance by training multiple models on random subsets (e.g.,
Random Forest).
- **Boosting**: Reduces bias by training models sequentially, giving more weight to
misclassified instances (e.g., AdaBoost, Gradient Boosting).
- **Stacking**: Combines multiple models using a meta-learner for final predictions.

Model Combination Schemes

Different strategies exist for combining multiple models in ensemble learning.

1. **Voting**: In classification, multiple models vote, and the majority class is selected.
2. **Error-Correcting Output Codes (ECOC)**: Decomposes multi-class problems into
multiple binary classifications.
3. **Bagging (Bootstrap Aggregating)**: Trains models independently on different
subsets of data and averages results.
4. **Boosting**: Models are trained sequentially, correcting errors from previous
models.
5. **Stacking**: Outputs from base learners are combined using another model (meta-
learner) for final predictions.

Bagging: Random Forest

Bagging is a technique that improves stability and accuracy by reducing overfitting.

### **Random Forest**:
- Uses multiple Decision Trees trained on different subsets of data.
- Predictions are averaged (regression) or majority-voted (classification).
- Handles missing values and large datasets well.

### **Advantages**:
- Reduces overfitting.
- Works well with high-dimensional data.
- Can be used for feature importance ranking.

### **Disadvantages**:
- Requires more computational power.
- Loses interpretability compared to individual Decision Trees.

Boosting: AdaBoost

Boosting combines weak models sequentially, giving more weight to misclassified

instances.

### AdaBoost (Adaptive Boosting):

- Assigns weights to each sample and updates them iteratively.
- Focuses on misclassified samples to improve predictions.
- Uses weak classifiers like Decision Stumps.

### **Advantages**:
- Reduces bias, improving weak classifiers.
- More accurate than bagging for complex datasets.

### **Disadvantages**:
- Sensitive to noise in the dataset.
- Slower training due to sequential model building.

Unsupervised Learning

Unsupervised Learning finds patterns in unlabeled data. Unlike supervised learning,

it does not rely on predefined outputs.

### Key Features:

1. Works with **unlabeled** data.
2. Groups similar data points or reduces dimensionality.
3. Used in anomaly detection, recommendation systems, and exploratory data analysis.

### Main Types:

- **Clustering**: Groups similar data points.
- **Dimensionality Reduction**: Reduces dataset complexity while preserving essential
information (e.g., PCA, LLE, Factor Analysis).

Clustering: Introduction

Clustering is an unsupervised learning technique that groups similar data points

based on some similarity measure.

### Types of Clustering:

1. **Hierarchical Clustering**: Builds a hierarchy of clusters (e.g., AGNES, DIANA).
2. **Partitional Clustering**: Divides data into distinct clusters (e.g., K-Means, K-
Mode).
3. **Density-Based Clustering**: Identifies clusters based on dense regions (e.g.,
DBSCAN, Mean-Shift).

Hierarchical Clustering: AGNES & DIANA

Hierarchical Clustering builds a nested structure of clusters.

### AGNES (Agglomerative Nesting):

- A **bottom-up** approach: Each data point starts as its own cluster and merges step by
step.
- Uses linkage methods (single, complete, average).

### DIANA (Divisive Analysis):

- A **top-down** approach: All data points start in one cluster and are split iteratively.

### **Advantages**:
- No need to predefine the number of clusters.
- Dendrograms provide visual insights.

### **Disadvantages**:
- Computationally expensive for large datasets.
- Sensitive to noise and outliers.

Partitional Clustering: K-Means & K-Mode

Partitional Clustering divides data into fixed K clusters.

### K-Means Clustering:

- Assigns data points to **K clusters** based on distance (usually Euclidean).
- Iteratively updates centroids to minimize variance.

### K-Mode Clustering:

- Used for categorical data instead of numerical values.
- Replaces means with **modes** (most frequent values).

### **Advantages**:
- Fast and scalable for large datasets.
- Works well when clusters are well-separated.

### **Disadvantages**:
- Sensitive to initial cluster centers.
- Does not handle outliers well.

Dimensionality Reduction: PCA & LLE

Dimensionality reduction techniques help reduce the number of features while preserving
important information.

### Principal Component Analysis (PCA):

- Finds new feature axes (principal components) that maximize variance.
- Used in image compression, face recognition.

### Locally Linear Embedding (LLE):

- A nonlinear technique preserving local relationships in data.
- Suitable for highly nonlinear structures.

### **Advantages**:
- Reduces noise and redundancy.
- Speeds up model training.
### **Disadvantages**:
- Can lose interpretability.
- Assumes linearity (for PCA).

Ensemble Learning Techniques 12 Marks
No ratings yet
Ensemble Learning Techniques 12 Marks
3 pages
Aiml Unit 4
No ratings yet
Aiml Unit 4
26 pages
AI & ML Unit 4 Notes
No ratings yet
AI & ML Unit 4 Notes
16 pages
Unit Iv
No ratings yet
Unit Iv
14 pages
Ensemble Methods Unit - 4
No ratings yet
Ensemble Methods Unit - 4
17 pages
Ensemble Learning
No ratings yet
Ensemble Learning
13 pages
Unit 4
No ratings yet
Unit 4
24 pages
ML Unit 3 V2
No ratings yet
ML Unit 3 V2
47 pages
Unit 4 Ensemble Techniques and Unsupervised Learning
100% (1)
Unit 4 Ensemble Techniques and Unsupervised Learning
25 pages
Unsupervised Learning: Clustering & Dimensionality Reduction
No ratings yet
Unsupervised Learning: Clustering & Dimensionality Reduction
2 pages
Unit 3-Ensemble Techniques
No ratings yet
Unit 3-Ensemble Techniques
47 pages
Unit 3 Aml
No ratings yet
Unit 3 Aml
9 pages
Module 3: Advanced ML Algorithms and Hardware Design Optimization
No ratings yet
Module 3: Advanced ML Algorithms and Hardware Design Optimization
38 pages
Unit-3 ML
No ratings yet
Unit-3 ML
18 pages
Ensemble Learning: Proprietary Content. ©great Learning. All Rights Reserved. Unauthorized Use or Distribution Prohibited
No ratings yet
Ensemble Learning: Proprietary Content. ©great Learning. All Rights Reserved. Unauthorized Use or Distribution Prohibited
6 pages
Unit 4
No ratings yet
Unit 4
17 pages
Ensemble Methods
No ratings yet
Ensemble Methods
27 pages
UNIT III Word File
No ratings yet
UNIT III Word File
13 pages
M4 - FDS
No ratings yet
M4 - FDS
15 pages
LR Desktop Udo6rlp
No ratings yet
LR Desktop Udo6rlp
4 pages
Aimlunit4 250115133449 E0e46c09
No ratings yet
Aimlunit4 250115133449 E0e46c09
32 pages
Unit 4
No ratings yet
Unit 4
24 pages
Machine Learning
No ratings yet
Machine Learning
76 pages
Unit 4 ML
No ratings yet
Unit 4 ML
9 pages
Ensemble Learning
100% (1)
Ensemble Learning
7 pages
Lecture 5
No ratings yet
Lecture 5
11 pages
Ensemble Methods Final PDF
No ratings yet
Ensemble Methods Final PDF
25 pages
Time To Explore (5) ML
No ratings yet
Time To Explore (5) ML
9 pages
Ensembles
No ratings yet
Ensembles
9 pages
Unit 5 ML
No ratings yet
Unit 5 ML
14 pages
Full ml-2
No ratings yet
Full ml-2
1 page
ML Unit 3-1
No ratings yet
ML Unit 3-1
14 pages
Ensemble Methods - Bagging, Boosting and Stacking - by Joseph Rocca - Towards Data Science
No ratings yet
Ensemble Methods - Bagging, Boosting and Stacking - by Joseph Rocca - Towards Data Science
20 pages
Ch-4 Ensemble Learning
No ratings yet
Ch-4 Ensemble Learning
18 pages
Ensemble Learning
No ratings yet
Ensemble Learning
26 pages
ML Lecture 15 Ensemble
No ratings yet
ML Lecture 15 Ensemble
27 pages
Ensemble Techniques in ML Guide
No ratings yet
Ensemble Techniques in ML Guide
13 pages
Evaluating Machine Learning Algorithms and Model Selection
No ratings yet
Evaluating Machine Learning Algorithms and Model Selection
10 pages
Machine Learning: Video 106: Gradient Boosting Explained - How Gradient Boosting Works?
No ratings yet
Machine Learning: Video 106: Gradient Boosting Explained - How Gradient Boosting Works?
6 pages
Machine Learning Lecture 2,3,4
No ratings yet
Machine Learning Lecture 2,3,4
26 pages
Unit 4 Part 1
No ratings yet
Unit 4 Part 1
47 pages
Ensemble Methods
No ratings yet
Ensemble Methods
31 pages
Technical Report
No ratings yet
Technical Report
10 pages
Ensemble Learning in Machine Learning
No ratings yet
Ensemble Learning in Machine Learning
4 pages
Ensemble Learning for Data Scientists
No ratings yet
Ensemble Learning for Data Scientists
20 pages
Ensemble Methods
No ratings yet
Ensemble Methods
19 pages
ML Ass
No ratings yet
ML Ass
21 pages
Unit 4 - ML
No ratings yet
Unit 4 - ML
38 pages
Ensemble Methods Advanced ML
No ratings yet
Ensemble Methods Advanced ML
14 pages
Ensemble, Voting, Bagging, Boosting
No ratings yet
Ensemble, Voting, Bagging, Boosting
15 pages
Da & ML Note Final
No ratings yet
Da & ML Note Final
39 pages
AIML Unit 4
No ratings yet
AIML Unit 4
26 pages
CH 7 Ensemble Learning
No ratings yet
CH 7 Ensemble Learning
34 pages
Unit 4 ML
No ratings yet
Unit 4 ML
25 pages
Machine Learning Unit3
No ratings yet
Machine Learning Unit3
26 pages
33 - Assignment 7 - Implementation of Ensemble Techniques
No ratings yet
33 - Assignment 7 - Implementation of Ensemble Techniques
7 pages
Machine Learning: Video 106: Gradient Boosting Explained - How Gradient Boosting Works?
No ratings yet
Machine Learning: Video 106: Gradient Boosting Explained - How Gradient Boosting Works?
6 pages
Unit IV Aiml
No ratings yet
Unit IV Aiml
32 pages
Milk Billing System Documentation
No ratings yet
Milk Billing System Documentation
1 page
A12 Route Visit Bus Report
No ratings yet
A12 Route Visit Bus Report
2 pages
A12-Passed Out Students Count-1
No ratings yet
A12-Passed Out Students Count-1
1 page
Java Stack and Queue Implementation
No ratings yet
Java Stack and Queue Implementation
7 pages
Journal 1
No ratings yet
Journal 1
9 pages
ID3 Decision Tree
No ratings yet
ID3 Decision Tree
5 pages
C Programming Full
No ratings yet
C Programming Full
93 pages
Complete ID3 Decision Tree
No ratings yet
Complete ID3 Decision Tree
15 pages
Chapter 2
No ratings yet
Chapter 2
4 pages
Statistics SPPU QP Solution
No ratings yet
Statistics SPPU QP Solution
17 pages
Confidence Intervals Explained
No ratings yet
Confidence Intervals Explained
4 pages
Nutritional Epidemiology Lecture 3 2013-14 - Moodle
100% (1)
Nutritional Epidemiology Lecture 3 2013-14 - Moodle
26 pages
STK110 Semester Test 2 Version 1 MEMO PDF
No ratings yet
STK110 Semester Test 2 Version 1 MEMO PDF
7 pages
A Meta-Analytic Review of Social, Self-Concept, and Behavioral Outcomes of Peer-Assisted Learning
No ratings yet
A Meta-Analytic Review of Social, Self-Concept, and Behavioral Outcomes of Peer-Assisted Learning
18 pages
Variography Insights for Geologists
No ratings yet
Variography Insights for Geologists
8 pages
Business Case - Aerofit - Descriptive Statistics Probability (Final)
100% (1)
Business Case - Aerofit - Descriptive Statistics Probability (Final)
1 page
Unit 2 Problems
No ratings yet
Unit 2 Problems
9 pages
Understanding Quartiles & Percentiles
No ratings yet
Understanding Quartiles & Percentiles
5 pages
Normality Tests Power Analysis
No ratings yet
Normality Tests Power Analysis
14 pages
S2 Exercise 7B
No ratings yet
S2 Exercise 7B
3 pages
Computing Unit 4
No ratings yet
Computing Unit 4
37 pages
Univariate and Bivariate Analysis
No ratings yet
Univariate and Bivariate Analysis
21 pages
Everything You Need To Know About Linear Regression
No ratings yet
Everything You Need To Know About Linear Regression
19 pages
Synthesis Paper 3
No ratings yet
Synthesis Paper 3
4 pages
Skewness and Kurtosis Explained
100% (1)
Skewness and Kurtosis Explained
28 pages
Data Analytics For Lean Six Sigma
No ratings yet
Data Analytics For Lean Six Sigma
28 pages
Lse Ppa M4u3 Notes
No ratings yet
Lse Ppa M4u3 Notes
15 pages
324 First14272
No ratings yet
324 First14272
3 pages
Ch. 9 Sampling Distributions and Confidence Intervals For Proportions
No ratings yet
Ch. 9 Sampling Distributions and Confidence Intervals For Proportions
22 pages
Transport Management
No ratings yet
Transport Management
37 pages
Factors Influencing Purchase Decisions
No ratings yet
Factors Influencing Purchase Decisions
24 pages
Syllabus
No ratings yet
Syllabus
8 pages
5 4th Quarter Math 10
No ratings yet
5 4th Quarter Math 10
26 pages
Assignment 3
No ratings yet
Assignment 3
4 pages
Final Project Memo - QNT550
No ratings yet
Final Project Memo - QNT550
7 pages
Module6 Lecture 2
No ratings yet
Module6 Lecture 2
25 pages
ANOVA for Scientific Experiments
No ratings yet
ANOVA for Scientific Experiments
32 pages
The Application of Geostatistics in Coal Estimation and Classification Abstract FINAL
100% (2)
The Application of Geostatistics in Coal Estimation and Classification Abstract FINAL
127 pages
BP 801t Biostatistics and Research Methodology Jun 2020
No ratings yet
BP 801t Biostatistics and Research Methodology Jun 2020
3 pages

Unit IV Ensemble Unsupervised Learning

Uploaded by

Unit IV Ensemble Unsupervised Learning

Uploaded by

Unit IV: Ensemble Learning & Unsupervised Learning – Study Material

Ensemble Learning is a technique where multiple models are combined to improve

### Key Features:

### Types of Ensemble Learning:

Model Combination Schemes

Different strategies exist for combining multiple models in ensemble learning.

Bagging: Random Forest

Bagging is a technique that improves stability and accuracy by reducing overfitting.

Boosting combines weak models sequentially, giving more weight to misclassified

### **AdaBoost (Adaptive Boosting)**:

Unsupervised Learning finds patterns in **unlabeled data**. Unlike supervised learning,

### **Key Features**:

### **Main Types**:

Clustering is an unsupervised learning technique that **groups similar data points**

### **Types of Clustering**:

Hierarchical Clustering: AGNES & DIANA

Hierarchical Clustering builds a nested structure of clusters.

### **AGNES (Agglomerative Nesting)**:

### **DIANA (Divisive Analysis)**:

Partitional Clustering: K-Means & K-Mode

Partitional Clustering divides data into **fixed K clusters**.

### **K-Means Clustering**:

### **K-Mode Clustering**:

Dimensionality Reduction: PCA & LLE

### **Principal Component Analysis (PCA)**:

### **Locally Linear Embedding (LLE)**:

You might also like

### AdaBoost (Adaptive Boosting):

Unsupervised Learning finds patterns in unlabeled data. Unlike supervised learning,

### Key Features:

### Main Types:

Clustering is an unsupervised learning technique that groups similar data points

### Types of Clustering:

### AGNES (Agglomerative Nesting):

### DIANA (Divisive Analysis):

Partitional Clustering divides data into fixed K clusters.

### K-Means Clustering:

### K-Mode Clustering:

### Principal Component Analysis (PCA):

### Locally Linear Embedding (LLE):