0% found this document useful (0 votes)

64 views1 page

Concept Drift in Machine Learning

The document discusses concept drift in machine learning, which is when the relationship between input and output variables changes over time, affecting a model's performance. It explains what concept drift is, how it relates to the data science lifecycle, why it needs to be monitored, and methods for addressing it like periodically retraining or updating models.

Uploaded by

tedsm55458

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

64 views1 page

Concept Drift in Machine Learning

Uploaded by

tedsm55458

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

Search Origin

Shobhit Srivastava

Concept drift in Machine Learning

“The pessimist complains about the wind; the optimist expects it to
change; the realist adjusts the sails.”- William Arthur Ward

Source: Unsplash Anthony Aird

Everything changes with time, data is no exception. The change in data leads
to degrading testing performance of the machine learning model with time.
Ultimately the wrong prediction coming out of the model can affect its
business values.

The relationship between input and output label attributes doesn’t remain
static rather it changes with time, which affects the model performance as it
is unable to understand the new underlying pattern present in the new data.
The effect is termed as Concept Drift in machine learning.

In this article, I will provide a brief overview of this concept that is used in
machine learning quite frequently and is important for every practitioner to
be aware of.

Here’s a brief mention of the points I will be going through the this article.

What is concept drift?

How the concept is related to data science life cycle?

Why do we need to monitor this effect?

How to address the issue?

Conclusion.

What is concept drift?

Concept drift is an effect which leads to degradation of the machine learning

model’s performance over the years. This degradation happens due to the
change in the underlying pattern between the new data set on which model is
tested and the data set on which model is trained. This change happens due
to change is customer’ s product buying pattern or due to some weather
parameter getting changed over time.

We all would be quite familiar with this basic function concept:

Y= f(X)

Here we have a function f which understands the pattern or relationship

between independent variable X and dependent variable Y.

But when this pattern fades, the model gives out the wrong output and
becomes equivalent to garbage.

This is where concept drift comes into effect.

Source: Unsplash by Markus Spiske

How the concept is related to data science life cycle?

We all are aware that the data science project is executed in various phases,
right? Starting with:

Problems identification and its business context.

Data set collection.

Data exploration and feature engineering.

Data visualizations.

Model training and development.

Model testing and deployment.

Model retraining and update process.

I am taking that we all are quite familiar with the top 6 concepts. Concept
drift comes in the last phase i.e., model retraining and updating. It is
where the model is deployed on the customer end and frequent model testing
happens daily. To avoid the model’s deviation, its prediction is monitored
and checked as if it is giving the right predictions or not to maintain business
productivity.

Why do we need to monitor this effect?

We need to monitor this effect because it can cause a huge problem for the
business entity it is running for. Wrong predictions can lead to a business
company losing its reputation as well as its loyal customers as a model could
be providing wrong recommendations that aren’t matching with the new
buying pattern of the users.

Let’s take an example of Corona pandemic time where people have

experienced a major shift in their buying patterns. They are only catering
their requirements to very necessary stuff, of which the model is unaware.
Thus it keeps recommending products to them which aren’t going with the
customer’s choice. Due to this, a business can lose a major chunk of revenue.

How to address the issue?

There are many methods to deal with this issue.

1.Do Nothing (Maintain a single static model).

NO, I am not joking...!. We can just assume that the underlying pattern in the
data doesn’t change over time which in many cases happens so.

Due to this, we can focus on building one single best model for making future
predictions and focus on some other projects.

2.Periodically re-fit the model.

This may be a bit more effective than the first one. We retrain our outdated
model on the new data set coming in, thus explaining the new underlying
pattern in the data set.

This saves the model from becoming ‘Garbage’ and keeps bringing business
values.

3.Periodically update the model.

Instead of updating the outdated model on the new data set we can train and
deploy a new model time by time when our testing shows that the previous
model giving wrong predictions.

This method is a bit more effective as a change in the model can make leads
to more accurate predictions. But model training, as well as its deployment,
takes significant time.

4.Ensemble a new model with the old one.

In this method, we ensemble some new models trained on the new data set
with the outdated model. It is where the new model work together with the
old one at the same time correcting the wrong predictions of the old model.

This method comes out to be a bit complex but more effective than the above
mentioned.

Source: Unsplash by Aaron Burden

[Edit] If you want to dive deep into the topic, please go through this article at
neptune.ai.

Conclusion...

All right guys, that’s it for today. I think we must have learned some new
concepts. This effect is most of the time ignored by the junior data scientists
who after completing one project thinks their work is over, but that’s not the
case. Their responsibilities don’t end there. To maintain our business values
we must monitor and track how and what values we are adding to customer’s
experience and whether the recommendation provider is providing them
with good recommendations or not.

For more as such visit here.

If this article has benefitted you in anyway. Please do support me here

https://www.buymeacoffee.com/shobhitsri

Please feel free to comment below in case you are unclear with any points. I
will reply as soon as possible. You can connect with me here on LinkedIn.
Thank you for co-operating. Have a good day.

Machine Learning Data Science Articles Learning

Mastering Machine Learning Tutorial Creation

Ten Key Insights from My Journey in Crafting Engaging ML Video Tutorials

4 min read

Qwak

How to Build an End-to-End ML Pipeline in 2024

Learn to build an end-to-end ML pipeline and streamline your ML workflows in 2024, from data
ingestion to model deployment and performance…

24 min read

Turkish Technology

Deep Learning with Tabnet

TabNet is a deep learning architecture specifically designed for tabular data, introduced in the
paper “TabNet: Attentive Interpretable…

7 min read

Simran Kaushik

House Price Prediction: A Simple Guide with Scikit-Learn and Linear

Regression
Navigate the realm of predictive analytics with simplicity

7 min read

Hakan Ateşli

Explainable AI With SHAP

From Complexity to Clarity: Exploring AI Transparency with SHAP

13 min read

Concept Drift
No ratings yet
Concept Drift
21 pages
Concept Drift
No ratings yet
Concept Drift
13 pages
Fyp2021 3
No ratings yet
Fyp2021 3
54 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
79 pages
UNIT 2 Data Science LM 2023
No ratings yet
UNIT 2 Data Science LM 2023
13 pages
Machine Learning Model Drift Detection Via Weak Data Slices
No ratings yet
Machine Learning Model Drift Detection Via Weak Data Slices
8 pages
Data-Science-Assignment No-1
No ratings yet
Data-Science-Assignment No-1
5 pages
Biased Data and Data Drifts
No ratings yet
Biased Data and Data Drifts
3 pages
Issues in ML and Generating Algo
No ratings yet
Issues in ML and Generating Algo
31 pages
Sat - 34.Pdf - A Systematic Approach Towards Description and Classification of Crime Incidents
No ratings yet
Sat - 34.Pdf - A Systematic Approach Towards Description and Classification of Crime Incidents
11 pages
Machine Learning
No ratings yet
Machine Learning
5 pages
Unit 3 Feature Generation & Selection
No ratings yet
Unit 3 Feature Generation & Selection
11 pages
DsNaIT v2.0
No ratings yet
DsNaIT v2.0
43 pages
Data Science Life Cycle
No ratings yet
Data Science Life Cycle
12 pages
Data Science Industrial Report
No ratings yet
Data Science Industrial Report
22 pages
7 Data Preprocessing Steps in Machine Learning
No ratings yet
7 Data Preprocessing Steps in Machine Learning
5 pages
Unsupervised Drift Detection Method
No ratings yet
Unsupervised Drift Detection Method
8 pages
Autoregressive Based Drift Detection Method
No ratings yet
Autoregressive Based Drift Detection Method
13 pages
Data Prep and Cleaning For Machine Learning
No ratings yet
Data Prep and Cleaning For Machine Learning
22 pages
Ids Sem Ans U-Ii
No ratings yet
Ids Sem Ans U-Ii
10 pages
File of ML
No ratings yet
File of ML
42 pages
Week5 Modified
No ratings yet
Week5 Modified
25 pages
40 Interview Questions Asked at Startups in Machine Learning - Data Science
No ratings yet
40 Interview Questions Asked at Startups in Machine Learning - Data Science
13 pages
9781838826321-Managing Data Science
100% (7)
9781838826321-Managing Data Science
276 pages
Common Machine Learning Issues
No ratings yet
Common Machine Learning Issues
2 pages
AI Unit 1
No ratings yet
AI Unit 1
30 pages
ML Terms
No ratings yet
ML Terms
15 pages
Machine Learning Note
No ratings yet
Machine Learning Note
40 pages
Overfitting and Underfitting
No ratings yet
Overfitting and Underfitting
25 pages
Introduction To Data Science
No ratings yet
Introduction To Data Science
11 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
61 pages
Brain, Bytes & Bias: ML Interview Questions You Can't Miss!
No ratings yet
Brain, Bytes & Bias: ML Interview Questions You Can't Miss!
21 pages
ML Concepts
No ratings yet
ML Concepts
8 pages
Data Science and Machine Learning A Survey On The Future Revenue Predictions and The Amount of Product Sales
No ratings yet
Data Science and Machine Learning A Survey On The Future Revenue Predictions and The Amount of Product Sales
8 pages
Air Quality Prediction Using Machine Learning
No ratings yet
Air Quality Prediction Using Machine Learning
29 pages
Unit 6 Machine Learning Algorithms - AI CBSE
No ratings yet
Unit 6 Machine Learning Algorithms - AI CBSE
1 page
Data Science Notes
No ratings yet
Data Science Notes
13 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
32 pages
Module3 DS PPT
No ratings yet
Module3 DS PPT
68 pages
7 Machine Learning and Deep Learning Mistakes and Limitations To Avoid
No ratings yet
7 Machine Learning and Deep Learning Mistakes and Limitations To Avoid
10 pages
Advance Concepts of Modeling in AI
No ratings yet
Advance Concepts of Modeling in AI
18 pages
Introduction To Data Science and Machine Learning
No ratings yet
Introduction To Data Science and Machine Learning
30 pages
Bi Unit 2 PDF
No ratings yet
Bi Unit 2 PDF
33 pages
? What Is Data Science
No ratings yet
? What Is Data Science
31 pages
Data Science Syllabus For Bba
No ratings yet
Data Science Syllabus For Bba
2 pages
Imbalanced Classes in ML: 10 Techniques
No ratings yet
Imbalanced Classes in ML: 10 Techniques
10 pages
Next - Level - Data - Science - Sample Chapter
No ratings yet
Next - Level - Data - Science - Sample Chapter
37 pages
Coursera 2.5
No ratings yet
Coursera 2.5
38 pages
Data Science S (2 Files Merged)
No ratings yet
Data Science S (2 Files Merged)
30 pages
Unit 1 ML
No ratings yet
Unit 1 ML
28 pages
Unit 3
No ratings yet
Unit 3
28 pages
Chapter 02 Overview - 4
No ratings yet
Chapter 02 Overview - 4
43 pages
What Is Data Science - A Beginner's Guide To Data Science - Edureka
No ratings yet
What Is Data Science - A Beginner's Guide To Data Science - Edureka
14 pages
SWE 227 Slide 01
No ratings yet
SWE 227 Slide 01
21 pages
03 Data Science Process - Spring-24-25
No ratings yet
03 Data Science Process - Spring-24-25
48 pages
The Future of The NHS Depends On Its Workforce
No ratings yet
The Future of The NHS Depends On Its Workforce
16 pages
Botox Spray for Hay Fever: Evidence?
No ratings yet
Botox Spray for Hay Fever: Evidence?
1 page
Hallmarks of Aging
No ratings yet
Hallmarks of Aging
4 pages
ExLlamaV2 - The Fastest Library To Run LLMs
No ratings yet
ExLlamaV2 - The Fastest Library To Run LLMs
1 page
Pattern Recognition Opportunities and Limits
No ratings yet
Pattern Recognition Opportunities and Limits
22 pages
Church Service: Our Reasonable Duty
No ratings yet
Church Service: Our Reasonable Duty
3 pages
Ge6100 Understanding Self
No ratings yet
Ge6100 Understanding Self
100 pages
Rotary Club Startup Guide
No ratings yet
Rotary Club Startup Guide
20 pages
Subject Syllabus 1616352144
No ratings yet
Subject Syllabus 1616352144
2 pages
Application of Operation Research in Steel Industry
No ratings yet
Application of Operation Research in Steel Industry
6 pages
508 Determiners Grammar Test Exercises Multiple Choice Questions With Answers Advanced Level 9
50% (2)
508 Determiners Grammar Test Exercises Multiple Choice Questions With Answers Advanced Level 9
5 pages
PRU01 Drawing List
No ratings yet
PRU01 Drawing List
30 pages
SAT Math - Non Equations in 1 Var and System of Equations in 2 Vars - Hard R
No ratings yet
SAT Math - Non Equations in 1 Var and System of Equations in 2 Vars - Hard R
56 pages
MUA 0900 Winter 2024 Syllabus
No ratings yet
MUA 0900 Winter 2024 Syllabus
5 pages
Japan'S Religion: Richard U. Cayabyab (Ph. D. - Math, Presentor)
No ratings yet
Japan'S Religion: Richard U. Cayabyab (Ph. D. - Math, Presentor)
9 pages
Day Trade Brokerage Note 02/10/2023
No ratings yet
Day Trade Brokerage Note 02/10/2023
2 pages
Bahan Berbahaya & Beracun (B3)
100% (1)
Bahan Berbahaya & Beracun (B3)
8 pages
Stealing Parker Hundred Oaks 2 Miranda Kenneally PDF Download
100% (2)
Stealing Parker Hundred Oaks 2 Miranda Kenneally PDF Download
78 pages
Patanjali Value Chain Analysis
No ratings yet
Patanjali Value Chain Analysis
4 pages
Numerical Computation - Lec - 1 PDF
No ratings yet
Numerical Computation - Lec - 1 PDF
32 pages
Lee 2025 Ai Critical Thinking Survey
No ratings yet
Lee 2025 Ai Critical Thinking Survey
23 pages
Grand Demo Lesson Plan
No ratings yet
Grand Demo Lesson Plan
21 pages
Major Themes in William Shakespeare
0% (1)
Major Themes in William Shakespeare
5 pages
Coiled Spring Pins
No ratings yet
Coiled Spring Pins
20 pages
Global Film & Foil Solutions
No ratings yet
Global Film & Foil Solutions
48 pages
Anti-Money Laundering Guide
100% (2)
Anti-Money Laundering Guide
61 pages
E-Auction List
No ratings yet
E-Auction List
11 pages
N12005 Rev. 88 (Revestimientos. Sistema de Designación y Uso de Los Estándares)
No ratings yet
N12005 Rev. 88 (Revestimientos. Sistema de Designación y Uso de Los Estándares)
32 pages
21st Century Module 6
No ratings yet
21st Century Module 6
3 pages
Free Goods: Sales & Distribution
No ratings yet
Free Goods: Sales & Distribution
8 pages
Caribbean Studies SBA
67% (12)
Caribbean Studies SBA
2 pages
Fichas Tecnicas Modern Fuji Elevator
No ratings yet
Fichas Tecnicas Modern Fuji Elevator
132 pages
1580 PDF
No ratings yet
1580 PDF
8 pages
Policies & Procedures: FOR Unaccompanied Minors, Young Passengers & Persons With Reduced Mobility
No ratings yet
Policies & Procedures: FOR Unaccompanied Minors, Young Passengers & Persons With Reduced Mobility
12 pages
2syllabus - FM 1 - Financial Management 1-BSA2
100% (1)
2syllabus - FM 1 - Financial Management 1-BSA2
9 pages

Concept Drift in Machine Learning

Uploaded by

Concept Drift in Machine Learning

Uploaded by

Search Origin

Concept drift in Machine Learning

Source: Unsplash Anthony Aird

What is concept drift?

How the concept is related to data science life cycle?

Why do we need to monitor this effect?

How to address the issue?

What is concept drift?

Concept drift is an effect which leads to degradation of the machine learning

We all would be quite familiar with this basic function concept:

Here we have a function f which understands the pattern or relationship

This is where concept drift comes into effect.

Source: Unsplash by Markus Spiske

How the concept is related to data science life cycle?

Problems identification and its business context.

Data set collection.

Data exploration and feature engineering.

Model training and development.

Model testing and deployment.

Model retraining and update process.

Why do we need to monitor this effect?

Let’s take an example of Corona pandemic time where people have

How to address the issue?

There are many methods to deal with this issue.

1.Do Nothing (Maintain a single static model).

2.Periodically re-fit the model.

3.Periodically update the model.

4.Ensemble a new model with the old one.

Source: Unsplash by Aaron Burden

For more as such visit here.

If this article has benefitted you in anyway. Please do support me here

Machine Learning Data Science Articles Learning

Recommended from ReadMedium

Mastering Machine Learning Tutorial Creation

How to Build an End-to-End ML Pipeline in 2024

Deep Learning with Tabnet

House Price Prediction: A Simple Guide with Scikit-Learn and Linear

Explainable AI With SHAP

You might also like