0% found this document useful (0 votes)

5 views19 pages

MLOps Notes

The document provides notes on the Introduction to Machine Learning in Production (MLOps1) course, covering key concepts such as the ML project lifecycle, data and concept drift, model selection, and performance auditing. It emphasizes the importance of monitoring, error analysis, and maintaining high-quality data throughout the ML lifecycle. Additionally, the notes discuss strategies for labeling data, establishing baselines, and addressing challenges in both small and large datasets.

Uploaded by

ht23resch14004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views19 pages

MLOps Notes

Uploaded by

ht23resch14004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

Notes for Introduction to Machine Learning in Production (MLOps1) on Coursera/De... https://zhaoyufei.rbind.

io/post/mlops1/

Notes for Introduction to

Machine Learning in Production
(MLOps1) on Coursera/
Deeplearning.ai
2021-06-07 · 0 Comments

Week 1� Steps of an ML project

The ML project lifecycle
MLOps (Machine learning Operations) comprises a set of tools and principles to
support progress through the ML project lifcycle.

• Decide to work on speech recognition for voice search

• Decide on key metrics:
◦ Acc, latency, throughput
• Estimate resources and timeline

• Is the data labeled consistently

• How much silence before/after each clip?
• How to perform volume normalization?

1 of 19 7/22/24, 12:11
Notes for Introduction to Machine Learning in Production (MLOps1) on Coursera/De... https://zhaoyufei.rbind.io/post/mlops1/

Data drift: the input data has changed. The distribution of the variables is
meaningfully different. As a result, the trained model is not relevant for this new
data.

Concept drift occurs when the patterns the model learned no longer hold.

In contrast to the data drift, the distributions (such as user demographics,

frequency of words, etc.) might even remain the same. Instead, the relationships
between the model inputs and outputs change.

• Realtime of Batch
• Cloud vs. Edge/Browser
• Compute resources (CPU/GPU/memory)
• Latency, throughput (QPS)
• Logging
• Security and privacy

1. New product/capability
2. Automate/assist with manual task
3. Replace previous ML system

Key idesa:

• Gradual ramp up with monitoring

• Rollback

• ML system shadows the human and runs in parallel.

• Ml system’s output not used for any decisions during this phase.

• Roll out to small fraction (say 5%) of tra�c initially.

• Monitor system and ramp up tra�c gradually.

• Easy way to enable rollback

2 of 19 7/22/24, 12:11
Notes for Introduction to Machine Learning in Production (MLOps1) on Coursera/De... https://zhaoyufei.rbind.io/post/mlops1/

• Brainstorm the things that could go wrong.

• Brainstorm a few statistics/metrics that will detect the problem.
• It is ok to use many metrics initially and gradually remove the ones you �nd
not useful.

• Software metrics: memory, compute, latency, throughout, server load

• Input metrics: avg input length, avg input volume, num missing values, avg
image brightness
• Output metrics # times return ‘'(null), # times user

• Set thresholds for alarms

• Adapt metrics and thresholds over time

• Manual retraining
• Automatic retraining

3 of 19 7/22/24, 12:11
Notes for Introduction to Machine Learning in Production (MLOps1) on Coursera/De... https://zhaoyufei.rbind.io/post/mlops1/

• Monitor
◦ Software metrics
◦ Input metrics
◦ Output metrics
• How quickly do they change?
◦ User data generally has slower drift.
◦ enterprise data (B2B applications) can shift fast.

• Machine Learning in Production: Why You Should Care About Data and
Concept Drift
• Monitoring Machine Learning Models in Production
• A Chat with Andrew on MLOps: From Model-centric to Data-centric AI

Week 2� Select and train model

Selecting and Training a Model

1. Doing well on training set (usually measured by average training error)

2. Doing well on dev/test sets.
3. Doing well on business metrics/project goals.

Web search example

• Informational and transactional queries

• Navigational queries

• Example: ML for loan approval: makes sure not to discriminate by ethnicity,

gender, location, language of other protected attributes.
• Example: Product recommendations from retailers: Be careful to treat
fairly all major user, retailer, and product categories.

• Skewed data distribution

• Accuracy in rare classes

• I did well on the test set

• But this doesn’t work for my application

4 of 19 7/22/24, 12:11
Notes for Introduction to Machine Learning in Production (MLOps1) on Coursera/De... https://zhaoyufei.rbind.io/post/mlops1/

• But this doesn’t work for my application

• Unstructured data: Image, Audio, Text (HLP is important)

• Structured data: a data frame

• Human level performance

• Literature search for state-of-the-art/open source
• Quick-and-dirty implementation
• Performance of older system

Baseline helps to indicates what might be possible. In some cases (such as

HLP) is also gives a sense of what is irreducible error/Bayes error.

• Literature search to see what’s possible (courses, blogs, open-source

projects).
• Find open-source implementations if available.
• A reasonable algorithm with good data will often outperform a great
algorithm with no so good data.

Should you take into account deployment constraints when picking a model?

• Yes, if baseline is already established and goal is to build and deploy.

• No (or not necessarily), if purpose is to establish a baseline and determine
what is possible and might be worth pursuing.

• Try to over�t a small training dataset before training on a large one.

Error analysis and performance auditing

5 of 19 7/22/24, 12:11
Notes for Introduction to Machine Learning in Production (MLOps1) on Coursera/De... https://zhaoyufei.rbind.io/post/mlops1/

• What fraction of errors has that tag?

• Of all data with that tag, what fraction is misclassi�ed?
• What fraction of all the data has that tag?
• How much room for improvement is there on data with that tag?

Prioritizing what to work on

Decide on most important categories to work on based on:

• How much room for improvement there is

• How frequently that category appears
• How easy is to improve accuracy in that category
• How important it is to improve in that category

For categories you want to prioritize:

• Collect more data

• Use data augmentation to get more data
• Improve label accuracy/data quality

Skewed datasets

6 of 19 7/22/24, 12:11
Notes for Introduction to Machine Learning in Production (MLOps1) on Coursera/De... https://zhaoyufei.rbind.io/post/mlops1/

Performance auditing

Check for accuracy, fairness/bias, and other problems.

7 of 19 7/22/24, 12:11
Notes for Introduction to Machine Learning in Production (MLOps1) on Coursera/De... https://zhaoyufei.rbind.io/post/mlops1/

1. Brainstorm the ways the system might go wrong.

◦ Performance on subsets of data (e.g., ethnicity, gender).
◦ How common are certain errors (e.g., FP, FN).
◦ Performance on rare classes.
2. Establish metrics to assess performance against these issues on
appropriate slices of data.
3. Get business/product owner buy-in.

1. Brainstorm the ways the system might go wrong.

◦ Accuracy on different genders and ethnicities.
◦ Accuracy on different devices.
◦ Prevalence of rude mis-transcriptions.
2. Establish metrics to assess performance against these issues on
appropriate slices of data.
◦ Mean accuracy for different genders and major accents.
◦ Mean accuracy on different devices.
◦ Check for prevalence of offensive words in the output.

Data iteration

• Model-centric view: take the data you have, and develop a model that does
as well as possible on it.
◦ Hold the data �xed and iteratively improve the code/model.
• Data-centric view: the quality of the data is paramount. Use tools to
improve the data quality; this will allow multiple models to do well.
◦ Hold the code �xed and iteratively improve the data.

• Goal:
◦ Create realistic examples that (i) the algorithm does poorly on, but (ii)
humans (or other baseline) do well on
• Checklist:
◦ Does it sound realistic?
◦ Is the x→ y mapping clear? (e.g. can humans recognize speech?)
◦ Is the algorithm currently doing poorly on it?

8 of 19 7/22/24, 12:11
Notes for Introduction to Machine Learning in Production (MLOps1) on Coursera/De... https://zhaoyufei.rbind.io/post/mlops1/

For unstructured data problems, if:

• The model is large (low bias).

• The mapping x→y is clear (e.g., given only the input x, humans can make
accurate predictions).

Then, adding data rarely hurts accuracy.

• Restaurant recommendation example:

◦ Vegan are frequently recommended restaurants with only meat
options.
◦ Possible features to add?
▪ Is person vegan (based on past orders)?
▪ Does restaurant have vegan options (based on menu)?
• Other food delivery examples
◦ Only tea/coffee and only pizza
◦ What are the added features that can help make a decision?
◦ Product recommendation:

9 of 19 7/22/24, 12:11
Notes for Introduction to Machine Learning in Production (MLOps1) on Coursera/De... https://zhaoyufei.rbind.io/post/mlops1/

◦ Product recommendation:
Collaborative �ltering ——> Content based �ltering (cold-start)

• Error analysis can be harder if there is not good baseline (such as HLP) to
compare to.
• Error analysis, user feedback and benchmarking to competitors can all
provide inspiration for features to add.

1. What to track?
◦ Algorithm/code versioning
◦ Dataset used
◦ Hyperparameters
◦ Results
2. Tracking tools
◦ Text �les
◦ spreadsheet
◦ Experiment tracking system
3. Desirable features
◦ Information needed to replicate results
◦ Experiment results, ideally with summary metrics/analysis
◦ Perhaps also: resource monitoring, visualization, model error analysis

Try to ensure consistently high-quality data in all phases of the ML project

lifecycly

Good data:

• Covers important cases (good coverage of inputs x)

• Is de�ned consistently (de�nition of labels y is unambiguous)
• Has timely feedback from production data (distribution covers data drift
and concept drift)
• Is sized appropriately

Reading Material Week 2:

• Establishing a baseline
• Error analysis

10 of 19 7/22/24, 12:11
Notes for Introduction to Machine Learning in Production (MLOps1) on Coursera/De... https://zhaoyufei.rbind.io/post/mlops1/

Error analysis
• Experiment tracking

Week 3� Data De�nition and

Baseline
De�ne Data and Establish Baseline

• What is the input x?

◦ lightning? contrast? resolution?
◦ What features need to be in included?
• What is the target label y?
◦ How can we ensure labelers give consistent labels?

• Unstructured data
◦ may or may not have huge collection of unlabeled examples x.
◦ Humans can label more data.
◦ Data augmentation more likely to be helpful.
• Structured data
◦ May be more di�cult to obtain more data.
◦ Human labelling may not be possible (with some exceptions)

• Small data
◦ Clean labels are critical
◦ Can manually look through dataset and �x labels
◦ Can get all the labelers to talk to each other
• Big data
Emphasis data process

11 of 19 7/22/24, 12:11
Notes for Introduction to Machine Learning in Production (MLOps1) on Coursera/De... https://zhaoyufei.rbind.io/post/mlops1/

◦ Emphasis data process

Problems with a large dataset but where there’s a long tail or rare events in the
input will have small data challenges too.

• Web search
• Self-driving cars
• Product recommendation systems

• Have multiple labelers label same example.

• When there is disagreement, have MLE, subject matter expert (SME) and/
or labelers discuss de�nition of y to reach agreement.
• If labelers believe that x doesn’t contain enough information, consider
changing x.
• Iterate until it is hard to signi�cantly increase agreement

12 of 19 7/22/24, 12:11
Notes for Introduction to Machine Learning in Production (MLOps1) on Coursera/De... https://zhaoyufei.rbind.io/post/mlops1/

• Small data
◦ Usually small number of labelers
◦ Can ask labelers to discuss speci�c labels
• Big data
◦ Get to consistent de�nition with a small group.
◦ Then send labeling instructions to labelers.
◦ Can consider having multiple labelers label every example and using
voting or consensus labels to increase accuracy.

Estimate Bayes error / irreducible error to help with error analysis and
prioritization.

• In academia, establish and beat a respectable benchmark to support

publication.
• Business or product owner asks for 99% accuracy. HLP helps establish a
more reasonable target.
• “Prove” the ML system is superior to humans doing the job and thus the
business or product owner should adopt it. (Use with caution)

13 of 19 7/22/24, 12:11
Notes for Introduction to Machine Learning in Production (MLOps1) on Coursera/De... https://zhaoyufei.rbind.io/post/mlops1/

When the ground truth label is externally de�ned, HLP gives an estimate for
Bayes error / irreducible error.

But often ground truth is just anther human label.

• When the label y comes from a human label, HLP « 100% may indicate
ambiguous labeling instructions.
• Improving label consistency will raise HLP
• This makes it harder for ML to beat HLP. But the more consistent labels
will raise ML performance, which is ultimately likely to bene�t the actual
application performance.

Structured data problems are less likely to involve human labelers, thus HLP is
less frequently used.

Some exceptions:

• User ID merging: same person?

• Based on network tra�c, is the computer hacked?
• Is the transaction fraudulent?
• Spam account? Bot?
• From GPS, what is the mode transportation - on foot, bike, car, bus?

Label and Organize Data

• Get into this iteration loop as quickly as possible.

• Instead of asking: how long it would take to obtain m examples? ask: How
much data can we obtain in k days.
• Exception: if you have worked on the problem before and from experience
you know you need m examples.

Brainstorm list of data sources

14 of 19 7/22/24, 12:11
Notes for Introduction to Machine Learning in Production (MLOps1) on Coursera/De... https://zhaoyufei.rbind.io/post/mlops1/

Other factors: data quality, privacy, regulatory constrains

• Options: in-house vs. outsourced vs. crowdsourced

• Having MLEs label data expensive. But doing this for just a few days is
usually �ne
• Who is quali�ed to label?
◦ Speech recognition - any reasonable �uent speaker
◦ Factory inspection, medical image diagnosis - SME (subject matter
expert)
◦ Recommender systems - maybe impossible to label well
• Don’t increase data by more than 10x at a time

• POC(proof-of-concept):
◦ Goal is to decide if the application is workable and worth deploying.
◦ Focus on getting the prototype to work
◦ It’s ok if data pre-processing is manual. But take extensive notes/
comments
• Production phase:
• After project utility is established, use more sophisticated tools to make
sure the data pipeline is replicable.
• E.g., Tensor Flow Transform, Apache Beam, Air�ow, …

15 of 19 7/22/24, 12:11
Notes for Introduction to Machine Learning in Production (MLOps1) on Coursera/De... https://zhaoyufei.rbind.io/post/mlops1/

• Examples:
◦ Manufacturing visual inspection: time, factory, line #, camera
settings, phone model, inspector ID,…
◦ Speech recognition: device type, labeler ID, VAD model ID,…
• Useful for:
◦ Error analysis. Spotting unexpected effects.
◦ Keeping track of data provenance.

Visual inspection example: 100 examples, 30 positive (defective)

• Train/dev/test: 60%/20%/20%
• Random split: positive example: 21/2/7 (35%/10%/35%)→ dev set is not
representative
• Want: 18/6/6 (30%/30%/30%) →balanced split
• No need to worry about this with large datasets - a random split will be
representative

Scooping

16 of 19 7/22/24, 12:11
Notes for Introduction to Machine Learning in Production (MLOps1) on Coursera/De... https://zhaoyufei.rbind.io/post/mlops1/

• Use external benchmark (literature, other company, competitor)

People are very good on unstructured data tasks

Criteria: can a human, given the same data, perform the task?

• Given past purchases, predict future purchases �

• Given weather, predict shopping mall foot tra�c �
• Given DNA info, predict heart disease �
• Given social media chatter, predict demand for a clothing style �
• Given history of stock’s price, predict future price of that stock �

17 of 19 7/22/24, 12:11
Notes for Introduction to Machine Learning in Production (MLOps1) on Coursera/De... https://zhaoyufei.rbind.io/post/mlops1/

• Is this project creating net positive societal value?

• Is this project reasonable fair and free from bias?
• Have any ethical concerns been openly aired and debated?

Key speci�cations:

• ML metrics (accuracy, precision/recall, etc.)

• Software metrics (latency, throughput, etc. given compute resources)
• Business metrics (revenue, etc.)
• Resources needed (data, personnel, help from other teams)
• Timeline

If unsure, consider benchmarking to other projects, or building a POC (Proof of

Concept) �rst.

Reading Material Week 3:

Label ambiguity

https://arxiv.org/pdf/1706.06969.pdf

Data pipelines

Data lineage

MLops

Overall resources:

Konstantinos, Katsiapis, Karmarkar, A., Altay, A., Zaks, A., Polyzotis, N., … Li, Z.
(2020). Towards ML Engineering: A brief history of TensorFlow Extended (TFX).
http://arxiv.org/abs/2010.02013

Paleyes, A., Urma, R.-G., & Lawrence, N. D. (2020). Challenges in deploying

machine learning: A survey of case studies. http://arxiv.org/abs/2011.09926

Python Machine learning

     

PhD Candidate

   

18 of 19 7/22/24, 12:11
Notes for Introduction to Machine Learning in Production (MLOps1) on Coursera/De... https://zhaoyufei.rbind.io/post/mlops1/

   

0 Comments 
1 Login

G Start the discussion…

LOG IN WITH OR SIGN UP WITH DISQUS ?

Name

 Share Best Newest Oldest

Be the �rst to comment.

Subscribe Privacy Do Not Sell My Data

• Building API for Predicting Mando-pop Popularity

• Notes for Machine Learning Data Lifecycle in Production (MLOps2) on
Coursera/Deeplearning.ai
• Decoding memory content from human parietal cortex: VGG16 application
on memory research
• Books/video courses recommendation for data science related coding/
machine learning/stats
• E�cient way of the brain for resolving similar memory interference

19 of 19 7/22/24, 12:11

MLOps Data Lifecycle Course
No ratings yet
MLOps Data Lifecycle Course
133 pages
Week 1 - Overview of ML Lifecycle and Deployment
No ratings yet
Week 1 - Overview of ML Lifecycle and Deployment
21 pages
cs329s 2022 02 Slides MLSD
No ratings yet
cs329s 2022 02 Slides MLSD
99 pages
CT1-MLOPs S1 2
No ratings yet
CT1-MLOPs S1 2
68 pages
A Practical and Technical Introduction To Machine Learning
No ratings yet
A Practical and Technical Introduction To Machine Learning
23 pages
Segmentation Dataset
No ratings yet
Segmentation Dataset
41 pages
MLOps Getting From Good To Great
No ratings yet
MLOps Getting From Good To Great
41 pages
Designing Machine Learning Systems by Chip Huygen by Rick
100% (1)
Designing Machine Learning Systems by Chip Huygen by Rick
15 pages
Data Science Project Lifecycle
No ratings yet
Data Science Project Lifecycle
43 pages
Webinar Slides Mlops
100% (1)
Webinar Slides Mlops
35 pages
Unit-1 Introduction To Machine Learning (5hrs)
No ratings yet
Unit-1 Introduction To Machine Learning (5hrs)
8 pages
MLOps Interview Study CSCW24
No ratings yet
MLOps Interview Study CSCW24
34 pages
A Data Quality-Driven View of Mlops
No ratings yet
A Data Quality-Driven View of Mlops
12 pages
C1 W2
No ratings yet
C1 W2
60 pages
Lecture 3 - 1-ML and Data Systems Fundamentals
No ratings yet
Lecture 3 - 1-ML and Data Systems Fundamentals
48 pages
Machine Learning Guide for Experts
No ratings yet
Machine Learning Guide for Experts
3 pages
AWS MLOps Slides
No ratings yet
AWS MLOps Slides
185 pages
Machine Learning Guide: Basics to Deployment
No ratings yet
Machine Learning Guide: Basics to Deployment
2 pages
Machine Learning (ML) - Comprehensive Summary
No ratings yet
Machine Learning (ML) - Comprehensive Summary
7 pages
Final Documentation
No ratings yet
Final Documentation
101 pages
MLOps for Efficient ML Deployment
No ratings yet
MLOps for Efficient ML Deployment
20 pages
MLOps Asilla 20221124
No ratings yet
MLOps Asilla 20221124
16 pages
Unit 2
No ratings yet
Unit 2
12 pages
Unit 1
No ratings yet
Unit 1
21 pages
CMPE257 - W2C2 - ML Fundamentals - Part 1
No ratings yet
CMPE257 - W2C2 - ML Fundamentals - Part 1
18 pages
ML Checklist PDF
No ratings yet
ML Checklist PDF
4 pages
MLOps for ML Researchers & Practitioners
No ratings yet
MLOps for ML Researchers & Practitioners
13 pages
Machine Learning (Unit I)
No ratings yet
Machine Learning (Unit I)
12 pages
C2 - W1 Mlopssadsa
No ratings yet
C2 - W1 Mlopssadsa
111 pages
Module 1
No ratings yet
Module 1
25 pages
ML Life Cycle
No ratings yet
ML Life Cycle
10 pages
Lecture 5 - Planning & Feasibility of ML Projects
No ratings yet
Lecture 5 - Planning & Feasibility of ML Projects
42 pages
ML Project Guide for Practitioners
No ratings yet
ML Project Guide for Practitioners
7 pages
Lecture 1
No ratings yet
Lecture 1
21 pages
Chapter 02 Overview - 4
No ratings yet
Chapter 02 Overview - 4
43 pages
MLOps: Definition and Architecture
No ratings yet
MLOps: Definition and Architecture
13 pages
Introduction to Machine Learning Basics
No ratings yet
Introduction to Machine Learning Basics
5 pages
Introduction to MLOps Concepts
No ratings yet
Introduction to MLOps Concepts
10 pages
C1 W3
No ratings yet
C1 W3
60 pages
Hands On Machine Learning With Scikit Learn and TensorFlow-427-432
No ratings yet
Hands On Machine Learning With Scikit Learn and TensorFlow-427-432
6 pages
ML Advice Lecture
No ratings yet
ML Advice Lecture
87 pages
MLOps Ultimate Guide Cheat Sheet Included 1673365290
No ratings yet
MLOps Ultimate Guide Cheat Sheet Included 1673365290
24 pages
ML Data Management for Experts
No ratings yet
ML Data Management for Experts
122 pages
ML Copy 2
No ratings yet
ML Copy 2
82 pages
ML Systems Interview Notes
No ratings yet
ML Systems Interview Notes
5 pages
Introduction To Machine Learning Lecture Notes
No ratings yet
Introduction To Machine Learning Lecture Notes
3 pages
ML Challenges and Metrics
No ratings yet
ML Challenges and Metrics
19 pages
ML Notes All
No ratings yet
ML Notes All
32 pages
MLOps: Definition and Architecture
No ratings yet
MLOps: Definition and Architecture
13 pages
Data Science Checklist
No ratings yet
Data Science Checklist
22 pages
1 - Machine Learning Overview
No ratings yet
1 - Machine Learning Overview
56 pages
1 - Machine Learning Overview
No ratings yet
1 - Machine Learning Overview
53 pages
Why Do AI Initiatives Fail
No ratings yet
Why Do AI Initiatives Fail
5 pages
MLOPS Notes
100% (1)
MLOPS Notes
5 pages
Introduction - Final
No ratings yet
Introduction - Final
64 pages
Operationalizing Machine Learning - An Interview StudyOperationalizing Machine Learning - An Interview Study - 2209.09125
No ratings yet
Operationalizing Machine Learning - An Interview StudyOperationalizing Machine Learning - An Interview Study - 2209.09125
40 pages
SEC Presentation
No ratings yet
SEC Presentation
22 pages
Base Paper 3 - Master Theises
No ratings yet
Base Paper 3 - Master Theises
75 pages
Deep Learning - Unit-III Two Marks
100% (2)
Deep Learning - Unit-III Two Marks
3 pages
Noordman Rogier 12366315 MSC ETRICS
No ratings yet
Noordman Rogier 12366315 MSC ETRICS
54 pages
A Minor Project Synopsis 2
No ratings yet
A Minor Project Synopsis 2
11 pages
KUNet-An Optimized AI Based Bengali Sign Language Translator For Hearing Impaired and Non Verbal People
No ratings yet
KUNet-An Optimized AI Based Bengali Sign Language Translator For Hearing Impaired and Non Verbal People
12 pages
Accurate Prediction of Heart Disease Using Machine Learning-A Case Study On The Cleveland Dataset - IJISRT24JUL1400
No ratings yet
Accurate Prediction of Heart Disease Using Machine Learning-A Case Study On The Cleveland Dataset - IJISRT24JUL1400
8 pages
IS4242 W6 Model Evaluation and Selection
No ratings yet
IS4242 W6 Model Evaluation and Selection
86 pages
Countermeasures To Spoofing and Jamming of Drone Signals
No ratings yet
Countermeasures To Spoofing and Jamming of Drone Signals
10 pages
Lecture1 Intro ML
No ratings yet
Lecture1 Intro ML
60 pages
AI ML Roadmap
No ratings yet
AI ML Roadmap
4 pages
Shreya 2025
No ratings yet
Shreya 2025
1 page
Data Mining Review - 1
No ratings yet
Data Mining Review - 1
9 pages
The Role of Mathematics in Artificial Intelligence
No ratings yet
The Role of Mathematics in Artificial Intelligence
4 pages
Effects of Batches - Jupyter Notebook
No ratings yet
Effects of Batches - Jupyter Notebook
73 pages
Lipid Project
No ratings yet
Lipid Project
76 pages
Your Cybersecurity Toolkit
No ratings yet
Your Cybersecurity Toolkit
22 pages
AMT305 INTRODUCTION TO MACHINE LEARNING, Pyq2
No ratings yet
AMT305 INTRODUCTION TO MACHINE LEARNING, Pyq2
3 pages
Real-Time Traffic Accident Detection
No ratings yet
Real-Time Traffic Accident Detection
29 pages
Computing Innovations Conference
No ratings yet
Computing Innovations Conference
9 pages
128 Supply Chain Resilience and Risk Management
No ratings yet
128 Supply Chain Resilience and Risk Management
7 pages
15056-Article Text-44992-2-10-20210906
No ratings yet
15056-Article Text-44992-2-10-20210906
15 pages
Midterm Solutions PDF
No ratings yet
Midterm Solutions PDF
17 pages
Success Stories in Recent Application of Data Mining
No ratings yet
Success Stories in Recent Application of Data Mining
11 pages
ML Internship Insights
No ratings yet
ML Internship Insights
31 pages
1 DL Introduction
No ratings yet
1 DL Introduction
51 pages
Applications of Data-Driven Approaches in Prediction of Fatigue and Fracture
No ratings yet
Applications of Data-Driven Approaches in Prediction of Fatigue and Fracture
13 pages
ML SPPU May Jun 2023
No ratings yet
ML SPPU May Jun 2023
2 pages
ML - Chapter 5 - Neural Network
No ratings yet
ML - Chapter 5 - Neural Network
64 pages
Data Literacy for Business Pros
No ratings yet
Data Literacy for Business Pros
24 pages
HW 01 - CSL 537
No ratings yet
HW 01 - CSL 537
6 pages
NNDL
No ratings yet
NNDL
10 pages

MLOps Notes

Uploaded by

MLOps Notes

Uploaded by

Notes for Introduction to Machine Learning in Production (MLOps1) on Coursera/De... https://zhaoyufei.rbind.

Notes for Introduction to

Week 1� Steps of an ML project

• Decide to work on speech recognition for voice search

• Is the data labeled consistently

In contrast to the data drift, the distributions (such as user demographics,

• Gradual ramp up with monitoring

• ML system shadows the human and runs in parallel.

• Roll out to small fraction (say 5%) of tra�c initially.

• Easy way to enable rollback

• Brainstorm the things that could go wrong.

• Software metrics: memory, compute, latency, throughout, server load

• Set thresholds for alarms

Week 2� Select and train model

1. Doing well on training set (usually measured by average training error)

Web search example

• Informational and transactional queries

• Example: ML for loan approval: makes sure not to discriminate by ethnicity,

• Skewed data distribution

• I did well on the test set

• But this doesn’t work for my application

• Unstructured data: Image, Audio, Text (HLP is important)

• Human level performance

Baseline helps to indicates what might be possible. In some cases (such as

• Literature search to see what’s possible (courses, blogs, open-source

• Yes, if baseline is already established and goal is to build and deploy.

• Try to over�t a small training dataset before training on a large one.

Error analysis and performance auditing

• What fraction of errors has that tag?

Prioritizing what to work on

Decide on most important categories to work on based on:

• How much room for improvement there is

For categories you want to prioritize:

• Collect more data

Check for accuracy, fairness/bias, and other problems.

1. Brainstorm the ways the system might go wrong.

1. Brainstorm the ways the system might go wrong.

For unstructured data problems, if:

• The model is large (low bias).

Then, adding data rarely hurts accuracy.

• Restaurant recommendation example:

Try to ensure consistently high-quality data in all phases of the ML project

• Covers important cases (good coverage of inputs x)

Reading Material Week 2:

Week 3� Data De�nition and

• What is the input x?

◦ Emphasis data process

• Have multiple labelers label same example.

• In academia, establish and beat a respectable benchmark to support

But often ground truth is just anther human label.

• User ID merging: same person?

Label and Organize Data

• Get into this iteration loop as quickly as possible.

Brainstorm list of data sources

Other factors: data quality, privacy, regulatory constrains

• Options: in-house vs. outsourced vs. crowdsourced

Visual inspection example: 100 examples, 30 positive (defective)

• Use external benchmark (literature, other company, competitor)

People are very good on unstructured data tasks

• Given past purchases, predict future purchases �

• Is this project creating net positive societal value?

• ML metrics (accuracy, precision/recall, etc.)

If unsure, consider benchmarking to other projects, or building a POC (Proof of

Reading Material Week 3:

Paleyes, A., Urma, R.-G., & Lawrence, N. D. (2020). Challenges in deploying

Python Machine learning

G Start the discussion…

LOG IN WITH OR SIGN UP WITH DISQUS ?

 Share Best Newest Oldest

Be the �rst to comment.

Subscribe Privacy Do Not Sell My Data

• Building API for Predicting Mando-pop Popularity

You might also like