0% found this document useful (0 votes)

27 views29 pages

Mlflow Workshop Part 2

This document provides an overview of MLflow, focusing on its components such as Tracking, Projects, and Models, which facilitate the machine learning lifecycle. It emphasizes the importance of reproducibility in machine learning and outlines how MLflow helps package and manage data science code and models. Additionally, it includes examples and resources for further learning about MLflow functionalities and usage.

Uploaded by

Tuan Minh Pham

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views29 pages

Mlflow Workshop Part 2

Uploaded by

Tuan Minh Pham

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

Platform for Complete Machine

Learning Lifecycle
Jules S. Damji
@2twitme

San Francisco| May 13, 2020: Part 2 of 3 Series

Outline – Introduction to MLflow: Understanding MLflow
Projects and Models- Part 2
§ Review & Recap Part 1: MLflow Tracking
▪ https://youtu.be/x3cxvsUFVZA
§ MLFlow Component
▪ MLflow Projects & Models
▪ Concepts and Motivations
▪ MLflow on Databricks Community Edition (DCE)
▪ Explore MLflow UI
▪ Tutorials
§ Q&A

https://dbricks.co/mlflow-part-2
https://github.com/dmatrix/mlflow-workshop-project-expamle-1
Machine Learning
Development is Complex
Traditional Software vs. Machine Learning
Traditional Software Machine Learning

§ Goal: Meet a functional specification § Goal: Optimize metric(e.g., accuracy.

§ Quality depends only on code Constantly experiment to improve it
§ Typically pick one software stack w/ § Quality depends on input data and
fewer libraries and tools tuning parameters
§ Compare + combine many libraries,
model
Machine Learning Lifecycle
μ
λθ Tuning

Scale

Data Prep

μ
Model λθ Tuning
Delta Raw Data Exchange Training
Scale
Scale

Deploy
Governance

Scale
MLflow Components
w
ne

Tracking Projects Models Model

Record and query Package data Deploy machine Registry
experiments: code, science code in a learning models in
Store, annotate
data, config, and results format that enables diverse serving
and manage
reproducible runs environments
models in a
on any platform environments
central repository

databricks.com
mlflow.org github.com/mlflow twitter.com/MLflow
/mlflow
Model Development with MLflow is Simple!
data = load_text(file) $ mlflow ui
ngrams = extract_ngrams(data, N=n)
model = train_model(ngrams,
learning_rate=lr)
score = compute_accuracy(model)
with mlflow.start_run() as run:
mlflow.log_param(“data_file”, file)
mlflow.log_param(“n”, n)
mlflow.log_param(“learn_rate”, lr)
mlflow.log_metric(“score”, score) Track parameters, metrics,
mlflow.sklearn.log_model(model) output files & code version
Search using UI or API
MLflow Tracking
Python,
Java, R or
REST API
Notebooks Tracking Server UI

Local Apps Parameters Metrics Artifacts

API

Spark
Cloud Jobs Metadata Models
Data Source
$ export MLFLOW_TRACKING_URI <URI>
mlflow.set_tracking_uri(URI)
MLflow Components
w
ne

Tracking Projects Models Model

databricks.com
mlflow.org github.com/mlflow twitter.com/MLflow
/mlflow
MLflow Projects Motivation
Diverse set of tools

Projects
Package data science
Diverse set of environments code in a format that
enables reproducible runs
on any platform

Challenge: ML results difficult to reproduce

MLflow Projects

Local Execution
Project Spec

Code Config
Remote Execution
Dependencies Data
1. Example MLflow Project File
my_projectject/
├── MLproject conda_env: conda.yaml

│ entry_points:
│ main:
parameters:
│ training_data: path
│ lambda: {type: float, default: 0.1}
command: python main.py {training_data} {lambda}
│
├── conda.yaml
├── main.py $ mlflow run git://<my_project>.git -P lambda=0.2
└── model.py
mlflow.run(“git://<my_project>”, parameters={..})
...
mlflow run . –e main –P lambda=0.2
2. Example Conda.yaml
my_project/
├── MLproject
channels:
│ - defaults
│ dependencies:
│ - python=3.7.3
- scikit-learn=0.20.3
│ - pip:
│ - mlflow
├── conda.yaml - cloudpickle==0.8.0
├── main.py name: mlflow-env

└── model.py
….
MLflow Projects
Packaging format for reproducible ML runs
• Any code folder or GitHub repository
• MLproject file with project configuration
Defines dependencies for reproducibility
• Conda (+ R, Docker, …) dependencies can be specified in MLproject
• Reproducible in (almost) any environment
Execution API for running projects
§ CLI / Python / R / Java mml

directory paths to
§ Supports local and remote execution MLproject file
▪ mlflow run –help (CLI)
▪ mlflow run https://github.com/dmatrix/jsd-mlflow-examples.git#keras/imdbclassifier (CLI)
▪ mlflow.run (<project_uri>, parameters={}) or mlflow.projects.run((<project_uri>, parameters={}) (API)
Anatomy of MLflow Project Execution
1 2 3
$ mlflow run Fetch the GitHub project into
https://github.com/mlflow- /var/folders/xxx directory Create conda env & activate
d
project-example-1 d mlflow-runidd

4 5

Install packages & dependencies from In the activated conda environment

conda.yaml d mlflow-runid d
Execute your entry point:
python train.py args, …,args
How to build an MLflow Project
1 2
• Create an MLproject file • Create a conda.yaml file
• Populate with entry points • Populate with dependencies
d and
and default type • Copy from yourd mlflow ui
parameters artifacts ->Model->conda.yaml

3 • Test it
4
• Create a GitHub repository
• Populate or upload • mlflow run git://URI –P arg.. –P args
d
MLProject, conda.yaml, • d params-{})
mlflow.run(URI,
data, src files… etc. • Share it …
MLflow Project: Create Multi-Step Workflow

https://github.com/mlflow/mlflow/tree/master/examples/multistep_workflow
MLflow Components
w
ne

Tracking Projects Models Model

databricks.com
mlflow.org github.com/mlflow twitter.com/MLflow
/mlflow
MLflow Model Motivations

Inference Code
NxM
Combination of
Model support for
all Serving tools

Batch & Stream Scoring

ML Frameworks Serving Tools

MLflow Model Motivation
MLflow Models
Inference Code

Model Format

Flavor 1 Flavor 2
Batch & Stream
Scoring

Standard for ML models

ML Frameworks Serving Tools
Example MLflow Model
Example MLflow Model
mlflow.tensorflow.log_model(...)
my_model/
├── MLmodel run_id: 769915006efd4c4bbd662461
time_created: 2018-06-28T12:34
│ flavors:
│ tensorflow:
Usable by tools that understand
saved_model_dir: estimator
│ signature_def_key: predict TensorFlow model format
│ python_function: Usable by any tool that can run
loader_module: mlflow.tensorflow
│ Python (Docker, Spark, etc!)
└── estimator/
├── saved_model.pb
└── variables/
...
Model Keras Flavor Example
mlflow.keras.log_model(…)

Train a model

predict = mlflow.pyfunc.load_model(…)
Flavor 1:
Pyfunc predict(pandas.input_dataframe)

Model Flavor 2:
Format Keras
model = mlflow.keras.load_model(…)

model.predict(keras.Input(…))
Model Flavors Example

predict = mlflow.pyfunc.load_model(model_uri)

predict(pandas.input_dataframe)
MLflow Models
Packaging format for ML Models
• Any directory with MLmodel file
Defines dependencies for reproducibility
• Conda environment can be specified in MLmodel configuration
Model creation and loading utilities
• mlflow.<model_flavor>.save_model(…) or log_model(…)
• mlflow.<model_flavor>.load_model(…)
Deployment APIs
• CLI / Python / R / Java
• mlflow models [OPTIONS] COMMAND [ARGS]...
• mlflow models serve [OPTIONS [ARGS] ….
• mlflow models predict [OPTIONS [ARGS] ...
MLflow Project & Models
Tutorials
Tutorials: https://github.com/dmatrix/mlflow-workshop-part-2

MLflow Project Keras Example:

https://github.com/dmatrix/mlflow-workshop-project-expamle-1
Learning More About MLflow

§ pip install mlflow to get started

§ Find docs & examples at mlflow.org
§ Peruse code at MLflow Github
§ Join the Slack channel
§ More MLflow tutorials
Thank you! J
Q&A
jules@databricks.com
@2twitme
https://www.linkedin.com/in/dmatrix/

MLflow - An Open Platform To Simplify The Machine Learning Lifecycle Presentation 1
No ratings yet
MLflow - An Open Platform To Simplify The Machine Learning Lifecycle Presentation 1
28 pages
Mlflow Workshop Part 3
No ratings yet
Mlflow Workshop Part 3
25 pages
DVC Mlflow
No ratings yet
DVC Mlflow
10 pages
MLflow Présentation
No ratings yet
MLflow Présentation
51 pages
Introduction To MLFlow
No ratings yet
Introduction To MLFlow
8 pages
Machine Learning Model Deployment
No ratings yet
Machine Learning Model Deployment
88 pages
MLOps With Agentic AI Curriculum
No ratings yet
MLOps With Agentic AI Curriculum
33 pages
MLOps & ML Lifecycle Mastery
No ratings yet
MLOps & ML Lifecycle Mastery
106 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
91 pages
Effortless Models Deployment With MLFlow - by Facundo Santiago - Medium
No ratings yet
Effortless Models Deployment With MLFlow - by Facundo Santiago - Medium
15 pages
Lecture+Notes Intro To MLOps Session3
No ratings yet
Lecture+Notes Intro To MLOps Session3
8 pages
Getting Started With MLOPs 21 Page Tutorial
No ratings yet
Getting Started With MLOPs 21 Page Tutorial
21 pages
ML Tools - MLflow
No ratings yet
ML Tools - MLflow
11 pages
ML Project (BS IT-8)
No ratings yet
ML Project (BS IT-8)
20 pages
MLFlow
No ratings yet
MLFlow
4 pages
Nebius LLM Fine Tuning Mlflow
No ratings yet
Nebius LLM Fine Tuning Mlflow
24 pages
7 - From ML To Production
No ratings yet
7 - From ML To Production
23 pages
ML Deployment & MLOps Guide
No ratings yet
ML Deployment & MLOps Guide
56 pages
MLOps
No ratings yet
MLOps
16 pages
Tantithamthavorn Et Al - 2025
No ratings yet
Tantithamthavorn Et Al - 2025
7 pages
Production ML Pipelines With TensorFlow Extended - TFX - Presentation
No ratings yet
Production ML Pipelines With TensorFlow Extended - TFX - Presentation
234 pages
Homework 5 Yessine Labyedh
No ratings yet
Homework 5 Yessine Labyedh
28 pages
Machine Learning Ops Tools Guide
No ratings yet
Machine Learning Ops Tools Guide
27 pages
MLFlow for Data Science Teams
No ratings yet
MLFlow for Data Science Teams
3 pages
Deep Learning With Databricks: Srijith Rajamohan, Ph.D. John O'Dwyer
No ratings yet
Deep Learning With Databricks: Srijith Rajamohan, Ph.D. John O'Dwyer
38 pages
Final ML Project File
No ratings yet
Final ML Project File
16 pages
Lecture Notes - Building Continuous Learning Infrastructure
No ratings yet
Lecture Notes - Building Continuous Learning Infrastructure
8 pages
CT1-MLOPs S1 2
No ratings yet
CT1-MLOPs S1 2
68 pages
MLFlow Experiment Tracking and Model Registering PPT 1711953158
No ratings yet
MLFlow Experiment Tracking and Model Registering PPT 1711953158
20 pages
MLOps Course for AI Professionals
No ratings yet
MLOps Course for AI Professionals
29 pages
MLOps Interview Q&A Guide 2024
No ratings yet
MLOps Interview Q&A Guide 2024
19 pages
Unit 2
No ratings yet
Unit 2
12 pages
GenAI LLM Foundations and Building Blocks
No ratings yet
GenAI LLM Foundations and Building Blocks
6 pages
MLOps Specialization Course January 2024!5!15
No ratings yet
MLOps Specialization Course January 2024!5!15
11 pages
MLOps Asilla 20221124
No ratings yet
MLOps Asilla 20221124
16 pages
MLOps Skills: A Step-by-Step Guide
No ratings yet
MLOps Skills: A Step-by-Step Guide
6 pages
Flow-Based Programming For Machine Learning
No ratings yet
Flow-Based Programming For Machine Learning
30 pages
MLOps Research Work by Arka Roy
No ratings yet
MLOps Research Work by Arka Roy
21 pages
MLOps Roadmap Beginner To Advanced
No ratings yet
MLOps Roadmap Beginner To Advanced
3 pages
8 Code Snippets To Quickly Get Started With Mlflow Tracking: Tips To Better Log Your Experiments and Reproduce Them
No ratings yet
8 Code Snippets To Quickly Get Started With Mlflow Tracking: Tips To Better Log Your Experiments and Reproduce Them
24 pages
DA Python Env Intro
No ratings yet
DA Python Env Intro
47 pages
ML Ops
100% (1)
ML Ops
19 pages
BDA Lec11
No ratings yet
BDA Lec11
32 pages
Applied ML
No ratings yet
Applied ML
74 pages
Unit 2
No ratings yet
Unit 2
9 pages
Working With Spark: With Focus On Lab3
No ratings yet
Working With Spark: With Focus On Lab3
18 pages
AI ML Engineer Roadmap Formatted
No ratings yet
AI ML Engineer Roadmap Formatted
6 pages
Ai ML Roadmap
No ratings yet
Ai ML Roadmap
7 pages
01 Coding The God Bot (Dragged) 6
No ratings yet
01 Coding The God Bot (Dragged) 6
1 page
05 Versioning
No ratings yet
05 Versioning
47 pages
End-to-End Machine Learning Project Workflows
No ratings yet
End-to-End Machine Learning Project Workflows
5 pages
MLOps Specialization Course April 2024
100% (1)
MLOps Specialization Course April 2024
25 pages
Toward An Open Source MLOps Architecture
No ratings yet
Toward An Open Source MLOps Architecture
6 pages
MLOps for Efficient ML Deployment
No ratings yet
MLOps for Efficient ML Deployment
20 pages
ML Fundamentals
No ratings yet
ML Fundamentals
2 pages
ML Practicals
No ratings yet
ML Practicals
8 pages
Deploy Machine Learning Models
100% (1)
Deploy Machine Learning Models
45 pages
The Machine Learning Lifecycle in 2021
No ratings yet
The Machine Learning Lifecycle in 2021
20 pages
Data Exploration On Databricks - Databricks
No ratings yet
Data Exploration On Databricks - Databricks
1 page
04 CaseStudy DataPlatformPeopleStrategy Rao Tom
No ratings yet
04 CaseStudy DataPlatformPeopleStrategy Rao Tom
30 pages
Data Exploration On Databricks (Setup) - Databricks
No ratings yet
Data Exploration On Databricks (Setup) - Databricks
1 page
Dataset - Databricks
No ratings yet
Dataset - Databricks
5 pages
AdTech Sample Notebook (Part 1) - Databricks
No ratings yet
AdTech Sample Notebook (Part 1) - Databricks
1 page
The Sum of Squares Technique
No ratings yet
The Sum of Squares Technique
4 pages
Inmo 2012
No ratings yet
Inmo 2012
6 pages
Swagger Tutorial - What Is Swagger
No ratings yet
Swagger Tutorial - What Is Swagger
28 pages
AWS AppSync & GraphQL Guide
No ratings yet
AWS AppSync & GraphQL Guide
5 pages
C Cpe 16
No ratings yet
C Cpe 16
23 pages
OpenAPI Guide for Developers
No ratings yet
OpenAPI Guide for Developers
32 pages
6 - Git Action
No ratings yet
6 - Git Action
6 pages
Covesa Amm 2022 - VSC
No ratings yet
Covesa Amm 2022 - VSC
24 pages
Terraform From Bigginer To Master
100% (4)
Terraform From Bigginer To Master
90 pages
Symfony Best Practices 2.7
No ratings yet
Symfony Best Practices 2.7
46 pages
Sannav 2.3.x Rest Api Reference Manual PDF
No ratings yet
Sannav 2.3.x Rest Api Reference Manual PDF
111 pages
RESTful API Design
No ratings yet
RESTful API Design
25 pages
APIGEE - Developer Services
100% (2)
APIGEE - Developer Services
2 pages
All-In-One Devnet Associate Exam: Devasc Exam 200-901 V1.0 Cert Guide
No ratings yet
All-In-One Devnet Associate Exam: Devasc Exam 200-901 V1.0 Cert Guide
29 pages
Robotics CAD to URDF Integration
No ratings yet
Robotics CAD to URDF Integration
10 pages
Kubernetes CheatSheet
100% (1)
Kubernetes CheatSheet
27 pages
Errores Adicionales 20250504
No ratings yet
Errores Adicionales 20250504
2 pages
GitHub - Ieski - Doodba - Base Image For Making The Creation of Customized Odoo Environments A Piece of Cake
No ratings yet
GitHub - Ieski - Doodba - Base Image For Making The Creation of Customized Odoo Environments A Piece of Cake
16 pages
YAML Language Specification Guide
0% (1)
YAML Language Specification Guide
8 pages
Cloud Formation (IaC) Deploying A Containerized Application On Cloud
No ratings yet
Cloud Formation (IaC) Deploying A Containerized Application On Cloud
10 pages
Session 4 - REST API Design, Development & Management - API Specification
No ratings yet
Session 4 - REST API Design, Development & Management - API Specification
52 pages
YAML
No ratings yet
YAML
3 pages
Mule 2
No ratings yet
Mule 2
15 pages
Spring Boot REST for Developers
No ratings yet
Spring Boot REST for Developers
19 pages
E Last Alert
No ratings yet
E Last Alert
53 pages
Creating A Modern Web App Using Symfony Api Platform
No ratings yet
Creating A Modern Web App Using Symfony Api Platform
59 pages
Game Index
No ratings yet
Game Index
8 pages
05 ROS2 Launch
No ratings yet
05 ROS2 Launch
34 pages
Kubernetes Cluster Setup Guide
No ratings yet
Kubernetes Cluster Setup Guide
179 pages
Spring Boot Application Setup Guide
No ratings yet
Spring Boot Application Setup Guide
210 pages
Puppet Labs Hiera Manual
No ratings yet
Puppet Labs Hiera Manual
44 pages
Jenkins Job Builder Guide
No ratings yet
Jenkins Job Builder Guide
315 pages

Mlflow Workshop Part 2

Uploaded by

Mlflow Workshop Part 2

Uploaded by

Platform for Complete Machine

San Francisco| May 13, 2020: Part 2 of 3 Series

§ Goal: Meet a functional specification § Goal: Optimize metric(e.g., accuracy.

Tracking Projects Models Model

Local Apps Parameters Metrics Artifacts

Tracking Projects Models Model

Challenge: ML results difficult to reproduce

Install packages & dependencies from In the activated conda environment

Tracking Projects Models Model

Batch & Stream Scoring

ML Frameworks Serving Tools

Standard for ML models

MLflow Project Keras Example:

§ pip install mlflow to get started

You might also like