Production and Beyond: Deploying and Managing Machine Learning Models

What happens after (initial) deployment

ML production life cycle

Evaluation

Monitoring

Deployment

Management

After deployment

Evaluate and track metrics over time.

React to feedback from deployed models.

Monitoring Management Evaluation

ML in production - 101Model

Historical Data

Predictions

LiveData

Feedback

Batch training

Real-time predictions

ML in production - 101Model

Historical Data

Real-time predictions

Batch training

PredictionsModel 2

LiveData

Key questions• When to update a model?• How to choose between existing models?• Answer: continuous evaluation and testing

What is evaluation?

Predictions Metric

+ Evaluation

What data?Which metric?

Evaluating a recommenderModel

Historical Data

Predictions

LiveData

Ranking loss

User engagement

Evaluating a recommenderModel

Historical Data

Predictions

LiveData

Ranking loss

User engagementOffline evaluation:

When to update modelOnline evaluation:Choosing between models

Updating ML modelsWhy update?• Trends and user tastes change over time• Model performance drops

When to update?• Track statistics of data over time• Monitor both offline & online metrics on live data• Update when offline metric diverges from online metrics

Choosing between ML models

Model 2

Model 1

2000 visits10% CTR

Group A

Everybody gets Model 2

2000 visits30% CTR

Group B

Strategy 1: A/B testing—select the best model and use it all the time

A statistician walks into a casino…

Pay-off $1:$1000 Pay-off $1:$200 Pay-off $1:$500Play this 85% of

the timePlay this 10% of

the timePlay this 5% of the

Multi-armed bandits

A statistician walks into an ML production environment

Pay-off $1:$1000 Pay-off $1:$200 Pay-off $1:$500

Use this 85% of the time

(Exploitation)

(Exploration)

Model 1

Model 2

Model 3

MAB vs. A/B testingWhy MAB?• Continuous optimization, “set and forget”• Maximize overall reward

Why A/B test?• Simple to understand• Single winner• Tricky to do right

Other production considerations• Versioning• Logging• Provenance• Dashboards• Reports

“Machine learning: The high interest rate credit card of technical debt,” D. Sculley et al, Google, 2014“Two big challenges in machine learning,” Leon Bottou, ICML 2015 invited talk

Conclusions

Evaluation

Monitoring

Deploymen

Management

Dato Distributed&

Dato Predictive Services

A/B testing,multi-armed bandits

& much more

Dato – one stop shop for all stages of the ML life cycleSimple, platform agnostic interface

@datoinc, #DataSmt

Production and Beyond: Deploying and Managing Machine Learning Models

Technology

Transcript of Production and Beyond: Deploying and Managing Machine Learning Models

Deploying and Managing High Availability Networks

A PROCESS FOR CREATING, MANAGING AND DEPLOYING MATERIALS ... · PDF fileA PROCESS FOR CREATING, MANAGING AND DEPLOYING ... Process workflow for managing and deploying materials in

Deploying and Managing Lync Voice

Deploying and managing IBM MQ in the Cloud

Windows Server 2012 - Deploying and Managing

9780764526114 Chapter 6 Managing and Deploying Applicatio

Production and Beyond: Deploying and Managing Machine Learning Models

Deploying and managing gluster using ovirt - fudcon2015

Deploying and Managing SP2013 Apps

Gluecon 2016 Keynote: Deploying and Managing Blockchain Applications

Module 4: Deploying and Managing BizTalk Applications

Deploying and managing new cell sites with FTTA

A Scalable Approach to Deploying and Managing Appliances

Module 1_ Deploying and Managing Windows Server 2012

Deploying and managing Solr at scale

Managing & Deploying Desktop Productivity Platformsdownload.microsoft.com/download/B/4/7/B472789E-1FA9... · White Paper Managing & Deploying Desktop Productivity Platforms Comparing

Module 4: Deploying and Managing BizTalk Applications.

Deploying & Managing distributed apps on YARN

Meeting challenges deploying and managing on-premise IT equipment

Deploying and Managing OpenStack with Heat