MLSEV. BigML Workshop II

•

0 likes•507 views

This document discusses problems with client-side machine learning automation and proposes solutions using server-side workflows defined as RESTful resources and a domain-specific language (DSL). The DSL allows defining reusable ML workflows, executing workflows on a server, and easily parallelizing workflows for multiple resources through syntactic abstraction and language interoperability features.

Data & Analytics

Machine Learning School in Seville
1st edition
March 7–8, 2019

Client-side Machine Learning Automation
Problems of client-side solutions
Complex Too ﬁne-grained, leaky abstractions
Cumbersome Error handling, network issues
Hard to reuse Tied to a single programming language
Hard to scale Parallelization again a problem
Hard to generalize Declarative client tools hide complexity at
the cost of ﬂexibility
Hard to combine Black–box tools cannot be easily integrated
as parts of bigger client–side workﬂows
Hard to audit Client–side development environments are
complex and very hard to sandbox
Not enough automation

Machine Learning Automation
Solution (scalability, reuse): Back to the server

Machine Learning Automation
Solution (complexity, reuse): Domain-speciﬁc languages

Machine Learning Automation
Solution (complexity, reuse): Domain-speciﬁc languages
venturebeat.com

In a Nutshell
1. Workﬂows reiﬁed as server–side, RESTful resources
2. Domain–speciﬁc language for ML workﬂow automation

Workﬂows as RESTful Resources
Library Reusable building-block: a collection of
WhizzML deﬁnitions that can be
imported by other libraries or scripts.
Script Executable code that describes an actual
workﬂow.
• Imports List of libraries with code
used by the script.
• Inputs List of input values that
parameterize the workﬂow.
• Outputs List of values computed by
the script and returned to the user.
Execution Given a script and a complete set of
inputs, the workﬂow can be executed
and its outputs generated.

Ways to create WhizzML Scripts and Libraries
Github
Script editor
Gallery
Other scripts
Scriptify
−→

$Syntactic Abstraction in WhizzML: Simple workﬂow ;; ML artifacts are first-class citizens, ;; we only need to talk about our domain (let ([train-id test-id] (create-dataset-split id 0.8) model-id (create-model train-id)) (create-evaluation test-id model-id {"name" "Evaluation 80/20" "missing_strategy" 0}))$

$Language Interoperability in WhizzML from bigml.api import BigML api = BigML() # choose workflow script = 'script/567b4b5be3f2a123a690ff56' # define parameters inputs = {'source': 'source/5643d345f43a234ff2310a3e'} # execute api.ok(api.create_execution(script, inputs))$

Metaprogramming in reﬂective DSLs: Scriptify
Resources that create
resources that create
resources that create
resources that create
resources that create
resources that create
. . .

Domain Speciﬁcity and Scalability: Trivial
parallelization
;; Workflow for 1 resource
(let ([train-id test-id] (create-dataset-split id 0.8)
model-id (create-model train-id))
(create-evaluation test-id model-id))

Domain Speciﬁcity and Scalability: Trivial
parallelization
;; Workflow for arbitrary number of resources
(let (splits (for (id input-datasets)
(create-dataset-split id 0.8)))
(for (s splits)
(create-evaluation (s 1) (create-model (s 0)))))

$Domain Speciﬁcity and Scalability: Trivial parallelization from bigml.api import BigML api = BigML() # choose workflow script = 'script/567b4b5be3f2a123a690ff56' # define parameters inputs = {'input-dataset': 'dataset/5643d345f43a234ff2310a30'} # execute api.ok(api.create_execution(script, inputs))$

$Domain Speciﬁcity and Scalability: Trivial parallelization from bigml.api import BigML api = BigML() # choose workflow script = 'script/567b4b5be3f2a123a690de1228' # define parameters inputs = {'input-datasets': ['dataset/5643d345f43a234ff2310a30', 'dataset/5643d345f43a234ff2310a31', 'dataset/5643d345f43a234ff2310a32', ...]} # execute api.ok(api.create_execution(script, inputs))$

What's hot

MLSEV Virtual. From my First BigML Project to ProductionBigML, Inc

Seldon: Deploying Models at ScaleSeldon

BigML Release: PCABigML, Inc

ML Infra for Netflix Recommendations - AI NEXTCon talkFaisal Siddiqi

The A-Z of Data: Introduction to MLOpsDataPhoenix

MLSEV Virtual. Optimization of Passengers Waiting Time in ElevatorsBigML, Inc

Seamless End-to-End Production Machine Learning with Seldon and MLflowDatabricks

What's Next for MLflow in 2019Anyscale

Sparklyr: Big Data enabler for R usersICTeam S.p.A.

BigML Summer 2017 ReleaseBigML, Inc

Use MLflow to manage and deploy Machine Learning model on Spark Herman Wu

Continuous Deployment for Deep LearningDatabricks

Accelerating Production Machine Learning with MLflow with Matei ZahariaDatabricks

Managing the Machine Learning Lifecycle with MLOpsFatih Baltacı

MLOps - Build pipelines with Tensor Flow Extended & KubeflowJan Kirenz

Using Machine Learning & Artificial Intelligence to Create Impactful Customer...Costanoa Ventures

MLOps and Data Quality: Deploying Reliable ML Models in ProductionProvectus

Version Control in AI/Machine Learning by DatmoNicholas Walsh

mlflow: Accelerating the End-to-End ML lifecycleDatabricks

From Data Science to MLOpsCarl W. Handlin

What's hot (20)

MLSEV Virtual. From my First BigML Project to Production

Seldon: Deploying Models at Scale

BigML Release: PCA

ML Infra for Netflix Recommendations - AI NEXTCon talk

The A-Z of Data: Introduction to MLOps

MLSEV Virtual. Optimization of Passengers Waiting Time in Elevators

Seamless End-to-End Production Machine Learning with Seldon and MLflow

What's Next for MLflow in 2019

Sparklyr: Big Data enabler for R users

BigML Summer 2017 Release

Use MLflow to manage and deploy Machine Learning model on Spark

Continuous Deployment for Deep Learning

Accelerating Production Machine Learning with MLflow with Matei Zaharia

Managing the Machine Learning Lifecycle with MLOps

MLOps - Build pipelines with Tensor Flow Extended & Kubeflow

Using Machine Learning & Artificial Intelligence to Create Impactful Customer...

MLOps and Data Quality: Deploying Reliable ML Models in Production

Version Control in AI/Machine Learning by Datmo

mlflow: Accelerating the End-to-End ML lifecycle

From Data Science to MLOps

Similar to MLSEV. BigML Workshop II

Dot Net FundamentalsLiquidHub

Bn1001 demo ppt advance dot netconline training

AnswerModules ModuleSuiteAnswerModules

Architecting Microservices in .NetRichard Banks

Oopp Lab WorkHeather Dionne

dotNET frameworksnawal saad

Dotnet basicsMir Majid

JaCIL_ a CLI to JVM CompilerAlmann Goo

Part iMohamed Ebrahim

.Net frameworkRaghu nath

How to Build Your Own Product-Modeling Environment?Tim Geisler

Overview of VS2010 and .NET 4.0Bruce Johnson

Ruby On Rails IntroductionGustavo Andres Brey

49.INS2065.Computer Based Technologies.TA.NguyenDucAnh.pdfcNguyn506241

Unit 1 - TypeScript & Introduction to Angular CLI.pptxMalla Reddy University

Dotnet interview qaabcxyzqaz

Ob EssayAmanda Burkett

Java ProgrammingTracy Clark

vbaintro.pdfayshaukat05

Oops indexHitesh Wagle

Similar to MLSEV. BigML Workshop II (20)

Dot Net Fundamentals

Bn1001 demo ppt advance dot net

AnswerModules ModuleSuite

Architecting Microservices in .Net

Oopp Lab Work

dotNET frameworks

Dotnet basics

JaCIL_ a CLI to JVM Compiler

Part i

.Net framework

How to Build Your Own Product-Modeling Environment?

Overview of VS2010 and .NET 4.0

Ruby On Rails Introduction

49.INS2065.Computer Based Technologies.TA.NguyenDucAnh.pdf

Unit 1 - TypeScript & Introduction to Angular CLI.pptx

Dotnet interview qa

Ob Essay

Java Programming

vbaintro.pdf

Oops index

Recently uploaded

Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann

Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson

CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Onlineanilsa9823

Zuja dropshipping via API with DroFx.pptxolyaivanovalion

ALSO dropshipping via API with DroFx.pptxolyaivanovalion

VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE9953056974 Low Rate Call Girls In Saket, Delhi NCR

Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823

Introduction-to-Machine-Learning (1).pptxfirstjob4

Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls

Sampling (random) method and Non random.pptDr. Soumendra Kumar Patra

FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg

Midocean dropshipping via API with DroFxolyaivanovalion

Determinants of health, dimensions of health, positive health and spectrum of...shambhavirathore45

Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Riyadh +966572737505 get cytotec

{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083

Ravak dropshipping via API with DroFx.pptxolyaivanovalion

Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat

Recently uploaded (20)

Generative AI on Enterprise Cloud with NiFi and Milvus

Schema on read is obsolete. Welcome metaprogramming..pdf

CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online

Zuja dropshipping via API with DroFx.pptx

ALSO dropshipping via API with DroFx.pptx

VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE

Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...

Introduction-to-Machine-Learning (1).pptx

Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night

Sampling (random) method and Non random.ppt

FESE Capital Markets Fact Sheet 2024 Q1.pdf

Midocean dropshipping via API with DroFx

Determinants of health, dimensions of health, positive health and spectrum of...

Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec

{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call

Ravak dropshipping via API with DroFx.pptx

Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service

MLSEV. BigML Workshop II

1. Machine Learning School in Seville 1st edition March 7–8, 2019

2. Workshop jao - Mercè Martín

3. Client-side Machine Learning Automation Problems of client-side solutions Complex Too fine-grained, leaky abstractions Cumbersome Error handling, network issues Hard to reuse Tied to a single programming language Hard to scale Parallelization again a problem Hard to generalize Declarative client tools hide complexity at the cost of flexibility Hard to combine Black–box tools cannot be easily integrated as parts of bigger client–side workflows Hard to audit Client–side development environments are complex and very hard to sandbox Not enough automation

4. Client-side Machine Learning Automation Problems of client-side solutions Complex Too fine-grained, leaky abstractions Cumbersome Error handling, network issues Hard to reuse Tied to a single programming language Hard to scale Parallelization again a problem Hard to generalize Declarative client tools hide complexity at the cost of flexibility Hard to combine Black–box tools cannot be easily integrated as parts of bigger client–side workflows Hard to audit Client–side development environments are complex and very hard to sandbox Not enough abstraction

5. Client-side Machine Learning Automation Problems of client-side solutions Complex Too fine-grained, leaky abstractions Cumbersome Error handling, network issues Hard to reuse Tied to a single programming language Hard to scale Parallelization again a problem Hard to generalize Declarative client tools hide complexity at the cost of flexibility Hard to combine Black–box tools cannot be easily integrated as parts of bigger client–side workflows Hard to audit Client–side development environments are complex and very hard to sandbox Algorithmic complexity and computing resources management problems mostly washed away are back!

6. Machine Learning Automation

7. Machine Learning Automation Solution (scalability, reuse): Back to the server

8. Machine Learning Automation Solution (complexity, reuse): Domain-speciﬁc languages

9. Machine Learning Automation Solution (complexity, reuse): Domain-speciﬁc languages venturebeat.com

10. Machine Learning Automation Solution (complexity, reuse): Domain-speciﬁc languages

11. In a Nutshell 1. Workflows reified as server–side, RESTful resources 2. Domain–specific language for ML workflow automation

12. Workflows as RESTful Resources Library Reusable building-block: a collection of WhizzML definitions that can be imported by other libraries or scripts. Script Executable code that describes an actual workflow. • Imports List of libraries with code used by the script. • Inputs List of input values that parameterize the workflow. • Outputs List of values computed by the script and returned to the user. Execution Given a script and a complete set of inputs, the workflow can be executed and its outputs generated.

13. Ways to create WhizzML Scripts and Libraries Github Script editor Gallery Other scripts Scriptify −→

14. Syntactic Abstraction in WhizzML: Simple workﬂow ;; ML artifacts are first-class citizens, ;; we only need to talk about our domain (let ([train-id test-id] (create-dataset-split id 0.8) model-id (create-model train-id)) (create-evaluation test-id model-id {"name" "Evaluation 80/20" "missing_strategy" 0}))

15. Language Interoperability in WhizzML from bigml.api import BigML api = BigML() # choose workflow script = 'script/567b4b5be3f2a123a690ff56' # define parameters inputs = {'source': 'source/5643d345f43a234ff2310a3e'} # execute api.ok(api.create_execution(script, inputs))

16. Metaprogramming in reﬂective DSLs: Scriptify Resources that create resources that create resources that create resources that create resources that create resources that create . . .

17. Server-side Workﬂows: the bazaar

18. Domain Speciﬁcity and Scalability: Trivial parallelization ;; Workflow for 1 resource (let ([train-id test-id] (create-dataset-split id 0.8) model-id (create-model train-id)) (create-evaluation test-id model-id))

19. Domain Speciﬁcity and Scalability: Trivial parallelization ;; Workflow for arbitrary number of resources (let (splits (for (id input-datasets) (create-dataset-split id 0.8))) (for (s splits) (create-evaluation (s 1) (create-model (s 0)))))

20. Domain Speciﬁcity and Scalability: Trivial parallelization from bigml.api import BigML api = BigML() # choose workflow script = 'script/567b4b5be3f2a123a690ff56' # define parameters inputs = {'input-dataset': 'dataset/5643d345f43a234ff2310a30'} # execute api.ok(api.create_execution(script, inputs))

21. Domain Speciﬁcity and Scalability: Trivial parallelization from bigml.api import BigML api = BigML() # choose workflow script = 'script/567b4b5be3f2a123a690de1228' # define parameters inputs = {'input-datasets': ['dataset/5643d345f43a234ff2310a30', 'dataset/5643d345f43a234ff2310a31', 'dataset/5643d345f43a234ff2310a32', ...]} # execute api.ok(api.create_execution(script, inputs))

MLSEV. BigML Workshop II

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to MLSEV. BigML Workshop II

Similar to MLSEV. BigML Workshop II (20)

More from BigML, Inc

More from BigML, Inc (20)

Recently uploaded

Recently uploaded (20)

MLSEV. BigML Workshop II