We provide a recap of the MLflow R interface which was announced at Spark+AI Summit Europe and discuss recent developments. The session includes a live demo showcasing the intersection of big data (Spark) and deep learning (via TensorFlow) and how the end-to-end lifecycle from prototyping to deployment can be managed by MLflow.
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Streamlining AI Prototyping and Deployment with R and MLflow
1. Kevin Kuo @kevinykuo, RStudio
Streamlining AI
Prototyping and
Deployment with R and
MLflow
#UnifiedAnalytics #SparkAISummit
2. Daily specials
- Quick update on the R ecosystems for AI stuff
- Recap of MLflow
- Demo
- Discussion + Q&A
2#UnifiedAnalytics #SparkAISummit
3. Sparklyr Update
- Arrow integration to massively speed up UDFs
- XGBoost
- TFRecord read/write
- SparkNLP on the way
https://spark.rstudio.com/
3#UnifiedAnalytics #SparkAISummit
6. TensorFlow Update
He looks skeptical, as if you
were nothing
get it right.
It drives me crazy, I can do
this
reproachful look no longer
endure.
His name is Olaf.
6#UnifiedAnalytics #SparkAISummit
7. TensorFlow Update
- library(keras) defaults to tf.keras
- TensorFlow Probability for probabilistic modeling
- Eager execution
- Preparing for TF2.0 drop
https://tensorflow.rstudio.com/
https://blogs.rstudio.com/tensorflow/
7#UnifiedAnalytics #SparkAISummit
8. Quick recap of MLflow
Open source platform for
- Experiment instrumentation (Tracking)
- Reproducible runs (Projects)
- Model deployment (Models)
8#UnifiedAnalytics #SparkAISummit
13. Roadmap
How are package dependencies handled for R
projects?
Conda? Packrat?
What if your packages depend on Java/Python
libraries?
13#UnifiedAnalytics #SparkAISummit