Publicidad

Dmitry Spodarets: Modern MLOps toolchain 2023

Lviv Startup Club
1 de Apr de 2023
Publicidad

Más contenido relacionado

Similar a Dmitry Spodarets: Modern MLOps toolchain 2023(20)

Más de Lviv Startup Club(20)

Publicidad

Dmitry Spodarets: Modern MLOps toolchain 2023

  1. Modern MLOps toolchain 2023
  2. About Me Dmytro Spodarets ● DevOps Architect at Grid Dynamics ● Founder and chief editor of Data Phoenix AWS | Infrastructure | DevOps/MLOps
  3. Agenda ● What is MLOps? ● DevOps vs MLOps ● MLOps Stack ● Use cases: ○ From research to production ○ Versioning & retraining ○ IaaC & K8s ○ Using GitOps for Machine Learning
  4. CRISP-DM
  5. Hidden Technical Debt in Machine Learning Systems https://papers.nips.cc/paper/2015/file/86df7dcfd896fcaf2674f757a2463eba-Paper.pdf
  6. The goal of MLOps is to reduce technical friction to get the model from an idea into production in the shortest possible time with as little risk as possible.
  7. DevOps vs MLOps
  8. MLOps is not a single tool or platform
  9. MLOps is about agreeing to do ML the right way and then supporting it.
  10. A few shared principles will take you a long way… ML should be collaborative ML should be reproducible ML should be continuous ML should be tested & monitored
  11. A few shared principles will take you a long way… ML should be collaborative ML should be reproducible ML should be continuous ML should be tested & monitored Shared Infrastructure Versioning for Code, Data and Metadata Machine Learning Pipelines Model Deployment and Monitoring
  12. Continuous X MLOps is an ML engineering culture that includes the following practices: ● Continuous Integration (CI) extends the testing and validating code and components by adding testing and validating data and models. ● Continuous Delivery (CD) concerns with delivery of an ML training pipeline that automatically deploys another the ML model prediction service. ● Continuous Training (CT) is unique to ML systems property, which automatically retrains ML models for re-deployment. ● Continuous Monitoring (CM) concerns with monitoring production data and model performance metrics, which are bound to business metrics.
  13. MLOps levels ● Level 0: No MLOps ● Level 1: DevOps no MLOps ● Level 2: Automated Training ● Level 3: Automated Model Deployment ● Level 4: Full MLOps Automated Retraining https://learn.microsoft.com/en-us/azure/architecture/example-scenario/mlops/mlops-maturity-model
  14. MLOps Stack
  15. AWS ML Stack
  16. Use cases ○ From research to production ○ Versioning & retraining ○ IaaC & K8s ○ Using GitOps for Machine Learning
  17. From research to production
  18. From research to production
  19. Data versioning & retraining
  20. Data versioning & retraining
  21. IaaC & K8s
  22. Using GitOps for Machine Learning https://docs.dstack.ai/
  23. Large language models / Generative models
  24. 3D-parallelism https://aws.amazon.com/blogs/machine-learning/training-large-language-models-on-amazon-sagemaker-best-practices/
  25. Scalable HPC
  26. Questions? Dmytro Spodarets d.spodarets@dataphoenix.info https://dataphoenix.info
  27. https://www.eventbrite.com/o/data-phoenix-events-23453295848 https://dataphoenix.info/subscribe/
Publicidad