SlideShare una empresa de Scribd logo
1 de 29
INTRODUCING CLOUDERA MACHINE LEARNING
Cloud-Native Platform for Enterprise Scale Machine Learning
Tristan Zajonc, CTO for Machine Learning
2 © Cloudera, Inc. All rights reserved.
DISCLAIMER
The information in this document is proprietary to Cloudera. No part of this document may be reproduced
or disclosed without the express prior written permission of Cloudera.
This document is not binding upon Cloudera in any way (including with respect to Cloudera’s course of
business, product strategy, future releases, or development commitments), and is not a part of or subject to
any license agreement or other agreement you may have with Cloudera. Cloudera makes no commitments
about any future developments or functionality in any Cloudera product. The development, release, and
timing of release of any software features or functionality described in this document remains at the
discretion of Cloudera and you should not rely on any statements about development plans or anticipated
functionality in this document when making any purchasing decisions.
Cloudera assumes no responsibility for errors or omissions in this document. Cloudera does not warrant
the accuracy or completeness of the information, text, graphics, links or other items contained within this
material. This document is provided without a warranty of any kind, either express or implied, including but
not limited to the implied warranties of merchantability, fitness for a particular purpose or non-infringement.
Cloudera shall have no liability for damages of any kind including without limitation direct, special, indirect
or consequential damages that may result from the use of these materials.”
Confidential-Restricted – For Discussion Purposes
Only
3 © Cloudera, Inc. All rights reserved.
Let’s get started. Have you
• Used Spark?
• Used Cloudera Data Science Workbench?
• Used Kubernetes?
• Used GPUs for deep learning?
POLL QUESTION 1
4 © Cloudera, Inc. All rights reserved.
THE OPPORTUNITY: THE INDUSTRIALIZATION OF AI
PROTECT
business
CONNECT
products &
services (IoT)
DRIVE
customer insights
Thousands of use cases for ML and AI to unlock value within the enterprise
● Predictive maintenance
● Logistics optimization
● Self-driving cars
● Medical diagnostics
● Marketing effectiveness
● Next best action
● Insider threat
prevention
● Fraud prevention
● Payment integrity
5 © Cloudera, Inc. All rights reserved.
A TIPPING POINT OF DATA, COMPUTE, AND ALGORITHMS
Three major trends enable the potential of machine learning
COMPUTEDATA! ALGORITHMS
6 © Cloudera, Inc. All rights reserved.
7 © Cloudera, Inc. All rights reserved.
MACHINE LEARNING AT UBER, NETFLIX, AND FACEBOOK
Industrialized AI requires requires new supporting tools and platforms
Facebook
FBLearner
Uber
Michelangelo
Netflix
Recommendation
Platform
8 © Cloudera, Inc. All rights reserved.
UBER
Michelangelo: Uber’s Machine Learning Platform
Source: https://eng.uber.com/michelangelo/
DATA CENTER
PUBLIC CLOUD
9 © Cloudera, Inc. All rights reserved.
NETFLIX
Netflix recommendation platform
Source: https://medium.com/netflix-techblog/netflix-at-spark-ai-summit-2018-5304749ed7fa
PUBLIC CLOUD
10 © Cloudera, Inc. All rights reserved.
FACEBOOK
Facebook’s AI infrastructure
Source: https://www.slideshare.net/KarthikMurugesan2/facebook-machine-learning-infrastructure-2018-slides
DATA CENTER
11 © Cloudera, Inc. All rights reserved.
LESSON 1: ML AT SCALE REQUIRES A UNIFIED DATA STRATEGY
Streaming
Ingest
Batch Ingest
Machine
Learning Tools
BI Tools and
SQL Editors
Data Products
DATA, METADATA, SECURITY, GOVERNANCE, WORKLOAD MANAGEMENT
MACHINE
LEARNING
DATA
ENGINEERING
DATA
WAREHOUSE
OPERATIONAL
DATABASE
12 © Cloudera, Inc. All rights reserved.
LESSON 2: EMERGING CONSENSUS ON MODERN ML STACK
Big Data
Platform
ML/AI
Frameworks
Container
Infrastructure
Confidential-Restricted – For Discussion Purposes
Only
13 © Cloudera, Inc. All rights reserved.
LESSON 3: AGILITY REQUIRES SELF-SERVICE PLATFORM
Manage pipelines + models
Deploy models
Automate pipelines
Monitor performance
DEPLOYDEVELOP
Make teams more productive
Explore data
Develop reports, pipelines, models
Collaborate with peers
TRAIN
Scale resources efficiently
Train models
Tune parameters
Track performance
End-to-end machine learning infrastructure for teams building at scale
MANAGE
Run anywhere with a common architecture
Manage access and resources
Scale cost with usage
Confidential-Restricted – For Discussion Purposes
Only
14 © Cloudera, Inc. All rights reserved.
LESSON 4: LEVERAGE THE BEST INFRASTRUCTURE FOR TASK
Non-Cloud Private Cloud Public Cloud PaaS
I want to
maximize
• Cost-efficiency • Control, elasticity,
and convenience
• Control, elasticity,
and convenience
• Agility
I want to
minimize
• Dependence on
unproven technology
• Resource contention
between departments
• Dependence on data
center floor space
• Dependence on IT
and therefore need
as simple as possible
I want to
standardize
• On whatever
provides the best
ROI
• On a single
environment for the
entire data center
• On a single cloud
provider for all
infrastructure needs
• On whatever is
easiest to use
I want to
store my data
• On premises
because cheaper
and/or more secure
• On premises due to
company /
government mandate
• In the cloud because
easier
• In the cloud because
easier
15 © Cloudera, Inc. All rights reserved.
TODAY: CLOUDERA DATA SCIENCE WORKBENCH
Accelerate Machine Learning from Research to Production on CDH/HDP
For data scientists
• Experiment faster
Use R, Python, or Scala with on-
demand compute and secure
CDH data access
• Work together
Share reproducible research
with your whole team
• Deploy with confidence
Get to production repeatably
and without recoding
For IT professionals
• Bring data science to the data
Give your data science team
more freedom while reducing
the risk and cost of silos
• Secure by default
Leverage common security and
governance across workloads
• Run anywhere
On-premises or in the cloud
16 © Cloudera, Inc. All rights reserved.
TODAY: CURRENT CDSW ARCHITECTURE
Containerized gateway to CDH/HDP cluster for data scientists
• Benefits
• Easy to attach for self-service data science
• Seamless connectivity to existing clusters
• Isolated, reproducible user environments
• Tradeoffs
• Requires sizing, managing two clusters
• Requires dependency coordination
• Not optimized for distributed training
• Complex, inelastic setup in public cloud
CDH/HDP CDH/HDP
Cloudera Manager / Ambari
gateway node(s) CDH/HDP nodes
Hive, HDFS, ...
CDSW CDSW
...
Master
...
Engine
EngineEngine
EngineEngine
17 © Cloudera, Inc. All rights reserved.
INTRODUCING
18 © Cloudera, Inc. All rights reserved. 18
enable the end-to-end machine learning lifecycle for teams
building thousands of applications across petabytes of data
with an enterprise-ready platform built for the cloud
Rapid provisioning and
experimentation
DevOps mindset
From development to
production
Team
collaboration
and shared
context
Any
framework,
your data, your
models
Secure by default
CPUs, GPUS, TPUs,
and beyond
Massive, diverse
data
Automation Runs anywhere
19 © Cloudera, Inc. All rights reserved.
CLOUDERA MACHINE LEARNING (PRIVATE PREVIEW)
Cloud-native enterprise machine learning platform
DATA SCIENCE DATA ENGINEERING MODEL OPERATIONS
CLOUDERA ML RUNTIME
Python/R, Spark, TensorFlow, CPU/GPU-Optimized
Interactive Development Batch Pipelines Predictive APIs
Full capability of CDSW
Rapid cloud provisioning
and elastic autoscaling
Unified data engineering and
ML with seamless
dependency management
Multi-cloud portability
powered by Kubernetes
Connects to HDFS or
cloud object storage
and shared metadata
Accelerated deep learning
with distributed GPU training
* Initially targeted for cloud managed
K8s services, then OpenShift
KUBERNETES
EKS, AKS, GKE, OpenShift
DATA CATALOG
GOVERNANCESECURITY LIFECYCLEWORKLOAD XM
STORAGE Amazon
S3
Microsoft
ADLS
HDFS KUDU
20 © Cloudera, Inc. All rights reserved.
Requires and extends CDH/HDP, pushing
distributed compute to the cluster
CDH / HDP CDH / HDP
Cloudera Manager / Ambari
gateway node(s) CDH/HDP nodes
Hive, HDFS, ...
CDSW CDSW
...
Master
...
Engine
EngineEngine
EngineEngine
Cloud optimized, elastic distributed compute;
can optionally use CDH / HDP
Kubernetes
(EKS, GKE, AKS,
OpenShift)
CDH / HDP CDH / HDP
Hive, HDFS,
HMS, Sentry,
Navigator
CML CML
...
Master
...
Engine
EngineEngine
EngineEngine
Object
Store
Altus Director / CloudBreak
OPTIONAL
DATA SCIENCE WORKBENCH vs. CLOUDERA MACHINE LEARNING
Cloud ML
APIs
21 © Cloudera, Inc. All rights reserved.
• Lightweight form-factor optimized for cloud and private cloud
• Elastic compute (CPU / GPU) in the cloud
• Unified workflow for distributed data engineering and machine learning
• Simple dependency management, particularly for ML workloads
• python: pip install, R: install.packages, etc
WHY SPARK ON KUBERNETES?
22 © Cloudera, Inc. All rights reserved.
cldr ml run --cmd “python train.py”
23 © Cloudera, Inc. All rights reserved.
from pyspark.sql import SparkSession
spark = SparkSession.builder.getOrCreate()
results = spark.sparkContext 
.parallelize(xrange(0, 1000)) 
.map(f) 
.collect()
24 © Cloudera, Inc. All rights reserved.
DEMO
Confidential-Restricted – For Discussion Purposes Only25 © Cloudera, Inc. All rights reserved.
OPERATIONAL EXPERIENCE
✓ Simple installation and management
✓ Lower costs with elastic compute
✓ Easier to support through self-service
✓ Cloud agility everywhere
RESEARCH EXPERIENCE
✓ Instant access to resources
✓ Faster model training and iteration
✓ Unified data engineering and ML
✓ Easier to onboard team members
DEPLOYMENT EXPERIENCE
✓ Easier to scale out compute
✓ Faster deployment of models
✓ Better reproducibility and isolation
✓ Flexible deployment targets
BENEFITS OF A CLOUD-NATIVE ML PLATFORM
Confidential-Restricted – For Discussion Purposes
Only
26 © Cloudera, Inc. All rights reserved.
What capability of Cloudera ML is most interesting for your workloads?
• Simple installation and management
• Autoscaling in the cloud
• Dependency management for Spark
• Distributed GPU training
• Cloud experience on-premise
• Potential multi-tenancy with other Kubernetes workloads
POLL QUESTION 2
© Cloudera, Inc. All rights reserved.
Request access to the private preview
tiny.cloudera.com/CML
QUESTIONS?
THANK YOU
29 © Cloudera, Inc. All rights reserved.
Kubernetes
CMLCML
Engine
Spark
Driver
Spark
Executor
Logging Logging
Project
Volume
Project
Volume
Spark
Executor
Project
Volume
Hive, HDFS,
HMS, Sentry,
Navigator
- Create user service accounts and namespace
for quota management & security
- Configure Spark driver / executor
configuration to connect to K8S API
- Populate kerberos related information for
accessing HDFS and other CDH services
- Populate dependency management related
configurations using pod template: image,
volumes, CPU/GPU/memory resources, etc
- Configure Spark logging and metric
information to push to external storage
HOW DOES SPARK IN CML WORK?

Más contenido relacionado

Más de Cloudera, Inc.

Más de Cloudera, Inc. (20)

Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the Platform
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18
 
Cloudera SDX
Cloudera SDXCloudera SDX
Cloudera SDX
 
Introducing Workload XM 8.7.18
Introducing Workload XM 8.7.18Introducing Workload XM 8.7.18
Introducing Workload XM 8.7.18
 
Get started with Cloudera's cyber solution
Get started with Cloudera's cyber solutionGet started with Cloudera's cyber solution
Get started with Cloudera's cyber solution
 
Spark and Deep Learning Frameworks at Scale 7.19.18
Spark and Deep Learning Frameworks at Scale 7.19.18Spark and Deep Learning Frameworks at Scale 7.19.18
Spark and Deep Learning Frameworks at Scale 7.19.18
 
Cloud Data Warehousing with Cloudera Altus 7.24.18
Cloud Data Warehousing with Cloudera Altus 7.24.18Cloud Data Warehousing with Cloudera Altus 7.24.18
Cloud Data Warehousing with Cloudera Altus 7.24.18
 
How Cloudera SDX can aid GDPR compliance
How Cloudera SDX can aid GDPR complianceHow Cloudera SDX can aid GDPR compliance
How Cloudera SDX can aid GDPR compliance
 
When SAP alone is not enough
When SAP alone is not enoughWhen SAP alone is not enough
When SAP alone is not enough
 
Multi task learning stepping away from narrow expert models 7.11.18
Multi task learning stepping away from narrow expert models 7.11.18Multi task learning stepping away from narrow expert models 7.11.18
Multi task learning stepping away from narrow expert models 7.11.18
 

Último

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 

Introducing Cloudera Machine Learning: Cloudera’s New Cloud-Native Platform for Enterprise Scale Machine Learning 1.15.19

  • 1. INTRODUCING CLOUDERA MACHINE LEARNING Cloud-Native Platform for Enterprise Scale Machine Learning Tristan Zajonc, CTO for Machine Learning
  • 2. 2 © Cloudera, Inc. All rights reserved. DISCLAIMER The information in this document is proprietary to Cloudera. No part of this document may be reproduced or disclosed without the express prior written permission of Cloudera. This document is not binding upon Cloudera in any way (including with respect to Cloudera’s course of business, product strategy, future releases, or development commitments), and is not a part of or subject to any license agreement or other agreement you may have with Cloudera. Cloudera makes no commitments about any future developments or functionality in any Cloudera product. The development, release, and timing of release of any software features or functionality described in this document remains at the discretion of Cloudera and you should not rely on any statements about development plans or anticipated functionality in this document when making any purchasing decisions. Cloudera assumes no responsibility for errors or omissions in this document. Cloudera does not warrant the accuracy or completeness of the information, text, graphics, links or other items contained within this material. This document is provided without a warranty of any kind, either express or implied, including but not limited to the implied warranties of merchantability, fitness for a particular purpose or non-infringement. Cloudera shall have no liability for damages of any kind including without limitation direct, special, indirect or consequential damages that may result from the use of these materials.”
  • 3. Confidential-Restricted – For Discussion Purposes Only 3 © Cloudera, Inc. All rights reserved. Let’s get started. Have you • Used Spark? • Used Cloudera Data Science Workbench? • Used Kubernetes? • Used GPUs for deep learning? POLL QUESTION 1
  • 4. 4 © Cloudera, Inc. All rights reserved. THE OPPORTUNITY: THE INDUSTRIALIZATION OF AI PROTECT business CONNECT products & services (IoT) DRIVE customer insights Thousands of use cases for ML and AI to unlock value within the enterprise ● Predictive maintenance ● Logistics optimization ● Self-driving cars ● Medical diagnostics ● Marketing effectiveness ● Next best action ● Insider threat prevention ● Fraud prevention ● Payment integrity
  • 5. 5 © Cloudera, Inc. All rights reserved. A TIPPING POINT OF DATA, COMPUTE, AND ALGORITHMS Three major trends enable the potential of machine learning COMPUTEDATA! ALGORITHMS
  • 6. 6 © Cloudera, Inc. All rights reserved.
  • 7. 7 © Cloudera, Inc. All rights reserved. MACHINE LEARNING AT UBER, NETFLIX, AND FACEBOOK Industrialized AI requires requires new supporting tools and platforms Facebook FBLearner Uber Michelangelo Netflix Recommendation Platform
  • 8. 8 © Cloudera, Inc. All rights reserved. UBER Michelangelo: Uber’s Machine Learning Platform Source: https://eng.uber.com/michelangelo/ DATA CENTER PUBLIC CLOUD
  • 9. 9 © Cloudera, Inc. All rights reserved. NETFLIX Netflix recommendation platform Source: https://medium.com/netflix-techblog/netflix-at-spark-ai-summit-2018-5304749ed7fa PUBLIC CLOUD
  • 10. 10 © Cloudera, Inc. All rights reserved. FACEBOOK Facebook’s AI infrastructure Source: https://www.slideshare.net/KarthikMurugesan2/facebook-machine-learning-infrastructure-2018-slides DATA CENTER
  • 11. 11 © Cloudera, Inc. All rights reserved. LESSON 1: ML AT SCALE REQUIRES A UNIFIED DATA STRATEGY Streaming Ingest Batch Ingest Machine Learning Tools BI Tools and SQL Editors Data Products DATA, METADATA, SECURITY, GOVERNANCE, WORKLOAD MANAGEMENT MACHINE LEARNING DATA ENGINEERING DATA WAREHOUSE OPERATIONAL DATABASE
  • 12. 12 © Cloudera, Inc. All rights reserved. LESSON 2: EMERGING CONSENSUS ON MODERN ML STACK Big Data Platform ML/AI Frameworks Container Infrastructure
  • 13. Confidential-Restricted – For Discussion Purposes Only 13 © Cloudera, Inc. All rights reserved. LESSON 3: AGILITY REQUIRES SELF-SERVICE PLATFORM Manage pipelines + models Deploy models Automate pipelines Monitor performance DEPLOYDEVELOP Make teams more productive Explore data Develop reports, pipelines, models Collaborate with peers TRAIN Scale resources efficiently Train models Tune parameters Track performance End-to-end machine learning infrastructure for teams building at scale MANAGE Run anywhere with a common architecture Manage access and resources Scale cost with usage
  • 14. Confidential-Restricted – For Discussion Purposes Only 14 © Cloudera, Inc. All rights reserved. LESSON 4: LEVERAGE THE BEST INFRASTRUCTURE FOR TASK Non-Cloud Private Cloud Public Cloud PaaS I want to maximize • Cost-efficiency • Control, elasticity, and convenience • Control, elasticity, and convenience • Agility I want to minimize • Dependence on unproven technology • Resource contention between departments • Dependence on data center floor space • Dependence on IT and therefore need as simple as possible I want to standardize • On whatever provides the best ROI • On a single environment for the entire data center • On a single cloud provider for all infrastructure needs • On whatever is easiest to use I want to store my data • On premises because cheaper and/or more secure • On premises due to company / government mandate • In the cloud because easier • In the cloud because easier
  • 15. 15 © Cloudera, Inc. All rights reserved. TODAY: CLOUDERA DATA SCIENCE WORKBENCH Accelerate Machine Learning from Research to Production on CDH/HDP For data scientists • Experiment faster Use R, Python, or Scala with on- demand compute and secure CDH data access • Work together Share reproducible research with your whole team • Deploy with confidence Get to production repeatably and without recoding For IT professionals • Bring data science to the data Give your data science team more freedom while reducing the risk and cost of silos • Secure by default Leverage common security and governance across workloads • Run anywhere On-premises or in the cloud
  • 16. 16 © Cloudera, Inc. All rights reserved. TODAY: CURRENT CDSW ARCHITECTURE Containerized gateway to CDH/HDP cluster for data scientists • Benefits • Easy to attach for self-service data science • Seamless connectivity to existing clusters • Isolated, reproducible user environments • Tradeoffs • Requires sizing, managing two clusters • Requires dependency coordination • Not optimized for distributed training • Complex, inelastic setup in public cloud CDH/HDP CDH/HDP Cloudera Manager / Ambari gateway node(s) CDH/HDP nodes Hive, HDFS, ... CDSW CDSW ... Master ... Engine EngineEngine EngineEngine
  • 17. 17 © Cloudera, Inc. All rights reserved. INTRODUCING
  • 18. 18 © Cloudera, Inc. All rights reserved. 18 enable the end-to-end machine learning lifecycle for teams building thousands of applications across petabytes of data with an enterprise-ready platform built for the cloud Rapid provisioning and experimentation DevOps mindset From development to production Team collaboration and shared context Any framework, your data, your models Secure by default CPUs, GPUS, TPUs, and beyond Massive, diverse data Automation Runs anywhere
  • 19. 19 © Cloudera, Inc. All rights reserved. CLOUDERA MACHINE LEARNING (PRIVATE PREVIEW) Cloud-native enterprise machine learning platform DATA SCIENCE DATA ENGINEERING MODEL OPERATIONS CLOUDERA ML RUNTIME Python/R, Spark, TensorFlow, CPU/GPU-Optimized Interactive Development Batch Pipelines Predictive APIs Full capability of CDSW Rapid cloud provisioning and elastic autoscaling Unified data engineering and ML with seamless dependency management Multi-cloud portability powered by Kubernetes Connects to HDFS or cloud object storage and shared metadata Accelerated deep learning with distributed GPU training * Initially targeted for cloud managed K8s services, then OpenShift KUBERNETES EKS, AKS, GKE, OpenShift DATA CATALOG GOVERNANCESECURITY LIFECYCLEWORKLOAD XM STORAGE Amazon S3 Microsoft ADLS HDFS KUDU
  • 20. 20 © Cloudera, Inc. All rights reserved. Requires and extends CDH/HDP, pushing distributed compute to the cluster CDH / HDP CDH / HDP Cloudera Manager / Ambari gateway node(s) CDH/HDP nodes Hive, HDFS, ... CDSW CDSW ... Master ... Engine EngineEngine EngineEngine Cloud optimized, elastic distributed compute; can optionally use CDH / HDP Kubernetes (EKS, GKE, AKS, OpenShift) CDH / HDP CDH / HDP Hive, HDFS, HMS, Sentry, Navigator CML CML ... Master ... Engine EngineEngine EngineEngine Object Store Altus Director / CloudBreak OPTIONAL DATA SCIENCE WORKBENCH vs. CLOUDERA MACHINE LEARNING Cloud ML APIs
  • 21. 21 © Cloudera, Inc. All rights reserved. • Lightweight form-factor optimized for cloud and private cloud • Elastic compute (CPU / GPU) in the cloud • Unified workflow for distributed data engineering and machine learning • Simple dependency management, particularly for ML workloads • python: pip install, R: install.packages, etc WHY SPARK ON KUBERNETES?
  • 22. 22 © Cloudera, Inc. All rights reserved. cldr ml run --cmd “python train.py”
  • 23. 23 © Cloudera, Inc. All rights reserved. from pyspark.sql import SparkSession spark = SparkSession.builder.getOrCreate() results = spark.sparkContext .parallelize(xrange(0, 1000)) .map(f) .collect()
  • 24. 24 © Cloudera, Inc. All rights reserved. DEMO
  • 25. Confidential-Restricted – For Discussion Purposes Only25 © Cloudera, Inc. All rights reserved. OPERATIONAL EXPERIENCE ✓ Simple installation and management ✓ Lower costs with elastic compute ✓ Easier to support through self-service ✓ Cloud agility everywhere RESEARCH EXPERIENCE ✓ Instant access to resources ✓ Faster model training and iteration ✓ Unified data engineering and ML ✓ Easier to onboard team members DEPLOYMENT EXPERIENCE ✓ Easier to scale out compute ✓ Faster deployment of models ✓ Better reproducibility and isolation ✓ Flexible deployment targets BENEFITS OF A CLOUD-NATIVE ML PLATFORM
  • 26. Confidential-Restricted – For Discussion Purposes Only 26 © Cloudera, Inc. All rights reserved. What capability of Cloudera ML is most interesting for your workloads? • Simple installation and management • Autoscaling in the cloud • Dependency management for Spark • Distributed GPU training • Cloud experience on-premise • Potential multi-tenancy with other Kubernetes workloads POLL QUESTION 2
  • 27. © Cloudera, Inc. All rights reserved. Request access to the private preview tiny.cloudera.com/CML QUESTIONS?
  • 29. 29 © Cloudera, Inc. All rights reserved. Kubernetes CMLCML Engine Spark Driver Spark Executor Logging Logging Project Volume Project Volume Spark Executor Project Volume Hive, HDFS, HMS, Sentry, Navigator - Create user service accounts and namespace for quota management & security - Configure Spark driver / executor configuration to connect to K8S API - Populate kerberos related information for accessing HDFS and other CDH services - Populate dependency management related configurations using pod template: image, volumes, CPU/GPU/memory resources, etc - Configure Spark logging and metric information to push to external storage HOW DOES SPARK IN CML WORK?