SlideShare una empresa de Scribd logo
1 de 39
© Cloudera, Inc. All rights reserved.
Apache Hadoop YARN Containerized Services:
Fading The Lines Between On-Prem And Cloud
Billie Rinaldi
© Cloudera, Inc. All rights reserved. 2
AGENDA
Emergence of Containers
Journey to a Container Cloud
Building Blocks of a Container Cloud
YARN Service APIs
YARN Service Examples
Enabling Hybrid Deployments
© Cloudera, Inc. All rights reserved. 4
CONTAINERIZATION IS GAINING MOMENTUM
• Industry adoption continues
• “Number of containerized applications
will rise by 80% in the next two years” [1]
• Multi-cloud and hybrid strategies
• Adoption of microservices
• Exponential ecosystem growth
• Dozens of container orchestrators
• Thousands of plugins
• Market moves
1. http://i.dell.com/sites/doccontent/business/solutions/whitepapers/en/Documents/Containers_Real_Adoption_2017_Dell_EMC_Forrester_Paper.pdf
© Cloudera, Inc. All rights reserved. 5
WHY ARE CONTAINERS GAINING POPULARITY?
• Improved hardware utilization through increased density
• No virtual machine operating system overhead
• Image layer reuse limits data duplication on disk
• Strong resource isolation
• Namespaces and cgroups
• Better software packaging
• Package applications and dependencies together
• Improved reuse vs VM images
• Distribution mechanism
• Improved developer self service
• More control over the execution environment
• Promise of portability
• On-premises and across multiple clouds
© Cloudera, Inc. All rights reserved. 6
CONTAINER ARCHITECTURE PATTERNS
• Mix of services
• Long lived services and ephemeral/batch jobs
• Decoupled compute and storage
• Scale independently
• Hybrid deployments
• Desire for consistency between cloud and on-
premises
© Cloudera, Inc. All rights reserved. 7
ON PREM VS. CLOUD: VERY DIFFERENT MODELS
Cloud
• Multiple clusters
• Decoupled compute and storage
• Infrastructure as a Service
• Improved agility and self-service
On Prem
• Large, multi-tenant clusters
• Co-located compute and storage
• Shared security and governance
• Less agile due to physical hardware
Public Cloud
ComputeSecurity &
Governance
Compute
Data Center Storage
EDW
Stream
Processing
Data Science
Operations
Data
Science
Data
Science
Data
Science
Stream
Processing
Stream
Processing
Stream
Processing
EDW
Security, Governance, Operations
EDWEDW
Public Cloud
Storage
© Cloudera, Inc. All rights reserved. 8
WHAT IS NEEDED TO BRIDGE THE GAP?
Across clusters
• Consistent deployment, security, and governance
Within clusters
• Decoupled compute and storage
• Eliminate physical hardware as a barrier to agility
How does Apache Hadoop YARN help enable portability?
© Cloudera, Inc. All rights reserved. 9
AGENDA
Emergence of Containers
Journey to a Container Cloud
Building Blocks of a Container Cloud
YARN Service APIs
YARN Service Examples
Enabling Hybrid Deployments
© Cloudera, Inc. All rights reserved. 10
JOURNEY TO A CONTAINER CLOUD
• Started off with on-prem hardware
• Quickly exceeded capacity, moved
to public cloud
• Costs were higher than we wanted
• Bigger concern was the rate of the
expense growth
• Then back to on-prem
• VM based infrastructure
• CloudStack followed by OpenStack
• Challenges before container cloud
• Low density
• Significant overhead per test
• Many images with minimal
differences, limited composition
• More and more tests and products
on-boarding
• The existing environment could no
longer keep up with the testing
demands
© Cloudera, Inc. All rights reserved. 11
ASSESSING THE CHALLENGES
• How is the industry addressing these same challenges?
• Can we leverage our existing investment in hardware?
• How to reduce overhead, improve density and hardware utilization?
• What about improving reuse of packaging and automation?
© Cloudera, Inc. All rights reserved. 12
SOLUTION: ON-PREM CONTAINER CLOUD BUILT ON YARN
• Containers (think Docker)
• Containers eliminate a bulk of the virtualization overhead
• Containers help improve reuse of images through composition
• Container startup time is fast, no real boot sequence
• Apache Hadoop YARN
• Good technical fit
• Good strategic fit
© Cloudera, Inc. All rights reserved. 13
WHY YARN?
• YARN is Apache Hadoop’s resource
management framework
• At its core, YARN is responsible for
orchestrating “containers” across a
collection of servers
• What is a YARN container?
• Linux Process
• Local Resources (scripts, jars, security tokens)
• Resource constraints (CPU, Memory, IO)
• Aligns well with container technologies
such as Docker
Container Model
© Cloudera, Inc. All rights reserved. 14
WHY YARN?
• YARN is widely deployed
• YARN is a superior scheduler
• hardened by customer feedback
• Leverage our existing expertise
• “use what we ship and ship what we use”
• No big leap to containerization
• existing “Hadoop native” frameworks to run
unchanged on the same infrastructure
Strategic Advantages
© Cloudera, Inc. All rights reserved. 15
DOGFOODING: CONTAINER CLOUD FOR RELEASE TESTING
Shared Services
Resource
Management
(YARN)
Management
and
Monitoring
(Ambari)
Jenkins
Worker
(Docker)
Testing HDP and HDF releases in container clusters
(soon CDH)
HDP
(Docker)
Worker
(Docker)
Storage
(HDFS)
Service
Discovery and
REST API
(YARN Services)
Security and
Governance
(Ranger and
Atlas)
SubmitTest
LaunchTest
Worker
(Docker)
HDP
(Docker)
HDP
(Docker)
HDP
(Docker)
© Cloudera, Inc. All rights reserved. 16
AGENDA
Emergence of Containers
Journey to a Container Cloud
Building Blocks of a Container Cloud
YARN Service APIs
YARN Service Examples
Enabling Hybrid Deployments
© Cloudera, Inc. All rights reserved. 17
BUILDING BLOCKS FOR A CONTAINER CLOUD ON YARN
• YARN Container Runtimes – Enables support for Docker containers to
make it easier to onboard new applications and services on YARN.
• YARN Services Framework – Provides AM implementation, REST API, and
various improvements to enable long running services on YARN.
• YARN Service Discovery – Allows services running on YARN to discover
one another.
© Cloudera, Inc. All rights reserved. 18
BUILDING BLOCKS FOR A CONTAINER CLOUD ON YARN
• YARN Container Runtimes – Enables support for Docker containers to
make it easier to onboard new applications and services on YARN.
• YARN Services Framework – Provides AM implementation, REST API, and
various improvements to enable long running services on YARN.
• YARN Service Discovery – Allows services running on YARN to discover
one another.
© Cloudera, Inc. All rights reserved. 19
NEW ABSTRACTION: YARN CONTAINER RUNTIMES
Choose the Container Runtime at app submission time!
DefaultLinuxContainerRuntime DockerLinuxContainerRuntime
Existing Linux process
based execution
Using Docker to run and
monitor the containers
© Cloudera, Inc. All rights reserved. 20
DISTRIBUTED SHELL AND MAPREDUCE EXAMPLES
Only difference is setting environment variables!
© Cloudera, Inc. All rights reserved. 21
DOCKER CONTAINER SUPPORT EVOLVING
• Recent Efforts
• Container Security
• ACLs for privileged containers
• Improved out the box security for untrusted images
• Entrypoint support (systemd as PID-1 Fixes)
• Exec to container support
• Ongoing Efforts
• Improving image management and lifecycle (YARN-9228)
• runc/squashfs (YARN-9014)
• CSI support (YARN-8811)
© Cloudera, Inc. All rights reserved. 22
BUILDING BLOCKS FOR A CONTAINER CLOUD ON YARN
• YARN Container Runtimes – Enables support for Docker containers to
make it easier to onboard new applications and services on YARN.
• YARN Services Framework – Provides AM implementation, REST API, and
various improvements to enable long running services on YARN.
• YARN Service Discovery – Allows services running on YARN to discover
one another.
© Cloudera, Inc. All rights reserved. 23
YARN SERVICES FRAMEWORK OVERVIEW
• Long Running
• Simplify the deployment and management of long running apps on YARN
• Easy Onboarding
• Remove tedious process of bringing new services to YARN
• Declarative Configuration
• JSON specification describing the desired state for the service to be managed
• Standard Interfaces
• REST API that lives in the Resource Manager, CLI tools for clients
© Cloudera, Inc. All rights reserved. 24
DEFINING SERVICES THROUGH THE JSON SPEC
$ curl -H "Content-Type: application/json" -X POST 
http://RM_HOST:8088/app/v1/services -d @sleeper.json
• This spec creates two
component instances, sleeper-0
and sleeper-1
• Optional features include
readiness checks, placement
policies, and creating / mounting
resources such as config files
$ yarn app -launch serviceName sleeper.json
© Cloudera, Inc. All rights reserved. 25
BUILDING BLOCKS FOR A CONTAINER CLOUD ON YARN
• YARN Container Runtimes – Enables support for Docker containers to
make it easier to onboard new applications and services on YARN.
• YARN Services Framework – Provides AM implementation, REST API, and
various improvements to enable long running services on YARN.
• YARN Service Discovery – Allows services running on YARN to discover
one another.
© Cloudera, Inc. All rights reserved. 26
SIMPLIFIED SERVICE DISCOVERY VIA DNS
Existing YARN Service Registry
• Allows apps to register themselves
• Stores entries in Apache ZooKeeper
• Provides native Java, REST, and CLI
clients to enable service discovery
YARN Registry DNS Server
• Watches the YARN Service Registry
(ZK) for new application and container
records
• Creates user friendly DNS records
based on the records
• Supports zone transfers, zone
forwarding, upstream querying, and
DNSSEC
Examples:
componentInstanceName.serviceName.user.domain
sleeper-0.sleeper-service.billie.domain
ctr-e138-1518143905142-215498-01-000007.domain
© Cloudera, Inc. All rights reserved. 27
AGENDA
Emergence of Containers
Journey to a Container Cloud
Building Blocks of a Container Cloud
YARN Service APIs
Yarn Service Examples
Enabling Hybrid Deployments
© Cloudera, Inc. All rights reserved. 28
YARN SERVICE REST API
Create a service
POST URL - http://RM_HOST:8088/app/v1/services
Get service status
GET URL - http://RM_HOST:8088/app/v1/services/tensorflow
Update service
PUT URL - http://RM_HOST:8088/app/v1/services/tensorflow
• Extend lifetime
• STOP service
• START service
• Flex UP/DOWN the # of containers of one or more components
• DELETE (destroy) service
© Cloudera, Inc. All rights reserved. 29
YARN APP CLI
Usage: yarn app
-launch serviceName jsonfile
-flex serviceName -component componentName count
-save serviceName jsonfile
-start serviceName
-status serviceName
-stop serviceName
-destroy serviceName
© Cloudera, Inc. All rights reserved. 30
Emergence of Containers
Journey to a Container Cloud
Building Blocks of a Container Cloud
YARN Service APIs
YARN Service Examples
Enabling Hybrid Deployments
AGENDA
© Cloudera, Inc. All rights reserved. 31
DEFINING SERVICES THROUGH THE JSON SPEC
$ curl -H "Content-Type: application/json" -X POST 
http://RM_HOST:8088/app/v1/services -d @sleeper.json
• This spec creates two
component instances, sleeper-0
and sleeper-1
• Optional features include
readiness checks, placement
policies, and creating / mounting
resources such as config files
$ yarn app -launch serviceName sleeper.json
© Cloudera, Inc. All rights reserved. 32
DOCKER EXAMPLE
To convert the sleeper example
into a docker example, add an
artifact:
"artifact": {
"id": "library/centos:7",
"type": "DOCKER"
}
© Cloudera, Inc. All rights reserved. 36
APACHE HBASE TARBALL EXAMPLE
HBase tarball service
● TARBALL artifact type
● ENV variables
● Config files
© Cloudera, Inc. All rights reserved. 37
APACHE HBASE DOCKER EXAMPLE
Replace TARBALL artifact with DOCKER artifact
Remove unneeded env vars and add Docker mounts
Optionally use absolute paths for generated config files
Remove unneeded config files that already exist in the image
Adjust launch command based on location in image
© Cloudera, Inc. All rights reserved. 40
AGENDA
Emergence of Containers
Journey to a Container Cloud
Building Blocks of a Container Cloud
YARN Service APIs
YARN Service Examples
Enabling Hybrid Deployments
© Cloudera, Inc. All rights reserved.
Canada East (GCP) Reality: Multi-cloud and On-prem
© Cloudera, Inc. All rights reserved. 42
ON PREM VS. CLOUD: VERY DIFFERENT MODELS
Cloud
• Multiple clusters
• Decoupled compute and storage
• Infrastructure as a Service
• Improved agility and self-service
On Prem
• Large, multi-tenant clusters
• Co-located compute and storage
• Shared security and governance
• Less agile due to physical hardware
Public Cloud
ComputeSecurity &
Governance
Compute
Data Center Storage
EDW
Stream
Processing
Data Science
Operations
Data
Science
Data
Science
Data
Science
Stream
Processing
Stream
Processing
Stream
Processing
EDW
Security, Governance, Operations
EDWEDW
Public Cloud
Storage
© Cloudera, Inc. All rights reserved. 43
ON PREM VS. CLOUD: BRIDGING THE GAP
Cloud
Shared Sec/Gov Services, Multi-
Cluster, Multi-Cloud
On Prem
Shared Sec/Gov Services,
Multi-Cluster, Containerized
Public Cloud
Compute
Data Science
Data ScienceData
Science
Stream
ProcessingStream
Processing
Stream
Processing
EDW
Security, Governance, Operations
EDWEDW
Public Cloud
Storage
Apache Hadoop
YARN Container
Cloud
Data Science
Data ScienceData
Science
Stream
ProcessingStream
Processing
Stream
Processing
EDW
Security, Governance, Operations
EDWEDW
Data Center
Storage
© Cloudera, Inc. All rights reserved. 44© Cloudera, Inc. All rights reserved.
CLOUDERA DATA
PLATFORM
• Public, private & hybrid cloud
• Shared data experience
• Powered by open source
• Analytics from the Edge to AI
• Unified data control plane
Infrastructur
e
Private
Cloud
Hybrid
Cloud
Public
Multi-Cloud
Edge
DSX Catalog | Schema | Migration | Security | GovernanceData
management
Analytic
experiences
Data Flow &
Streaming
Data
Engineering
Data
Warehouse
Operational
Database
Machine
Learning
Altus DataPlane Identity | Orchestration | Management | OperationsUnified
control plane
© Cloudera, Inc. All rights reserved.
THANK YOU

Más contenido relacionado

La actualidad más candente

Hive LLAP: A High Performance, Cost-effective Alternative to Traditional MPP ...
Hive LLAP: A High Performance, Cost-effective Alternative to Traditional MPP ...Hive LLAP: A High Performance, Cost-effective Alternative to Traditional MPP ...
Hive LLAP: A High Performance, Cost-effective Alternative to Traditional MPP ...DataWorks Summit
 
Docker based Hadoop provisioning - anywhere
Docker based Hadoop provisioning - anywhereDocker based Hadoop provisioning - anywhere
Docker based Hadoop provisioning - anywhereDataWorks Summit
 
What's new in Hadoop Yarn- Dec 2014
What's new in Hadoop Yarn- Dec 2014What's new in Hadoop Yarn- Dec 2014
What's new in Hadoop Yarn- Dec 2014InMobi Technology
 
One Click Hadoop Clusters - Anywhere (Using Docker)
One Click Hadoop Clusters - Anywhere (Using Docker)One Click Hadoop Clusters - Anywhere (Using Docker)
One Click Hadoop Clusters - Anywhere (Using Docker)DataWorks Summit
 
20150716 introduction to apache spark v3
20150716 introduction to apache spark v3 20150716 introduction to apache spark v3
20150716 introduction to apache spark v3 Andrey Vykhodtsev
 
Hadoop on Docker
Hadoop on DockerHadoop on Docker
Hadoop on DockerRakesh Saha
 
What's new in apache hive
What's new in apache hive What's new in apache hive
What's new in apache hive DataWorks Summit
 
Building Effective Apache Geode Applications with Spring Data GemFire
Building Effective Apache Geode Applications with Spring Data GemFireBuilding Effective Apache Geode Applications with Spring Data GemFire
Building Effective Apache Geode Applications with Spring Data GemFireJohn Blum
 
Uncovering an Apache Spark 2 Benchmark - Configuration, Tuning and Test Results
Uncovering an Apache Spark 2 Benchmark - Configuration, Tuning and Test ResultsUncovering an Apache Spark 2 Benchmark - Configuration, Tuning and Test Results
Uncovering an Apache Spark 2 Benchmark - Configuration, Tuning and Test ResultsDataWorks Summit
 
Migrating Analytics to the Cloud at Fannie Mae
Migrating Analytics to the Cloud at Fannie MaeMigrating Analytics to the Cloud at Fannie Mae
Migrating Analytics to the Cloud at Fannie MaeDataWorks Summit
 
What's the Hadoop-la about Kubernetes?
What's the Hadoop-la about Kubernetes?What's the Hadoop-la about Kubernetes?
What's the Hadoop-la about Kubernetes?DataWorks Summit
 
Sharing metadata across the data lake and streams
Sharing metadata across the data lake and streamsSharing metadata across the data lake and streams
Sharing metadata across the data lake and streamsDataWorks Summit
 
PayPal merchant ecosystem using Apache Spark, Hive, Druid, and HBase
PayPal merchant ecosystem using Apache Spark, Hive, Druid, and HBase PayPal merchant ecosystem using Apache Spark, Hive, Druid, and HBase
PayPal merchant ecosystem using Apache Spark, Hive, Druid, and HBase DataWorks Summit
 
An Introduction to Apache Geode (incubating)
An Introduction to Apache Geode (incubating)An Introduction to Apache Geode (incubating)
An Introduction to Apache Geode (incubating)Anthony Baker
 
Hive 3 - a new horizon
Hive 3 - a new horizonHive 3 - a new horizon
Hive 3 - a new horizonThejas Nair
 
Leveraging docker for hadoop build automation and big data stack provisioning
Leveraging docker for hadoop build automation and big data stack provisioningLeveraging docker for hadoop build automation and big data stack provisioning
Leveraging docker for hadoop build automation and big data stack provisioningEvans Ye
 
Deep Learning with DL4J on Apache Spark: Yeah it's Cool, but are You Doing it...
Deep Learning with DL4J on Apache Spark: Yeah it's Cool, but are You Doing it...Deep Learning with DL4J on Apache Spark: Yeah it's Cool, but are You Doing it...
Deep Learning with DL4J on Apache Spark: Yeah it's Cool, but are You Doing it...DataWorks Summit
 

La actualidad más candente (20)

Empower Hive with Spark
Empower Hive with SparkEmpower Hive with Spark
Empower Hive with Spark
 
Hive LLAP: A High Performance, Cost-effective Alternative to Traditional MPP ...
Hive LLAP: A High Performance, Cost-effective Alternative to Traditional MPP ...Hive LLAP: A High Performance, Cost-effective Alternative to Traditional MPP ...
Hive LLAP: A High Performance, Cost-effective Alternative to Traditional MPP ...
 
Intro to Apache Spark
Intro to Apache SparkIntro to Apache Spark
Intro to Apache Spark
 
Docker based Hadoop provisioning - anywhere
Docker based Hadoop provisioning - anywhereDocker based Hadoop provisioning - anywhere
Docker based Hadoop provisioning - anywhere
 
What's new in Hadoop Yarn- Dec 2014
What's new in Hadoop Yarn- Dec 2014What's new in Hadoop Yarn- Dec 2014
What's new in Hadoop Yarn- Dec 2014
 
One Click Hadoop Clusters - Anywhere (Using Docker)
One Click Hadoop Clusters - Anywhere (Using Docker)One Click Hadoop Clusters - Anywhere (Using Docker)
One Click Hadoop Clusters - Anywhere (Using Docker)
 
20150716 introduction to apache spark v3
20150716 introduction to apache spark v3 20150716 introduction to apache spark v3
20150716 introduction to apache spark v3
 
Hadoop on Docker
Hadoop on DockerHadoop on Docker
Hadoop on Docker
 
What's new in apache hive
What's new in apache hive What's new in apache hive
What's new in apache hive
 
Building Effective Apache Geode Applications with Spring Data GemFire
Building Effective Apache Geode Applications with Spring Data GemFireBuilding Effective Apache Geode Applications with Spring Data GemFire
Building Effective Apache Geode Applications with Spring Data GemFire
 
Uncovering an Apache Spark 2 Benchmark - Configuration, Tuning and Test Results
Uncovering an Apache Spark 2 Benchmark - Configuration, Tuning and Test ResultsUncovering an Apache Spark 2 Benchmark - Configuration, Tuning and Test Results
Uncovering an Apache Spark 2 Benchmark - Configuration, Tuning and Test Results
 
Migrating Analytics to the Cloud at Fannie Mae
Migrating Analytics to the Cloud at Fannie MaeMigrating Analytics to the Cloud at Fannie Mae
Migrating Analytics to the Cloud at Fannie Mae
 
What's the Hadoop-la about Kubernetes?
What's the Hadoop-la about Kubernetes?What's the Hadoop-la about Kubernetes?
What's the Hadoop-la about Kubernetes?
 
Sharing metadata across the data lake and streams
Sharing metadata across the data lake and streamsSharing metadata across the data lake and streams
Sharing metadata across the data lake and streams
 
PayPal merchant ecosystem using Apache Spark, Hive, Druid, and HBase
PayPal merchant ecosystem using Apache Spark, Hive, Druid, and HBase PayPal merchant ecosystem using Apache Spark, Hive, Druid, and HBase
PayPal merchant ecosystem using Apache Spark, Hive, Druid, and HBase
 
Kafka Security
Kafka SecurityKafka Security
Kafka Security
 
An Introduction to Apache Geode (incubating)
An Introduction to Apache Geode (incubating)An Introduction to Apache Geode (incubating)
An Introduction to Apache Geode (incubating)
 
Hive 3 - a new horizon
Hive 3 - a new horizonHive 3 - a new horizon
Hive 3 - a new horizon
 
Leveraging docker for hadoop build automation and big data stack provisioning
Leveraging docker for hadoop build automation and big data stack provisioningLeveraging docker for hadoop build automation and big data stack provisioning
Leveraging docker for hadoop build automation and big data stack provisioning
 
Deep Learning with DL4J on Apache Spark: Yeah it's Cool, but are You Doing it...
Deep Learning with DL4J on Apache Spark: Yeah it's Cool, but are You Doing it...Deep Learning with DL4J on Apache Spark: Yeah it's Cool, but are You Doing it...
Deep Learning with DL4J on Apache Spark: Yeah it's Cool, but are You Doing it...
 

Similar a YARN Containerized Services: Fading The Lines Between On-Prem And Cloud

Lessons Learned Running a Container Cloud on Apache Hadoop YARN
Lessons Learned Running a Container Cloud on Apache Hadoop YARNLessons Learned Running a Container Cloud on Apache Hadoop YARN
Lessons Learned Running a Container Cloud on Apache Hadoop YARNBillie Rinaldi
 
Lessons learned running a container cloud on YARN
Lessons learned running a container cloud on YARNLessons learned running a container cloud on YARN
Lessons learned running a container cloud on YARNDataWorks Summit
 
Containerized Services on Apache Hadoop YARN: Past, Present, and Future, Shan...
Containerized Services on Apache Hadoop YARN: Past, Present, and Future, Shan...Containerized Services on Apache Hadoop YARN: Past, Present, and Future, Shan...
Containerized Services on Apache Hadoop YARN: Past, Present, and Future, Shan...Yahoo Developer Network
 
Running a container cloud on YARN
Running a container cloud on YARNRunning a container cloud on YARN
Running a container cloud on YARNDataWorks Summit
 
VMworld 2013: Three Advantages of Running Cloud Foundry in a VMware Private C...
VMworld 2013: Three Advantages of Running Cloud Foundry in a VMware Private C...VMworld 2013: Three Advantages of Running Cloud Foundry in a VMware Private C...
VMworld 2013: Three Advantages of Running Cloud Foundry in a VMware Private C...VMworld
 
Current State of Affairs – Cloud Computing - Indicthreads Cloud Computing Con...
Current State of Affairs – Cloud Computing - Indicthreads Cloud Computing Con...Current State of Affairs – Cloud Computing - Indicthreads Cloud Computing Con...
Current State of Affairs – Cloud Computing - Indicthreads Cloud Computing Con...IndicThreads
 
20191201 kubernetes managed weblogic revival - part 1
20191201 kubernetes managed weblogic revival - part 120191201 kubernetes managed weblogic revival - part 1
20191201 kubernetes managed weblogic revival - part 1makker_nl
 
The Kubernetes WebLogic revival (part 1)
The Kubernetes WebLogic revival (part 1)The Kubernetes WebLogic revival (part 1)
The Kubernetes WebLogic revival (part 1)Simon Haslam
 
Running Kubernetes Workloads on Oracle Cloud Infrastructure
Running Kubernetes Workloads on Oracle Cloud InfrastructureRunning Kubernetes Workloads on Oracle Cloud Infrastructure
Running Kubernetes Workloads on Oracle Cloud InfrastructureOracle Developers
 
컨테이너 기술 소개 - Warden, Garden, Docker
컨테이너 기술 소개 - Warden, Garden, Docker컨테이너 기술 소개 - Warden, Garden, Docker
컨테이너 기술 소개 - Warden, Garden, Dockerseungdon Choi
 
Rightscale webinar-key-design-considerations-private-hybrid-clouds
Rightscale webinar-key-design-considerations-private-hybrid-cloudsRightscale webinar-key-design-considerations-private-hybrid-clouds
Rightscale webinar-key-design-considerations-private-hybrid-cloudsRightScale
 
Docker Containers Deep Dive
Docker Containers Deep DiveDocker Containers Deep Dive
Docker Containers Deep DiveWill Kinard
 
DevConf.cz - Introduction to Kubernetes Operators for Databases
DevConf.cz - Introduction to Kubernetes Operators for DatabasesDevConf.cz - Introduction to Kubernetes Operators for Databases
DevConf.cz - Introduction to Kubernetes Operators for DatabasesJuarez Junior
 
Grow Your Business with Oracle Linux, Virtualization- BL v6.pdf
Grow Your Business with Oracle Linux, Virtualization- BL v6.pdfGrow Your Business with Oracle Linux, Virtualization- BL v6.pdf
Grow Your Business with Oracle Linux, Virtualization- BL v6.pdfsyamsulsakbar
 
Docker & aPaaS: Enterprise Innovation and Trends for 2015
Docker & aPaaS: Enterprise Innovation and Trends for 2015Docker & aPaaS: Enterprise Innovation and Trends for 2015
Docker & aPaaS: Enterprise Innovation and Trends for 2015WaveMaker, Inc.
 
oci-container-engine-oke-100.pdf
oci-container-engine-oke-100.pdfoci-container-engine-oke-100.pdf
oci-container-engine-oke-100.pdfNandiniSinghal16
 
Docker Datacenter Overview and Production Setup Slides
Docker Datacenter Overview and Production Setup SlidesDocker Datacenter Overview and Production Setup Slides
Docker Datacenter Overview and Production Setup SlidesDocker, Inc.
 
Oracle IaaS including OCM and Ravello
Oracle IaaS including OCM and RavelloOracle IaaS including OCM and Ravello
Oracle IaaS including OCM and RavelloAndrey Akulov
 
Using MySQL in the Cloud
Using MySQL in the CloudUsing MySQL in the Cloud
Using MySQL in the CloudMatt Lord
 
Containers, microservices and serverless for realists
Containers, microservices and serverless for realistsContainers, microservices and serverless for realists
Containers, microservices and serverless for realistsKarthik Gaekwad
 

Similar a YARN Containerized Services: Fading The Lines Between On-Prem And Cloud (20)

Lessons Learned Running a Container Cloud on Apache Hadoop YARN
Lessons Learned Running a Container Cloud on Apache Hadoop YARNLessons Learned Running a Container Cloud on Apache Hadoop YARN
Lessons Learned Running a Container Cloud on Apache Hadoop YARN
 
Lessons learned running a container cloud on YARN
Lessons learned running a container cloud on YARNLessons learned running a container cloud on YARN
Lessons learned running a container cloud on YARN
 
Containerized Services on Apache Hadoop YARN: Past, Present, and Future, Shan...
Containerized Services on Apache Hadoop YARN: Past, Present, and Future, Shan...Containerized Services on Apache Hadoop YARN: Past, Present, and Future, Shan...
Containerized Services on Apache Hadoop YARN: Past, Present, and Future, Shan...
 
Running a container cloud on YARN
Running a container cloud on YARNRunning a container cloud on YARN
Running a container cloud on YARN
 
VMworld 2013: Three Advantages of Running Cloud Foundry in a VMware Private C...
VMworld 2013: Three Advantages of Running Cloud Foundry in a VMware Private C...VMworld 2013: Three Advantages of Running Cloud Foundry in a VMware Private C...
VMworld 2013: Three Advantages of Running Cloud Foundry in a VMware Private C...
 
Current State of Affairs – Cloud Computing - Indicthreads Cloud Computing Con...
Current State of Affairs – Cloud Computing - Indicthreads Cloud Computing Con...Current State of Affairs – Cloud Computing - Indicthreads Cloud Computing Con...
Current State of Affairs – Cloud Computing - Indicthreads Cloud Computing Con...
 
20191201 kubernetes managed weblogic revival - part 1
20191201 kubernetes managed weblogic revival - part 120191201 kubernetes managed weblogic revival - part 1
20191201 kubernetes managed weblogic revival - part 1
 
The Kubernetes WebLogic revival (part 1)
The Kubernetes WebLogic revival (part 1)The Kubernetes WebLogic revival (part 1)
The Kubernetes WebLogic revival (part 1)
 
Running Kubernetes Workloads on Oracle Cloud Infrastructure
Running Kubernetes Workloads on Oracle Cloud InfrastructureRunning Kubernetes Workloads on Oracle Cloud Infrastructure
Running Kubernetes Workloads on Oracle Cloud Infrastructure
 
컨테이너 기술 소개 - Warden, Garden, Docker
컨테이너 기술 소개 - Warden, Garden, Docker컨테이너 기술 소개 - Warden, Garden, Docker
컨테이너 기술 소개 - Warden, Garden, Docker
 
Rightscale webinar-key-design-considerations-private-hybrid-clouds
Rightscale webinar-key-design-considerations-private-hybrid-cloudsRightscale webinar-key-design-considerations-private-hybrid-clouds
Rightscale webinar-key-design-considerations-private-hybrid-clouds
 
Docker Containers Deep Dive
Docker Containers Deep DiveDocker Containers Deep Dive
Docker Containers Deep Dive
 
DevConf.cz - Introduction to Kubernetes Operators for Databases
DevConf.cz - Introduction to Kubernetes Operators for DatabasesDevConf.cz - Introduction to Kubernetes Operators for Databases
DevConf.cz - Introduction to Kubernetes Operators for Databases
 
Grow Your Business with Oracle Linux, Virtualization- BL v6.pdf
Grow Your Business with Oracle Linux, Virtualization- BL v6.pdfGrow Your Business with Oracle Linux, Virtualization- BL v6.pdf
Grow Your Business with Oracle Linux, Virtualization- BL v6.pdf
 
Docker & aPaaS: Enterprise Innovation and Trends for 2015
Docker & aPaaS: Enterprise Innovation and Trends for 2015Docker & aPaaS: Enterprise Innovation and Trends for 2015
Docker & aPaaS: Enterprise Innovation and Trends for 2015
 
oci-container-engine-oke-100.pdf
oci-container-engine-oke-100.pdfoci-container-engine-oke-100.pdf
oci-container-engine-oke-100.pdf
 
Docker Datacenter Overview and Production Setup Slides
Docker Datacenter Overview and Production Setup SlidesDocker Datacenter Overview and Production Setup Slides
Docker Datacenter Overview and Production Setup Slides
 
Oracle IaaS including OCM and Ravello
Oracle IaaS including OCM and RavelloOracle IaaS including OCM and Ravello
Oracle IaaS including OCM and Ravello
 
Using MySQL in the Cloud
Using MySQL in the CloudUsing MySQL in the Cloud
Using MySQL in the Cloud
 
Containers, microservices and serverless for realists
Containers, microservices and serverless for realistsContainers, microservices and serverless for realists
Containers, microservices and serverless for realists
 

Más de DataWorks Summit

Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisDataWorks Summit
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiDataWorks Summit
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...DataWorks Summit
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...DataWorks Summit
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal SystemDataWorks Summit
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExampleDataWorks Summit
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberDataWorks Summit
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixDataWorks Summit
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiDataWorks Summit
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsDataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureDataWorks Summit
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EngineDataWorks Summit
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...DataWorks Summit
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudDataWorks Summit
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiDataWorks Summit
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerDataWorks Summit
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouDataWorks Summit
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkDataWorks Summit
 

Más de DataWorks Summit (20)

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
 

Último

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 

Último (20)

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 

YARN Containerized Services: Fading The Lines Between On-Prem And Cloud

  • 1. © Cloudera, Inc. All rights reserved. Apache Hadoop YARN Containerized Services: Fading The Lines Between On-Prem And Cloud Billie Rinaldi
  • 2. © Cloudera, Inc. All rights reserved. 2 AGENDA Emergence of Containers Journey to a Container Cloud Building Blocks of a Container Cloud YARN Service APIs YARN Service Examples Enabling Hybrid Deployments
  • 3. © Cloudera, Inc. All rights reserved. 4 CONTAINERIZATION IS GAINING MOMENTUM • Industry adoption continues • “Number of containerized applications will rise by 80% in the next two years” [1] • Multi-cloud and hybrid strategies • Adoption of microservices • Exponential ecosystem growth • Dozens of container orchestrators • Thousands of plugins • Market moves 1. http://i.dell.com/sites/doccontent/business/solutions/whitepapers/en/Documents/Containers_Real_Adoption_2017_Dell_EMC_Forrester_Paper.pdf
  • 4. © Cloudera, Inc. All rights reserved. 5 WHY ARE CONTAINERS GAINING POPULARITY? • Improved hardware utilization through increased density • No virtual machine operating system overhead • Image layer reuse limits data duplication on disk • Strong resource isolation • Namespaces and cgroups • Better software packaging • Package applications and dependencies together • Improved reuse vs VM images • Distribution mechanism • Improved developer self service • More control over the execution environment • Promise of portability • On-premises and across multiple clouds
  • 5. © Cloudera, Inc. All rights reserved. 6 CONTAINER ARCHITECTURE PATTERNS • Mix of services • Long lived services and ephemeral/batch jobs • Decoupled compute and storage • Scale independently • Hybrid deployments • Desire for consistency between cloud and on- premises
  • 6. © Cloudera, Inc. All rights reserved. 7 ON PREM VS. CLOUD: VERY DIFFERENT MODELS Cloud • Multiple clusters • Decoupled compute and storage • Infrastructure as a Service • Improved agility and self-service On Prem • Large, multi-tenant clusters • Co-located compute and storage • Shared security and governance • Less agile due to physical hardware Public Cloud ComputeSecurity & Governance Compute Data Center Storage EDW Stream Processing Data Science Operations Data Science Data Science Data Science Stream Processing Stream Processing Stream Processing EDW Security, Governance, Operations EDWEDW Public Cloud Storage
  • 7. © Cloudera, Inc. All rights reserved. 8 WHAT IS NEEDED TO BRIDGE THE GAP? Across clusters • Consistent deployment, security, and governance Within clusters • Decoupled compute and storage • Eliminate physical hardware as a barrier to agility How does Apache Hadoop YARN help enable portability?
  • 8. © Cloudera, Inc. All rights reserved. 9 AGENDA Emergence of Containers Journey to a Container Cloud Building Blocks of a Container Cloud YARN Service APIs YARN Service Examples Enabling Hybrid Deployments
  • 9. © Cloudera, Inc. All rights reserved. 10 JOURNEY TO A CONTAINER CLOUD • Started off with on-prem hardware • Quickly exceeded capacity, moved to public cloud • Costs were higher than we wanted • Bigger concern was the rate of the expense growth • Then back to on-prem • VM based infrastructure • CloudStack followed by OpenStack • Challenges before container cloud • Low density • Significant overhead per test • Many images with minimal differences, limited composition • More and more tests and products on-boarding • The existing environment could no longer keep up with the testing demands
  • 10. © Cloudera, Inc. All rights reserved. 11 ASSESSING THE CHALLENGES • How is the industry addressing these same challenges? • Can we leverage our existing investment in hardware? • How to reduce overhead, improve density and hardware utilization? • What about improving reuse of packaging and automation?
  • 11. © Cloudera, Inc. All rights reserved. 12 SOLUTION: ON-PREM CONTAINER CLOUD BUILT ON YARN • Containers (think Docker) • Containers eliminate a bulk of the virtualization overhead • Containers help improve reuse of images through composition • Container startup time is fast, no real boot sequence • Apache Hadoop YARN • Good technical fit • Good strategic fit
  • 12. © Cloudera, Inc. All rights reserved. 13 WHY YARN? • YARN is Apache Hadoop’s resource management framework • At its core, YARN is responsible for orchestrating “containers” across a collection of servers • What is a YARN container? • Linux Process • Local Resources (scripts, jars, security tokens) • Resource constraints (CPU, Memory, IO) • Aligns well with container technologies such as Docker Container Model
  • 13. © Cloudera, Inc. All rights reserved. 14 WHY YARN? • YARN is widely deployed • YARN is a superior scheduler • hardened by customer feedback • Leverage our existing expertise • “use what we ship and ship what we use” • No big leap to containerization • existing “Hadoop native” frameworks to run unchanged on the same infrastructure Strategic Advantages
  • 14. © Cloudera, Inc. All rights reserved. 15 DOGFOODING: CONTAINER CLOUD FOR RELEASE TESTING Shared Services Resource Management (YARN) Management and Monitoring (Ambari) Jenkins Worker (Docker) Testing HDP and HDF releases in container clusters (soon CDH) HDP (Docker) Worker (Docker) Storage (HDFS) Service Discovery and REST API (YARN Services) Security and Governance (Ranger and Atlas) SubmitTest LaunchTest Worker (Docker) HDP (Docker) HDP (Docker) HDP (Docker)
  • 15. © Cloudera, Inc. All rights reserved. 16 AGENDA Emergence of Containers Journey to a Container Cloud Building Blocks of a Container Cloud YARN Service APIs YARN Service Examples Enabling Hybrid Deployments
  • 16. © Cloudera, Inc. All rights reserved. 17 BUILDING BLOCKS FOR A CONTAINER CLOUD ON YARN • YARN Container Runtimes – Enables support for Docker containers to make it easier to onboard new applications and services on YARN. • YARN Services Framework – Provides AM implementation, REST API, and various improvements to enable long running services on YARN. • YARN Service Discovery – Allows services running on YARN to discover one another.
  • 17. © Cloudera, Inc. All rights reserved. 18 BUILDING BLOCKS FOR A CONTAINER CLOUD ON YARN • YARN Container Runtimes – Enables support for Docker containers to make it easier to onboard new applications and services on YARN. • YARN Services Framework – Provides AM implementation, REST API, and various improvements to enable long running services on YARN. • YARN Service Discovery – Allows services running on YARN to discover one another.
  • 18. © Cloudera, Inc. All rights reserved. 19 NEW ABSTRACTION: YARN CONTAINER RUNTIMES Choose the Container Runtime at app submission time! DefaultLinuxContainerRuntime DockerLinuxContainerRuntime Existing Linux process based execution Using Docker to run and monitor the containers
  • 19. © Cloudera, Inc. All rights reserved. 20 DISTRIBUTED SHELL AND MAPREDUCE EXAMPLES Only difference is setting environment variables!
  • 20. © Cloudera, Inc. All rights reserved. 21 DOCKER CONTAINER SUPPORT EVOLVING • Recent Efforts • Container Security • ACLs for privileged containers • Improved out the box security for untrusted images • Entrypoint support (systemd as PID-1 Fixes) • Exec to container support • Ongoing Efforts • Improving image management and lifecycle (YARN-9228) • runc/squashfs (YARN-9014) • CSI support (YARN-8811)
  • 21. © Cloudera, Inc. All rights reserved. 22 BUILDING BLOCKS FOR A CONTAINER CLOUD ON YARN • YARN Container Runtimes – Enables support for Docker containers to make it easier to onboard new applications and services on YARN. • YARN Services Framework – Provides AM implementation, REST API, and various improvements to enable long running services on YARN. • YARN Service Discovery – Allows services running on YARN to discover one another.
  • 22. © Cloudera, Inc. All rights reserved. 23 YARN SERVICES FRAMEWORK OVERVIEW • Long Running • Simplify the deployment and management of long running apps on YARN • Easy Onboarding • Remove tedious process of bringing new services to YARN • Declarative Configuration • JSON specification describing the desired state for the service to be managed • Standard Interfaces • REST API that lives in the Resource Manager, CLI tools for clients
  • 23. © Cloudera, Inc. All rights reserved. 24 DEFINING SERVICES THROUGH THE JSON SPEC $ curl -H "Content-Type: application/json" -X POST http://RM_HOST:8088/app/v1/services -d @sleeper.json • This spec creates two component instances, sleeper-0 and sleeper-1 • Optional features include readiness checks, placement policies, and creating / mounting resources such as config files $ yarn app -launch serviceName sleeper.json
  • 24. © Cloudera, Inc. All rights reserved. 25 BUILDING BLOCKS FOR A CONTAINER CLOUD ON YARN • YARN Container Runtimes – Enables support for Docker containers to make it easier to onboard new applications and services on YARN. • YARN Services Framework – Provides AM implementation, REST API, and various improvements to enable long running services on YARN. • YARN Service Discovery – Allows services running on YARN to discover one another.
  • 25. © Cloudera, Inc. All rights reserved. 26 SIMPLIFIED SERVICE DISCOVERY VIA DNS Existing YARN Service Registry • Allows apps to register themselves • Stores entries in Apache ZooKeeper • Provides native Java, REST, and CLI clients to enable service discovery YARN Registry DNS Server • Watches the YARN Service Registry (ZK) for new application and container records • Creates user friendly DNS records based on the records • Supports zone transfers, zone forwarding, upstream querying, and DNSSEC Examples: componentInstanceName.serviceName.user.domain sleeper-0.sleeper-service.billie.domain ctr-e138-1518143905142-215498-01-000007.domain
  • 26. © Cloudera, Inc. All rights reserved. 27 AGENDA Emergence of Containers Journey to a Container Cloud Building Blocks of a Container Cloud YARN Service APIs Yarn Service Examples Enabling Hybrid Deployments
  • 27. © Cloudera, Inc. All rights reserved. 28 YARN SERVICE REST API Create a service POST URL - http://RM_HOST:8088/app/v1/services Get service status GET URL - http://RM_HOST:8088/app/v1/services/tensorflow Update service PUT URL - http://RM_HOST:8088/app/v1/services/tensorflow • Extend lifetime • STOP service • START service • Flex UP/DOWN the # of containers of one or more components • DELETE (destroy) service
  • 28. © Cloudera, Inc. All rights reserved. 29 YARN APP CLI Usage: yarn app -launch serviceName jsonfile -flex serviceName -component componentName count -save serviceName jsonfile -start serviceName -status serviceName -stop serviceName -destroy serviceName
  • 29. © Cloudera, Inc. All rights reserved. 30 Emergence of Containers Journey to a Container Cloud Building Blocks of a Container Cloud YARN Service APIs YARN Service Examples Enabling Hybrid Deployments AGENDA
  • 30. © Cloudera, Inc. All rights reserved. 31 DEFINING SERVICES THROUGH THE JSON SPEC $ curl -H "Content-Type: application/json" -X POST http://RM_HOST:8088/app/v1/services -d @sleeper.json • This spec creates two component instances, sleeper-0 and sleeper-1 • Optional features include readiness checks, placement policies, and creating / mounting resources such as config files $ yarn app -launch serviceName sleeper.json
  • 31. © Cloudera, Inc. All rights reserved. 32 DOCKER EXAMPLE To convert the sleeper example into a docker example, add an artifact: "artifact": { "id": "library/centos:7", "type": "DOCKER" }
  • 32. © Cloudera, Inc. All rights reserved. 36 APACHE HBASE TARBALL EXAMPLE HBase tarball service ● TARBALL artifact type ● ENV variables ● Config files
  • 33. © Cloudera, Inc. All rights reserved. 37 APACHE HBASE DOCKER EXAMPLE Replace TARBALL artifact with DOCKER artifact Remove unneeded env vars and add Docker mounts Optionally use absolute paths for generated config files Remove unneeded config files that already exist in the image Adjust launch command based on location in image
  • 34. © Cloudera, Inc. All rights reserved. 40 AGENDA Emergence of Containers Journey to a Container Cloud Building Blocks of a Container Cloud YARN Service APIs YARN Service Examples Enabling Hybrid Deployments
  • 35. © Cloudera, Inc. All rights reserved. Canada East (GCP) Reality: Multi-cloud and On-prem
  • 36. © Cloudera, Inc. All rights reserved. 42 ON PREM VS. CLOUD: VERY DIFFERENT MODELS Cloud • Multiple clusters • Decoupled compute and storage • Infrastructure as a Service • Improved agility and self-service On Prem • Large, multi-tenant clusters • Co-located compute and storage • Shared security and governance • Less agile due to physical hardware Public Cloud ComputeSecurity & Governance Compute Data Center Storage EDW Stream Processing Data Science Operations Data Science Data Science Data Science Stream Processing Stream Processing Stream Processing EDW Security, Governance, Operations EDWEDW Public Cloud Storage
  • 37. © Cloudera, Inc. All rights reserved. 43 ON PREM VS. CLOUD: BRIDGING THE GAP Cloud Shared Sec/Gov Services, Multi- Cluster, Multi-Cloud On Prem Shared Sec/Gov Services, Multi-Cluster, Containerized Public Cloud Compute Data Science Data ScienceData Science Stream ProcessingStream Processing Stream Processing EDW Security, Governance, Operations EDWEDW Public Cloud Storage Apache Hadoop YARN Container Cloud Data Science Data ScienceData Science Stream ProcessingStream Processing Stream Processing EDW Security, Governance, Operations EDWEDW Data Center Storage
  • 38. © Cloudera, Inc. All rights reserved. 44© Cloudera, Inc. All rights reserved. CLOUDERA DATA PLATFORM • Public, private & hybrid cloud • Shared data experience • Powered by open source • Analytics from the Edge to AI • Unified data control plane Infrastructur e Private Cloud Hybrid Cloud Public Multi-Cloud Edge DSX Catalog | Schema | Migration | Security | GovernanceData management Analytic experiences Data Flow & Streaming Data Engineering Data Warehouse Operational Database Machine Learning Altus DataPlane Identity | Orchestration | Management | OperationsUnified control plane
  • 39. © Cloudera, Inc. All rights reserved. THANK YOU