SlideShare una empresa de Scribd logo
1 de 49
Descargar para leer sin conexión
Apache Mesos as an SDK for
Building Distributed Frameworks	






Strata SC, 2014-02-13	

Paco Nathan 

http://liber118.com/pxn/

@pacoid
A Big Idea
!

Have you heard about 

“data democratization” ? ? ?	

	

	

making data available

	

	

 	

 throughout more of the organization
!

Have you heard about 

“data democratization” ? ? ?	

	

	

making data available

	

	

 	

 throughout more of the organization	

!

Then how would you handle 

“cluster democratization” ? ? ?	

	

	

making data+resources available

	

	

 	

 throughout	

 more of the organization
!

Have you heard about 

“data democratization” ? ? ?	

	

	

making data available

	

	

 	

 throughout more of the organization	


In other words, 

how to remove silos…

!

Then how would you handle 

“cluster democratization” ? ? ?	


	

	

making data+resources available

	

	

 	

 throughout	

 more of the organization
Lessons

from Google
Datacenter Computing	

Google has been doing datacenter computing for years, 

to address the complexities of large-scale data workflows:	


•

leveraging the modern kernel: isolation in lieu of VMs	


•

“most (>80%) jobs are batch jobs, but the majority 

of resources (55–80%) are allocated to service jobs”	


•

mixed workloads, multi-tenancy	


•

relatively high utilization rates	


•

because JVM? not so much…	


•

reality: scheduling batch is simple; 

scheduling services is hard/expensive
The Modern Kernel: Top Linux Contributors…	


arstechnica.com/information-technology/2013/09/...
“Return of the Borg”	

Return of the Borg: How Twitter Rebuilt Google’s Secret Weapon

Cade Metz

wired.com/wiredenterprise/2013/03/google-borg-twittermesos	


!

The Datacenter as a Computer: An Introduction 

to the Design of Warehouse-Scale Machines	

Luiz André Barroso, Urs Hölzle	

research.google.com/pubs/pub35290.html	


!
!

2011 GAFS Omega

John Wilkes, et al.

youtu.be/0ZFMlO98Jkc
Google describes the technology…	

Omega: flexible, scalable schedulers for large compute clusters	

Malte Schwarzkopf, Andy Konwinski, Michael Abd-El-Malek, John Wilkes	

eurosys2013.tudos.org/wp-content/uploads/2013/paper/
Schwarzkopf.pdf
Google describes the business case…	

Taming Latency Variability

Jeff Dean

plus.google.com/u/0/+ResearchatGoogle/posts/C1dPhQhcDRv
Commercial OS Cluster Schedulers	


!
• IBM Platform Symphony

• Microsoft Autopilot	


!



Arguably, some grid controllers 

are quite notable in-category:	

• Univa Grid Engine (formerly SGE)

• Condor	

• etc.
Emerging

at Berkeley
Beyond Hadoop	

Hadoop – an open source solution for fault-tolerant
parallel processing of batch jobs at scale, based on
commodity hardware… however, other priorities have
emerged for the analytics lifecycle:	


•
•
•

apps require integration beyond Hadoop	


•
•
•
•

higher utilization	


multiple topologies, mixed workloads, multi-tenancy	

significant disruptions in h/w cost/performance
curves	

lower latency	

highly-available, long running services	

more than “Just JVM” – e.g., Python growth
Beyond Hadoop	


keep in mind priorities for
interdisciplinary efforts, to
apps require down silos
• breakintegration beyond Hadoop	

 – extending
multiple topologies, mixed workloads, multi-tenancy	

• beyond a de facto “priesthood”
• significant disruptions in h/w cost/performance
of data engineering
curves	


Hadoop – an open source solution for fault-tolerant
parallel processing of batch jobs at scale, based on
commodity hardware… however, other priorities have
emerged for the

•
•
•
•

higher utilization	

lower latency	

highly-available, long running services	

more than “Just JVM” – e.g., Python growth
Mesos – open source datacenter computing	

a common substrate for cluster computing	

mesos.apache.org	

heterogenous assets in your datacenter or cloud 

made available as a homogenous set of resources	


•
•
•
•
•
•
•
•

top-level Apache project	

scalability to 10,000s of nodes	

obviates the need for virtual machines	

isolation (pluggable) for CPU, RAM, I/O, FS, etc.	

fault-tolerant leader election based on Zookeeper	

APIs in C++, Java, Python, Go	

web UI for inspecting cluster state	

available for Linux, OpenSolaris, Mac OSX
What are the costs of Virtualization?
benchmark	

type

OpenVZ	

improvement

mixed workloads

210%-300%

LAMP (related)

38%-200%

I/O throughput

200%-500%

response time

order magnitude

more pronounced 

at higher loads
What are the costs of Single Tenancy?
MEMCACHED
CPU LOAD

RAILS CPU
LOAD

HADOOP CPU
LOAD

100%

100%

100%

75%

75%

75%

50%

50%

50%

25%

25%

25%

0%

0%

0%

t

t

COMBINED CPU LOAD (RAILS,
MEMCACHED, HADOOP)
100%

75%

50%

25%

0%

Hadoop
Memcached
Rails
Arguments for Datacenter Computing	

rather than running several specialized clusters, each 

at relatively low utilization rates, instead run many 

mixed workloads 	

obvious benefits are realized in terms of:	


•
•
•

scalability, elasticity, fault tolerance, performance, utilization	

reduced equipment capex, Ops overhead, etc.	

reduced licensing, eliminating need for VMs or potential 

vendor lock-in	


subtle benefits – arguably, more important for Enterprise IT:	


•
•

reduced time for engineers to ramp up new services at scale	


•

enables Dev/Test apps to run safely on a Production cluster

reduced latency between batch and services, enabling new 

high ROI use cases
Analogies and
Architecture
Prior Practice: Dedicated Servers	


DATACENTER

•
•

low utilization rates	

longer time to ramp up new services
Prior Practice: Virtualization	


DATACENTER

PROVISIONED VMS

•
•

even more machines to manage	


•

VM licensing costs

substantial performance decrease 

due to virtualization
Prior Practice: Static Partitioning

DATACENTER

STATIC PARTITIONING

•
•

even more machines to manage	


•
•

VM licensing costs	


substantial performance decrease 

due to virtualization	

static partitioning limits elasticity
Mesos: One Large Pool of Resources	


DATACENTER

MESOS

“We wanted people to be able to program 

for the datacenter just like they program 

!for their laptop."	

Ben Hindman
Frameworks Integrated with Mesos	

Continuous Integration:

Jenkins, GitLab
Big Data:

Hadoop, Spark, Storm, Kafka, Cassandra,

Hypertable, MPI
Python workloads:

DPark, Exelixi
Meta-Frameworks / HA Services:

Aurora, Marathon
Distributed Cron:

Chronos
Containers:

Docker
!

Fault-tolerant distributed systems…	

…written in 100-300 lines of 

C++, Java/Scala, Python, Go, etc.	

…building blocks, if you will	

!

Q: required lines of network code?	

A: probably none
Mesos – architecture	

services

batch

Workloads

Apps
Scalding

MPI

Impala

Hadoop

Shark

Spark

MySQL

Kafka

JBoss

Django

Chronos

Storm

Rails

Frameworks

Py

th
on

Marathon

C

++

JV

M

Kernel

distributed file system

distributed resources: CPU, RAM, I/O, FS, rack locality, etc.

DFS

Cluster
Mesos – architecture	

apps: HA services, web apps, batch
jobs, scripts, etc.

frameworks: Spark, Storm,
MPI, Jenkins, etc.

task schedulers: Chronos, etc.

meta-frameworks: Aurora, Marathon

APIs: C++, JVM, Py, Go

Mesos, distrib kernel

HDFS, distrib file system

Linux: libcgroup, libprocess, libev, etc.
Mesos – dynamics	


scheduled
apps

HA
services

distrib
frameworks

Chronos
distrib cron

GNU/Linux

Marathon
distrib init.d

Mesos
distrib kernel

Linux
Mesos – dynamics	


distributed
framework

Scheduler

Executor

Executor

Executor

Mesos
Mesos
slave
slave

Mesos
Mesos
slave
slave

Mesos
Mesos
slave
slave

resource
offers
Mesos
Mesos
master
master

available resources

distributed
kernel
Example: Resource Offer in a Two-Level Scheduler

mesos.apache.org/documentation/latest/mesos-architecture/
Because…

Use Cases
Production Deployments (public)
Opposite Ends of the Spectrum, One Common Substrate	


Solaris Zones
Built-in /

bare metal
Linux CGroups

Hypervisors
Opposite Ends of the Spectrum, One Common Substrate	


Request /

Response

Batch
Case Study: Twitter (bare metal / on premise)	

“Mesos is the cornerstone of our elastic compute infrastructure – 

it’s how we build all our new services and is critical for Twitter’s

continued success at scale. It's one of the primary keys to our

data center efficiency."	

Chris Fry, SVP Engineering	


blog.twitter.com/2013/mesos-graduates-from-apache-incubation	


!

wired.com/gadgetlab/2013/11/qa-with-chris-fry/	


•

key services run in production: analytics, typeahead, ads	


•

Twitter engineers rely on Mesos to build all new services	


•

instead of thinking about static machines, engineers think 

about resources like CPU, memory and disk	


•

allows services to scale and leverage a shared pool of 

servers across datacenters efficiently	


•

reduces the time between prototyping and launching
Case Study: Airbnb (fungible cloud infrastructure)	

“We think we might be pushing data science in the field of travel 

more so than anyone has ever done before… a smaller number 

of engineers can have higher impact through automation on 

Mesos."	

Mike Curtis,VP Engineering


gigaom.com/2013/07/29/airbnb-is-engineering-itself-into-a-data...	


•

improves resource management and efficiency	


•

helps advance engineering strategy of building small teams 

that can move fast	


•

key to letting engineers make the most of AWS-based 

infrastructure beyond just Hadoop	


•

allowed company to migrate off Elastic MapReduce	


•

enables use of Hadoop along with Chronos, Spark, Storm, etc.
DIY
!
!

http://elastic.mesosphere.io
!

http://mesosphere.io/learn	

!
Elastic Mesos

Master 1
NN
ZK

Worker
DN

Worker
DN

Worker
DN

Worker
DN

Worker
DN

Master 2
NN
ZK

Worker
DN

Worker
DN

Worker
DN

Worker
DN

Worker
DN

Master 3
NN
ZK

Worker
DN

Worker
DN

Worker
DN

Worker
DN

Worker
DN
Save the Date:	

Apr 3, 2014	

Mesos Summit	

(or something)
Resources	

Apache Mesos Project

mesos.apache.org	

Twitter

@ApacheMesos	

Mesosphere

mesosphere.io	

Tutorials

mesosphere.io/learn	

Documentation

mesos.apache.org/documentation	

2011 USENIX Research Paper

usenix.org/legacy/event/nsdi11/tech/full_papers/Hindman_new.pdf	

Collected Notes/Archives

goo.gl/jPtTP
Former Airbnb engineers simplify Mesos to manage data jobs in the cloud

Jordan Novet

VentureBeat (2013-11-12)

venturebeat.com/2013/11/12/former-airbnb-engineers-simplify...	

Mesosphere Adds Docker Support To Its Mesos-Based Operating System For The Data Center

Frederic Lardinois

TechCrunch (2013-09-26)

techcrunch.com/2013/09/26/mesosphere...	

Play Framework Grid Deployment with Mesos

James Ward, Flo Leibert, et al.

Typesafe blog (2013-09-19)

typesafe.com/blog/play-framework-grid...	

Mesosphere Launches Marathon Framework

Adrian Bridgwater

Dr. Dobbs (2013-09-18)

drdobbs.com/open-source/mesosphere...	

New open source tech Marathon wants to make your data center run like Google’s

Derrick Harris

GigaOM (2013-09-04)

gigaom.com/2013/09/04/new-open-source...	

Running batch and long-running, highly available service jobs on the same cluster

Ben Lorica

O’Reilly (2013-09-01)

strata.oreilly.com/2013/09/running-batch...

Enterprise Data Workflows with Cascading	

O’Reilly, 2013	

shop.oreilly.com/product/
0636920028536.do
!

monthly newsletter for updates, 

events, conference summaries, etc.:	

liber118.com/pxn/

Más contenido relacionado

La actualidad más candente

Apache Kafka, HDFS, Accumulo and more on Mesos
Apache Kafka, HDFS, Accumulo and more on MesosApache Kafka, HDFS, Accumulo and more on Mesos
Apache Kafka, HDFS, Accumulo and more on MesosJoe Stein
 
Cassandra on Mesos Across Multiple Datacenters at Uber (Abhishek Verma) | C* ...
Cassandra on Mesos Across Multiple Datacenters at Uber (Abhishek Verma) | C* ...Cassandra on Mesos Across Multiple Datacenters at Uber (Abhishek Verma) | C* ...
Cassandra on Mesos Across Multiple Datacenters at Uber (Abhishek Verma) | C* ...DataStax
 
Elastic HBase on Mesos - HBaseCon 2015
Elastic HBase on Mesos - HBaseCon 2015Elastic HBase on Mesos - HBaseCon 2015
Elastic HBase on Mesos - HBaseCon 2015Cosmin Lehene
 
Developing Frameworks for Apache Mesos
Developing Frameworks  for Apache MesosDeveloping Frameworks  for Apache Mesos
Developing Frameworks for Apache MesosJoe Stein
 
Cassandra Introduction & Features
Cassandra Introduction & FeaturesCassandra Introduction & Features
Cassandra Introduction & FeaturesPhil Peace
 
Terraform Modules Restructured
Terraform Modules RestructuredTerraform Modules Restructured
Terraform Modules RestructuredDoiT International
 
RENCI User Group Meeting 2017 - I Upgraded iRODS and I still have all my hair
RENCI User Group Meeting 2017 - I Upgraded iRODS and I still have all my hairRENCI User Group Meeting 2017 - I Upgraded iRODS and I still have all my hair
RENCI User Group Meeting 2017 - I Upgraded iRODS and I still have all my hairJohn Constable
 
Introduction to hazelcast
Introduction to hazelcastIntroduction to hazelcast
Introduction to hazelcastEmin Demirci
 
Sanger OpenStack presentation March 2017
Sanger OpenStack presentation March 2017Sanger OpenStack presentation March 2017
Sanger OpenStack presentation March 2017Dave Holland
 
High Performance Computing (HPC) in cloud
High Performance Computing (HPC) in cloudHigh Performance Computing (HPC) in cloud
High Performance Computing (HPC) in cloudAccubits Technologies
 
Mesosphere and Contentteam: A New Way to Run Cassandra
Mesosphere and Contentteam: A New Way to Run CassandraMesosphere and Contentteam: A New Way to Run Cassandra
Mesosphere and Contentteam: A New Way to Run CassandraDataStax Academy
 
Scaling Big Data with Hadoop and Mesos
Scaling Big Data with Hadoop and MesosScaling Big Data with Hadoop and Mesos
Scaling Big Data with Hadoop and MesosDiscover Pinterest
 
Hazelcast Essentials
Hazelcast EssentialsHazelcast Essentials
Hazelcast EssentialsRahul Gupta
 
Multi-tenant, Multi-cluster and Multi-container Apache HBase Deployments
Multi-tenant, Multi-cluster and Multi-container Apache HBase DeploymentsMulti-tenant, Multi-cluster and Multi-container Apache HBase Deployments
Multi-tenant, Multi-cluster and Multi-container Apache HBase DeploymentsDataWorks Summit
 
Best Practices for Virtualizing Hadoop
Best Practices for Virtualizing HadoopBest Practices for Virtualizing Hadoop
Best Practices for Virtualizing HadoopDataWorks Summit
 
HBaseCon 2013: Apache HBase Operations at Pinterest
HBaseCon 2013: Apache HBase Operations at PinterestHBaseCon 2013: Apache HBase Operations at Pinterest
HBaseCon 2013: Apache HBase Operations at PinterestCloudera, Inc.
 

La actualidad más candente (20)

Apache Kafka, HDFS, Accumulo and more on Mesos
Apache Kafka, HDFS, Accumulo and more on MesosApache Kafka, HDFS, Accumulo and more on Mesos
Apache Kafka, HDFS, Accumulo and more on Mesos
 
Hadoop on-mesos
Hadoop on-mesosHadoop on-mesos
Hadoop on-mesos
 
Cassandra on Mesos Across Multiple Datacenters at Uber (Abhishek Verma) | C* ...
Cassandra on Mesos Across Multiple Datacenters at Uber (Abhishek Verma) | C* ...Cassandra on Mesos Across Multiple Datacenters at Uber (Abhishek Verma) | C* ...
Cassandra on Mesos Across Multiple Datacenters at Uber (Abhishek Verma) | C* ...
 
Elastic HBase on Mesos - HBaseCon 2015
Elastic HBase on Mesos - HBaseCon 2015Elastic HBase on Mesos - HBaseCon 2015
Elastic HBase on Mesos - HBaseCon 2015
 
Developing Frameworks for Apache Mesos
Developing Frameworks  for Apache MesosDeveloping Frameworks  for Apache Mesos
Developing Frameworks for Apache Mesos
 
Cassandra Introduction & Features
Cassandra Introduction & FeaturesCassandra Introduction & Features
Cassandra Introduction & Features
 
Flexible compute
Flexible computeFlexible compute
Flexible compute
 
Terraform Modules Restructured
Terraform Modules RestructuredTerraform Modules Restructured
Terraform Modules Restructured
 
RENCI User Group Meeting 2017 - I Upgraded iRODS and I still have all my hair
RENCI User Group Meeting 2017 - I Upgraded iRODS and I still have all my hairRENCI User Group Meeting 2017 - I Upgraded iRODS and I still have all my hair
RENCI User Group Meeting 2017 - I Upgraded iRODS and I still have all my hair
 
Introduction to hazelcast
Introduction to hazelcastIntroduction to hazelcast
Introduction to hazelcast
 
Nov 2011 HUG: Blur - Lucene on Hadoop
Nov 2011 HUG: Blur - Lucene on HadoopNov 2011 HUG: Blur - Lucene on Hadoop
Nov 2011 HUG: Blur - Lucene on Hadoop
 
Sanger OpenStack presentation March 2017
Sanger OpenStack presentation March 2017Sanger OpenStack presentation March 2017
Sanger OpenStack presentation March 2017
 
High Performance Computing (HPC) in cloud
High Performance Computing (HPC) in cloudHigh Performance Computing (HPC) in cloud
High Performance Computing (HPC) in cloud
 
Mesosphere and Contentteam: A New Way to Run Cassandra
Mesosphere and Contentteam: A New Way to Run CassandraMesosphere and Contentteam: A New Way to Run Cassandra
Mesosphere and Contentteam: A New Way to Run Cassandra
 
Scaling Big Data with Hadoop and Mesos
Scaling Big Data with Hadoop and MesosScaling Big Data with Hadoop and Mesos
Scaling Big Data with Hadoop and Mesos
 
Hazelcast Essentials
Hazelcast EssentialsHazelcast Essentials
Hazelcast Essentials
 
Multi-tenant, Multi-cluster and Multi-container Apache HBase Deployments
Multi-tenant, Multi-cluster and Multi-container Apache HBase DeploymentsMulti-tenant, Multi-cluster and Multi-container Apache HBase Deployments
Multi-tenant, Multi-cluster and Multi-container Apache HBase Deployments
 
Best Practices for Virtualizing Hadoop
Best Practices for Virtualizing HadoopBest Practices for Virtualizing Hadoop
Best Practices for Virtualizing Hadoop
 
ZooKeeper (and other things)
ZooKeeper (and other things)ZooKeeper (and other things)
ZooKeeper (and other things)
 
HBaseCon 2013: Apache HBase Operations at Pinterest
HBaseCon 2013: Apache HBase Operations at PinterestHBaseCon 2013: Apache HBase Operations at Pinterest
HBaseCon 2013: Apache HBase Operations at Pinterest
 

Destacado

Creating a Mesos python framework
Creating a Mesos python frameworkCreating a Mesos python framework
Creating a Mesos python frameworkOlivier Sallou
 
Building and Deploying Application to Apache Mesos
Building and Deploying Application to Apache MesosBuilding and Deploying Application to Apache Mesos
Building and Deploying Application to Apache MesosJoe Stein
 
100+ Beautiful Slides from #CannesLions '11 from @jessedee
100+ Beautiful Slides from #CannesLions '11 from @jessedee100+ Beautiful Slides from #CannesLions '11 from @jessedee
100+ Beautiful Slides from #CannesLions '11 from @jessedeeJesse Desjardins - @jessedee
 
Musings on Mesos: Docker, Kubernetes, and Beyond.
Musings on Mesos: Docker, Kubernetes, and Beyond.Musings on Mesos: Docker, Kubernetes, and Beyond.
Musings on Mesos: Docker, Kubernetes, and Beyond.Timothy St. Clair
 
Dynamic Scheduling - Federated clusters in mesos
Dynamic Scheduling - Federated clusters in mesosDynamic Scheduling - Federated clusters in mesos
Dynamic Scheduling - Federated clusters in mesosAaron Carey
 
50 New Features of Java EE 7 in 50 minutes
50 New Features of Java EE 7 in 50 minutes50 New Features of Java EE 7 in 50 minutes
50 New Features of Java EE 7 in 50 minutesArun Gupta
 
The thing with Fuel SAS helps the KLM Fuel & Emission DWH
The thing with Fuel SAS helps the KLM Fuel & Emission DWHThe thing with Fuel SAS helps the KLM Fuel & Emission DWH
The thing with Fuel SAS helps the KLM Fuel & Emission DWHBusiness_Analytics
 
Getting Started Hacking OpenNebula - Fosdem-2013
Getting Started Hacking OpenNebula - Fosdem-2013Getting Started Hacking OpenNebula - Fosdem-2013
Getting Started Hacking OpenNebula - Fosdem-2013OpenNebula Project
 
Presenting the BetaCodex (BetaCodex06)
Presenting the BetaCodex (BetaCodex06)Presenting the BetaCodex (BetaCodex06)
Presenting the BetaCodex (BetaCodex06)Niels Pflaeging
 
Agile Recruiting White Paper
Agile Recruiting White PaperAgile Recruiting White Paper
Agile Recruiting White PaperAmber Grewal
 
Mesos vs kubernetes comparison
Mesos vs kubernetes comparisonMesos vs kubernetes comparison
Mesos vs kubernetes comparisonKrishna-Kumar
 
Hyperscale Computing, Enterprise Agility with Mesosphere
Hyperscale Computing, Enterprise Agility with MesosphereHyperscale Computing, Enterprise Agility with Mesosphere
Hyperscale Computing, Enterprise Agility with MesosphereMarkus Eisele
 
MesosCon EU - HTTP API Framework
MesosCon EU - HTTP API FrameworkMesosCon EU - HTTP API Framework
MesosCon EU - HTTP API FrameworkMarco Massenzio
 
Mesos and Kubernetes ecosystem overview
Mesos and Kubernetes ecosystem overviewMesos and Kubernetes ecosystem overview
Mesos and Kubernetes ecosystem overviewKrishna-Kumar
 
Metodos de programacion no-lineal
Metodos de programacion no-linealMetodos de programacion no-lineal
Metodos de programacion no-linealRobert Rivero
 

Destacado (20)

Creating a Mesos python framework
Creating a Mesos python frameworkCreating a Mesos python framework
Creating a Mesos python framework
 
Building and Deploying Application to Apache Mesos
Building and Deploying Application to Apache MesosBuilding and Deploying Application to Apache Mesos
Building and Deploying Application to Apache Mesos
 
100+ Beautiful Slides from #CannesLions '11 from @jessedee
100+ Beautiful Slides from #CannesLions '11 from @jessedee100+ Beautiful Slides from #CannesLions '11 from @jessedee
100+ Beautiful Slides from #CannesLions '11 from @jessedee
 
Musings on Mesos: Docker, Kubernetes, and Beyond.
Musings on Mesos: Docker, Kubernetes, and Beyond.Musings on Mesos: Docker, Kubernetes, and Beyond.
Musings on Mesos: Docker, Kubernetes, and Beyond.
 
Mesos introduction
Mesos introductionMesos introduction
Mesos introduction
 
GoDocker presentation
GoDocker presentationGoDocker presentation
GoDocker presentation
 
Interaction17 | Designing to Combat Misinformation
Interaction17 | Designing to Combat Misinformation Interaction17 | Designing to Combat Misinformation
Interaction17 | Designing to Combat Misinformation
 
Dynamic Scheduling - Federated clusters in mesos
Dynamic Scheduling - Federated clusters in mesosDynamic Scheduling - Federated clusters in mesos
Dynamic Scheduling - Federated clusters in mesos
 
50 New Features of Java EE 7 in 50 minutes
50 New Features of Java EE 7 in 50 minutes50 New Features of Java EE 7 in 50 minutes
50 New Features of Java EE 7 in 50 minutes
 
The thing with Fuel SAS helps the KLM Fuel & Emission DWH
The thing with Fuel SAS helps the KLM Fuel & Emission DWHThe thing with Fuel SAS helps the KLM Fuel & Emission DWH
The thing with Fuel SAS helps the KLM Fuel & Emission DWH
 
Getting Started Hacking OpenNebula - Fosdem-2013
Getting Started Hacking OpenNebula - Fosdem-2013Getting Started Hacking OpenNebula - Fosdem-2013
Getting Started Hacking OpenNebula - Fosdem-2013
 
Rpscala2011 0601
Rpscala2011 0601Rpscala2011 0601
Rpscala2011 0601
 
Presenting the BetaCodex (BetaCodex06)
Presenting the BetaCodex (BetaCodex06)Presenting the BetaCodex (BetaCodex06)
Presenting the BetaCodex (BetaCodex06)
 
Agile Recruiting White Paper
Agile Recruiting White PaperAgile Recruiting White Paper
Agile Recruiting White Paper
 
Mesos vs kubernetes comparison
Mesos vs kubernetes comparisonMesos vs kubernetes comparison
Mesos vs kubernetes comparison
 
Hyperscale Computing, Enterprise Agility with Mesosphere
Hyperscale Computing, Enterprise Agility with MesosphereHyperscale Computing, Enterprise Agility with Mesosphere
Hyperscale Computing, Enterprise Agility with Mesosphere
 
MesosCon EU - HTTP API Framework
MesosCon EU - HTTP API FrameworkMesosCon EU - HTTP API Framework
MesosCon EU - HTTP API Framework
 
Mesos and Kubernetes ecosystem overview
Mesos and Kubernetes ecosystem overviewMesos and Kubernetes ecosystem overview
Mesos and Kubernetes ecosystem overview
 
Actividad 3 BD
Actividad 3 BDActividad 3 BD
Actividad 3 BD
 
Metodos de programacion no-lineal
Metodos de programacion no-linealMetodos de programacion no-lineal
Metodos de programacion no-lineal
 

Similar a Strata SC 2014: Apache Mesos as an SDK for Building Distributed Frameworks

Apache Mesos Overview and Integration
Apache Mesos Overview and IntegrationApache Mesos Overview and Integration
Apache Mesos Overview and IntegrationAlex Baretto
 
How leading financial services organisations are winning with tech
How leading financial services organisations are winning with techHow leading financial services organisations are winning with tech
How leading financial services organisations are winning with techMongoDB
 
[Capitole du Libre] #serverless -  mettez-le en oeuvre dans votre entreprise...
[Capitole du Libre] #serverless -  mettez-le en oeuvre dans votre entreprise...[Capitole du Libre] #serverless -  mettez-le en oeuvre dans votre entreprise...
[Capitole du Libre] #serverless -  mettez-le en oeuvre dans votre entreprise...Ludovic Piot
 
Achieving Cost & Resource Effeciencies through Trove Database As-A-Service (D...
Achieving Cost & Resource Effeciencies through Trove Database As-A-Service (D...Achieving Cost & Resource Effeciencies through Trove Database As-A-Service (D...
Achieving Cost & Resource Effeciencies through Trove Database As-A-Service (D...Dean Delamont
 
Declare Victory with Big Data
Declare Victory with Big DataDeclare Victory with Big Data
Declare Victory with Big DataJ On The Beach
 
Introduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OSIntroduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OSSteve Wong
 
Building FoundationDB
Building FoundationDBBuilding FoundationDB
Building FoundationDBFoundationDB
 
Modernizing Applications with Microservices and DC/OS (Lightbend/Mesosphere c...
Modernizing Applications with Microservices and DC/OS (Lightbend/Mesosphere c...Modernizing Applications with Microservices and DC/OS (Lightbend/Mesosphere c...
Modernizing Applications with Microservices and DC/OS (Lightbend/Mesosphere c...Lightbend
 
Stay productive while slicing up the monolith
Stay productive while slicing up the monolithStay productive while slicing up the monolith
Stay productive while slicing up the monolithMarkus Eisele
 
Stay productive while slicing up the monolith
Stay productive while slicing up the monolithStay productive while slicing up the monolith
Stay productive while slicing up the monolithMarkus Eisele
 
Easy Docker Deployments with Mesosphere DCOS on Azure
Easy Docker Deployments with Mesosphere DCOS on AzureEasy Docker Deployments with Mesosphere DCOS on Azure
Easy Docker Deployments with Mesosphere DCOS on AzureMesosphere Inc.
 
Data Lake and the rise of the microservices
Data Lake and the rise of the microservicesData Lake and the rise of the microservices
Data Lake and the rise of the microservicesBigstep
 
Kubernetes solutions
Kubernetes solutionsKubernetes solutions
Kubernetes solutionsEric Cattoir
 
Dori Exterman, Considerations for choosing the parallel computing strategy th...
Dori Exterman, Considerations for choosing the parallel computing strategy th...Dori Exterman, Considerations for choosing the parallel computing strategy th...
Dori Exterman, Considerations for choosing the parallel computing strategy th...Sergey Platonov
 
Java Agile ALM: OTAP and DevOps in the Cloud
Java Agile ALM: OTAP and DevOps in the CloudJava Agile ALM: OTAP and DevOps in the Cloud
Java Agile ALM: OTAP and DevOps in the CloudMongoDB
 
Tackling complexity in giant systems: approaches from several cloud providers
Tackling complexity in giant systems: approaches from several cloud providersTackling complexity in giant systems: approaches from several cloud providers
Tackling complexity in giant systems: approaches from several cloud providersPatrick Chanezon
 
Docker for the enterprise
Docker for the enterpriseDocker for the enterprise
Docker for the enterpriseBert Poller
 
FLUX - Crash Course in Cloud 2.0
FLUX - Crash Course in Cloud 2.0 FLUX - Crash Course in Cloud 2.0
FLUX - Crash Course in Cloud 2.0 Mark Hinkle
 
Accelerate DevOps/Microservices and Kubernetes
Accelerate DevOps/Microservices and KubernetesAccelerate DevOps/Microservices and Kubernetes
Accelerate DevOps/Microservices and KubernetesRick Hightower
 
DevOps in Age of Kubernetes
DevOps in Age of KubernetesDevOps in Age of Kubernetes
DevOps in Age of KubernetesMesosphere Inc.
 

Similar a Strata SC 2014: Apache Mesos as an SDK for Building Distributed Frameworks (20)

Apache Mesos Overview and Integration
Apache Mesos Overview and IntegrationApache Mesos Overview and Integration
Apache Mesos Overview and Integration
 
How leading financial services organisations are winning with tech
How leading financial services organisations are winning with techHow leading financial services organisations are winning with tech
How leading financial services organisations are winning with tech
 
[Capitole du Libre] #serverless -  mettez-le en oeuvre dans votre entreprise...
[Capitole du Libre] #serverless -  mettez-le en oeuvre dans votre entreprise...[Capitole du Libre] #serverless -  mettez-le en oeuvre dans votre entreprise...
[Capitole du Libre] #serverless -  mettez-le en oeuvre dans votre entreprise...
 
Achieving Cost & Resource Effeciencies through Trove Database As-A-Service (D...
Achieving Cost & Resource Effeciencies through Trove Database As-A-Service (D...Achieving Cost & Resource Effeciencies through Trove Database As-A-Service (D...
Achieving Cost & Resource Effeciencies through Trove Database As-A-Service (D...
 
Declare Victory with Big Data
Declare Victory with Big DataDeclare Victory with Big Data
Declare Victory with Big Data
 
Introduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OSIntroduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OS
 
Building FoundationDB
Building FoundationDBBuilding FoundationDB
Building FoundationDB
 
Modernizing Applications with Microservices and DC/OS (Lightbend/Mesosphere c...
Modernizing Applications with Microservices and DC/OS (Lightbend/Mesosphere c...Modernizing Applications with Microservices and DC/OS (Lightbend/Mesosphere c...
Modernizing Applications with Microservices and DC/OS (Lightbend/Mesosphere c...
 
Stay productive while slicing up the monolith
Stay productive while slicing up the monolithStay productive while slicing up the monolith
Stay productive while slicing up the monolith
 
Stay productive while slicing up the monolith
Stay productive while slicing up the monolithStay productive while slicing up the monolith
Stay productive while slicing up the monolith
 
Easy Docker Deployments with Mesosphere DCOS on Azure
Easy Docker Deployments with Mesosphere DCOS on AzureEasy Docker Deployments with Mesosphere DCOS on Azure
Easy Docker Deployments with Mesosphere DCOS on Azure
 
Data Lake and the rise of the microservices
Data Lake and the rise of the microservicesData Lake and the rise of the microservices
Data Lake and the rise of the microservices
 
Kubernetes solutions
Kubernetes solutionsKubernetes solutions
Kubernetes solutions
 
Dori Exterman, Considerations for choosing the parallel computing strategy th...
Dori Exterman, Considerations for choosing the parallel computing strategy th...Dori Exterman, Considerations for choosing the parallel computing strategy th...
Dori Exterman, Considerations for choosing the parallel computing strategy th...
 
Java Agile ALM: OTAP and DevOps in the Cloud
Java Agile ALM: OTAP and DevOps in the CloudJava Agile ALM: OTAP and DevOps in the Cloud
Java Agile ALM: OTAP and DevOps in the Cloud
 
Tackling complexity in giant systems: approaches from several cloud providers
Tackling complexity in giant systems: approaches from several cloud providersTackling complexity in giant systems: approaches from several cloud providers
Tackling complexity in giant systems: approaches from several cloud providers
 
Docker for the enterprise
Docker for the enterpriseDocker for the enterprise
Docker for the enterprise
 
FLUX - Crash Course in Cloud 2.0
FLUX - Crash Course in Cloud 2.0 FLUX - Crash Course in Cloud 2.0
FLUX - Crash Course in Cloud 2.0
 
Accelerate DevOps/Microservices and Kubernetes
Accelerate DevOps/Microservices and KubernetesAccelerate DevOps/Microservices and Kubernetes
Accelerate DevOps/Microservices and Kubernetes
 
DevOps in Age of Kubernetes
DevOps in Age of KubernetesDevOps in Age of Kubernetes
DevOps in Age of Kubernetes
 

Más de Paco Nathan

Human in the loop: a design pattern for managing teams working with ML
Human in the loop: a design pattern for managing  teams working with MLHuman in the loop: a design pattern for managing  teams working with ML
Human in the loop: a design pattern for managing teams working with MLPaco Nathan
 
Human-in-the-loop: a design pattern for managing teams that leverage ML
Human-in-the-loop: a design pattern for managing teams that leverage MLHuman-in-the-loop: a design pattern for managing teams that leverage ML
Human-in-the-loop: a design pattern for managing teams that leverage MLPaco Nathan
 
Human-in-a-loop: a design pattern for managing teams which leverage ML
Human-in-a-loop: a design pattern for managing teams which leverage MLHuman-in-a-loop: a design pattern for managing teams which leverage ML
Human-in-a-loop: a design pattern for managing teams which leverage MLPaco Nathan
 
Humans in a loop: Jupyter notebooks as a front-end for AI
Humans in a loop: Jupyter notebooks as a front-end for AIHumans in a loop: Jupyter notebooks as a front-end for AI
Humans in a loop: Jupyter notebooks as a front-end for AIPaco Nathan
 
Humans in the loop: AI in open source and industry
Humans in the loop: AI in open source and industryHumans in the loop: AI in open source and industry
Humans in the loop: AI in open source and industryPaco Nathan
 
Computable Content
Computable ContentComputable Content
Computable ContentPaco Nathan
 
Computable Content: Lessons Learned
Computable Content: Lessons LearnedComputable Content: Lessons Learned
Computable Content: Lessons LearnedPaco Nathan
 
SF Python Meetup: TextRank in Python
SF Python Meetup: TextRank in PythonSF Python Meetup: TextRank in Python
SF Python Meetup: TextRank in PythonPaco Nathan
 
Use of standards and related issues in predictive analytics
Use of standards and related issues in predictive analyticsUse of standards and related issues in predictive analytics
Use of standards and related issues in predictive analyticsPaco Nathan
 
Data Science in 2016: Moving Up
Data Science in 2016: Moving UpData Science in 2016: Moving Up
Data Science in 2016: Moving UpPaco Nathan
 
Data Science Reinvents Learning?
Data Science Reinvents Learning?Data Science Reinvents Learning?
Data Science Reinvents Learning?Paco Nathan
 
Jupyter for Education: Beyond Gutenberg and Erasmus
Jupyter for Education: Beyond Gutenberg and ErasmusJupyter for Education: Beyond Gutenberg and Erasmus
Jupyter for Education: Beyond Gutenberg and ErasmusPaco Nathan
 
GalvanizeU Seattle: Eleven Almost-Truisms About Data
GalvanizeU Seattle: Eleven Almost-Truisms About DataGalvanizeU Seattle: Eleven Almost-Truisms About Data
GalvanizeU Seattle: Eleven Almost-Truisms About DataPaco Nathan
 
Microservices, containers, and machine learning
Microservices, containers, and machine learningMicroservices, containers, and machine learning
Microservices, containers, and machine learningPaco Nathan
 
GraphX: Graph analytics for insights about developer communities
GraphX: Graph analytics for insights about developer communitiesGraphX: Graph analytics for insights about developer communities
GraphX: Graph analytics for insights about developer communitiesPaco Nathan
 
Graph Analytics in Spark
Graph Analytics in SparkGraph Analytics in Spark
Graph Analytics in SparkPaco Nathan
 
Apache Spark and the Emerging Technology Landscape for Big Data
Apache Spark and the Emerging Technology Landscape for Big DataApache Spark and the Emerging Technology Landscape for Big Data
Apache Spark and the Emerging Technology Landscape for Big DataPaco Nathan
 
QCon São Paulo: Real-Time Analytics with Spark Streaming
QCon São Paulo: Real-Time Analytics with Spark StreamingQCon São Paulo: Real-Time Analytics with Spark Streaming
QCon São Paulo: Real-Time Analytics with Spark StreamingPaco Nathan
 
Strata 2015 Data Preview: Spark, Data Visualization, YARN, and More
Strata 2015 Data Preview: Spark, Data Visualization, YARN, and MoreStrata 2015 Data Preview: Spark, Data Visualization, YARN, and More
Strata 2015 Data Preview: Spark, Data Visualization, YARN, and MorePaco Nathan
 
A New Year in Data Science: ML Unpaused
A New Year in Data Science: ML UnpausedA New Year in Data Science: ML Unpaused
A New Year in Data Science: ML UnpausedPaco Nathan
 

Más de Paco Nathan (20)

Human in the loop: a design pattern for managing teams working with ML
Human in the loop: a design pattern for managing  teams working with MLHuman in the loop: a design pattern for managing  teams working with ML
Human in the loop: a design pattern for managing teams working with ML
 
Human-in-the-loop: a design pattern for managing teams that leverage ML
Human-in-the-loop: a design pattern for managing teams that leverage MLHuman-in-the-loop: a design pattern for managing teams that leverage ML
Human-in-the-loop: a design pattern for managing teams that leverage ML
 
Human-in-a-loop: a design pattern for managing teams which leverage ML
Human-in-a-loop: a design pattern for managing teams which leverage MLHuman-in-a-loop: a design pattern for managing teams which leverage ML
Human-in-a-loop: a design pattern for managing teams which leverage ML
 
Humans in a loop: Jupyter notebooks as a front-end for AI
Humans in a loop: Jupyter notebooks as a front-end for AIHumans in a loop: Jupyter notebooks as a front-end for AI
Humans in a loop: Jupyter notebooks as a front-end for AI
 
Humans in the loop: AI in open source and industry
Humans in the loop: AI in open source and industryHumans in the loop: AI in open source and industry
Humans in the loop: AI in open source and industry
 
Computable Content
Computable ContentComputable Content
Computable Content
 
Computable Content: Lessons Learned
Computable Content: Lessons LearnedComputable Content: Lessons Learned
Computable Content: Lessons Learned
 
SF Python Meetup: TextRank in Python
SF Python Meetup: TextRank in PythonSF Python Meetup: TextRank in Python
SF Python Meetup: TextRank in Python
 
Use of standards and related issues in predictive analytics
Use of standards and related issues in predictive analyticsUse of standards and related issues in predictive analytics
Use of standards and related issues in predictive analytics
 
Data Science in 2016: Moving Up
Data Science in 2016: Moving UpData Science in 2016: Moving Up
Data Science in 2016: Moving Up
 
Data Science Reinvents Learning?
Data Science Reinvents Learning?Data Science Reinvents Learning?
Data Science Reinvents Learning?
 
Jupyter for Education: Beyond Gutenberg and Erasmus
Jupyter for Education: Beyond Gutenberg and ErasmusJupyter for Education: Beyond Gutenberg and Erasmus
Jupyter for Education: Beyond Gutenberg and Erasmus
 
GalvanizeU Seattle: Eleven Almost-Truisms About Data
GalvanizeU Seattle: Eleven Almost-Truisms About DataGalvanizeU Seattle: Eleven Almost-Truisms About Data
GalvanizeU Seattle: Eleven Almost-Truisms About Data
 
Microservices, containers, and machine learning
Microservices, containers, and machine learningMicroservices, containers, and machine learning
Microservices, containers, and machine learning
 
GraphX: Graph analytics for insights about developer communities
GraphX: Graph analytics for insights about developer communitiesGraphX: Graph analytics for insights about developer communities
GraphX: Graph analytics for insights about developer communities
 
Graph Analytics in Spark
Graph Analytics in SparkGraph Analytics in Spark
Graph Analytics in Spark
 
Apache Spark and the Emerging Technology Landscape for Big Data
Apache Spark and the Emerging Technology Landscape for Big DataApache Spark and the Emerging Technology Landscape for Big Data
Apache Spark and the Emerging Technology Landscape for Big Data
 
QCon São Paulo: Real-Time Analytics with Spark Streaming
QCon São Paulo: Real-Time Analytics with Spark StreamingQCon São Paulo: Real-Time Analytics with Spark Streaming
QCon São Paulo: Real-Time Analytics with Spark Streaming
 
Strata 2015 Data Preview: Spark, Data Visualization, YARN, and More
Strata 2015 Data Preview: Spark, Data Visualization, YARN, and MoreStrata 2015 Data Preview: Spark, Data Visualization, YARN, and More
Strata 2015 Data Preview: Spark, Data Visualization, YARN, and More
 
A New Year in Data Science: ML Unpaused
A New Year in Data Science: ML UnpausedA New Year in Data Science: ML Unpaused
A New Year in Data Science: ML Unpaused
 

Último

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 

Último (20)

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 

Strata SC 2014: Apache Mesos as an SDK for Building Distributed Frameworks

  • 1. Apache Mesos as an SDK for Building Distributed Frameworks 
 
 Strata SC, 2014-02-13 Paco Nathan 
 http://liber118.com/pxn/
 @pacoid
  • 3. ! Have you heard about 
 “data democratization” ? ? ? making data available
 throughout more of the organization
  • 4. ! Have you heard about 
 “data democratization” ? ? ? making data available
 throughout more of the organization ! Then how would you handle 
 “cluster democratization” ? ? ? making data+resources available
 throughout more of the organization
  • 5. ! Have you heard about 
 “data democratization” ? ? ? making data available
 throughout more of the organization In other words, 
 how to remove silos… ! Then how would you handle 
 “cluster democratization” ? ? ? making data+resources available
 throughout more of the organization
  • 7. Datacenter Computing Google has been doing datacenter computing for years, 
 to address the complexities of large-scale data workflows: • leveraging the modern kernel: isolation in lieu of VMs • “most (>80%) jobs are batch jobs, but the majority 
 of resources (55–80%) are allocated to service jobs” • mixed workloads, multi-tenancy • relatively high utilization rates • because JVM? not so much… • reality: scheduling batch is simple; 
 scheduling services is hard/expensive
  • 8. The Modern Kernel: Top Linux Contributors… arstechnica.com/information-technology/2013/09/...
  • 9. “Return of the Borg” Return of the Borg: How Twitter Rebuilt Google’s Secret Weapon
 Cade Metz
 wired.com/wiredenterprise/2013/03/google-borg-twittermesos ! The Datacenter as a Computer: An Introduction 
 to the Design of Warehouse-Scale Machines Luiz André Barroso, Urs Hölzle research.google.com/pubs/pub35290.html ! ! 2011 GAFS Omega
 John Wilkes, et al.
 youtu.be/0ZFMlO98Jkc
  • 10. Google describes the technology… Omega: flexible, scalable schedulers for large compute clusters Malte Schwarzkopf, Andy Konwinski, Michael Abd-El-Malek, John Wilkes eurosys2013.tudos.org/wp-content/uploads/2013/paper/ Schwarzkopf.pdf
  • 11. Google describes the business case… Taming Latency Variability
 Jeff Dean
 plus.google.com/u/0/+ResearchatGoogle/posts/C1dPhQhcDRv
  • 12. Commercial OS Cluster Schedulers ! • IBM Platform Symphony
 • Microsoft Autopilot ! 
 Arguably, some grid controllers 
 are quite notable in-category: • Univa Grid Engine (formerly SGE)
 • Condor • etc.
  • 14. Beyond Hadoop Hadoop – an open source solution for fault-tolerant parallel processing of batch jobs at scale, based on commodity hardware… however, other priorities have emerged for the analytics lifecycle: • • • apps require integration beyond Hadoop • • • • higher utilization multiple topologies, mixed workloads, multi-tenancy significant disruptions in h/w cost/performance curves lower latency highly-available, long running services more than “Just JVM” – e.g., Python growth
  • 15. Beyond Hadoop keep in mind priorities for interdisciplinary efforts, to apps require down silos • breakintegration beyond Hadoop – extending multiple topologies, mixed workloads, multi-tenancy • beyond a de facto “priesthood” • significant disruptions in h/w cost/performance of data engineering curves Hadoop – an open source solution for fault-tolerant parallel processing of batch jobs at scale, based on commodity hardware… however, other priorities have emerged for the • • • • higher utilization lower latency highly-available, long running services more than “Just JVM” – e.g., Python growth
  • 16.
  • 17. Mesos – open source datacenter computing a common substrate for cluster computing mesos.apache.org heterogenous assets in your datacenter or cloud 
 made available as a homogenous set of resources • • • • • • • • top-level Apache project scalability to 10,000s of nodes obviates the need for virtual machines isolation (pluggable) for CPU, RAM, I/O, FS, etc. fault-tolerant leader election based on Zookeeper APIs in C++, Java, Python, Go web UI for inspecting cluster state available for Linux, OpenSolaris, Mac OSX
  • 18. What are the costs of Virtualization? benchmark type OpenVZ improvement mixed workloads 210%-300% LAMP (related) 38%-200% I/O throughput 200%-500% response time order magnitude more pronounced 
 at higher loads
  • 19. What are the costs of Single Tenancy? MEMCACHED CPU LOAD RAILS CPU LOAD HADOOP CPU LOAD 100% 100% 100% 75% 75% 75% 50% 50% 50% 25% 25% 25% 0% 0% 0% t t COMBINED CPU LOAD (RAILS, MEMCACHED, HADOOP) 100% 75% 50% 25% 0% Hadoop Memcached Rails
  • 20. Arguments for Datacenter Computing rather than running several specialized clusters, each 
 at relatively low utilization rates, instead run many 
 mixed workloads obvious benefits are realized in terms of: • • • scalability, elasticity, fault tolerance, performance, utilization reduced equipment capex, Ops overhead, etc. reduced licensing, eliminating need for VMs or potential 
 vendor lock-in subtle benefits – arguably, more important for Enterprise IT: • • reduced time for engineers to ramp up new services at scale • enables Dev/Test apps to run safely on a Production cluster reduced latency between batch and services, enabling new 
 high ROI use cases
  • 22. Prior Practice: Dedicated Servers DATACENTER • • low utilization rates longer time to ramp up new services
  • 23. Prior Practice: Virtualization DATACENTER PROVISIONED VMS • • even more machines to manage • VM licensing costs substantial performance decrease 
 due to virtualization
  • 24. Prior Practice: Static Partitioning DATACENTER STATIC PARTITIONING • • even more machines to manage • • VM licensing costs substantial performance decrease 
 due to virtualization static partitioning limits elasticity
  • 25. Mesos: One Large Pool of Resources DATACENTER MESOS “We wanted people to be able to program 
 for the datacenter just like they program 
 !for their laptop." Ben Hindman
  • 26. Frameworks Integrated with Mesos Continuous Integration:
 Jenkins, GitLab Big Data:
 Hadoop, Spark, Storm, Kafka, Cassandra,
 Hypertable, MPI Python workloads:
 DPark, Exelixi Meta-Frameworks / HA Services:
 Aurora, Marathon Distributed Cron:
 Chronos Containers:
 Docker
  • 27. ! Fault-tolerant distributed systems… …written in 100-300 lines of 
 C++, Java/Scala, Python, Go, etc. …building blocks, if you will ! Q: required lines of network code? A: probably none
  • 29. Mesos – architecture apps: HA services, web apps, batch jobs, scripts, etc. frameworks: Spark, Storm, MPI, Jenkins, etc. task schedulers: Chronos, etc. meta-frameworks: Aurora, Marathon APIs: C++, JVM, Py, Go Mesos, distrib kernel HDFS, distrib file system Linux: libcgroup, libprocess, libev, etc.
  • 30. Mesos – dynamics scheduled apps HA services distrib frameworks Chronos distrib cron GNU/Linux Marathon distrib init.d Mesos distrib kernel Linux
  • 32. Example: Resource Offer in a Two-Level Scheduler mesos.apache.org/documentation/latest/mesos-architecture/
  • 35. Opposite Ends of the Spectrum, One Common Substrate Solaris Zones Built-in /
 bare metal Linux CGroups Hypervisors
  • 36. Opposite Ends of the Spectrum, One Common Substrate Request /
 Response Batch
  • 37. Case Study: Twitter (bare metal / on premise) “Mesos is the cornerstone of our elastic compute infrastructure – 
 it’s how we build all our new services and is critical for Twitter’s
 continued success at scale. It's one of the primary keys to our
 data center efficiency." Chris Fry, SVP Engineering blog.twitter.com/2013/mesos-graduates-from-apache-incubation ! wired.com/gadgetlab/2013/11/qa-with-chris-fry/ • key services run in production: analytics, typeahead, ads • Twitter engineers rely on Mesos to build all new services • instead of thinking about static machines, engineers think 
 about resources like CPU, memory and disk • allows services to scale and leverage a shared pool of 
 servers across datacenters efficiently • reduces the time between prototyping and launching
  • 38. Case Study: Airbnb (fungible cloud infrastructure) “We think we might be pushing data science in the field of travel 
 more so than anyone has ever done before… a smaller number 
 of engineers can have higher impact through automation on 
 Mesos." Mike Curtis,VP Engineering
 gigaom.com/2013/07/29/airbnb-is-engineering-itself-into-a-data... • improves resource management and efficiency • helps advance engineering strategy of building small teams 
 that can move fast • key to letting engineers make the most of AWS-based 
 infrastructure beyond just Hadoop • allowed company to migrate off Elastic MapReduce • enables use of Hadoop along with Chronos, Spark, Storm, etc.
  • 39. DIY
  • 41.
  • 42.
  • 43. Elastic Mesos Master 1 NN ZK Worker DN Worker DN Worker DN Worker DN Worker DN Master 2 NN ZK Worker DN Worker DN Worker DN Worker DN Worker DN Master 3 NN ZK Worker DN Worker DN Worker DN Worker DN Worker DN
  • 44.
  • 45. Save the Date: Apr 3, 2014 Mesos Summit (or something)
  • 46. Resources Apache Mesos Project
 mesos.apache.org Twitter
 @ApacheMesos Mesosphere
 mesosphere.io Tutorials
 mesosphere.io/learn Documentation
 mesos.apache.org/documentation 2011 USENIX Research Paper
 usenix.org/legacy/event/nsdi11/tech/full_papers/Hindman_new.pdf Collected Notes/Archives
 goo.gl/jPtTP
  • 47.
  • 48. Former Airbnb engineers simplify Mesos to manage data jobs in the cloud
 Jordan Novet
 VentureBeat (2013-11-12)
 venturebeat.com/2013/11/12/former-airbnb-engineers-simplify... Mesosphere Adds Docker Support To Its Mesos-Based Operating System For The Data Center
 Frederic Lardinois
 TechCrunch (2013-09-26)
 techcrunch.com/2013/09/26/mesosphere... Play Framework Grid Deployment with Mesos
 James Ward, Flo Leibert, et al.
 Typesafe blog (2013-09-19)
 typesafe.com/blog/play-framework-grid... Mesosphere Launches Marathon Framework
 Adrian Bridgwater
 Dr. Dobbs (2013-09-18)
 drdobbs.com/open-source/mesosphere... New open source tech Marathon wants to make your data center run like Google’s
 Derrick Harris
 GigaOM (2013-09-04)
 gigaom.com/2013/09/04/new-open-source... Running batch and long-running, highly available service jobs on the same cluster
 Ben Lorica
 O’Reilly (2013-09-01)
 strata.oreilly.com/2013/09/running-batch...

  • 49. Enterprise Data Workflows with Cascading O’Reilly, 2013 shop.oreilly.com/product/ 0636920028536.do ! monthly newsletter for updates, 
 events, conference summaries, etc.: liber118.com/pxn/