SlideShare una empresa de Scribd logo
1 de 95
InfoQ.com: News & Community Site
• Over 1,000,000 software developers, architects and CTOs read the site world-
wide every month
• 250,000 senior developers subscribe to our weekly newsletter
• Published in 4 languages (English, Chinese, Japanese and Brazilian
Portuguese)
• Post content from our QCon conferences
• 2 dedicated podcast channels: The InfoQ Podcast, with a focus on
Architecture and The Engineering Culture Podcast, with a focus on building
• 96 deep dives on innovative topics packed as downloadable emags and
minibooks
• Over 40 new content items per week
Watch the video with slide
synchronization on InfoQ.com!
https://www.infoq.com/presentations/
datadog-cloud
Purpose of QCon
- to empower software development by facilitating the spread of
knowledge and innovation
Strategy
- practitioner-driven conference designed for YOU: influencers of
change and innovation in your teams
- speakers and topics driving the evolution and innovation
- connecting and catalyzing the influencers and innovators
Highlights
- attended by more than 12,000 delegates since 2007
- held in 9 cities worldwide
Presented at QCon San Francisco
www.qconsf.com
The Evolution of a Data Project
The Evolution of a Data Project
Python
script
The Evolution of a Data Project
Python
script
SQL on 

live DB
The Evolution of a Data Project
Python
script
SQL on
reporting DB
SQL on 

live DB
The Evolution of a Data Project
Python
script
SQL on
reporting DB
SQL on 

live DB
Terrible
confusion
The Evolution of a Data Project
Python
script
SQL on
reporting DB
SQL on 

live DB
Terrible
confusion
Hadoop / Spark
cluster
What needs fixing
image: Pexels
What needs fixing
image: Pexels
• One cluster: data lock-in.
What needs fixing
image: Pexels
• One cluster: data lock-in.
• Want cluster time? You have to wait.
What needs fixing
image: Pexels
• One cluster: data lock-in.
• Want cluster time? You have to wait.
• Clusters are underutilized and EXPENSIVE
Elastic Big Data
Platform @ Datadog
Doug Daniels
Director, Engineering
WHOM
What’s our big data platform do?
Data Engineers
Data Scientists
WHOM
What’s our big data platform do?
Data Engineers
Data Scientists
do
WHAT
App features
Statistical Analysis/ML
Ad-hoc investigation
WHOM
What’s our big data platform do?
Data Engineers
Data Scientists
do
WHAT
App features
Statistical Analysis/ML
Ad-hoc investigation
WITH
Spark
Hadoop (Pig)
Python (Luigi)
with
Exploring the platform
COPIOUS

TOOLING
CLOUD
STORAGE
ELASTIC
COMPUTE
CLOUD STORAGE
What do we store?
150 Integrations
…and more
What’s time series data?
timestamp 1447020511
metric system.cpu.idle
value 98.16687
tags host:i-xyz,
role:cassandra, …
We collect
over a trillion
of these per day
…and growing!
Where to put the petabytes?
Amazon S3.
Amazon S3
How data gets to S3
116
- Buffer
- Sort + Dedupe
- Upload
GO
- Partition + Sort
- Write Parquet
- Update Metastore
LUIGI/SPARK/PIG
HIVE METASTORE
Internal Format
AMAZON S3
Parquet Metadata
Kafka
Isn’t this a job for HDFS?
What we don’t love about HDFS
What we don’t love about HDFS
• Causes the “one cluster” problem
What we don’t love about HDFS
• Causes the “one cluster” problem
• Come for the storage, get stuck with the servers
What we don’t love about HDFS
• Causes the “one cluster” problem
• Come for the storage, get stuck with the servers
• No Java? No data!
S3 is flexible!
• Read data from as many clusters as you want
S3 is flexible!
• Read data from as many clusters as you want
• Store unlimited stuff(*) with no management
* Accepting laws of physics and your credit card limit
S3 is flexible!
• Read data from as many clusters as you want
• Store unlimited stuff(*) with no management
• Rock solid: durability (99.999999999), availability (99.99)
* Accepting laws of physics and your credit card limit
S3 is flexible!
• Read data from as many clusters as you want
• Store unlimited stuff(*) with no management
• Rock solid: durability (99.999999999), availability (99.99)
• Access from any programming language
* Accepting laws of physics and your credit card limit
Decouple data and compute
(BREAK THE RULES!)
Breaking the rules is fine.
In benchmarks: S3 is ~2X slower than HDFS
Breaking the rules is fine.
In benchmarks: S3 is ~2X slower than HDFS
It’s not all roses
Listing is slooooow
(A CAUTIONARY TALE)
How to fix slow listing
Bigger filesParallelize it
HDFS
No way to quickly move data
Task
Intermediate Final
write atomic move
HDFS
No way to quickly move data
Task
Intermediate Final
write atomic move
S3 Task
write
No way to quickly move data
• Say goodbye to speculative execution
No way to quickly move data
• Say goodbye to speculative execution
• Say hello to better task timeouts
But really: We 💜S3
This is a great system.
✓ Data accessible from many clusters
✓ Storage is easy to manage
✓ It’s a multi-language paradise up in here
ELASTIC
COMPUTE
CLOUD
STORAGE
One cluster to
compute it all
TRADITIONALLY
Instead, we run many, many clusters
• New cluster for every
automated job
• 10–20 clusters at a time
• Median lifetime: 2hrs
Why so many clusters?
Total isolation
We know what’s happening and why
No more waiting on loaded clusters
• Tailor each cluster to the work you want to do
• Scale up when you need results faster
• Data scientists and data engineers don’t have to wait
🕐🕓🕥
Pick the best hardware for each job
for CPU-bound jobs
r3
if you don’t care (cheap!)
== ~30% savings over general purpose hardware
c3
for memory-bound jobs
m1.xlarge
100% spot-instance
clusters, all the time.*
* (ok, most of the time)
100% spot-instance
clusters, all the time.*
* (ok, most of the time)
Ridiculous
savings!
Disappearing
clusters!
How we do spot clusters
• Bid the on-demand price,
pay the spot price
In the big data platform
How we do spot clusters
• Bid the on-demand price,
pay the spot price
• Fallback to on-demand
instances if you can’t get spot
In the big data platform
How we do spot clusters
• Bid the on-demand price,
pay the spot price
• Fallback to on-demand
instances if you can’t get spot
• Monitor everything: jobs,
clusters, spot market
In the big data platform
How we do spot clusters
• Bid the on-demand price,
pay the spot price
• Fallback to on-demand
instances if you can’t get spot
• Monitor everything: jobs,
clusters, spot market
• ☞ Save up to 80% off the 

on-demand price
In the big data platform
Switch hardware when the market gets volatile
Monitor the spot price
We like this strategy a lot!
Cluster is oversubscribed; everyone waiting in line to do their work
Lots of expensive hardware sits idle when everyone’s gone
✓ No waiting for the cluster you need
✓ No waste from hardware sitting idle
✓ Spot clusters are affordable enough to use everywhere
What’s challenging,
though?
Many things that disappear.
ELASTIC
COMPUTE
CLOUD
STORAGE
COPIOUS

TOOLING
Web and APIs
Platform as a service
CLI
Jobs, Clusters, Schedules, Users, Code, Monitoring, Logs, and more
Big Data Platform Architecture
DATA Amazon S3
Big Data Platform Architecture
DATA Amazon S3
CLUSTER EMR
Big Data Platform Architecture
DATA Amazon S3
CLUSTER EMR
WORKER Pig Workers Spark Workers Luigi Workers
Big Data Platform Architecture
DATA Amazon S3
CLUSTER EMR
WORKER Pig Workers Spark Workers Luigi Workers
STORAGE Metadata DB Queueing Logs
Big Data Platform Architecture
DATA Amazon S3
CLUSTER EMR
WEB Web API
WORKER Pig Workers Spark Workers Luigi Workers
STORAGE Metadata DB Queueing Logs
Big Data Platform Architecture
DATA Amazon S3
CLUSTER EMR
WEB Web API
WORKER Pig Workers Spark Workers Luigi Workers
USER CLI API Clients Job Scheduler
STORAGE Metadata DB Queueing Logs
Big Data Platform Architecture
DATA Amazon S3
CLUSTER EMR
WEB Web API
WORKER Pig Workers Spark Workers Luigi Workers
USER CLI API Clients Job Scheduler
STORAGE Metadata DB Queueing Logs
Datadog
Monitoring
How to find the right cluster
when they disappear?
Cluster tagging 

for discovery
#anomaly
-detection
#monitor-report
How to monitor many
disappearing clusters?
Dashboards Monitors
Dynamic Monitoring on Tags
anomaly-detection
cluster_tags: anomaly-detection
How to debug problems
when the cluster’s gone?
Debugging In a Post-Cluster World
Debugging In a Post-Cluster World
Send all logs to S3
• HDFS
• YARN
• Pig
• Spark
Debugging In a Post-Cluster World
Visualize the pipeline
• Lipstick for Pig
• Spark History Server
• Luigi task flow
Send all logs to S3
• HDFS
• YARN
• Pig
• Spark
Debugging In a Post-Cluster World
Visualize the pipeline
• Lipstick for Pig
• Spark History Server
• Luigi task flow
Preserve historical
monitoring data
Keep history, by tag, after
the cluster disappears
Send all logs to S3
• HDFS
• YARN
• Pig
• Spark
How to handle
certain cluster failure
in your jobs?
Automatic cleanup and restart
Luigi: design for failure.
A B
Automatic cleanup and restart
Luigi: design for failure.
B
Automatic cleanup and restart
Luigi: design for failure.
❌
Automatic cleanup and restart
Luigi: design for failure.
ELASTIC
COMPUTE
CLOUD
STORAGE
COPIOUS

TOOLING
Recommendations 

for Cloud Big Data
Recommendations 

for Cloud Big Data
• Use S3 for permanent data, not HDFS
Recommendations 

for Cloud Big Data
• Use S3 for permanent data, not HDFS
• Start from EMR if building yourself
Recommendations 

for Cloud Big Data
• Use S3 for permanent data, not HDFS
• Start from EMR if building yourself
• Look into a PaaS: Netflix Genie, Qubole, Databricks
Recommendations 

for Cloud Big Data
• Use S3 for permanent data, not HDFS
• Start from EMR if building yourself
• Look into a PaaS: Netflix Genie, Qubole, Databricks
• Tag your clusters for dynamic monitoring
Recommendations 

for Cloud Big Data
• Use S3 for permanent data, not HDFS
• Start from EMR if building yourself
• Look into a PaaS: Netflix Genie, Qubole, Databricks
• Tag your clusters for dynamic monitoring
• Design for failure with a workflow tool (Luigi, Airflow)
Thanks!
Want to work with us on Spark, Hadoop,
Kafka, Parquet, and more?
jobs.datadoghq.com
DM me @ddaniels888 or doug@datadoghq.com
Watch the video with slide
synchronization on InfoQ.com!
https://www.infoq.com/presentations/
datadog-cloud

Más contenido relacionado

La actualidad más candente

Lessons Learned - Monitoring the Data Pipeline at Hulu
Lessons Learned - Monitoring the Data Pipeline at HuluLessons Learned - Monitoring the Data Pipeline at Hulu
Lessons Learned - Monitoring the Data Pipeline at HuluDataWorks Summit
 
Monitoring and scaling postgres at datadog
Monitoring and scaling postgres at datadogMonitoring and scaling postgres at datadog
Monitoring and scaling postgres at datadogSeth Rosenblum
 
Provisioning Datadog with Terraform
Provisioning Datadog with TerraformProvisioning Datadog with Terraform
Provisioning Datadog with TerraformMatt Spurlin
 
The evolution of the big data platform @ Netflix (OSCON 2015)
The evolution of the big data platform @ Netflix (OSCON 2015)The evolution of the big data platform @ Netflix (OSCON 2015)
The evolution of the big data platform @ Netflix (OSCON 2015)Eva Tse
 
Realtime Risk Management Using Kafka, Python, and Spark Streaming by Nick Evans
Realtime Risk Management Using Kafka, Python, and Spark Streaming by Nick EvansRealtime Risk Management Using Kafka, Python, and Spark Streaming by Nick Evans
Realtime Risk Management Using Kafka, Python, and Spark Streaming by Nick EvansSpark Summit
 
netflix-real-time-data-strata-talk
netflix-real-time-data-strata-talknetflix-real-time-data-strata-talk
netflix-real-time-data-strata-talkDanny Yuan
 
AWS Re-Invent 2017 Netflix Keystone SPaaS - Monal Daxini - Abd320 2017
AWS Re-Invent 2017 Netflix Keystone SPaaS - Monal Daxini - Abd320 2017AWS Re-Invent 2017 Netflix Keystone SPaaS - Monal Daxini - Abd320 2017
AWS Re-Invent 2017 Netflix Keystone SPaaS - Monal Daxini - Abd320 2017Monal Daxini
 
Monitoring Big Data Systems Done "The Simple Way" - Demi Ben-Ari - Codemotion...
Monitoring Big Data Systems Done "The Simple Way" - Demi Ben-Ari - Codemotion...Monitoring Big Data Systems Done "The Simple Way" - Demi Ben-Ari - Codemotion...
Monitoring Big Data Systems Done "The Simple Way" - Demi Ben-Ari - Codemotion...Codemotion
 
British Gas Connected Homes: Data Engineering
British Gas Connected Homes: Data EngineeringBritish Gas Connected Homes: Data Engineering
British Gas Connected Homes: Data EngineeringDataStax Academy
 
Chronografand dashboarding
Chronografand dashboardingChronografand dashboarding
Chronografand dashboardingInfluxData
 
Cassandra Day SV 2014: Scaling Hulu’s Video Progress Tracking Service with Ap...
Cassandra Day SV 2014: Scaling Hulu’s Video Progress Tracking Service with Ap...Cassandra Day SV 2014: Scaling Hulu’s Video Progress Tracking Service with Ap...
Cassandra Day SV 2014: Scaling Hulu’s Video Progress Tracking Service with Ap...DataStax Academy
 
Building highly reliable data pipeline @datadog par Quentin François
Building highly reliable data pipeline @datadog par Quentin FrançoisBuilding highly reliable data pipeline @datadog par Quentin François
Building highly reliable data pipeline @datadog par Quentin FrançoisParis Data Engineers !
 
Netflix Keystone—Cloud scale event processing pipeline
Netflix Keystone—Cloud scale event processing pipelineNetflix Keystone—Cloud scale event processing pipeline
Netflix Keystone—Cloud scale event processing pipelineMonal Daxini
 
Ml sprint16 thesis_intro
Ml sprint16 thesis_introMl sprint16 thesis_intro
Ml sprint16 thesis_introThanhNguyen3805
 
Taboola Road To Scale With Apache Spark
Taboola Road To Scale With Apache SparkTaboola Road To Scale With Apache Spark
Taboola Road To Scale With Apache Sparktsliwowicz
 
Scylla Summit 2022: Stream Processing with ScyllaDB
Scylla Summit 2022: Stream Processing with ScyllaDBScylla Summit 2022: Stream Processing with ScyllaDB
Scylla Summit 2022: Stream Processing with ScyllaDBScyllaDB
 
Accelerating Spark Genome Sequencing in Cloud—A Data Driven Approach, Case St...
Accelerating Spark Genome Sequencing in Cloud—A Data Driven Approach, Case St...Accelerating Spark Genome Sequencing in Cloud—A Data Driven Approach, Case St...
Accelerating Spark Genome Sequencing in Cloud—A Data Driven Approach, Case St...Spark Summit
 
Scalable and Reliable Logging at Pinterest
Scalable and Reliable Logging at PinterestScalable and Reliable Logging at Pinterest
Scalable and Reliable Logging at PinterestKrishna Gade
 
Using SparkML to Power a DSaaS (Data Science as a Service): Spark Summit East...
Using SparkML to Power a DSaaS (Data Science as a Service): Spark Summit East...Using SparkML to Power a DSaaS (Data Science as a Service): Spark Summit East...
Using SparkML to Power a DSaaS (Data Science as a Service): Spark Summit East...Spark Summit
 
Managing Cassandra Databases with OpenStack Trove
Managing Cassandra Databases with OpenStack TroveManaging Cassandra Databases with OpenStack Trove
Managing Cassandra Databases with OpenStack TroveTesora
 

La actualidad más candente (20)

Lessons Learned - Monitoring the Data Pipeline at Hulu
Lessons Learned - Monitoring the Data Pipeline at HuluLessons Learned - Monitoring the Data Pipeline at Hulu
Lessons Learned - Monitoring the Data Pipeline at Hulu
 
Monitoring and scaling postgres at datadog
Monitoring and scaling postgres at datadogMonitoring and scaling postgres at datadog
Monitoring and scaling postgres at datadog
 
Provisioning Datadog with Terraform
Provisioning Datadog with TerraformProvisioning Datadog with Terraform
Provisioning Datadog with Terraform
 
The evolution of the big data platform @ Netflix (OSCON 2015)
The evolution of the big data platform @ Netflix (OSCON 2015)The evolution of the big data platform @ Netflix (OSCON 2015)
The evolution of the big data platform @ Netflix (OSCON 2015)
 
Realtime Risk Management Using Kafka, Python, and Spark Streaming by Nick Evans
Realtime Risk Management Using Kafka, Python, and Spark Streaming by Nick EvansRealtime Risk Management Using Kafka, Python, and Spark Streaming by Nick Evans
Realtime Risk Management Using Kafka, Python, and Spark Streaming by Nick Evans
 
netflix-real-time-data-strata-talk
netflix-real-time-data-strata-talknetflix-real-time-data-strata-talk
netflix-real-time-data-strata-talk
 
AWS Re-Invent 2017 Netflix Keystone SPaaS - Monal Daxini - Abd320 2017
AWS Re-Invent 2017 Netflix Keystone SPaaS - Monal Daxini - Abd320 2017AWS Re-Invent 2017 Netflix Keystone SPaaS - Monal Daxini - Abd320 2017
AWS Re-Invent 2017 Netflix Keystone SPaaS - Monal Daxini - Abd320 2017
 
Monitoring Big Data Systems Done "The Simple Way" - Demi Ben-Ari - Codemotion...
Monitoring Big Data Systems Done "The Simple Way" - Demi Ben-Ari - Codemotion...Monitoring Big Data Systems Done "The Simple Way" - Demi Ben-Ari - Codemotion...
Monitoring Big Data Systems Done "The Simple Way" - Demi Ben-Ari - Codemotion...
 
British Gas Connected Homes: Data Engineering
British Gas Connected Homes: Data EngineeringBritish Gas Connected Homes: Data Engineering
British Gas Connected Homes: Data Engineering
 
Chronografand dashboarding
Chronografand dashboardingChronografand dashboarding
Chronografand dashboarding
 
Cassandra Day SV 2014: Scaling Hulu’s Video Progress Tracking Service with Ap...
Cassandra Day SV 2014: Scaling Hulu’s Video Progress Tracking Service with Ap...Cassandra Day SV 2014: Scaling Hulu’s Video Progress Tracking Service with Ap...
Cassandra Day SV 2014: Scaling Hulu’s Video Progress Tracking Service with Ap...
 
Building highly reliable data pipeline @datadog par Quentin François
Building highly reliable data pipeline @datadog par Quentin FrançoisBuilding highly reliable data pipeline @datadog par Quentin François
Building highly reliable data pipeline @datadog par Quentin François
 
Netflix Keystone—Cloud scale event processing pipeline
Netflix Keystone—Cloud scale event processing pipelineNetflix Keystone—Cloud scale event processing pipeline
Netflix Keystone—Cloud scale event processing pipeline
 
Ml sprint16 thesis_intro
Ml sprint16 thesis_introMl sprint16 thesis_intro
Ml sprint16 thesis_intro
 
Taboola Road To Scale With Apache Spark
Taboola Road To Scale With Apache SparkTaboola Road To Scale With Apache Spark
Taboola Road To Scale With Apache Spark
 
Scylla Summit 2022: Stream Processing with ScyllaDB
Scylla Summit 2022: Stream Processing with ScyllaDBScylla Summit 2022: Stream Processing with ScyllaDB
Scylla Summit 2022: Stream Processing with ScyllaDB
 
Accelerating Spark Genome Sequencing in Cloud—A Data Driven Approach, Case St...
Accelerating Spark Genome Sequencing in Cloud—A Data Driven Approach, Case St...Accelerating Spark Genome Sequencing in Cloud—A Data Driven Approach, Case St...
Accelerating Spark Genome Sequencing in Cloud—A Data Driven Approach, Case St...
 
Scalable and Reliable Logging at Pinterest
Scalable and Reliable Logging at PinterestScalable and Reliable Logging at Pinterest
Scalable and Reliable Logging at Pinterest
 
Using SparkML to Power a DSaaS (Data Science as a Service): Spark Summit East...
Using SparkML to Power a DSaaS (Data Science as a Service): Spark Summit East...Using SparkML to Power a DSaaS (Data Science as a Service): Spark Summit East...
Using SparkML to Power a DSaaS (Data Science as a Service): Spark Summit East...
 
Managing Cassandra Databases with OpenStack Trove
Managing Cassandra Databases with OpenStack TroveManaging Cassandra Databases with OpenStack Trove
Managing Cassandra Databases with OpenStack Trove
 

Destacado

Application Monitoring using Datadog
Application Monitoring using DatadogApplication Monitoring using Datadog
Application Monitoring using DatadogMukta Aphale
 
Data Science in the Cloud @StitchFix
Data Science in the Cloud @StitchFixData Science in the Cloud @StitchFix
Data Science in the Cloud @StitchFixC4Media
 
водопад виктория кантария
водопад виктория кантарияводопад виктория кантария
водопад виктория кантарияnatasimkina
 
Presentation1
Presentation1Presentation1
Presentation1Zak_
 
Maximizing Amazon S3 Performance (STG304) | AWS re:Invent 2013
Maximizing Amazon S3 Performance (STG304) | AWS re:Invent 2013Maximizing Amazon S3 Performance (STG304) | AWS re:Invent 2013
Maximizing Amazon S3 Performance (STG304) | AWS re:Invent 2013Amazon Web Services
 
Lifting the Blinds: Monitoring Windows Server 2012
Lifting the Blinds: Monitoring Windows Server 2012Lifting the Blinds: Monitoring Windows Server 2012
Lifting the Blinds: Monitoring Windows Server 2012Datadog
 
20161108 datadog and_sushi
20161108 datadog and_sushi20161108 datadog and_sushi
20161108 datadog and_sushiMasahiro Hattori
 
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix ScaleQcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix ScaleXavier Amatriain
 
Scaling Dropbox
Scaling DropboxScaling Dropbox
Scaling DropboxC4Media
 
Recommendation at Netflix Scale
Recommendation at Netflix ScaleRecommendation at Netflix Scale
Recommendation at Netflix ScaleJustin Basilico
 
Netflix Global Cloud Architecture
Netflix Global Cloud ArchitectureNetflix Global Cloud Architecture
Netflix Global Cloud ArchitectureAdrian Cockcroft
 
Netflix viewing data architecture evolution - QCon 2014
Netflix viewing data architecture evolution - QCon 2014Netflix viewing data architecture evolution - QCon 2014
Netflix viewing data architecture evolution - QCon 2014Philip Fisher-Ogden
 
Netflix Big Data Paris 2017
Netflix Big Data Paris 2017Netflix Big Data Paris 2017
Netflix Big Data Paris 2017Jason Flittner
 
NETFLIX- swot pest porter v3
NETFLIX- swot pest porter v3NETFLIX- swot pest porter v3
NETFLIX- swot pest porter v3Kyle Robinson
 
Netflix Business Model & Strategy
Netflix Business Model & StrategyNetflix Business Model & Strategy
Netflix Business Model & StrategyEvgenii Gvozdev
 
The Top Skills That Can Get You Hired in 2017
The Top Skills That Can Get You Hired in 2017The Top Skills That Can Get You Hired in 2017
The Top Skills That Can Get You Hired in 2017LinkedIn
 

Destacado (16)

Application Monitoring using Datadog
Application Monitoring using DatadogApplication Monitoring using Datadog
Application Monitoring using Datadog
 
Data Science in the Cloud @StitchFix
Data Science in the Cloud @StitchFixData Science in the Cloud @StitchFix
Data Science in the Cloud @StitchFix
 
водопад виктория кантария
водопад виктория кантарияводопад виктория кантария
водопад виктория кантария
 
Presentation1
Presentation1Presentation1
Presentation1
 
Maximizing Amazon S3 Performance (STG304) | AWS re:Invent 2013
Maximizing Amazon S3 Performance (STG304) | AWS re:Invent 2013Maximizing Amazon S3 Performance (STG304) | AWS re:Invent 2013
Maximizing Amazon S3 Performance (STG304) | AWS re:Invent 2013
 
Lifting the Blinds: Monitoring Windows Server 2012
Lifting the Blinds: Monitoring Windows Server 2012Lifting the Blinds: Monitoring Windows Server 2012
Lifting the Blinds: Monitoring Windows Server 2012
 
20161108 datadog and_sushi
20161108 datadog and_sushi20161108 datadog and_sushi
20161108 datadog and_sushi
 
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix ScaleQcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
 
Scaling Dropbox
Scaling DropboxScaling Dropbox
Scaling Dropbox
 
Recommendation at Netflix Scale
Recommendation at Netflix ScaleRecommendation at Netflix Scale
Recommendation at Netflix Scale
 
Netflix Global Cloud Architecture
Netflix Global Cloud ArchitectureNetflix Global Cloud Architecture
Netflix Global Cloud Architecture
 
Netflix viewing data architecture evolution - QCon 2014
Netflix viewing data architecture evolution - QCon 2014Netflix viewing data architecture evolution - QCon 2014
Netflix viewing data architecture evolution - QCon 2014
 
Netflix Big Data Paris 2017
Netflix Big Data Paris 2017Netflix Big Data Paris 2017
Netflix Big Data Paris 2017
 
NETFLIX- swot pest porter v3
NETFLIX- swot pest porter v3NETFLIX- swot pest porter v3
NETFLIX- swot pest porter v3
 
Netflix Business Model & Strategy
Netflix Business Model & StrategyNetflix Business Model & Strategy
Netflix Business Model & Strategy
 
The Top Skills That Can Get You Hired in 2017
The Top Skills That Can Get You Hired in 2017The Top Skills That Can Get You Hired in 2017
The Top Skills That Can Get You Hired in 2017
 

Similar a Elastic Data Analytics Platform @Datadog

TenMax Data Pipeline Experience Sharing
TenMax Data Pipeline Experience SharingTenMax Data Pipeline Experience Sharing
TenMax Data Pipeline Experience SharingChen-en Lu
 
(BDT404) Large-Scale ETL Data Flows w/AWS Data Pipeline & Dataduct
(BDT404) Large-Scale ETL Data Flows w/AWS Data Pipeline & Dataduct(BDT404) Large-Scale ETL Data Flows w/AWS Data Pipeline & Dataduct
(BDT404) Large-Scale ETL Data Flows w/AWS Data Pipeline & DataductAmazon Web Services
 
Headaches and Breakthroughs in Building Continuous Applications
Headaches and Breakthroughs in Building Continuous ApplicationsHeadaches and Breakthroughs in Building Continuous Applications
Headaches and Breakthroughs in Building Continuous ApplicationsDatabricks
 
Spark + AI Summit 2019: Headaches and Breakthroughs in Building Continuous Ap...
Spark + AI Summit 2019: Headaches and Breakthroughs in Building Continuous Ap...Spark + AI Summit 2019: Headaches and Breakthroughs in Building Continuous Ap...
Spark + AI Summit 2019: Headaches and Breakthroughs in Building Continuous Ap...Landon Robinson
 
From a student to an apache committer practice of apache io tdb
From a student to an apache committer  practice of apache io tdbFrom a student to an apache committer  practice of apache io tdb
From a student to an apache committer practice of apache io tdbjixuan1989
 
Amazon RDS for MySQL – Diagnostics, Security, and Data Migration (DAT302) | A...
Amazon RDS for MySQL – Diagnostics, Security, and Data Migration (DAT302) | A...Amazon RDS for MySQL – Diagnostics, Security, and Data Migration (DAT302) | A...
Amazon RDS for MySQL – Diagnostics, Security, and Data Migration (DAT302) | A...Amazon Web Services
 
Cloud Big Data Architectures
Cloud Big Data ArchitecturesCloud Big Data Architectures
Cloud Big Data ArchitecturesLynn Langit
 
Leapfrog into Serverless - a Deloitte-Amtrak Case Study | Serverless Confere...
Leapfrog into Serverless - a Deloitte-Amtrak Case Study | Serverless Confere...Leapfrog into Serverless - a Deloitte-Amtrak Case Study | Serverless Confere...
Leapfrog into Serverless - a Deloitte-Amtrak Case Study | Serverless Confere...Gary Arora
 
Lambda Architectures in Practice
Lambda Architectures in PracticeLambda Architectures in Practice
Lambda Architectures in PracticeC4Media
 
Data & Analytics - Session 1 - Big Data Analytics
Data & Analytics - Session 1 -  Big Data AnalyticsData & Analytics - Session 1 -  Big Data Analytics
Data & Analytics - Session 1 - Big Data AnalyticsAmazon Web Services
 
Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...
Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...
Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...Precisely
 
BIO IT 15 - Are Your Researchers Paying Too Much for Their Cloud-Based Data B...
BIO IT 15 - Are Your Researchers Paying Too Much for Their Cloud-Based Data B...BIO IT 15 - Are Your Researchers Paying Too Much for Their Cloud-Based Data B...
BIO IT 15 - Are Your Researchers Paying Too Much for Their Cloud-Based Data B...Dirk Petersen
 
Spark + AI Summit 2019: Apache Spark Listeners: A Crash Course in Fast, Easy ...
Spark + AI Summit 2019: Apache Spark Listeners: A Crash Course in Fast, Easy ...Spark + AI Summit 2019: Apache Spark Listeners: A Crash Course in Fast, Easy ...
Spark + AI Summit 2019: Apache Spark Listeners: A Crash Course in Fast, Easy ...Landon Robinson
 
T1 – Architecting highly available applications on aws
T1 – Architecting highly available applications on awsT1 – Architecting highly available applications on aws
T1 – Architecting highly available applications on awsAmazon Web Services
 
Managing your black friday logs Voxxed Luxembourg
Managing your black friday logs Voxxed LuxembourgManaging your black friday logs Voxxed Luxembourg
Managing your black friday logs Voxxed LuxembourgDavid Pilato
 
Yow Conference Dec 2013 Netflix Workshop Slides with Notes
Yow Conference Dec 2013 Netflix Workshop Slides with NotesYow Conference Dec 2013 Netflix Workshop Slides with Notes
Yow Conference Dec 2013 Netflix Workshop Slides with NotesAdrian Cockcroft
 
Accelerating analytics in the cloud with the Starburst Presto + Alluxio stack
Accelerating analytics in the cloud with the Starburst Presto + Alluxio stackAccelerating analytics in the cloud with the Starburst Presto + Alluxio stack
Accelerating analytics in the cloud with the Starburst Presto + Alluxio stackAlluxio, Inc.
 
Introduction to Azure DocumentDB
Introduction to Azure DocumentDBIntroduction to Azure DocumentDB
Introduction to Azure DocumentDBDenny Lee
 
AWS Summit 2013 | Singapore - Understanding AWS Storage Options
AWS Summit 2013 | Singapore - Understanding AWS Storage OptionsAWS Summit 2013 | Singapore - Understanding AWS Storage Options
AWS Summit 2013 | Singapore - Understanding AWS Storage OptionsAmazon Web Services
 

Similar a Elastic Data Analytics Platform @Datadog (20)

TenMax Data Pipeline Experience Sharing
TenMax Data Pipeline Experience SharingTenMax Data Pipeline Experience Sharing
TenMax Data Pipeline Experience Sharing
 
(BDT404) Large-Scale ETL Data Flows w/AWS Data Pipeline & Dataduct
(BDT404) Large-Scale ETL Data Flows w/AWS Data Pipeline & Dataduct(BDT404) Large-Scale ETL Data Flows w/AWS Data Pipeline & Dataduct
(BDT404) Large-Scale ETL Data Flows w/AWS Data Pipeline & Dataduct
 
Headaches and Breakthroughs in Building Continuous Applications
Headaches and Breakthroughs in Building Continuous ApplicationsHeadaches and Breakthroughs in Building Continuous Applications
Headaches and Breakthroughs in Building Continuous Applications
 
Spark + AI Summit 2019: Headaches and Breakthroughs in Building Continuous Ap...
Spark + AI Summit 2019: Headaches and Breakthroughs in Building Continuous Ap...Spark + AI Summit 2019: Headaches and Breakthroughs in Building Continuous Ap...
Spark + AI Summit 2019: Headaches and Breakthroughs in Building Continuous Ap...
 
From a student to an apache committer practice of apache io tdb
From a student to an apache committer  practice of apache io tdbFrom a student to an apache committer  practice of apache io tdb
From a student to an apache committer practice of apache io tdb
 
Amazon RDS for MySQL – Diagnostics, Security, and Data Migration (DAT302) | A...
Amazon RDS for MySQL – Diagnostics, Security, and Data Migration (DAT302) | A...Amazon RDS for MySQL – Diagnostics, Security, and Data Migration (DAT302) | A...
Amazon RDS for MySQL – Diagnostics, Security, and Data Migration (DAT302) | A...
 
Cloud Big Data Architectures
Cloud Big Data ArchitecturesCloud Big Data Architectures
Cloud Big Data Architectures
 
Leapfrog into Serverless - a Deloitte-Amtrak Case Study | Serverless Confere...
Leapfrog into Serverless - a Deloitte-Amtrak Case Study | Serverless Confere...Leapfrog into Serverless - a Deloitte-Amtrak Case Study | Serverless Confere...
Leapfrog into Serverless - a Deloitte-Amtrak Case Study | Serverless Confere...
 
Lambda Architectures in Practice
Lambda Architectures in PracticeLambda Architectures in Practice
Lambda Architectures in Practice
 
Data & Analytics - Session 1 - Big Data Analytics
Data & Analytics - Session 1 -  Big Data AnalyticsData & Analytics - Session 1 -  Big Data Analytics
Data & Analytics - Session 1 - Big Data Analytics
 
Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...
Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...
Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...
 
Data Science
Data ScienceData Science
Data Science
 
BIO IT 15 - Are Your Researchers Paying Too Much for Their Cloud-Based Data B...
BIO IT 15 - Are Your Researchers Paying Too Much for Their Cloud-Based Data B...BIO IT 15 - Are Your Researchers Paying Too Much for Their Cloud-Based Data B...
BIO IT 15 - Are Your Researchers Paying Too Much for Their Cloud-Based Data B...
 
Spark + AI Summit 2019: Apache Spark Listeners: A Crash Course in Fast, Easy ...
Spark + AI Summit 2019: Apache Spark Listeners: A Crash Course in Fast, Easy ...Spark + AI Summit 2019: Apache Spark Listeners: A Crash Course in Fast, Easy ...
Spark + AI Summit 2019: Apache Spark Listeners: A Crash Course in Fast, Easy ...
 
T1 – Architecting highly available applications on aws
T1 – Architecting highly available applications on awsT1 – Architecting highly available applications on aws
T1 – Architecting highly available applications on aws
 
Managing your black friday logs Voxxed Luxembourg
Managing your black friday logs Voxxed LuxembourgManaging your black friday logs Voxxed Luxembourg
Managing your black friday logs Voxxed Luxembourg
 
Yow Conference Dec 2013 Netflix Workshop Slides with Notes
Yow Conference Dec 2013 Netflix Workshop Slides with NotesYow Conference Dec 2013 Netflix Workshop Slides with Notes
Yow Conference Dec 2013 Netflix Workshop Slides with Notes
 
Accelerating analytics in the cloud with the Starburst Presto + Alluxio stack
Accelerating analytics in the cloud with the Starburst Presto + Alluxio stackAccelerating analytics in the cloud with the Starburst Presto + Alluxio stack
Accelerating analytics in the cloud with the Starburst Presto + Alluxio stack
 
Introduction to Azure DocumentDB
Introduction to Azure DocumentDBIntroduction to Azure DocumentDB
Introduction to Azure DocumentDB
 
AWS Summit 2013 | Singapore - Understanding AWS Storage Options
AWS Summit 2013 | Singapore - Understanding AWS Storage OptionsAWS Summit 2013 | Singapore - Understanding AWS Storage Options
AWS Summit 2013 | Singapore - Understanding AWS Storage Options
 

Más de C4Media

Streaming a Million Likes/Second: Real-Time Interactions on Live Video
Streaming a Million Likes/Second: Real-Time Interactions on Live VideoStreaming a Million Likes/Second: Real-Time Interactions on Live Video
Streaming a Million Likes/Second: Real-Time Interactions on Live VideoC4Media
 
Next Generation Client APIs in Envoy Mobile
Next Generation Client APIs in Envoy MobileNext Generation Client APIs in Envoy Mobile
Next Generation Client APIs in Envoy MobileC4Media
 
Software Teams and Teamwork Trends Report Q1 2020
Software Teams and Teamwork Trends Report Q1 2020Software Teams and Teamwork Trends Report Q1 2020
Software Teams and Teamwork Trends Report Q1 2020C4Media
 
Understand the Trade-offs Using Compilers for Java Applications
Understand the Trade-offs Using Compilers for Java ApplicationsUnderstand the Trade-offs Using Compilers for Java Applications
Understand the Trade-offs Using Compilers for Java ApplicationsC4Media
 
Kafka Needs No Keeper
Kafka Needs No KeeperKafka Needs No Keeper
Kafka Needs No KeeperC4Media
 
High Performing Teams Act Like Owners
High Performing Teams Act Like OwnersHigh Performing Teams Act Like Owners
High Performing Teams Act Like OwnersC4Media
 
Does Java Need Inline Types? What Project Valhalla Can Bring to Java
Does Java Need Inline Types? What Project Valhalla Can Bring to JavaDoes Java Need Inline Types? What Project Valhalla Can Bring to Java
Does Java Need Inline Types? What Project Valhalla Can Bring to JavaC4Media
 
Service Meshes- The Ultimate Guide
Service Meshes- The Ultimate GuideService Meshes- The Ultimate Guide
Service Meshes- The Ultimate GuideC4Media
 
Shifting Left with Cloud Native CI/CD
Shifting Left with Cloud Native CI/CDShifting Left with Cloud Native CI/CD
Shifting Left with Cloud Native CI/CDC4Media
 
CI/CD for Machine Learning
CI/CD for Machine LearningCI/CD for Machine Learning
CI/CD for Machine LearningC4Media
 
Fault Tolerance at Speed
Fault Tolerance at SpeedFault Tolerance at Speed
Fault Tolerance at SpeedC4Media
 
Architectures That Scale Deep - Regaining Control in Deep Systems
Architectures That Scale Deep - Regaining Control in Deep SystemsArchitectures That Scale Deep - Regaining Control in Deep Systems
Architectures That Scale Deep - Regaining Control in Deep SystemsC4Media
 
ML in the Browser: Interactive Experiences with Tensorflow.js
ML in the Browser: Interactive Experiences with Tensorflow.jsML in the Browser: Interactive Experiences with Tensorflow.js
ML in the Browser: Interactive Experiences with Tensorflow.jsC4Media
 
Build Your Own WebAssembly Compiler
Build Your Own WebAssembly CompilerBuild Your Own WebAssembly Compiler
Build Your Own WebAssembly CompilerC4Media
 
User & Device Identity for Microservices @ Netflix Scale
User & Device Identity for Microservices @ Netflix ScaleUser & Device Identity for Microservices @ Netflix Scale
User & Device Identity for Microservices @ Netflix ScaleC4Media
 
Scaling Patterns for Netflix's Edge
Scaling Patterns for Netflix's EdgeScaling Patterns for Netflix's Edge
Scaling Patterns for Netflix's EdgeC4Media
 
Make Your Electron App Feel at Home Everywhere
Make Your Electron App Feel at Home EverywhereMake Your Electron App Feel at Home Everywhere
Make Your Electron App Feel at Home EverywhereC4Media
 
The Talk You've Been Await-ing For
The Talk You've Been Await-ing ForThe Talk You've Been Await-ing For
The Talk You've Been Await-ing ForC4Media
 
Future of Data Engineering
Future of Data EngineeringFuture of Data Engineering
Future of Data EngineeringC4Media
 
Automated Testing for Terraform, Docker, Packer, Kubernetes, and More
Automated Testing for Terraform, Docker, Packer, Kubernetes, and MoreAutomated Testing for Terraform, Docker, Packer, Kubernetes, and More
Automated Testing for Terraform, Docker, Packer, Kubernetes, and MoreC4Media
 

Más de C4Media (20)

Streaming a Million Likes/Second: Real-Time Interactions on Live Video
Streaming a Million Likes/Second: Real-Time Interactions on Live VideoStreaming a Million Likes/Second: Real-Time Interactions on Live Video
Streaming a Million Likes/Second: Real-Time Interactions on Live Video
 
Next Generation Client APIs in Envoy Mobile
Next Generation Client APIs in Envoy MobileNext Generation Client APIs in Envoy Mobile
Next Generation Client APIs in Envoy Mobile
 
Software Teams and Teamwork Trends Report Q1 2020
Software Teams and Teamwork Trends Report Q1 2020Software Teams and Teamwork Trends Report Q1 2020
Software Teams and Teamwork Trends Report Q1 2020
 
Understand the Trade-offs Using Compilers for Java Applications
Understand the Trade-offs Using Compilers for Java ApplicationsUnderstand the Trade-offs Using Compilers for Java Applications
Understand the Trade-offs Using Compilers for Java Applications
 
Kafka Needs No Keeper
Kafka Needs No KeeperKafka Needs No Keeper
Kafka Needs No Keeper
 
High Performing Teams Act Like Owners
High Performing Teams Act Like OwnersHigh Performing Teams Act Like Owners
High Performing Teams Act Like Owners
 
Does Java Need Inline Types? What Project Valhalla Can Bring to Java
Does Java Need Inline Types? What Project Valhalla Can Bring to JavaDoes Java Need Inline Types? What Project Valhalla Can Bring to Java
Does Java Need Inline Types? What Project Valhalla Can Bring to Java
 
Service Meshes- The Ultimate Guide
Service Meshes- The Ultimate GuideService Meshes- The Ultimate Guide
Service Meshes- The Ultimate Guide
 
Shifting Left with Cloud Native CI/CD
Shifting Left with Cloud Native CI/CDShifting Left with Cloud Native CI/CD
Shifting Left with Cloud Native CI/CD
 
CI/CD for Machine Learning
CI/CD for Machine LearningCI/CD for Machine Learning
CI/CD for Machine Learning
 
Fault Tolerance at Speed
Fault Tolerance at SpeedFault Tolerance at Speed
Fault Tolerance at Speed
 
Architectures That Scale Deep - Regaining Control in Deep Systems
Architectures That Scale Deep - Regaining Control in Deep SystemsArchitectures That Scale Deep - Regaining Control in Deep Systems
Architectures That Scale Deep - Regaining Control in Deep Systems
 
ML in the Browser: Interactive Experiences with Tensorflow.js
ML in the Browser: Interactive Experiences with Tensorflow.jsML in the Browser: Interactive Experiences with Tensorflow.js
ML in the Browser: Interactive Experiences with Tensorflow.js
 
Build Your Own WebAssembly Compiler
Build Your Own WebAssembly CompilerBuild Your Own WebAssembly Compiler
Build Your Own WebAssembly Compiler
 
User & Device Identity for Microservices @ Netflix Scale
User & Device Identity for Microservices @ Netflix ScaleUser & Device Identity for Microservices @ Netflix Scale
User & Device Identity for Microservices @ Netflix Scale
 
Scaling Patterns for Netflix's Edge
Scaling Patterns for Netflix's EdgeScaling Patterns for Netflix's Edge
Scaling Patterns for Netflix's Edge
 
Make Your Electron App Feel at Home Everywhere
Make Your Electron App Feel at Home EverywhereMake Your Electron App Feel at Home Everywhere
Make Your Electron App Feel at Home Everywhere
 
The Talk You've Been Await-ing For
The Talk You've Been Await-ing ForThe Talk You've Been Await-ing For
The Talk You've Been Await-ing For
 
Future of Data Engineering
Future of Data EngineeringFuture of Data Engineering
Future of Data Engineering
 
Automated Testing for Terraform, Docker, Packer, Kubernetes, and More
Automated Testing for Terraform, Docker, Packer, Kubernetes, and MoreAutomated Testing for Terraform, Docker, Packer, Kubernetes, and More
Automated Testing for Terraform, Docker, Packer, Kubernetes, and More
 

Último

Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 

Último (20)

Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 

Elastic Data Analytics Platform @Datadog

  • 1.
  • 2. InfoQ.com: News & Community Site • Over 1,000,000 software developers, architects and CTOs read the site world- wide every month • 250,000 senior developers subscribe to our weekly newsletter • Published in 4 languages (English, Chinese, Japanese and Brazilian Portuguese) • Post content from our QCon conferences • 2 dedicated podcast channels: The InfoQ Podcast, with a focus on Architecture and The Engineering Culture Podcast, with a focus on building • 96 deep dives on innovative topics packed as downloadable emags and minibooks • Over 40 new content items per week Watch the video with slide synchronization on InfoQ.com! https://www.infoq.com/presentations/ datadog-cloud
  • 3. Purpose of QCon - to empower software development by facilitating the spread of knowledge and innovation Strategy - practitioner-driven conference designed for YOU: influencers of change and innovation in your teams - speakers and topics driving the evolution and innovation - connecting and catalyzing the influencers and innovators Highlights - attended by more than 12,000 delegates since 2007 - held in 9 cities worldwide Presented at QCon San Francisco www.qconsf.com
  • 4. The Evolution of a Data Project
  • 5. The Evolution of a Data Project Python script
  • 6. The Evolution of a Data Project Python script SQL on 
 live DB
  • 7. The Evolution of a Data Project Python script SQL on reporting DB SQL on 
 live DB
  • 8. The Evolution of a Data Project Python script SQL on reporting DB SQL on 
 live DB Terrible confusion
  • 9. The Evolution of a Data Project Python script SQL on reporting DB SQL on 
 live DB Terrible confusion Hadoop / Spark cluster
  • 11. What needs fixing image: Pexels • One cluster: data lock-in.
  • 12. What needs fixing image: Pexels • One cluster: data lock-in. • Want cluster time? You have to wait.
  • 13. What needs fixing image: Pexels • One cluster: data lock-in. • Want cluster time? You have to wait. • Clusters are underutilized and EXPENSIVE
  • 14. Elastic Big Data Platform @ Datadog Doug Daniels Director, Engineering
  • 15. WHOM What’s our big data platform do? Data Engineers Data Scientists
  • 16. WHOM What’s our big data platform do? Data Engineers Data Scientists do WHAT App features Statistical Analysis/ML Ad-hoc investigation
  • 17. WHOM What’s our big data platform do? Data Engineers Data Scientists do WHAT App features Statistical Analysis/ML Ad-hoc investigation WITH Spark Hadoop (Pig) Python (Luigi) with
  • 19.
  • 21. What do we store?
  • 23. What’s time series data? timestamp 1447020511 metric system.cpu.idle value 98.16687 tags host:i-xyz, role:cassandra, …
  • 24. We collect over a trillion of these per day …and growing!
  • 25. Where to put the petabytes? Amazon S3. Amazon S3
  • 26. How data gets to S3 116 - Buffer - Sort + Dedupe - Upload GO - Partition + Sort - Write Parquet - Update Metastore LUIGI/SPARK/PIG HIVE METASTORE Internal Format AMAZON S3 Parquet Metadata Kafka
  • 27. Isn’t this a job for HDFS?
  • 28. What we don’t love about HDFS
  • 29. What we don’t love about HDFS • Causes the “one cluster” problem
  • 30. What we don’t love about HDFS • Causes the “one cluster” problem • Come for the storage, get stuck with the servers
  • 31. What we don’t love about HDFS • Causes the “one cluster” problem • Come for the storage, get stuck with the servers • No Java? No data!
  • 32. S3 is flexible! • Read data from as many clusters as you want
  • 33. S3 is flexible! • Read data from as many clusters as you want • Store unlimited stuff(*) with no management * Accepting laws of physics and your credit card limit
  • 34. S3 is flexible! • Read data from as many clusters as you want • Store unlimited stuff(*) with no management • Rock solid: durability (99.999999999), availability (99.99) * Accepting laws of physics and your credit card limit
  • 35. S3 is flexible! • Read data from as many clusters as you want • Store unlimited stuff(*) with no management • Rock solid: durability (99.999999999), availability (99.99) • Access from any programming language * Accepting laws of physics and your credit card limit
  • 36. Decouple data and compute (BREAK THE RULES!)
  • 37. Breaking the rules is fine. In benchmarks: S3 is ~2X slower than HDFS
  • 38. Breaking the rules is fine. In benchmarks: S3 is ~2X slower than HDFS
  • 39. It’s not all roses
  • 40. Listing is slooooow (A CAUTIONARY TALE)
  • 41. How to fix slow listing Bigger filesParallelize it
  • 42. HDFS No way to quickly move data Task Intermediate Final write atomic move
  • 43. HDFS No way to quickly move data Task Intermediate Final write atomic move S3 Task write
  • 44. No way to quickly move data • Say goodbye to speculative execution
  • 45. No way to quickly move data • Say goodbye to speculative execution • Say hello to better task timeouts
  • 46. But really: We 💜S3 This is a great system. ✓ Data accessible from many clusters ✓ Storage is easy to manage ✓ It’s a multi-language paradise up in here
  • 48. One cluster to compute it all TRADITIONALLY
  • 49. Instead, we run many, many clusters • New cluster for every automated job • 10–20 clusters at a time • Median lifetime: 2hrs
  • 50. Why so many clusters?
  • 51. Total isolation We know what’s happening and why
  • 52. No more waiting on loaded clusters • Tailor each cluster to the work you want to do • Scale up when you need results faster • Data scientists and data engineers don’t have to wait 🕐🕓🕥
  • 53. Pick the best hardware for each job for CPU-bound jobs r3 if you don’t care (cheap!) == ~30% savings over general purpose hardware c3 for memory-bound jobs m1.xlarge
  • 54. 100% spot-instance clusters, all the time.* * (ok, most of the time)
  • 55. 100% spot-instance clusters, all the time.* * (ok, most of the time) Ridiculous savings! Disappearing clusters!
  • 56. How we do spot clusters • Bid the on-demand price, pay the spot price In the big data platform
  • 57. How we do spot clusters • Bid the on-demand price, pay the spot price • Fallback to on-demand instances if you can’t get spot In the big data platform
  • 58. How we do spot clusters • Bid the on-demand price, pay the spot price • Fallback to on-demand instances if you can’t get spot • Monitor everything: jobs, clusters, spot market In the big data platform
  • 59. How we do spot clusters • Bid the on-demand price, pay the spot price • Fallback to on-demand instances if you can’t get spot • Monitor everything: jobs, clusters, spot market • ☞ Save up to 80% off the 
 on-demand price In the big data platform
  • 60. Switch hardware when the market gets volatile Monitor the spot price
  • 61. We like this strategy a lot! Cluster is oversubscribed; everyone waiting in line to do their work Lots of expensive hardware sits idle when everyone’s gone ✓ No waiting for the cluster you need ✓ No waste from hardware sitting idle ✓ Spot clusters are affordable enough to use everywhere
  • 63. Many things that disappear.
  • 65. Web and APIs Platform as a service CLI Jobs, Clusters, Schedules, Users, Code, Monitoring, Logs, and more
  • 66. Big Data Platform Architecture DATA Amazon S3
  • 67. Big Data Platform Architecture DATA Amazon S3 CLUSTER EMR
  • 68. Big Data Platform Architecture DATA Amazon S3 CLUSTER EMR WORKER Pig Workers Spark Workers Luigi Workers
  • 69. Big Data Platform Architecture DATA Amazon S3 CLUSTER EMR WORKER Pig Workers Spark Workers Luigi Workers STORAGE Metadata DB Queueing Logs
  • 70. Big Data Platform Architecture DATA Amazon S3 CLUSTER EMR WEB Web API WORKER Pig Workers Spark Workers Luigi Workers STORAGE Metadata DB Queueing Logs
  • 71. Big Data Platform Architecture DATA Amazon S3 CLUSTER EMR WEB Web API WORKER Pig Workers Spark Workers Luigi Workers USER CLI API Clients Job Scheduler STORAGE Metadata DB Queueing Logs
  • 72. Big Data Platform Architecture DATA Amazon S3 CLUSTER EMR WEB Web API WORKER Pig Workers Spark Workers Luigi Workers USER CLI API Clients Job Scheduler STORAGE Metadata DB Queueing Logs Datadog Monitoring
  • 73. How to find the right cluster when they disappear?
  • 74. Cluster tagging 
 for discovery #anomaly -detection #monitor-report
  • 75. How to monitor many disappearing clusters?
  • 76. Dashboards Monitors Dynamic Monitoring on Tags anomaly-detection cluster_tags: anomaly-detection
  • 77. How to debug problems when the cluster’s gone?
  • 78. Debugging In a Post-Cluster World
  • 79. Debugging In a Post-Cluster World Send all logs to S3 • HDFS • YARN • Pig • Spark
  • 80. Debugging In a Post-Cluster World Visualize the pipeline • Lipstick for Pig • Spark History Server • Luigi task flow Send all logs to S3 • HDFS • YARN • Pig • Spark
  • 81. Debugging In a Post-Cluster World Visualize the pipeline • Lipstick for Pig • Spark History Server • Luigi task flow Preserve historical monitoring data Keep history, by tag, after the cluster disappears Send all logs to S3 • HDFS • YARN • Pig • Spark
  • 82. How to handle certain cluster failure in your jobs?
  • 83. Automatic cleanup and restart Luigi: design for failure. A B
  • 84. Automatic cleanup and restart Luigi: design for failure. B
  • 85. Automatic cleanup and restart Luigi: design for failure. ❌
  • 86. Automatic cleanup and restart Luigi: design for failure.
  • 89. Recommendations 
 for Cloud Big Data • Use S3 for permanent data, not HDFS
  • 90. Recommendations 
 for Cloud Big Data • Use S3 for permanent data, not HDFS • Start from EMR if building yourself
  • 91. Recommendations 
 for Cloud Big Data • Use S3 for permanent data, not HDFS • Start from EMR if building yourself • Look into a PaaS: Netflix Genie, Qubole, Databricks
  • 92. Recommendations 
 for Cloud Big Data • Use S3 for permanent data, not HDFS • Start from EMR if building yourself • Look into a PaaS: Netflix Genie, Qubole, Databricks • Tag your clusters for dynamic monitoring
  • 93. Recommendations 
 for Cloud Big Data • Use S3 for permanent data, not HDFS • Start from EMR if building yourself • Look into a PaaS: Netflix Genie, Qubole, Databricks • Tag your clusters for dynamic monitoring • Design for failure with a workflow tool (Luigi, Airflow)
  • 94. Thanks! Want to work with us on Spark, Hadoop, Kafka, Parquet, and more? jobs.datadoghq.com DM me @ddaniels888 or doug@datadoghq.com
  • 95. Watch the video with slide synchronization on InfoQ.com! https://www.infoq.com/presentations/ datadog-cloud