Running Yarn at Scale

•Descargar como PPTX, PDF•

5 recomendaciones•2,509 vistas

At Yahoo! over the past year we have helped migrate hundreds of our grids? users to YARN. Our YARN clusters have in aggregate run over 18 million jobs with more than 3 billion tasks consuming over 10 thousand years of compute time. With one single cluster running 90 thousand jobs a day. From this experience we would like to share what we have learned about running YARN well, how this is different from running a 1.0 based cluster, and what it takes to migrate your jobs to YARN from 1.0.

Tecnología

Our Experience Running YARN at Scale
Bobby Evans

Agenda
• Who We Are
• Some Background on YARN and YARN at Yahoo!
• What Was Not So Good
• What Was Good

Who IAm
Robert (Bobby) Evans
• Technical Lead @ Yahoo!
• Apache Hadoop Committer and PMC Member
• Past
– Hardware Design
– Linux Kernel and Device Driver Development
– Machine Learning on Hadoop
• Current
– Hadoop Core Development (MapReduce and YARN)
– TEZ, Storm and Spark

Who I Represent
• Yahoo! Hadoop Team
– We are over 40 people developing, maintaining and
supporting a complete Hadoop stack including
Pig, Hive, HBase, Oozie, and HCatalog.
• Hadoop Users @ Yahoo!

Hadoop Releases
2007 2008 2009 2010 2011 2012 2013
0.14.X
0.15.X
0.16.X
0.17.X
0.18.X
0.19.X
0.20.X
0.21.X
0.20.2X
0.22.X
0.23.X
1.X
2.X
Security
YARN
HDFS HA
Source: http://hadoop.apache.org/releases.htmlhttp://is.gd/axRlgJ

Yahoo! Scale
• About 40,000 Nodes Running Hadoop.
• Around 500,000 Map/Reduce jobs a day.
• Consuming in excess of 230 compute years
every single day.
• Over 350 PB of Storage.
• On 0.23.X we have over 20,000 years of
compute time under our belts.
http://www.flickr.com/photos/doctorow/2699158636/

YARNArchitecture
http://www.flickr.com/photos/bradhoc/7343761514/

TheAM Runs on Unreliable Hardware
• Split Brain/AM Recovery (FIXED for MR but not perfect)
– For anyone else writing a YARN app, be aware you
have to handle this.

TheAM Runs on Unreliable Hardware
• Debugging the AM is hard when it does crash.
• AM can get overwhelmed if it is on a slow node or the
job is very large.
• Tuning the AM is difficult to get right for large jobs.
– Be sure to tune the heap/container size. 1GB heap
can fit about 100,000 task attempts in memory
(25,000 tasks worst case).
http://www.flickr.com/photos/cushinglibrary/3963200463/

Lack of Flow Control
• Both AM and RM based on an asynchronous event
framework that has no flow control.
http://www.flickr.com/photos/iz4aks/4085305231/

Name Node Load
• YARN launches tasks faster than 1.0
• MR keeps a running history log for recovery
• Log Aggregation.
– 7 days of aggregated logs used up approximately
30% of the total namespace.
• 50% higher write load on HDFS for the same
jobs
• 160% more rename operations
• 60% more create, addBlock and fsync
operations

Web UI
• Resource Manager and History Server Forget Apps too Quickly
• Browser/Javascript Heavy
• Follows the YARN model, so it can be confusing for those used to
old UI.

Binary Incompatibility
• Map/Reduce APIs are not binary compatible between 1.0
and 0.23. They are source compatible though so just
recompile require.

Operability
“The issues were not with
incompatibilities, but coupling between
applications and check-offs.”
-- Rajiv Chittajallu

Performance
Tests run on a 350 node cluster on top of JDK 1.6.0
1.0.2 0.23.3 Improvement
Sort (GB/s
throughput)
2.26 2.35 4%
Sort with
compression
(GB/s throughput)
4.5 4.5 0%
Shuffle (mean
shuffle time secs)
303.8 263.5 13%
Scan (GB/s
throughput)
25.2 22.9 -9%
Gridmx 3 replay
(Runtime secs)
2817 2668 5%

Web Services/LogAggregation
• No more scraping of web pages needed
– Resource Manager
– Node Managers
– History Server
– MR App Master
• Deep analysis of log output using Map/Reduce

Non Map ReduceApplications*
• Storm
• TEZ
• Spark
• …
* Coming Soon

Total Capacity
Our most heavily used cluster was able to increase from
80,000 jobs a day to 125,000 jobs a day.
That is more than a 50% increase. It is like we bought over
1000 new servers and added it to the cluster.
This is primarily due to the removal of the artificial split
between maps and reduces, but also because the Job
Tracker could not keep up with tracking/launching all the
tasks.

Conclusion
Upgrading to 0.23 from 1.0 took a lot of planning and effort.
Most of that was stabilization and hardening of Hadoop for
the scale that we run at, but it was worth it.

Más contenido relacionado

La actualidad más candente

SmartNews's journey into microservicesSmartNews, Inc.

What we learned from the AWS OutagePolarSeven Pty Ltd

Dev Ops without the OpsKonstantin Gredeskoul

A Look at the Performance of SAP's Modern UIsSascha Wenninger

sparkBen Liu

Breaking Spark: Top 5 mistakes to avoid when using Apache Spark in productionNeelesh Srinivas Salian

Rainbows, Unicorns, and other Fairy Tales in the Land of Serverless DreamsJosh Carlisle

Spark Summit EU talk by Dean WamplerSpark Summit

Kubernetes at NU.nl (Kubernetes meetup 2019-09-05)Tibo Beijen

Livy: A REST Web Service For Apache SparkJen Aman

DevOpsCon Cloud Workshop Sascha Möllering

Harnessing Spark and Cassandra with GroovySteve Pember

Release the Monkeys ! Testing in the Wild at NetflixGareth Bowles

Spark Job Server and Spark as a Query Engine (Spark Meetup 5/14)Evan Chan

Nosql taxonomy with new nuggetMatt Ingenthron

From Obvious to Ingenius: Incrementally Scaling Web Apps on PostgreSQLKonstantin Gredeskoul

From MapReduce to Apache SparkJen Aman

Sas 2015 event_drivenSascha Möllering

Cloud Computing: Safe Haven from the Data Deluge? AGBT 2011Toby Bloom

Intro to Netflix's Chaos MonkeyMichael Whitehead

La actualidad más candente (20)

SmartNews's journey into microservices

What we learned from the AWS Outage

Dev Ops without the Ops

A Look at the Performance of SAP's Modern UIs

spark

Breaking Spark: Top 5 mistakes to avoid when using Apache Spark in production

Rainbows, Unicorns, and other Fairy Tales in the Land of Serverless Dreams

Spark Summit EU talk by Dean Wampler

Kubernetes at NU.nl (Kubernetes meetup 2019-09-05)

Livy: A REST Web Service For Apache Spark

DevOpsCon Cloud Workshop

Harnessing Spark and Cassandra with Groovy

Release the Monkeys ! Testing in the Wild at Netflix

Spark Job Server and Spark as a Query Engine (Spark Meetup 5/14)

Nosql taxonomy with new nugget

From Obvious to Ingenius: Incrementally Scaling Web Apps on PostgreSQL

From MapReduce to Apache Spark

Sas 2015 event_driven

Cloud Computing: Safe Haven from the Data Deluge? AGBT 2011

Intro to Netflix's Chaos Monkey

Destacado

Career AdviceSchubert Zhang

Engineering practices in big data storage and processingSchubert Zhang

Spark-on-Yarn: The Road Ahead-(Marcelo Vanzin, Cloudera)Spark Summit

Spark on YARN: The Road AheadCloudera, Inc.

The Future of Hadoop: A deeper look at Apache SparkCloudera, Inc.

Oracle Information Data Discovery for HR (EBS)Bizinsight Consulting Inc

Scaling Spark Workloads on YARN - Boulder/Denver July 2015Mac Moore

Drop Ship Sales Order Across Operating UnitsBizinsight Consulting Inc

Why Apache Spark is the Heir to MapReduce in the Hadoop EcosystemCloudera, Inc.

Destacado (9)

Career Advice

Engineering practices in big data storage and processing

Spark-on-Yarn: The Road Ahead-(Marcelo Vanzin, Cloudera)

Spark on YARN: The Road Ahead

The Future of Hadoop: A deeper look at Apache Spark

Oracle Information Data Discovery for HR (EBS)

Scaling Spark Workloads on YARN - Boulder/Denver July 2015

Drop Ship Sales Order Across Operating Units

Why Apache Spark is the Heir to MapReduce in the Hadoop Ecosystem

Similar a Running Yarn at Scale

12-Step Program for Scaling Web Applications on PostgreSQLKonstantin Gredeskoul

Chirp 2010: Scaling TwitterJohn Adams

Cvcc performance tuningJohn McCaffrey

Spark_Intro_Syed_AcademySyed Hadoop

Big Data and Hadoop in Cloud - Leveraging Amazon EMRVijay Rayapati

Distributed Data processing in a Cloudelliando dias

Hadoop Operations: Keeping the Elephant Running SmoothlyMichael Arnold

MHUG - YARNJoseph Niemiec

Apache hadoop technology : BeginnersShweta Patnaik

My Site is slow - Drupal Camp London 2013hernanibf

John adams talk cloudyJohn Adams

Getting started with Riak in the CloudInes Sombra

Stream Computing (The Engineer's Perspective)Ilya Ganelin

Machine Learning With H2O vs SparkMLArnab Biswas

Be faster then rabbitsVladislav Bauer

Scale-Out Resource Management at Microsoft using Apache YARNDataWorks Summit/Hadoop Summit

Dibi Conference 2012Scott Rutherford

Data ScienceAhmet Bulut

Similar a Running Yarn at Scale (20)

12-Step Program for Scaling Web Applications on PostgreSQL

Chirp 2010: Scaling Twitter

Cvcc performance tuning

Spark_Intro_Syed_Academy

Big Data and Hadoop in Cloud - Leveraging Amazon EMR

Distributed Data processing in a Cloud

Hadoop Operations: Keeping the Elephant Running Smoothly

MHUG - YARN

Apache hadoop technology : Beginners

My Site is slow - Drupal Camp London 2013

John adams talk cloudy

Getting started with Riak in the Cloud

Stream Computing (The Engineer's Perspective)

Machine Learning With H2O vs SparkML

Be faster then rabbits

Scale-Out Resource Management at Microsoft using Apache YARN

Dibi Conference 2012

Data Science

Más de DataWorks Summit

Data Science Crash CourseDataWorks Summit

Floating on a RAFT: HBase Durability with Apache RatisDataWorks Summit

Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiDataWorks Summit

HBase Tales From the Trenches - Short stories about most common HBase operati...DataWorks Summit

Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...DataWorks Summit

Managing the Dewey Decimal SystemDataWorks Summit

Practical NoSQL: Accumulo's dirlist ExampleDataWorks Summit

HBase Global Indexing to support large-scale data ingestion at UberDataWorks Summit

Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixDataWorks Summit

Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiDataWorks Summit

Supporting Apache HBase : Troubleshooting and Supportability ImprovementsDataWorks Summit

Security Framework for Multitenant ArchitectureDataWorks Summit

Presto: Optimizing Performance of SQL-on-Anything EngineDataWorks Summit

Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...DataWorks Summit

Extending Twitter's Data Platform to Google CloudDataWorks Summit

Event-Driven Messaging and Actions using Apache Flink and Apache NiFiDataWorks Summit

Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerDataWorks Summit

Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...DataWorks Summit

Computer Vision: Coming to a Store Near YouDataWorks Summit

Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkDataWorks Summit

Más de DataWorks Summit (20)

Data Science Crash Course

Floating on a RAFT: HBase Durability with Apache Ratis

Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi

HBase Tales From the Trenches - Short stories about most common HBase operati...

Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...

Managing the Dewey Decimal System

Practical NoSQL: Accumulo's dirlist Example

HBase Global Indexing to support large-scale data ingestion at Uber

Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix

Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi

Supporting Apache HBase : Troubleshooting and Supportability Improvements

Security Framework for Multitenant Architecture

Presto: Optimizing Performance of SQL-on-Anything Engine

Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...

Extending Twitter's Data Platform to Google Cloud

Event-Driven Messaging and Actions using Apache Flink and Apache NiFi

Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger

Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...

Computer Vision: Coming to a Store Near You

Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark

Último

SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski

Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi

Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University

Artificial intelligence in cctv survelliance.pptxhariprasad279825

What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett

Story boards and shot lists for my a level piececharlottematthew16

Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge

Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang

Gen AI in Business - Global Trends Report 2024.pdfAddepto

DevEX - reference for building teams, processes, and platformsSergiu Bodiu

Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst

Training state-of-the-art general text embeddingZilliz

Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson

Commit 2024 - Secret Management made easyAlfredo García Lavilla

"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays

Install Stable Diffusion in windows machinePadma Pradeep

CloudStudio User manual (basic edition):comworks

"ML in Production",Oleksandr BaganFwdays

Running Yarn at Scale

1. Our Experience Running YARN at Scale Bobby Evans

2. Agenda • Who We Are • Some Background on YARN and YARN at Yahoo! • What Was Not So Good • What Was Good

3. Who IAm Robert (Bobby) Evans • Technical Lead @ Yahoo! • Apache Hadoop Committer and PMC Member • Past – Hardware Design – Linux Kernel and Device Driver Development – Machine Learning on Hadoop • Current – Hadoop Core Development (MapReduce and YARN) – TEZ, Storm and Spark

4. Who I Represent • Yahoo! Hadoop Team – We are over 40 people developing, maintaining and supporting a complete Hadoop stack including Pig, Hive, HBase, Oozie, and HCatalog. • Hadoop Users @ Yahoo!

5. Agenda • Who We Are • Some Background on YARN and YARN at Yahoo! • What Was Not So Good • What Was Good

6. Hadoop Releases 2007 2008 2009 2010 2011 2012 2013 0.14.X 0.15.X 0.16.X 0.17.X 0.18.X 0.19.X 0.20.X 0.21.X 0.20.2X 0.22.X 0.23.X 1.X 2.X Security YARN HDFS HA Source: http://hadoop.apache.org/releases.htmlhttp://is.gd/axRlgJ

7. Yahoo! Scale • About 40,000 Nodes Running Hadoop. • Around 500,000 Map/Reduce jobs a day. • Consuming in excess of 230 compute years every single day. • Over 350 PB of Storage. • On 0.23.X we have over 20,000 years of compute time under our belts. http://www.flickr.com/photos/doctorow/2699158636/

8. YARNArchitecture http://www.flickr.com/photos/bradhoc/7343761514/

9. Agenda • Who We Are • Some Background on YARN and YARN at Yahoo! • What Was Not So Good • What Was Good

10. TheAM Runs on Unreliable Hardware • Split Brain/AM Recovery (FIXED for MR but not perfect) – For anyone else writing a YARN app, be aware you have to handle this.

11. TheAM Runs on Unreliable Hardware • Debugging the AM is hard when it does crash. • AM can get overwhelmed if it is on a slow node or the job is very large. • Tuning the AM is difficult to get right for large jobs. – Be sure to tune the heap/container size. 1GB heap can fit about 100,000 task attempts in memory (25,000 tasks worst case). http://www.flickr.com/photos/cushinglibrary/3963200463/

12. Lack of Flow Control • Both AM and RM based on an asynchronous event framework that has no flow control. http://www.flickr.com/photos/iz4aks/4085305231/

13. Name Node Load • YARN launches tasks faster than 1.0 • MR keeps a running history log for recovery • Log Aggregation. – 7 days of aggregated logs used up approximately 30% of the total namespace. • 50% higher write load on HDFS for the same jobs • 160% more rename operations • 60% more create, addBlock and fsync operations

14. Web UI • Resource Manager and History Server Forget Apps too Quickly • Browser/Javascript Heavy • Follows the YARN model, so it can be confusing for those used to old UI.

15. Binary Incompatibility • Map/Reduce APIs are not binary compatible between 1.0 and 0.23. They are source compatible though so just recompile require.

16. Agenda • Who We Are • Some Background on YARN and YARN at Yahoo! • What Was Not So Good • What Was Good

17. Operability “The issues were not with incompatibilities, but coupling between applications and check-offs.” -- Rajiv Chittajallu

18. Performance Tests run on a 350 node cluster on top of JDK 1.6.0 1.0.2 0.23.3 Improvement Sort (GB/s throughput) 2.26 2.35 4% Sort with compression (GB/s throughput) 4.5 4.5 0% Shuffle (mean shuffle time secs) 303.8 263.5 13% Scan (GB/s throughput) 25.2 22.9 -9% Gridmx 3 replay (Runtime secs) 2817 2668 5%

19. Web Services/LogAggregation • No more scraping of web pages needed – Resource Manager – Node Managers – History Server – MR App Master • Deep analysis of log output using Map/Reduce

20. Non Map ReduceApplications* • Storm • TEZ • Spark • … * Coming Soon

21. Total Capacity Our most heavily used cluster was able to increase from 80,000 jobs a day to 125,000 jobs a day. That is more than a 50% increase. It is like we bought over 1000 new servers and added it to the cluster. This is primarily due to the removal of the artificial split between maps and reduces, but also because the Job Tracker could not keep up with tracking/launching all the tasks.

22. Conclusion Upgrading to 0.23 from 1.0 took a lot of planning and effort. Most of that was stabilization and hardening of Hadoop for the scale that we run at, but it was worth it.

23. ?Questions

Notas del editor

MR AM abandonscontainers that were already running.Testing recovery code that is a path rarely taken.
Uber-AM also saw big performance gains for small jobs.We have run other performance tests but most of them are on different hardware, and compare different versions of 0.23.Sorry we are not going to release the code for the benchmarks.
That is more than a 50% increase

Running Yarn at Scale

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Destacado

Destacado (9)

Similar a Running Yarn at Scale

Similar a Running Yarn at Scale (20)

Más de DataWorks Summit

Más de DataWorks Summit (20)

Último

Último (20)

Running Yarn at Scale

Notas del editor