SlideShare a Scribd company logo
1 of 22
Gordon: Using Flash Memory to
  Build Fast, Power-efficient
  Clusters for Data-intensive
         Applications



            Presenter: He Wang
    Department of Electrical and Computer
                Engineering
             University of Florida
Outline
• Motivation and Background

• Introduction to Gordon’s system
  architecture

• Gordon’s storage system

• Configuring Gordon
Wiki
• Gordon
  o A flash-based system architecture for massively parallel,
    data-centric computing

• Feature
  o Power efficiency
  o Performance advantage
  o Aimed at data-centric applications
Motivation and Background
• Challenges with large-scale data processing
  o Slowdown in uni-processor performance
  o Latency and BW bottleneck of HDD
  o Power constraints
• Improve performance and power efficiency
• Progresses
  o Programming model that parallelizing data-processing program
  o Increased BW and reduced latency with SSD
  o Recent power efficient processors
Motivation and Background(cont)

• Gordon
  o Programming system that parallelizing data-processing program(i.e.
    MapReduce)
     • Abstractions for specifying data-parallel compution
     • Automating the parallelism
  o SSD
     • Improved flash translation layer(FTL)
  o Power efficient processors
     • 100s or 1000s
     • simple interconnect
Gordon system architecture
• Gordon nodes
  o 256GB Flash mem, flash storage controller, 2GB SDRAM,
    1.9Ghz Intel Atom processor
  o Connected through 1Gb ethernet-style network
  o A standard rack hols 16 enclosures for 256 nodes with 64TB
    storage and 230GB/s I/O BW
  o Independent computer
      • OS
      • Network interfaces
Gordon system architecture
• Gordon nodes features
  o Power efficient
     • 19W to 81W
  o High BW
     • 900MB/S




                      Figure 1. Gordon system architecture
Storage system
• Key to power efficiency and performance
• Support Erase, Program, Read operations
• Reliability issue
   o Wear out, needs wear-leveling
• Flash translation layer(FTL)
Storage system

• Flash controller
  o Implements FTL
  o Link between CPU and flash array
     • Shared buses, up to 4 packages
Storage system
• Gordon FTL
  o Operate a write point
      • Pointer to a page of flash memory
  o Maintain a summary page in each block
      • Logical block address(LBA)-to-physical mapping
      • Benefit of this indirection
               • Address organization
               • Wear-leveling
• Working flow
  o Receive write command
  o Locate data by write point
  o update LBA table
Storage system

• Gordon FTL advantage---Write point
  o Original FTL has only one write point, no parrallel
  o Multiple write points with spread access
  o Sequence number
     • Avoid conflict with occupied write point
     • Assign the write point with smallest available
Storage system

• Gordon FTL advantage---super-page
  o Manage flash array with larger granularity with one write
    point for each
  o Horizontal striping
  o Vertical striping
  o 2D striping
Storage system
• Super-page stripping approaches




    Figure 2. Three approachs to striping data across flash arrays
Storage system

• Super-page
  o Pros
    • Reduced overhead
  o Cons
    • Latency for sub-page access
    • Wear out effect larger portion
Storage system
• Super-page evaluation




         Figure 3. Flash storage array performance
Configuring Gordon
• Workloads
  o Benchmarks that use MapReduce
• Power model
  o Direct mesure of a running system
  o Datasheet




  P = IdlePower * (1-ActivityFactor) + ActivePower * ActivityFactor
Configuring Gordon
• Measuring cluster performance
  o High-level simulator to measure overall performance
     • Model 32 node by running 4 Vmware on 8 servers
         o Sync mode, provides upper bound of exe time
         o nosync mode, provides lower bound
  o Storage simulator
Configuring Gordon
• Parato-optimal Gordon system design




                  Figure 6. Parato-optimal Gordon system designs
Configuring Gordon
• Optimal Gordon configurations
 Out-perform disk-based by 1.5X and deliver 2.5X more performance per watt




                 Figure 5. Optimal Gordon configuration
Configuring Gordon
• Gordon power consumption
  o   MaxE-flash consumes 40% of the energy of the disk-based configuration
  o   A factor of two increase in performance




                        Figure 6. Relative energy consumption
Discussions

• Exploit disks for cheap redundancy
• Virtualizing Gordon
Thanks

More Related Content

What's hot

Accumulo Summit 2015: Performance Models for Apache Accumulo: The Heavy Tail ...
Accumulo Summit 2015: Performance Models for Apache Accumulo: The Heavy Tail ...Accumulo Summit 2015: Performance Models for Apache Accumulo: The Heavy Tail ...
Accumulo Summit 2015: Performance Models for Apache Accumulo: The Heavy Tail ...
Accumulo Summit
 
Achieve Extreme Simplicity and Superior Price/Performance with Greenplum Buil...
Achieve Extreme Simplicity and Superior Price/Performance with Greenplum Buil...Achieve Extreme Simplicity and Superior Price/Performance with Greenplum Buil...
Achieve Extreme Simplicity and Superior Price/Performance with Greenplum Buil...
VMware Tanzu
 
Thrashing allocation frames.43
Thrashing allocation frames.43Thrashing allocation frames.43
Thrashing allocation frames.43
myrajendra
 
41 page replacement fifo
41 page replacement fifo41 page replacement fifo
41 page replacement fifo
myrajendra
 

What's hot (20)

Distributed Processing Frameworks
Distributed Processing FrameworksDistributed Processing Frameworks
Distributed Processing Frameworks
 
Imetal
ImetalImetal
Imetal
 
IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...
IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...
IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...
 
Accumulo Summit 2015: Performance Models for Apache Accumulo: The Heavy Tail ...
Accumulo Summit 2015: Performance Models for Apache Accumulo: The Heavy Tail ...Accumulo Summit 2015: Performance Models for Apache Accumulo: The Heavy Tail ...
Accumulo Summit 2015: Performance Models for Apache Accumulo: The Heavy Tail ...
 
MariaDB ColumnStore
MariaDB ColumnStoreMariaDB ColumnStore
MariaDB ColumnStore
 
Achieve Extreme Simplicity and Superior Price/Performance with Greenplum Buil...
Achieve Extreme Simplicity and Superior Price/Performance with Greenplum Buil...Achieve Extreme Simplicity and Superior Price/Performance with Greenplum Buil...
Achieve Extreme Simplicity and Superior Price/Performance with Greenplum Buil...
 
PGConf.ASIA 2019 Bali - Upcoming Features in PostgreSQL 12 - John Naylor
PGConf.ASIA 2019 Bali - Upcoming Features in PostgreSQL 12 - John NaylorPGConf.ASIA 2019 Bali - Upcoming Features in PostgreSQL 12 - John Naylor
PGConf.ASIA 2019 Bali - Upcoming Features in PostgreSQL 12 - John Naylor
 
Thrashing allocation frames.43
Thrashing allocation frames.43Thrashing allocation frames.43
Thrashing allocation frames.43
 
Stinger Initiative: Leveraging Hive & Yarn for High-Performance/Interactive Q...
Stinger Initiative: Leveraging Hive & Yarn for High-Performance/Interactive Q...Stinger Initiative: Leveraging Hive & Yarn for High-Performance/Interactive Q...
Stinger Initiative: Leveraging Hive & Yarn for High-Performance/Interactive Q...
 
20190909_PGconf.ASIA_KaiGai
20190909_PGconf.ASIA_KaiGai20190909_PGconf.ASIA_KaiGai
20190909_PGconf.ASIA_KaiGai
 
In-memory database
In-memory databaseIn-memory database
In-memory database
 
Public Cloud Performance Measurement Report
Public Cloud Performance Measurement ReportPublic Cloud Performance Measurement Report
Public Cloud Performance Measurement Report
 
DB2 10 & 11 for z/OS System Performance Monitoring and Optimisation
DB2 10 & 11 for z/OS System Performance Monitoring and OptimisationDB2 10 & 11 for z/OS System Performance Monitoring and Optimisation
DB2 10 & 11 for z/OS System Performance Monitoring and Optimisation
 
In-Memory Computing: How, Why? and common Patterns
In-Memory Computing: How, Why? and common PatternsIn-Memory Computing: How, Why? and common Patterns
In-Memory Computing: How, Why? and common Patterns
 
Optimizing MariaDB for maximum performance
Optimizing MariaDB for maximum performanceOptimizing MariaDB for maximum performance
Optimizing MariaDB for maximum performance
 
Kafka streams fifth elephant 2018
Kafka streams fifth elephant 2018Kafka streams fifth elephant 2018
Kafka streams fifth elephant 2018
 
505 kobal exadata
505 kobal exadata505 kobal exadata
505 kobal exadata
 
SPE effiency on modern hardware paper presentation
SPE effiency on modern hardware   paper presentationSPE effiency on modern hardware   paper presentation
SPE effiency on modern hardware paper presentation
 
Intro to column stores
Intro to column storesIntro to column stores
Intro to column stores
 
41 page replacement fifo
41 page replacement fifo41 page replacement fifo
41 page replacement fifo
 

Similar to Presentation gordon

Colvin exadata mistakes_ioug_2014
Colvin exadata mistakes_ioug_2014Colvin exadata mistakes_ioug_2014
Colvin exadata mistakes_ioug_2014
marvin herrera
 
stackArmor presentation for DevOpsDC ver 4
stackArmor presentation for DevOpsDC ver 4stackArmor presentation for DevOpsDC ver 4
stackArmor presentation for DevOpsDC ver 4
Gaurav "GP" Pal
 
The Secret Guide to Cloud Performance - Cloudlook
The Secret Guide to Cloud Performance - CloudlookThe Secret Guide to Cloud Performance - Cloudlook
The Secret Guide to Cloud Performance - Cloudlook
gidgreen
 

Similar to Presentation gordon (20)

Colvin exadata mistakes_ioug_2014
Colvin exadata mistakes_ioug_2014Colvin exadata mistakes_ioug_2014
Colvin exadata mistakes_ioug_2014
 
DevOps for ETL processing at scale with MongoDB, Solr, AWS and Chef
DevOps for ETL processing at scale with MongoDB, Solr, AWS and ChefDevOps for ETL processing at scale with MongoDB, Solr, AWS and Chef
DevOps for ETL processing at scale with MongoDB, Solr, AWS and Chef
 
stackArmor presentation for DevOpsDC ver 4
stackArmor presentation for DevOpsDC ver 4stackArmor presentation for DevOpsDC ver 4
stackArmor presentation for DevOpsDC ver 4
 
Taking Splunk to the Next Level - Architecture Breakout Session
Taking Splunk to the Next Level - Architecture Breakout SessionTaking Splunk to the Next Level - Architecture Breakout Session
Taking Splunk to the Next Level - Architecture Breakout Session
 
Tuning Linux Windows and Firebird for Heavy Workload
Tuning Linux Windows and Firebird for Heavy WorkloadTuning Linux Windows and Firebird for Heavy Workload
Tuning Linux Windows and Firebird for Heavy Workload
 
August 2013 HUG: Removing the NameNode's memory limitation
August 2013 HUG: Removing the NameNode's memory limitation August 2013 HUG: Removing the NameNode's memory limitation
August 2013 HUG: Removing the NameNode's memory limitation
 
Accelerating HBase with NVMe and Bucket Cache
Accelerating HBase with NVMe and Bucket CacheAccelerating HBase with NVMe and Bucket Cache
Accelerating HBase with NVMe and Bucket Cache
 
MongoDB Internals
MongoDB InternalsMongoDB Internals
MongoDB Internals
 
Running MongoDB 3.0 on AWS
Running MongoDB 3.0 on AWSRunning MongoDB 3.0 on AWS
Running MongoDB 3.0 on AWS
 
Linux tuning to improve PostgreSQL performance
Linux tuning to improve PostgreSQL performanceLinux tuning to improve PostgreSQL performance
Linux tuning to improve PostgreSQL performance
 
Storage Spaces Direct - the new Microsoft SDS star - Carsten Rachfahl
Storage Spaces Direct - the new Microsoft SDS star - Carsten RachfahlStorage Spaces Direct - the new Microsoft SDS star - Carsten Rachfahl
Storage Spaces Direct - the new Microsoft SDS star - Carsten Rachfahl
 
Accelerating hbase with nvme and bucket cache
Accelerating hbase with nvme and bucket cacheAccelerating hbase with nvme and bucket cache
Accelerating hbase with nvme and bucket cache
 
The state of Hive and Spark in the Cloud (July 2017)
The state of Hive and Spark in the Cloud (July 2017)The state of Hive and Spark in the Cloud (July 2017)
The state of Hive and Spark in the Cloud (July 2017)
 
AWS Summit London 2014 | Uses and Best Practices for Amazon Redshift (200)
AWS Summit London 2014 | Uses and Best Practices for Amazon Redshift (200)AWS Summit London 2014 | Uses and Best Practices for Amazon Redshift (200)
AWS Summit London 2014 | Uses and Best Practices for Amazon Redshift (200)
 
The Secret Guide to Cloud Performance - Cloudlook
The Secret Guide to Cloud Performance - CloudlookThe Secret Guide to Cloud Performance - Cloudlook
The Secret Guide to Cloud Performance - Cloudlook
 
Presentation database on flash
Presentation   database on flashPresentation   database on flash
Presentation database on flash
 
week1slides1704202828322.pdf
week1slides1704202828322.pdfweek1slides1704202828322.pdf
week1slides1704202828322.pdf
 
071410 sun a_1515_feldman_stephen
071410 sun a_1515_feldman_stephen071410 sun a_1515_feldman_stephen
071410 sun a_1515_feldman_stephen
 
To Serverless and Beyond
To Serverless and BeyondTo Serverless and Beyond
To Serverless and Beyond
 
PGConf.ASIA 2019 Bali - Tune Your LInux Box, Not Just PostgreSQL - Ibrar Ahmed
PGConf.ASIA 2019 Bali - Tune Your LInux Box, Not Just PostgreSQL - Ibrar AhmedPGConf.ASIA 2019 Bali - Tune Your LInux Box, Not Just PostgreSQL - Ibrar Ahmed
PGConf.ASIA 2019 Bali - Tune Your LInux Box, Not Just PostgreSQL - Ibrar Ahmed
 

Recently uploaded

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Recently uploaded (20)

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 

Presentation gordon

  • 1. Gordon: Using Flash Memory to Build Fast, Power-efficient Clusters for Data-intensive Applications Presenter: He Wang Department of Electrical and Computer Engineering University of Florida
  • 2. Outline • Motivation and Background • Introduction to Gordon’s system architecture • Gordon’s storage system • Configuring Gordon
  • 3. Wiki • Gordon o A flash-based system architecture for massively parallel, data-centric computing • Feature o Power efficiency o Performance advantage o Aimed at data-centric applications
  • 4. Motivation and Background • Challenges with large-scale data processing o Slowdown in uni-processor performance o Latency and BW bottleneck of HDD o Power constraints • Improve performance and power efficiency • Progresses o Programming model that parallelizing data-processing program o Increased BW and reduced latency with SSD o Recent power efficient processors
  • 5. Motivation and Background(cont) • Gordon o Programming system that parallelizing data-processing program(i.e. MapReduce) • Abstractions for specifying data-parallel compution • Automating the parallelism o SSD • Improved flash translation layer(FTL) o Power efficient processors • 100s or 1000s • simple interconnect
  • 6. Gordon system architecture • Gordon nodes o 256GB Flash mem, flash storage controller, 2GB SDRAM, 1.9Ghz Intel Atom processor o Connected through 1Gb ethernet-style network o A standard rack hols 16 enclosures for 256 nodes with 64TB storage and 230GB/s I/O BW o Independent computer • OS • Network interfaces
  • 7. Gordon system architecture • Gordon nodes features o Power efficient • 19W to 81W o High BW • 900MB/S Figure 1. Gordon system architecture
  • 8. Storage system • Key to power efficiency and performance • Support Erase, Program, Read operations • Reliability issue o Wear out, needs wear-leveling • Flash translation layer(FTL)
  • 9. Storage system • Flash controller o Implements FTL o Link between CPU and flash array • Shared buses, up to 4 packages
  • 10. Storage system • Gordon FTL o Operate a write point • Pointer to a page of flash memory o Maintain a summary page in each block • Logical block address(LBA)-to-physical mapping • Benefit of this indirection • Address organization • Wear-leveling • Working flow o Receive write command o Locate data by write point o update LBA table
  • 11. Storage system • Gordon FTL advantage---Write point o Original FTL has only one write point, no parrallel o Multiple write points with spread access o Sequence number • Avoid conflict with occupied write point • Assign the write point with smallest available
  • 12. Storage system • Gordon FTL advantage---super-page o Manage flash array with larger granularity with one write point for each o Horizontal striping o Vertical striping o 2D striping
  • 13. Storage system • Super-page stripping approaches Figure 2. Three approachs to striping data across flash arrays
  • 14. Storage system • Super-page o Pros • Reduced overhead o Cons • Latency for sub-page access • Wear out effect larger portion
  • 15. Storage system • Super-page evaluation Figure 3. Flash storage array performance
  • 16. Configuring Gordon • Workloads o Benchmarks that use MapReduce • Power model o Direct mesure of a running system o Datasheet P = IdlePower * (1-ActivityFactor) + ActivePower * ActivityFactor
  • 17. Configuring Gordon • Measuring cluster performance o High-level simulator to measure overall performance • Model 32 node by running 4 Vmware on 8 servers o Sync mode, provides upper bound of exe time o nosync mode, provides lower bound o Storage simulator
  • 18. Configuring Gordon • Parato-optimal Gordon system design Figure 6. Parato-optimal Gordon system designs
  • 19. Configuring Gordon • Optimal Gordon configurations Out-perform disk-based by 1.5X and deliver 2.5X more performance per watt Figure 5. Optimal Gordon configuration
  • 20. Configuring Gordon • Gordon power consumption o MaxE-flash consumes 40% of the energy of the disk-based configuration o A factor of two increase in performance Figure 6. Relative energy consumption
  • 21. Discussions • Exploit disks for cheap redundancy • Virtualizing Gordon