SlideShare a Scribd company logo
1 of 15
Download to read offline
Cumulus:
 Filesystem Backup to the Cloud
Michael Vrable, Stefan Savage, and Geoffrey M.
                    Voelker



      Presented by Hakim Weatherspoon
Good „ole Days


     Replace your tape drives
     with something truly scalable
Amazon S3 to the rescue
In Spring 2006, Amazon released a new
storage API: Put, Get, List, Delete




           Build whatever you want!

                      Quickly
Backing up the new way (S3)
Smart




    Scales


     - no longer our concern... Amazon's concern
     - all servers backup in parallel
Cheap

     - old cost =XXXper year
     - new cost =YYYper year
         where YYY < XXX
     
Thin vs Thick Cloud
• E.g. Amazons S3 vs EMC‟s MozyPro
• Thin
  – Can change provider easier
  – Applications can work across providers
• Thick
  – Better performance
  – Locked into a provider
  – Provider can go out of business
Cumulus
• Simple storage backup utility for Thin Clouds
• Evaluates efficacy of cloud storage
• Working prototype
  – http://www.cs.ucsd.edu/~mvrable/cumulus/
Outline
•   Motivation/Intro
•   Related Work
•   Design
•   Evaluation
•   Thoughts and Conclusions
Related Work
Outline
• Motivation/Intro
• Related Work
• Design
  –   API
  –   Segments
  –   Snapshots
  –   Subfileincrementals
  –   Cleaning
  –   restoring
• Evaluation
• Thoughts and Conclusions
API
• Same as S3
  – Put, Get, List, Delete


• Thin cloud – does not rely on integrated services
  – Can easily change provider and network protocols
  – S3, FTP, SFTP


• WORM Model
  – Write-once, read-many
  – Requires writing new entirely file if changes occur
  – What are the cleaning overheads?
Segments
• Aggregation via Segment Goals
  – Avoid costs due to small files
     • S3 charges on per file bases
     • Many small files
  – Avoid costs in network protocols
     • Small files have higher latency and other overheads
  – Compression
     • inter-file similarities
  – Privacy
     • Hide file boundaries
• Negative consequences?
  – Need an entire segment to write
Snapshots
Sub-File Incrementals
• Only stored changed part of files
• New snapshots point to old objects when data
  unchanged
  • Byte ranges – portions of old objcts to be reused
Segment Cleaning
• Similar to a log-structured file system (LFS)
• Clean based on utilization of segment, α
  – α = 0, no cleaning
  – α = 1, clean with the slightest change
• Cumulus
  –   attempts to find an equilibrium for α
  –   Uses a different process to clean
  –   Marks a local database as “expired”
  –   Then, next snapshot will not refer to expired segment
Restore
• Full Restore
  – Download all segments for a snapshot
• Partial Restore
  – Download snapshot descriptor, metadata, and only
    necessary segments


• What happens if client machine dies?
• How is latest snapshot descriptors identified?
• What about sharing between client machines?

More Related Content

What's hot

Load Balancing
Load BalancingLoad Balancing
Load Balancingnashniv
 
Scylla Summit 2018: Meshify - A Case Study, or Petshop Seamonsters
Scylla Summit 2018: Meshify - A Case Study, or Petshop SeamonstersScylla Summit 2018: Meshify - A Case Study, or Petshop Seamonsters
Scylla Summit 2018: Meshify - A Case Study, or Petshop SeamonstersScyllaDB
 
eZ Publish Asynchronous Content Publishing
eZ Publish Asynchronous Content PublishingeZ Publish Asynchronous Content Publishing
eZ Publish Asynchronous Content PublishingBertrand Dunogier
 
Cloud brew cloudcamp
Cloud brew cloudcampCloud brew cloudcamp
Cloud brew cloudcampHenry Been
 
Selecting the Right Cloud Host
Selecting the Right Cloud HostSelecting the Right Cloud Host
Selecting the Right Cloud HostAhsan Saleem
 
Windows Azure Service Bus
Windows Azure Service BusWindows Azure Service Bus
Windows Azure Service BusPavel Revenkov
 
AWS Customer Presentation - HotPads
AWS Customer Presentation - HotPadsAWS Customer Presentation - HotPads
AWS Customer Presentation - HotPadsAmazon Web Services
 
A brief introduction to CloudFormation
A brief introduction to CloudFormationA brief introduction to CloudFormation
A brief introduction to CloudFormationSWIFTotter Solutions
 
Percona Live 2014 - Scaling MySQL in AWS
Percona Live 2014 - Scaling MySQL in AWSPercona Live 2014 - Scaling MySQL in AWS
Percona Live 2014 - Scaling MySQL in AWSPythian
 
Escalabilidade com Lambda e Elastic Beanstalk – Parte I
Escalabilidade com Lambda e Elastic Beanstalk – Parte IEscalabilidade com Lambda e Elastic Beanstalk – Parte I
Escalabilidade com Lambda e Elastic Beanstalk – Parte ILeandro Silva
 
Experiences sharing about Lambda, Kinesis, and Postgresql
Experiences sharing about Lambda, Kinesis, and PostgresqlExperiences sharing about Lambda, Kinesis, and Postgresql
Experiences sharing about Lambda, Kinesis, and PostgresqlOkis Chuang
 
Getting Maximum Performance from Amazon Redshift: Complex Queries
Getting Maximum Performance from Amazon Redshift: Complex QueriesGetting Maximum Performance from Amazon Redshift: Complex Queries
Getting Maximum Performance from Amazon Redshift: Complex Queriestimonk
 
Rencore Webinar: Developing Secure and Performant JavaScript for SharePoint
Rencore Webinar: Developing Secure and Performant JavaScript for SharePointRencore Webinar: Developing Secure and Performant JavaScript for SharePoint
Rencore Webinar: Developing Secure and Performant JavaScript for SharePointRencore
 
Cassandra Day Atlanta 2015: Recording the Web: High-Fidelity Storage and Play...
Cassandra Day Atlanta 2015: Recording the Web: High-Fidelity Storage and Play...Cassandra Day Atlanta 2015: Recording the Web: High-Fidelity Storage and Play...
Cassandra Day Atlanta 2015: Recording the Web: High-Fidelity Storage and Play...DataStax Academy
 
Microsoft Azure Media Services
Microsoft Azure Media ServicesMicrosoft Azure Media Services
Microsoft Azure Media ServicesPavel Revenkov
 
Heap Dump Analysis - AEM: Real World Issues
Heap Dump Analysis - AEM: Real World IssuesHeap Dump Analysis - AEM: Real World Issues
Heap Dump Analysis - AEM: Real World IssuesKanika Gera
 
Pagespeed SEO 2018: Google Mobile Index Optimization
Pagespeed SEO 2018: Google Mobile Index OptimizationPagespeed SEO 2018: Google Mobile Index Optimization
Pagespeed SEO 2018: Google Mobile Index OptimizationRalf Schwoebel
 

What's hot (20)

Load Balancing
Load BalancingLoad Balancing
Load Balancing
 
Aws, an intro to startups
Aws, an intro to startupsAws, an intro to startups
Aws, an intro to startups
 
Scylla Summit 2018: Meshify - A Case Study, or Petshop Seamonsters
Scylla Summit 2018: Meshify - A Case Study, or Petshop SeamonstersScylla Summit 2018: Meshify - A Case Study, or Petshop Seamonsters
Scylla Summit 2018: Meshify - A Case Study, or Petshop Seamonsters
 
eZ Publish Asynchronous Content Publishing
eZ Publish Asynchronous Content PublishingeZ Publish Asynchronous Content Publishing
eZ Publish Asynchronous Content Publishing
 
Cloud brew cloudcamp
Cloud brew cloudcampCloud brew cloudcamp
Cloud brew cloudcamp
 
Architecture et coût
Architecture et coûtArchitecture et coût
Architecture et coût
 
Selecting the Right Cloud Host
Selecting the Right Cloud HostSelecting the Right Cloud Host
Selecting the Right Cloud Host
 
Windows Azure Service Bus
Windows Azure Service BusWindows Azure Service Bus
Windows Azure Service Bus
 
AWS Customer Presentation - HotPads
AWS Customer Presentation - HotPadsAWS Customer Presentation - HotPads
AWS Customer Presentation - HotPads
 
A brief introduction to CloudFormation
A brief introduction to CloudFormationA brief introduction to CloudFormation
A brief introduction to CloudFormation
 
Percona Live 2014 - Scaling MySQL in AWS
Percona Live 2014 - Scaling MySQL in AWSPercona Live 2014 - Scaling MySQL in AWS
Percona Live 2014 - Scaling MySQL in AWS
 
Escalabilidade com Lambda e Elastic Beanstalk – Parte I
Escalabilidade com Lambda e Elastic Beanstalk – Parte IEscalabilidade com Lambda e Elastic Beanstalk – Parte I
Escalabilidade com Lambda e Elastic Beanstalk – Parte I
 
Experiences sharing about Lambda, Kinesis, and Postgresql
Experiences sharing about Lambda, Kinesis, and PostgresqlExperiences sharing about Lambda, Kinesis, and Postgresql
Experiences sharing about Lambda, Kinesis, and Postgresql
 
Getting Maximum Performance from Amazon Redshift: Complex Queries
Getting Maximum Performance from Amazon Redshift: Complex QueriesGetting Maximum Performance from Amazon Redshift: Complex Queries
Getting Maximum Performance from Amazon Redshift: Complex Queries
 
Performance stack
Performance stackPerformance stack
Performance stack
 
Rencore Webinar: Developing Secure and Performant JavaScript for SharePoint
Rencore Webinar: Developing Secure and Performant JavaScript for SharePointRencore Webinar: Developing Secure and Performant JavaScript for SharePoint
Rencore Webinar: Developing Secure and Performant JavaScript for SharePoint
 
Cassandra Day Atlanta 2015: Recording the Web: High-Fidelity Storage and Play...
Cassandra Day Atlanta 2015: Recording the Web: High-Fidelity Storage and Play...Cassandra Day Atlanta 2015: Recording the Web: High-Fidelity Storage and Play...
Cassandra Day Atlanta 2015: Recording the Web: High-Fidelity Storage and Play...
 
Microsoft Azure Media Services
Microsoft Azure Media ServicesMicrosoft Azure Media Services
Microsoft Azure Media Services
 
Heap Dump Analysis - AEM: Real World Issues
Heap Dump Analysis - AEM: Real World IssuesHeap Dump Analysis - AEM: Real World Issues
Heap Dump Analysis - AEM: Real World Issues
 
Pagespeed SEO 2018: Google Mobile Index Optimization
Pagespeed SEO 2018: Google Mobile Index OptimizationPagespeed SEO 2018: Google Mobile Index Optimization
Pagespeed SEO 2018: Google Mobile Index Optimization
 

Viewers also liked (7)

Creating Resuable Learning Objects for Your Classroom
Creating Resuable Learning Objects for Your ClassroomCreating Resuable Learning Objects for Your Classroom
Creating Resuable Learning Objects for Your Classroom
 
2008 Q1 Google Earnings Slides
2008 Q1 Google Earnings Slides2008 Q1 Google Earnings Slides
2008 Q1 Google Earnings Slides
 
20070930 Google 10 Q
20070930 Google 10 Q20070930 Google 10 Q
20070930 Google 10 Q
 
citigroup October 16, 2008 - Third Quarter Financial Supplement
citigroup October 16, 2008 - Third Quarter  Financial Supplementcitigroup October 16, 2008 - Third Quarter  Financial Supplement
citigroup October 16, 2008 - Third Quarter Financial Supplement
 
Janneth
JannethJanneth
Janneth
 
Ebay News 2002 7 18 Earnings
Ebay News 2002 7 18 EarningsEbay News 2002 7 18 Earnings
Ebay News 2002 7 18 Earnings
 
Web Development or SDK?
Web Development or SDK?Web Development or SDK?
Web Development or SDK?
 

Similar to Quixote

Solaris Linux Performance, Tools and Tuning
Solaris Linux Performance, Tools and TuningSolaris Linux Performance, Tools and Tuning
Solaris Linux Performance, Tools and TuningAdrian Cockcroft
 
Data storage for the cloud ce11
Data storage for the cloud ce11Data storage for the cloud ce11
Data storage for the cloud ce11CloudExpoEurope
 
Data storage for the cloud ce11
Data storage for the cloud ce11Data storage for the cloud ce11
Data storage for the cloud ce11aseager
 
Data storage for the cloud ce11
Data storage for the cloud ce11Data storage for the cloud ce11
Data storage for the cloud ce11aseager
 
Storage Systems for High Scalable Systems Presentation
Storage Systems for High Scalable Systems PresentationStorage Systems for High Scalable Systems Presentation
Storage Systems for High Scalable Systems Presentationandyman3000
 
Reducing Database Costs via Shard Consolidation
Reducing Database Costs via Shard ConsolidationReducing Database Costs via Shard Consolidation
Reducing Database Costs via Shard ConsolidationAmazon Web Services
 
Our Multi-Year Journey to a 10x Faster Confluent Cloud
Our Multi-Year Journey to a 10x Faster Confluent CloudOur Multi-Year Journey to a 10x Faster Confluent Cloud
Our Multi-Year Journey to a 10x Faster Confluent CloudHostedbyConfluent
 
How to build a state-of-the-art rails cluster
How to build a state-of-the-art rails clusterHow to build a state-of-the-art rails cluster
How to build a state-of-the-art rails clusterTim Lossen
 
Getting Started with Amazon Aurora
Getting Started with Amazon AuroraGetting Started with Amazon Aurora
Getting Started with Amazon AuroraAmazon Web Services
 
Running Oracle EBS in the cloud (UKOUG APPS16 edition)
Running Oracle EBS in the cloud (UKOUG APPS16 edition)Running Oracle EBS in the cloud (UKOUG APPS16 edition)
Running Oracle EBS in the cloud (UKOUG APPS16 edition)Andrejs Prokopjevs
 
Building a High Performance Analytics Platform
Building a High Performance Analytics PlatformBuilding a High Performance Analytics Platform
Building a High Performance Analytics PlatformSantanu Dey
 
Getting started with amazon aurora - Toronto
Getting started with amazon aurora - TorontoGetting started with amazon aurora - Toronto
Getting started with amazon aurora - TorontoAmazon Web Services
 
Deep Dive on Amazon EC2 Instances - January 2017 AWS Online Tech Talks
Deep Dive on Amazon EC2 Instances - January 2017 AWS Online Tech TalksDeep Dive on Amazon EC2 Instances - January 2017 AWS Online Tech Talks
Deep Dive on Amazon EC2 Instances - January 2017 AWS Online Tech TalksAmazon Web Services
 
Storage and performance- Batch processing, Whiptail
Storage and performance- Batch processing, WhiptailStorage and performance- Batch processing, Whiptail
Storage and performance- Batch processing, WhiptailInternet World
 
Benchmarking your cloud performance with top 4 global public clouds
Benchmarking your cloud performance with top 4 global public cloudsBenchmarking your cloud performance with top 4 global public clouds
Benchmarking your cloud performance with top 4 global public cloudsdata://disrupted®
 
Running Oracle EBS in the cloud (OAUG Collaborate 18 edition)
Running Oracle EBS in the cloud (OAUG Collaborate 18 edition)Running Oracle EBS in the cloud (OAUG Collaborate 18 edition)
Running Oracle EBS in the cloud (OAUG Collaborate 18 edition)Andrejs Prokopjevs
 
Perfmon And Profiler 101
Perfmon And Profiler 101Perfmon And Profiler 101
Perfmon And Profiler 101Quest Software
 
Getting Started with Amazon Aurora
Getting Started with Amazon AuroraGetting Started with Amazon Aurora
Getting Started with Amazon AuroraAmazon Web Services
 

Similar to Quixote (20)

Solaris Linux Performance, Tools and Tuning
Solaris Linux Performance, Tools and TuningSolaris Linux Performance, Tools and Tuning
Solaris Linux Performance, Tools and Tuning
 
Data storage for the cloud ce11
Data storage for the cloud ce11Data storage for the cloud ce11
Data storage for the cloud ce11
 
Data storage for the cloud ce11
Data storage for the cloud ce11Data storage for the cloud ce11
Data storage for the cloud ce11
 
Data storage for the cloud ce11
Data storage for the cloud ce11Data storage for the cloud ce11
Data storage for the cloud ce11
 
Magee Dday2 Fixing App Performance Italiano
Magee Dday2 Fixing App Performance ItalianoMagee Dday2 Fixing App Performance Italiano
Magee Dday2 Fixing App Performance Italiano
 
Storage Systems for High Scalable Systems Presentation
Storage Systems for High Scalable Systems PresentationStorage Systems for High Scalable Systems Presentation
Storage Systems for High Scalable Systems Presentation
 
Reducing Database Costs via Shard Consolidation
Reducing Database Costs via Shard ConsolidationReducing Database Costs via Shard Consolidation
Reducing Database Costs via Shard Consolidation
 
Our Multi-Year Journey to a 10x Faster Confluent Cloud
Our Multi-Year Journey to a 10x Faster Confluent CloudOur Multi-Year Journey to a 10x Faster Confluent Cloud
Our Multi-Year Journey to a 10x Faster Confluent Cloud
 
How to build a state-of-the-art rails cluster
How to build a state-of-the-art rails clusterHow to build a state-of-the-art rails cluster
How to build a state-of-the-art rails cluster
 
Getting Started with Amazon Aurora
Getting Started with Amazon AuroraGetting Started with Amazon Aurora
Getting Started with Amazon Aurora
 
Running Oracle EBS in the cloud (UKOUG APPS16 edition)
Running Oracle EBS in the cloud (UKOUG APPS16 edition)Running Oracle EBS in the cloud (UKOUG APPS16 edition)
Running Oracle EBS in the cloud (UKOUG APPS16 edition)
 
Building a High Performance Analytics Platform
Building a High Performance Analytics PlatformBuilding a High Performance Analytics Platform
Building a High Performance Analytics Platform
 
Getting started with amazon aurora - Toronto
Getting started with amazon aurora - TorontoGetting started with amazon aurora - Toronto
Getting started with amazon aurora - Toronto
 
Amazon Aurora (Debanjan Saha) - AWS DB Day
Amazon Aurora (Debanjan Saha) - AWS DB DayAmazon Aurora (Debanjan Saha) - AWS DB Day
Amazon Aurora (Debanjan Saha) - AWS DB Day
 
Deep Dive on Amazon EC2 Instances - January 2017 AWS Online Tech Talks
Deep Dive on Amazon EC2 Instances - January 2017 AWS Online Tech TalksDeep Dive on Amazon EC2 Instances - January 2017 AWS Online Tech Talks
Deep Dive on Amazon EC2 Instances - January 2017 AWS Online Tech Talks
 
Storage and performance- Batch processing, Whiptail
Storage and performance- Batch processing, WhiptailStorage and performance- Batch processing, Whiptail
Storage and performance- Batch processing, Whiptail
 
Benchmarking your cloud performance with top 4 global public clouds
Benchmarking your cloud performance with top 4 global public cloudsBenchmarking your cloud performance with top 4 global public clouds
Benchmarking your cloud performance with top 4 global public clouds
 
Running Oracle EBS in the cloud (OAUG Collaborate 18 edition)
Running Oracle EBS in the cloud (OAUG Collaborate 18 edition)Running Oracle EBS in the cloud (OAUG Collaborate 18 edition)
Running Oracle EBS in the cloud (OAUG Collaborate 18 edition)
 
Perfmon And Profiler 101
Perfmon And Profiler 101Perfmon And Profiler 101
Perfmon And Profiler 101
 
Getting Started with Amazon Aurora
Getting Started with Amazon AuroraGetting Started with Amazon Aurora
Getting Started with Amazon Aurora
 

More from ceiparua

Rosalia de castro
Rosalia de castroRosalia de castro
Rosalia de castroceiparua
 
Moluscomania
MoluscomaniaMoluscomania
Moluscomaniaceiparua
 
Roni e Carapuchiña xigante namóranse nadando
Roni e Carapuchiña xigante namóranse nadandoRoni e Carapuchiña xigante namóranse nadando
Roni e Carapuchiña xigante namóranse nadandoceiparua
 
Roni e Carapuchiña xigante namóranse
Roni e Carapuchiña xigante namóranseRoni e Carapuchiña xigante namóranse
Roni e Carapuchiña xigante namóranseceiparua
 
Traballo por proxectos, de Azucena Arias Correa
Traballo por proxectos, de Azucena Arias CorreaTraballo por proxectos, de Azucena Arias Correa
Traballo por proxectos, de Azucena Arias Correaceiparua
 
Competencias Basicas Xunta
Competencias Basicas XuntaCompetencias Basicas Xunta
Competencias Basicas Xuntaceiparua
 
Isabel Monguilot Abeti.Competencias BáSicas. Galicia. 11 4 08
Isabel Monguilot Abeti.Competencias BáSicas. Galicia. 11 4 08Isabel Monguilot Abeti.Competencias BáSicas. Galicia. 11 4 08
Isabel Monguilot Abeti.Competencias BáSicas. Galicia. 11 4 08ceiparua
 
Reciclamos
ReciclamosReciclamos
Reciclamosceiparua
 
Comparámonos
ComparámonosComparámonos
Comparámonosceiparua
 
Carapuchiña vermella xigante
Carapuchiña vermella xiganteCarapuchiña vermella xigante
Carapuchiña vermella xiganteceiparua
 
Cartas de felicitación a Agustín Fdez Paz
Cartas de felicitación a Agustín Fdez PazCartas de felicitación a Agustín Fdez Paz
Cartas de felicitación a Agustín Fdez Pazceiparua
 
Empusa Pennata
Empusa PennataEmpusa Pennata
Empusa Pennataceiparua
 
Horta A RúA 08
Horta A RúA 08Horta A RúA 08
Horta A RúA 08ceiparua
 
FelicitacióN Nadal 08 Ceip A RúA
FelicitacióN Nadal 08 Ceip A RúAFelicitacióN Nadal 08 Ceip A RúA
FelicitacióN Nadal 08 Ceip A RúAceiparua
 

More from ceiparua (14)

Rosalia de castro
Rosalia de castroRosalia de castro
Rosalia de castro
 
Moluscomania
MoluscomaniaMoluscomania
Moluscomania
 
Roni e Carapuchiña xigante namóranse nadando
Roni e Carapuchiña xigante namóranse nadandoRoni e Carapuchiña xigante namóranse nadando
Roni e Carapuchiña xigante namóranse nadando
 
Roni e Carapuchiña xigante namóranse
Roni e Carapuchiña xigante namóranseRoni e Carapuchiña xigante namóranse
Roni e Carapuchiña xigante namóranse
 
Traballo por proxectos, de Azucena Arias Correa
Traballo por proxectos, de Azucena Arias CorreaTraballo por proxectos, de Azucena Arias Correa
Traballo por proxectos, de Azucena Arias Correa
 
Competencias Basicas Xunta
Competencias Basicas XuntaCompetencias Basicas Xunta
Competencias Basicas Xunta
 
Isabel Monguilot Abeti.Competencias BáSicas. Galicia. 11 4 08
Isabel Monguilot Abeti.Competencias BáSicas. Galicia. 11 4 08Isabel Monguilot Abeti.Competencias BáSicas. Galicia. 11 4 08
Isabel Monguilot Abeti.Competencias BáSicas. Galicia. 11 4 08
 
Reciclamos
ReciclamosReciclamos
Reciclamos
 
Comparámonos
ComparámonosComparámonos
Comparámonos
 
Carapuchiña vermella xigante
Carapuchiña vermella xiganteCarapuchiña vermella xigante
Carapuchiña vermella xigante
 
Cartas de felicitación a Agustín Fdez Paz
Cartas de felicitación a Agustín Fdez PazCartas de felicitación a Agustín Fdez Paz
Cartas de felicitación a Agustín Fdez Paz
 
Empusa Pennata
Empusa PennataEmpusa Pennata
Empusa Pennata
 
Horta A RúA 08
Horta A RúA 08Horta A RúA 08
Horta A RúA 08
 
FelicitacióN Nadal 08 Ceip A RúA
FelicitacióN Nadal 08 Ceip A RúAFelicitacióN Nadal 08 Ceip A RúA
FelicitacióN Nadal 08 Ceip A RúA
 

Recently uploaded

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024The Digital Insurer
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024SynarionITSolutions
 

Recently uploaded (20)

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 

Quixote

  • 1. Cumulus: Filesystem Backup to the Cloud Michael Vrable, Stefan Savage, and Geoffrey M. Voelker Presented by Hakim Weatherspoon
  • 2. Good „ole Days Replace your tape drives with something truly scalable
  • 3. Amazon S3 to the rescue In Spring 2006, Amazon released a new storage API: Put, Get, List, Delete Build whatever you want! Quickly
  • 4. Backing up the new way (S3) Smart Scales  - no longer our concern... Amazon's concern - all servers backup in parallel Cheap - old cost =XXXper year - new cost =YYYper year where YYY < XXX 
  • 5. Thin vs Thick Cloud • E.g. Amazons S3 vs EMC‟s MozyPro • Thin – Can change provider easier – Applications can work across providers • Thick – Better performance – Locked into a provider – Provider can go out of business
  • 6. Cumulus • Simple storage backup utility for Thin Clouds • Evaluates efficacy of cloud storage • Working prototype – http://www.cs.ucsd.edu/~mvrable/cumulus/
  • 7. Outline • Motivation/Intro • Related Work • Design • Evaluation • Thoughts and Conclusions
  • 9. Outline • Motivation/Intro • Related Work • Design – API – Segments – Snapshots – Subfileincrementals – Cleaning – restoring • Evaluation • Thoughts and Conclusions
  • 10. API • Same as S3 – Put, Get, List, Delete • Thin cloud – does not rely on integrated services – Can easily change provider and network protocols – S3, FTP, SFTP • WORM Model – Write-once, read-many – Requires writing new entirely file if changes occur – What are the cleaning overheads?
  • 11. Segments • Aggregation via Segment Goals – Avoid costs due to small files • S3 charges on per file bases • Many small files – Avoid costs in network protocols • Small files have higher latency and other overheads – Compression • inter-file similarities – Privacy • Hide file boundaries • Negative consequences? – Need an entire segment to write
  • 13. Sub-File Incrementals • Only stored changed part of files • New snapshots point to old objects when data unchanged • Byte ranges – portions of old objcts to be reused
  • 14. Segment Cleaning • Similar to a log-structured file system (LFS) • Clean based on utilization of segment, α – α = 0, no cleaning – α = 1, clean with the slightest change • Cumulus – attempts to find an equilibrium for α – Uses a different process to clean – Marks a local database as “expired” – Then, next snapshot will not refer to expired segment
  • 15. Restore • Full Restore – Download all segments for a snapshot • Partial Restore – Download snapshot descriptor, metadata, and only necessary segments • What happens if client machine dies? • How is latest snapshot descriptors identified? • What about sharing between client machines?
  • 16. Outline • Motivation/Intro • Related Work • Design • Evaluation – Performance Case Study – Monetary Case Study • Thoughts and Conclusions
  • 18. Backup over time (user trace)
  • 19. Backup w/out Segment Cleaning (user trace)
  • 20. Average Daily Storage (fileserver)
  • 21. Average Daily Upload (fileserver)
  • 22. Average Segments per Day (fileserver)
  • 23. Storage overhead for 16MB Segment (fileserver)
  • 26. Monetary Case Study • Storage: $0.15 per GB . Month • Upload: $0.10 per GB • Segment: $0.01 per 1000 files uploaded • We are charged this amount, so please be careful with your labs and projects!!!
  • 28. Costs for Backup (fileserver)
  • 29. Monetary Cost Comparison (user trace)
  • 30. Outline • Motivation/Intro • Related Work • Design • Evaluation • Thoughts and Conclusions
  • 31. Discussion • Thoughts? • Did paper make case for Thin Clouds? • Sharing between clients ignored? • What every happened to P2P?!
  • 33. Next Time • Read NFS and write review: – Design and Implementation of a Network File System, Sandberg, Goldberg, Kleiman, Walsh, and Lyon, USENIX 1985 • Do Lab 0 • Check website for updated schedule