SlideShare una empresa de Scribd logo
1 de 19
Descargar para leer sin conexión
Cloud Storage Migration, Backup, and
Archive

Feb 2014
Google Cloud Storage Backup and Archive

Who? Why?
Ido Green
Solutions Architect
plus.google.com/greenido

greenido.wordpress.com
Google Cloud Storage Migration, Backup, and Archive

Topics We Cover in This Lesson
● Copying/Migrating Data to GCS
● Object Composition
● Durable Reduced Availability Storage
Google Cloud Storage Backup and Archive

Copying/Migrating Data to Google Cloud Storage
●

How fast can you copy data to Google Cloud Storage ?
○

There are many factors
Exercise
Google Cloud Storage Backup and Archive

Using gsutil 101
●

Installation

○

developers.google.com/storage/docs/gsutil_install

○ gsutil update
●

Set Up Credentials to Access Protected Data

○ gsutil config
● Test
○ Create a new bucket: cloud.google.com/console/project/YourID/storage
○ Upload a file: gsutil cp rand_10m.txt gs://paris1
○

List the bucket: gsutil ls gs://paris1
Google Cloud Storage Backup and Archive

Using gsutil perfdiag
●

gsutil perfdiag gs://<bucket>

●

Exercise:
○ Run gsutil perfdiag now
○ Look for the Write Throughput output
-----------------------------------------------------------------------------Write Throughput
-----------------------------------------------------------------------------Copied a 1 MB file 5 times for a total transfer size of 5 MB.
Write throughput: 6.16 Mbit/s

Use the throughput to estimate how long it will take to upload a
10MB file, 100MB file, 1GB (1024MB) and 1TB (1048576MB)
○ Create 10MB file: head -c 10485760 /dev/random > rand.txt
○ Run gsutil cp <file> gs://<bucket> and time the upload
○
Google Cloud Storage Backup and Archive

Copying Data to Google Cloud Storage
●

Use the -m option for parallel copying
○

●

gsutil -m cp <file1> <file2> <file3> gs://<bucket>

Use offline disk import
○

Limited preview for customers with return address in the United States

○

Flat fee of $80 per HDD irrespective of the drive capacity or data size
Google Cloud Storage Backup and Archive

Migrating Data to Google Cloud Storage
What if you have petabytes of data to move to
Google Cloud Storage? While maintaining your
production system running?
○ Need to minimize the migration window
○ No impact to production system
○ Need to minimize storage cost
Google Cloud Storage Backup and Archive

Migrating Data to Google Cloud Storage
●

Architecture from a case study
Google Cloud Storage Backup and Archive

Object Composition
Google Cloud Storage Backup and Archive

Object Composition
●

Allow parallel uploads, followed by
○

●

gsutil compose <file1> .. <file32> <final_object>

Can append to an existing object
○

gsutil compose <final_object> <file_to_append>
<final_object>

●

Can do limited editing by replacing one of the components
○

gsutil compose <file1> <edited file n> ...
<final_object>

●

Note: ETag value is not the MD5 hash of the object for composite
object.
Google Cloud Storage Backup and Archive

Object Composition
To upload in parallel, split your file into smaller pieces, upload them using
“gsutil -m cp”, compose the results, and delete the pieces:
$ split -b 1000000 rand-splity.txt rand-s-part$ gsutil -m cp rand-s-part-* gs://bucket/dir/
$ rm rand-s-part-*
$ gsutil compose gs://bucket/rand-s-part-* gs://bucket/big-file
$ gsutil -m rm gs://bucket/dir/rand-s-part-*
Exercise
Google Cloud Storage Backup and Archive

Object Composition Exercise
1. Create three files and upload them to a storage bucket
echo "ONE" > one.txt
echo "TWO" > two.txt
echo "THREE" > three.txt
gsutil cp *.txt gs://<bucket>

2. Use gsutil ls -L to examine the metadata of the objects
gsutil ls -L gs://<bucket> | grep -v ACL

3. Run gsutil to compose them into a single object
gsutil compose gs://<bucket>/{one,two,three}.txt gs://<bucket>/composite.txt

4. Use gsutil ls -L to examine the metadata of the composite
5. Examine the Hash and ETag object
6. Use gsutil cat to view the contents of the composite object
a. Please Do NOT run it on binary files
Google Cloud Storage Backup and Archive

Durable Reduced Availability (DRA) Buckets
Google Cloud Storage Backup and Archive

Durable Reduced Availability (DRA) Buckets
●

●

●

Enables you to store data at lower cost than standard storage (via
fewer replicas)
Have the following characteristics compared to standard buckets:
○
lower costs
○
lower availability
○
same durability
○
same performance !!!
Create a DRA bucket
○

gsutil mb -c DRA gs://<bucketname>/
Google Cloud Storage Backup and Archive

Moving Data Between DRA and Standard Bucket
● Must download and upload
● gsutil provides a daisy chain copy mode
○ gsutil cp -D -R gs://<standard_bucket>/* gs:
//<durable_reduced_availability_bucket>

● Object ACL is not preserved
Google Cloud Storage Backup and Archive

Thank you!
Questions?

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

BigQuery implementation
BigQuery implementationBigQuery implementation
BigQuery implementation
 
Metadata For Humans and Machines
Metadata For Humans and MachinesMetadata For Humans and Machines
Metadata For Humans and Machines
 
ESDIS Metadata Archive
ESDIS Metadata ArchiveESDIS Metadata Archive
ESDIS Metadata Archive
 
MongoDB for Analytics
MongoDB for AnalyticsMongoDB for Analytics
MongoDB for Analytics
 
Cloudstack interfaces to EC2 and GCE
Cloudstack interfaces to EC2 and GCECloudstack interfaces to EC2 and GCE
Cloudstack interfaces to EC2 and GCE
 
Climate project
Climate projectClimate project
Climate project
 
Workshop 20140522 BigQuery Implementation
Workshop 20140522   BigQuery ImplementationWorkshop 20140522   BigQuery Implementation
Workshop 20140522 BigQuery Implementation
 
MongoDB IoT City Tour EINDHOVEN: Managing the Database Complexity
MongoDB IoT City Tour EINDHOVEN: Managing the Database ComplexityMongoDB IoT City Tour EINDHOVEN: Managing the Database Complexity
MongoDB IoT City Tour EINDHOVEN: Managing the Database Complexity
 
Graphite
GraphiteGraphite
Graphite
 
The next generation of the Montage image mosaic engine
The next generation of the Montage image mosaic engineThe next generation of the Montage image mosaic engine
The next generation of the Montage image mosaic engine
 
GeoPackage, OWS Context and the OGC Interoperability Program
GeoPackage, OWS Context and the OGC Interoperability ProgramGeoPackage, OWS Context and the OGC Interoperability Program
GeoPackage, OWS Context and the OGC Interoperability Program
 
The Weather of the Century
The Weather of the CenturyThe Weather of the Century
The Weather of the Century
 
The Weather of the Century Part 3: Visualization
The Weather of the Century Part 3: VisualizationThe Weather of the Century Part 3: Visualization
The Weather of the Century Part 3: Visualization
 
ETW - Monitor Anything, Anytime, Anywhere (NDC Oslo 2017)
ETW - Monitor Anything, Anytime, Anywhere (NDC Oslo 2017)ETW - Monitor Anything, Anytime, Anywhere (NDC Oslo 2017)
ETW - Monitor Anything, Anytime, Anywhere (NDC Oslo 2017)
 
Academy PRO: Elasticsearch Misc
Academy PRO: Elasticsearch MiscAcademy PRO: Elasticsearch Misc
Academy PRO: Elasticsearch Misc
 
k-means algorithm implementation on Hadoop
k-means algorithm implementation on Hadoopk-means algorithm implementation on Hadoop
k-means algorithm implementation on Hadoop
 
Calling Variant with FireCloud-Terra
Calling Variant with FireCloud-TerraCalling Variant with FireCloud-Terra
Calling Variant with FireCloud-Terra
 
C07.heaps
C07.heapsC07.heaps
C07.heaps
 
Counters for real-time statistics
Counters for real-time statisticsCounters for real-time statistics
Counters for real-time statistics
 
OGC GeoPackage Support in ArcGIS
OGC GeoPackage Support in ArcGISOGC GeoPackage Support in ArcGIS
OGC GeoPackage Support in ArcGIS
 

Destacado

Europa Del Settecento
Europa Del SettecentoEuropa Del Settecento
Europa Del Settecento
mapaa
 
Welcome to msp information night 2013
Welcome to msp information night 2013Welcome to msp information night 2013
Welcome to msp information night 2013
Bret Biornstad
 
Collaborative Assessment: Working Together Toward Institutional Change
Collaborative Assessment: Working Together Toward Institutional ChangeCollaborative Assessment: Working Together Toward Institutional Change
Collaborative Assessment: Working Together Toward Institutional Change
Elizabeth Nesius
 

Destacado (20)

weechat : web-based chatting with twitter friends
weechat : web-based chatting with twitter friendsweechat : web-based chatting with twitter friends
weechat : web-based chatting with twitter friends
 
Woobius eye
Woobius eyeWoobius eye
Woobius eye
 
[SIP 2015] Back-end Proposal: Chat System using Socket.io
[SIP 2015] Back-end Proposal: Chat System using Socket.io[SIP 2015] Back-end Proposal: Chat System using Socket.io
[SIP 2015] Back-end Proposal: Chat System using Socket.io
 
[SIP 2015] Marketing Proposal: Making edit flow more informative and simpler
[SIP 2015] Marketing Proposal: Making edit flow more informative and simpler[SIP 2015] Marketing Proposal: Making edit flow more informative and simpler
[SIP 2015] Marketing Proposal: Making edit flow more informative and simpler
 
Smart Chat
Smart ChatSmart Chat
Smart Chat
 
Zigbee based two way wireless chatting system
Zigbee based two way wireless chatting systemZigbee based two way wireless chatting system
Zigbee based two way wireless chatting system
 
Google Spanner - Synchronously-Replicated, Globally-Distributed, Multi-Versio...
Google Spanner - Synchronously-Replicated, Globally-Distributed, Multi-Versio...Google Spanner - Synchronously-Replicated, Globally-Distributed, Multi-Versio...
Google Spanner - Synchronously-Replicated, Globally-Distributed, Multi-Versio...
 
GDD Brazil 2010 - Google Storage, Bigquery and Prediction APIs
GDD Brazil 2010 - Google Storage, Bigquery and Prediction APIsGDD Brazil 2010 - Google Storage, Bigquery and Prediction APIs
GDD Brazil 2010 - Google Storage, Bigquery and Prediction APIs
 
Spanner
SpannerSpanner
Spanner
 
An Overview of Spanner: Google's Globally Distributed Database
An Overview of Spanner: Google's Globally Distributed DatabaseAn Overview of Spanner: Google's Globally Distributed Database
An Overview of Spanner: Google's Globally Distributed Database
 
Lan chat system
Lan chat systemLan chat system
Lan chat system
 
Mariocarretero reseñaconstructivismo
Mariocarretero reseñaconstructivismoMariocarretero reseñaconstructivismo
Mariocarretero reseñaconstructivismo
 
Europa Del Settecento
Europa Del SettecentoEuropa Del Settecento
Europa Del Settecento
 
Maximes Presentation For Rubyconf China 2009
Maximes Presentation For Rubyconf China 2009Maximes Presentation For Rubyconf China 2009
Maximes Presentation For Rubyconf China 2009
 
Message Queues : A Primer - International PHP Conference Fall 2012
Message Queues : A Primer - International PHP Conference Fall 2012Message Queues : A Primer - International PHP Conference Fall 2012
Message Queues : A Primer - International PHP Conference Fall 2012
 
Welcome to msp information night 2013
Welcome to msp information night 2013Welcome to msp information night 2013
Welcome to msp information night 2013
 
Cymphonix Launches iPhone App and New Version of Network Composer Software
Cymphonix Launches iPhone App and New Version of Network Composer SoftwareCymphonix Launches iPhone App and New Version of Network Composer Software
Cymphonix Launches iPhone App and New Version of Network Composer Software
 
Collaborative Assessment: Working Together Toward Institutional Change
Collaborative Assessment: Working Together Toward Institutional ChangeCollaborative Assessment: Working Together Toward Institutional Change
Collaborative Assessment: Working Together Toward Institutional Change
 
Jak vytvořit pozoruhodnou web aplikaci
Jak vytvořit pozoruhodnou web aplikaciJak vytvořit pozoruhodnou web aplikaci
Jak vytvořit pozoruhodnou web aplikaci
 
Marathon
MarathonMarathon
Marathon
 

Similar a Google Cloud Storage backup and archive

Similar a Google Cloud Storage backup and archive (20)

Earth Engine on Google Cloud Platform (GCP)
Earth Engine on Google Cloud Platform (GCP)Earth Engine on Google Cloud Platform (GCP)
Earth Engine on Google Cloud Platform (GCP)
 
Managing 100s of PetaBytes of data in Cloud
Managing 100s of PetaBytes of data in CloudManaging 100s of PetaBytes of data in Cloud
Managing 100s of PetaBytes of data in Cloud
 
EuroPython 2015 - Storing files for the web is not as straightforward as you ...
EuroPython 2015 - Storing files for the web is not as straightforward as you ...EuroPython 2015 - Storing files for the web is not as straightforward as you ...
EuroPython 2015 - Storing files for the web is not as straightforward as you ...
 
Webinar: Building a multi-cloud Kubernetes storage on GitLab
Webinar: Building a multi-cloud Kubernetes storage on GitLabWebinar: Building a multi-cloud Kubernetes storage on GitLab
Webinar: Building a multi-cloud Kubernetes storage on GitLab
 
Android - Gradle build optimisation 3d83f31339d239abcc55f869e5f30348?s=47
Android - Gradle build optimisation  3d83f31339d239abcc55f869e5f30348?s=47Android - Gradle build optimisation  3d83f31339d239abcc55f869e5f30348?s=47
Android - Gradle build optimisation 3d83f31339d239abcc55f869e5f30348?s=47
 
Grails resources
Grails resourcesGrails resources
Grails resources
 
Get started with Kubernetes on GKE
Get started with Kubernetes on GKEGet started with Kubernetes on GKE
Get started with Kubernetes on GKE
 
Intro to Google Apps Script
Intro to Google Apps ScriptIntro to Google Apps Script
Intro to Google Apps Script
 
Containerizing ContentBox CMS
Containerizing ContentBox CMSContainerizing ContentBox CMS
Containerizing ContentBox CMS
 
CubeBackup Introduction
CubeBackup IntroductionCubeBackup Introduction
CubeBackup Introduction
 
Google Cloud Platform Special Training
Google Cloud Platform Special TrainingGoogle Cloud Platform Special Training
Google Cloud Platform Special Training
 
Gluster: a SWOT Analysis
Gluster: a SWOT Analysis Gluster: a SWOT Analysis
Gluster: a SWOT Analysis
 
OSBConf 2016: The Database Backup is done - what next? - by Jörg Brühe
OSBConf 2016: The Database Backup is done - what next? - by Jörg BrüheOSBConf 2016: The Database Backup is done - what next? - by Jörg Brühe
OSBConf 2016: The Database Backup is done - what next? - by Jörg Brühe
 
Dmytro Patkovskyi "Practical tips regarding build optimization for those who ...
Dmytro Patkovskyi "Practical tips regarding build optimization for those who ...Dmytro Patkovskyi "Practical tips regarding build optimization for those who ...
Dmytro Patkovskyi "Practical tips regarding build optimization for those who ...
 
Decoupling Drupal mit dem Lupus Nuxt.js Drupal Stack
Decoupling Drupal mit dem Lupus Nuxt.js Drupal StackDecoupling Drupal mit dem Lupus Nuxt.js Drupal Stack
Decoupling Drupal mit dem Lupus Nuxt.js Drupal Stack
 
Coscup x ruby conf tw 2021 google cloud buildpacks 剖析與實踐
Coscup x ruby conf tw 2021  google cloud buildpacks 剖析與實踐Coscup x ruby conf tw 2021  google cloud buildpacks 剖析與實踐
Coscup x ruby conf tw 2021 google cloud buildpacks 剖析與實踐
 
How to git easily in day to-day work
How to git easily in day to-day workHow to git easily in day to-day work
How to git easily in day to-day work
 
PyConFR 2014 - DEPOT, Story of a file.write() gone wrong
PyConFR 2014 - DEPOT, Story of a file.write() gone wrongPyConFR 2014 - DEPOT, Story of a file.write() gone wrong
PyConFR 2014 - DEPOT, Story of a file.write() gone wrong
 
Data Science Workflows using Docker Containers
Data Science Workflows using Docker ContainersData Science Workflows using Docker Containers
Data Science Workflows using Docker Containers
 
GCP - GCE, Cloud SQL, Cloud Storage, BigQuery Basic Training
GCP - GCE, Cloud SQL, Cloud Storage, BigQuery Basic TrainingGCP - GCE, Cloud SQL, Cloud Storage, BigQuery Basic Training
GCP - GCE, Cloud SQL, Cloud Storage, BigQuery Basic Training
 

Más de Ido Green

Más de Ido Green (20)

How to get things done - Lessons from Yahoo, Google, Netflix and Meta
How to get things done - Lessons from Yahoo, Google, Netflix and Meta How to get things done - Lessons from Yahoo, Google, Netflix and Meta
How to get things done - Lessons from Yahoo, Google, Netflix and Meta
 
Crypto 101 and a bit more [Sep-2022]
Crypto 101 and a bit more [Sep-2022]Crypto 101 and a bit more [Sep-2022]
Crypto 101 and a bit more [Sep-2022]
 
The Future of Continuous Software Updates Is Here
The Future of Continuous Software Updates Is HereThe Future of Continuous Software Updates Is Here
The Future of Continuous Software Updates Is Here
 
Open Source & DevOps Market trends - Open Core Summit
Open Source & DevOps Market trends - Open Core SummitOpen Source & DevOps Market trends - Open Core Summit
Open Source & DevOps Market trends - Open Core Summit
 
DevOps as a competitive advantage
DevOps as a competitive advantageDevOps as a competitive advantage
DevOps as a competitive advantage
 
Data Driven DevOps & Technologies (swampUP 2019 keynote)
Data Driven DevOps & Technologies (swampUP 2019 keynote)Data Driven DevOps & Technologies (swampUP 2019 keynote)
Data Driven DevOps & Technologies (swampUP 2019 keynote)
 
Create An Amazing Apps For The Google Assistant!
Create An Amazing Apps For The Google Assistant!Create An Amazing Apps For The Google Assistant!
Create An Amazing Apps For The Google Assistant!
 
VUI Design
VUI DesignVUI Design
VUI Design
 
Google Assistant - Why? How?
Google Assistant - Why? How?Google Assistant - Why? How?
Google Assistant - Why? How?
 
The Google Assistant - Macro View (October 2017)
The Google Assistant - Macro View (October 2017)The Google Assistant - Macro View (October 2017)
The Google Assistant - Macro View (October 2017)
 
Actions On Google - GDD Europe 2017
Actions On Google - GDD Europe 2017Actions On Google - GDD Europe 2017
Actions On Google - GDD Europe 2017
 
Building conversational experiences with Actions on Google
Building conversational experiences with Actions on GoogleBuilding conversational experiences with Actions on Google
Building conversational experiences with Actions on Google
 
Actions On Google - How? Why?
Actions On Google - How? Why?Actions On Google - How? Why?
Actions On Google - How? Why?
 
Startups Best Practices
Startups Best PracticesStartups Best Practices
Startups Best Practices
 
Progressive Web Apps For Startups
Progressive Web Apps For StartupsProgressive Web Apps For Startups
Progressive Web Apps For Startups
 
Earn More Revenue With Firebase and AdMob
Earn More Revenue With Firebase and AdMobEarn More Revenue With Firebase and AdMob
Earn More Revenue With Firebase and AdMob
 
How To Grow Your User Base?
How To Grow Your User Base?How To Grow Your User Base?
How To Grow Your User Base?
 
Amp Overview #YGLF 2016
Amp Overview #YGLF 2016Amp Overview #YGLF 2016
Amp Overview #YGLF 2016
 
AMP - Accelerated Mobile Pages
AMP - Accelerated Mobile PagesAMP - Accelerated Mobile Pages
AMP - Accelerated Mobile Pages
 
From AMP to PWA
From AMP to PWAFrom AMP to PWA
From AMP to PWA
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 

Último (20)

Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 

Google Cloud Storage backup and archive

  • 1. Cloud Storage Migration, Backup, and Archive Feb 2014
  • 2. Google Cloud Storage Backup and Archive Who? Why? Ido Green Solutions Architect plus.google.com/greenido greenido.wordpress.com
  • 3. Google Cloud Storage Migration, Backup, and Archive Topics We Cover in This Lesson ● Copying/Migrating Data to GCS ● Object Composition ● Durable Reduced Availability Storage
  • 4. Google Cloud Storage Backup and Archive Copying/Migrating Data to Google Cloud Storage ● How fast can you copy data to Google Cloud Storage ? ○ There are many factors
  • 6. Google Cloud Storage Backup and Archive Using gsutil 101 ● Installation ○ developers.google.com/storage/docs/gsutil_install ○ gsutil update ● Set Up Credentials to Access Protected Data ○ gsutil config ● Test ○ Create a new bucket: cloud.google.com/console/project/YourID/storage ○ Upload a file: gsutil cp rand_10m.txt gs://paris1 ○ List the bucket: gsutil ls gs://paris1
  • 7. Google Cloud Storage Backup and Archive Using gsutil perfdiag ● gsutil perfdiag gs://<bucket> ● Exercise: ○ Run gsutil perfdiag now ○ Look for the Write Throughput output -----------------------------------------------------------------------------Write Throughput -----------------------------------------------------------------------------Copied a 1 MB file 5 times for a total transfer size of 5 MB. Write throughput: 6.16 Mbit/s Use the throughput to estimate how long it will take to upload a 10MB file, 100MB file, 1GB (1024MB) and 1TB (1048576MB) ○ Create 10MB file: head -c 10485760 /dev/random > rand.txt ○ Run gsutil cp <file> gs://<bucket> and time the upload ○
  • 8. Google Cloud Storage Backup and Archive Copying Data to Google Cloud Storage ● Use the -m option for parallel copying ○ ● gsutil -m cp <file1> <file2> <file3> gs://<bucket> Use offline disk import ○ Limited preview for customers with return address in the United States ○ Flat fee of $80 per HDD irrespective of the drive capacity or data size
  • 9. Google Cloud Storage Backup and Archive Migrating Data to Google Cloud Storage What if you have petabytes of data to move to Google Cloud Storage? While maintaining your production system running? ○ Need to minimize the migration window ○ No impact to production system ○ Need to minimize storage cost
  • 10. Google Cloud Storage Backup and Archive Migrating Data to Google Cloud Storage ● Architecture from a case study
  • 11. Google Cloud Storage Backup and Archive Object Composition
  • 12. Google Cloud Storage Backup and Archive Object Composition ● Allow parallel uploads, followed by ○ ● gsutil compose <file1> .. <file32> <final_object> Can append to an existing object ○ gsutil compose <final_object> <file_to_append> <final_object> ● Can do limited editing by replacing one of the components ○ gsutil compose <file1> <edited file n> ... <final_object> ● Note: ETag value is not the MD5 hash of the object for composite object.
  • 13. Google Cloud Storage Backup and Archive Object Composition To upload in parallel, split your file into smaller pieces, upload them using “gsutil -m cp”, compose the results, and delete the pieces: $ split -b 1000000 rand-splity.txt rand-s-part$ gsutil -m cp rand-s-part-* gs://bucket/dir/ $ rm rand-s-part-* $ gsutil compose gs://bucket/rand-s-part-* gs://bucket/big-file $ gsutil -m rm gs://bucket/dir/rand-s-part-*
  • 15. Google Cloud Storage Backup and Archive Object Composition Exercise 1. Create three files and upload them to a storage bucket echo "ONE" > one.txt echo "TWO" > two.txt echo "THREE" > three.txt gsutil cp *.txt gs://<bucket> 2. Use gsutil ls -L to examine the metadata of the objects gsutil ls -L gs://<bucket> | grep -v ACL 3. Run gsutil to compose them into a single object gsutil compose gs://<bucket>/{one,two,three}.txt gs://<bucket>/composite.txt 4. Use gsutil ls -L to examine the metadata of the composite 5. Examine the Hash and ETag object 6. Use gsutil cat to view the contents of the composite object a. Please Do NOT run it on binary files
  • 16. Google Cloud Storage Backup and Archive Durable Reduced Availability (DRA) Buckets
  • 17. Google Cloud Storage Backup and Archive Durable Reduced Availability (DRA) Buckets ● ● ● Enables you to store data at lower cost than standard storage (via fewer replicas) Have the following characteristics compared to standard buckets: ○ lower costs ○ lower availability ○ same durability ○ same performance !!! Create a DRA bucket ○ gsutil mb -c DRA gs://<bucketname>/
  • 18. Google Cloud Storage Backup and Archive Moving Data Between DRA and Standard Bucket ● Must download and upload ● gsutil provides a daisy chain copy mode ○ gsutil cp -D -R gs://<standard_bucket>/* gs: //<durable_reduced_availability_bucket> ● Object ACL is not preserved
  • 19. Google Cloud Storage Backup and Archive Thank you! Questions?