SlideShare una empresa de Scribd logo
1 de 43
Descargar para leer sin conexión
Best Practices for S3
Bruno Laurenti
Solutions Architect
• Storage Classes
• Bucket Settings & Features
• Managing Data at Scale
• Performance
• Data Transfer
Agenda
Enterprise
applicationsAnalytics
Archiving
Backup &
restore
Origin storage
for CDN
Benefits of Amazon S3
Website hosting
Mobile sync and
storage
Storage Classes
Decreasing prices and more storage options
1 2
Decreasing storage prices
S3 Standard
(2006)
Glacier
(2012)
S-IA
(2015)
Z-IA
(H1-2018)
INT
(Q4-2018)
Accelerating innovation
2006 2018
Choose the storage class that fits best
≥ 3 AZs 1 AZ
99.99% 99.5%
Milliseconds Hours
Hours YearsFrequent Infrequent
0 Bytes 5 Terabytes
Reduce storage cost > 80% by choosing the
storage class option that best fits your use case
2 Regions
99.9%
Your choice of Amazon S3 storage classes
Access FrequencyFrequent Infrequent
• Active, frequently
accessed data
• Milliseconds access
• > 3 AZ
• From: $0.0210/GB
• Data with changing access
pattern
• Milliseconds access
• > 3 AZ
• From: $0.0210 to
$0.0125/GB
• Monitoring fee per obj.
• Min storage duration
• Infrequently accessed
data
• Milliseconds access
• > 3 AZ
• From: $0.0125/GB
• Retrieval fee per GB
• Min storage duration
• Min object size
S3 Standard S3 Standard-IA S3 One Zone-IA S3 Glacier
• Re-creatable less accessed
data
• Milliseconds access
• 1 AZ
• From: $0.0100/GB
• Retrieval fee per GB
• Min storage duration
• Min object size
• Archive data
• Minutes to hours
access
• > 3 AZ
• From: $0.0040/GB
• Retrieval fee per GB
• Min storage duration
• Min object size
S3 Intelligent-
Tiering
S3 Glacier
Deep Archive
• Archive data
• Hours access
• > 3 AZ
• From: $0.00099/GB
• Retrieval fee per GB
• Min storage duration
• Min object size
S3 Storage Class Analysis and S3 Lifecycle Policy
Use S3 Storage Class Analysis to identify
storage age groups that are less
frequently accessed
Set S3 Lifecycle Policy to tier storage to
lower cost storage classes and expire
storage based on age of object
Great for predictable workloads (object
age indicates access frequency)
Fine tune analysis by bucket, prefix, or
object tag
Set S3 Lifecycle Policy to tier and
expire storage
S3 Lifecycle Policy to tier to lower
cost storage classes and expire
storage
S3 Storage Class Analysis results
help set up a S3 Lifecycle Policy
Policies are based on age of object
and set by bucket, prefix, or object
tag
S3 Standard S3 S-IA S3 Glacier
S3 Intelligent-Tiering
Automatically optimizes storage costs for
data with changing access patterns
Moves objects between two storage tiers:
• Frequent Access Tier
• Infrequent Access Tier
Monitors access patterns and auto-tiers on
granular object level
Milliseconds access, > 3 AZ, Monitoring fee
per Object, minimum storage duration
Ideal use cases for S3 Intelligent-Tiering
Dynamic cost optimization with no performance impact and no operational overhead
Big Data, Data Lakes
Storage with changing access
patterns used by multiple applications
Enterprises
Storage accessed by fragmented
applications from various
organizations
Startups
Constrained on resources and
experience to optimize storage
themselves
Amazon S3
Bucket Settings & Features
Permissions
Permissions
• IAM Policies
• S3 Bucket Policies
• S3 ACLs
As a general rule, AWS recommends using S3 bucket
policies or IAM policies for access control. S3 ACLs is a
legacy access control mechanism that predates IAM.
Use IAM policies if:
• You need to control access to AWS services other than S3. IAM policies will be easier to manage since you
can centrally manage all of your permissions in IAM, instead of spreading them between IAM and S3.
• You have numerous S3 buckets each with different permissions requirements. IAM policies will be easier
to manage since you don’t have to define a large number of S3 bucket policies and can instead rely on
fewer, more detailed IAM policies.
• You prefer to keep access control policies in the IAM environment.
Use S3 bucket policies if:
• You want a simple way to grant cross-account access to your S3 environment, without using IAM roles.
• Your IAM policies bump up against the size limit (up to 2 kb for users, 5 kb for groups, and 10 kb for roles). S3
supports bucket policies of up 20 kb.
• You prefer to keep access control policies in the S3 environment.
Permissions
With a few clicks in the S3
management console, you can
apply S3 Block Public Access to
every bucket in your account –
both existing and any new
buckets created in the future –
and make sure that there is no
public access to any object
S3 Block Public Access
Set at the account or bucket-level
Versioning
Protect your data from accidental deletion
• Create a new version with every upload
• Previous versions are retained, not overwritten
• Protect from unintended user deletes
• Making delete requests without a version ID removes
access to objects, but keeps the data
• Manage previous versions with lifecycle
• Transition or expire objects a specified number of
days after they are no longer the current version
S3 Object Lock
Immutable S3 Objects
• Write Once Read Many (WORM) Protection for S3 Objects
• Object or bucket control of WORM & retention attributes
Retention Management Controls
• Define retention periods in your app or with bucket-level defaults
• Objects Locked for the Duration of the Retention Period
• Support for Legal Hold scenarios
Data Protection and Compliance
• Assessed for use in SEC 17a-4, CFTC, and FINRA environments
• Extra protection against accidental or malicious delete
S3 Object Lock Modes
Compliance
Mode
• Intended for Compliance
• Deletes disallowed, even for root account
• Assessed for SEC 17a-4 by Cohasset Associates
Governance
Mode
• Intended for Data Protection
• Enables privileged delete of WORM-protected objects
• Protects against account compromise & rogue actors
• Retention can be changed to Compliance Mode
S3 Object Lock Legal Hold
Amazon S3 Glacier Enhancements
Restore
Notifications
Notifications fire when a S3 Glacier restore
starts and completes
Restore
Speed Upgrade
Upgrade an in-progress restore to a faster restore
speed
Direct
PUT
Direct access to S3 Glacier through the S3 PUT API
CRR direct to
Glacier
Replicate data direct to S3 Glacier in a
secondary AWS region
Object Tags
Add up to ten tags to your objects to control access and drive actions
For example:
• Grant an IAM user permissions to read only objects with specific tags
• Use tags to indicate which objects should be replicated
• Apply tags to specify granular lifecycle policies
• Filter metrics and reports based on tags
photos/photo1.jpg
project/projectX/document.pdf
project/projectY/document2.pdf
projectX
Cross-Region Replication
Flexibility to replicate data:
• At the bucket, prefix, or object level
• From any region to any region
• To any storage class
• Across AWS accounts
• Change the object owner in the destination region
Managing Data at Scale
Amazon S3 Batch Operations
Take large-scale actions on Amazon S3 objects
Amazon S3 Batch Operations
Take large-scale actions on Amazon S3 objects
S3 Batch Operations
Perform API actions across thousands, millions, or billions of objects
S3 Batch Operations
Choose
Objects
Select an
Operation
View
Progress
Perform API actions across thousands, millions, or billions of objects
Performance
Amazon S3 performance optimization
• What we hear from you …
§ How do I get the highest request rates?
§ How do I saturate my compute, network, and storage resources?
§ How do I maximize my single-threaded throughput?
§ How do I achieve more predictable outlier performance?
§ How do I optimize data query performance against Amazon S3?
• Where do you start?
§ Your object naming scheme!
§ We’ll review this first, then walk through a streaming video workflow use
case as an example.
Amazon S3 performance increase
Amazon S3 for data analytics
BEFORE
Compute
W R I T E T I M E R E A D T I M E
5TB of 2MB objects
S3
1with prefix
Amazon S3 performance increase
Amazon S3 request performance increase
NOW
Compute
W R I T E T I M E R E A D T I M E
S3
41m 40s 13m 52s
5TB of 2MB objects
1with prefix0
Amazon S3 performance increase
Amazon S3 request performance increase
PARALLEL PROCESSING
Compute
W R I T E T I M E R E A D T I M E
S3
12m 00s 7m 00s
41m 40s 13m 52s
5TB of 2MB objects
1with 0 prefix
Object naming scheme
ExampleAWSbucket/Logistics/packing-list.pdf
Up to 3,500 PUT* tps
Up to 5,500 GET tps
ExampleAWSbucket
Let’s look at how prefixes scale
request rate performance
BucketName/Prefix:
ExampleAWSbucket/LogFiles/
ExampleAWSbucket/Logistics/
ExampleAWSbucket/…
* PUT, POST, and DELETE are all included in the PUT tps
tps = transactions per second
Initial partition
3,500 PUT tps
5,500 GET tps
3,500 PUT tps
5,500 GET tps
2nd partition
Initial partition
ExampleAWSbucket
BucketName/Prefix:
ExampleAWSbucket/LogFiles/
ExampleAWSbucket/Logistics/
ExampleAWSbucket/…
Let’s look at how prefixes scale
request rate performance
/Log
/…
PUT, POST, and DELETE are all included in the PUT tps
tps = transactions per second
3,500 PUT tps
5,500 GET tps
3,500 PUT tps
5,500 GET tps
3,500 PUT tps
5,500 GET tps
2nd partition
Files/
istics/
3rd partition
3,500 PUT tps
5,500 GET tps
…
ExampleAWSbucket
BucketName/Prefix:
ExampleAWSbucket/LogFiles/
ExampleAWSbucket/Logistics/
ExampleAWSbucket/…
Let’s look at how prefixes scale
request rate performance
/Log
/…
PUT, POST, and DELETE are all included in the PUT tps
tps = transactions per second
Initial partition
3,500 PUT tps
5,500 GET tps
3,500 PUT tps
0 GET tps
0 PUT tps
5,500 GET tps
Request rate performance is allocated proportionally
PUT, POST, and DELETE are all included in the PUT tps
tps = transactions per second
Data Transfer
Flexible Data Transfer
More ways to get data into Amazon S3
Database
Migration
Service
AWS Transfer for SFTP
Fully-managed service enabling transfer
of data over SFTP, while stored in Amazon S3
Seamless migration
of existing workflows
Native integration
with AWS services
Simple
to use
Cost
effective
Secure and CompliantFully managed
in AWS
AWS
integrated
AWS
Transfer service that simplifies, automates, and accelerates data movement
Transfers up
to 10 Gbps
per agent
Pay as you
go
Secure and
reliable
transfers
Replicate data to AWS
for business continuity
Transfer data for timely
in-cloud analysis
Migrate active application
data to AWS
Combines the speed and reliability of network acceleration
software with the cost-effectiveness of open source tools
Simple data
movement to S3
or Amazon EFS
AWS DataSync
Let’s Build!
Bruno Laurenti
Solutions Architect
brulau@amazon.com

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

AWS Cloud Kata 2014 | Jakarta - 2-3 Big Data
 AWS Cloud Kata 2014 | Jakarta - 2-3 Big Data AWS Cloud Kata 2014 | Jakarta - 2-3 Big Data
AWS Cloud Kata 2014 | Jakarta - 2-3 Big Data
 
Building a Server-less Data Lake on AWS - Technical 301
Building a Server-less Data Lake on AWS - Technical 301Building a Server-less Data Lake on AWS - Technical 301
Building a Server-less Data Lake on AWS - Technical 301
 
ENT314 Automate Best Practices and Operational Health for Your AWS Resources
ENT314 Automate Best Practices and Operational Health for Your AWS ResourcesENT314 Automate Best Practices and Operational Health for Your AWS Resources
ENT314 Automate Best Practices and Operational Health for Your AWS Resources
 
February 2016 Webinar Series - Architectural Patterns for Big Data on AWS
February 2016 Webinar Series - Architectural Patterns for Big Data on AWSFebruary 2016 Webinar Series - Architectural Patterns for Big Data on AWS
February 2016 Webinar Series - Architectural Patterns for Big Data on AWS
 
Amazon S3 & Amazon Glacier - Object Storage Overview
Amazon S3 & Amazon Glacier - Object Storage OverviewAmazon S3 & Amazon Glacier - Object Storage Overview
Amazon S3 & Amazon Glacier - Object Storage Overview
 
Migrating Your NoSQL Database to Amazon DynamoDB (DAT314) - AWS re:Invent 2018
Migrating Your NoSQL Database to Amazon DynamoDB (DAT314) - AWS re:Invent 2018Migrating Your NoSQL Database to Amazon DynamoDB (DAT314) - AWS re:Invent 2018
Migrating Your NoSQL Database to Amazon DynamoDB (DAT314) - AWS re:Invent 2018
 
Best Practices for Building a Data Lake on AWS
Best Practices for Building a Data Lake on AWSBest Practices for Building a Data Lake on AWS
Best Practices for Building a Data Lake on AWS
 
AWS re:Invent 2016: How Amazon S3 Storage Management Helps Optimize Storage a...
AWS re:Invent 2016: How Amazon S3 Storage Management Helps Optimize Storage a...AWS re:Invent 2016: How Amazon S3 Storage Management Helps Optimize Storage a...
AWS re:Invent 2016: How Amazon S3 Storage Management Helps Optimize Storage a...
 
How to Build a Data Lake in Amazon S3 & Amazon Glacier - AWS Online Tech Talks
How to Build a Data Lake in Amazon S3 & Amazon Glacier - AWS Online Tech TalksHow to Build a Data Lake in Amazon S3 & Amazon Glacier - AWS Online Tech Talks
How to Build a Data Lake in Amazon S3 & Amazon Glacier - AWS Online Tech Talks
 
AWS Big Data Landscape
AWS Big Data LandscapeAWS Big Data Landscape
AWS Big Data Landscape
 
Big Data Architectural Patterns and Best Practices
Big Data Architectural Patterns and Best PracticesBig Data Architectural Patterns and Best Practices
Big Data Architectural Patterns and Best Practices
 
Active Archiving with Amazon S3 and Tiering to Amazon Glacier - March 2017 AW...
Active Archiving with Amazon S3 and Tiering to Amazon Glacier - March 2017 AW...Active Archiving with Amazon S3 and Tiering to Amazon Glacier - March 2017 AW...
Active Archiving with Amazon S3 and Tiering to Amazon Glacier - March 2017 AW...
 
ABD318_Architecting a data lake with Amazon S3, Amazon Kinesis, AWS Glue and ...
ABD318_Architecting a data lake with Amazon S3, Amazon Kinesis, AWS Glue and ...ABD318_Architecting a data lake with Amazon S3, Amazon Kinesis, AWS Glue and ...
ABD318_Architecting a data lake with Amazon S3, Amazon Kinesis, AWS Glue and ...
 
Building a Modern Data Architecture on AWS - Webinar
Building a Modern Data Architecture on AWS - WebinarBuilding a Modern Data Architecture on AWS - Webinar
Building a Modern Data Architecture on AWS - Webinar
 
Building Data Lakes in the AWS Cloud
Building Data Lakes in the AWS CloudBuilding Data Lakes in the AWS Cloud
Building Data Lakes in the AWS Cloud
 
Build Data Lakes and Analytics on AWS
Build Data Lakes and Analytics on AWS Build Data Lakes and Analytics on AWS
Build Data Lakes and Analytics on AWS
 
Deep Dive on Amazon S3 - AWS Online Tech Talks
Deep Dive on Amazon S3 - AWS Online Tech TalksDeep Dive on Amazon S3 - AWS Online Tech Talks
Deep Dive on Amazon S3 - AWS Online Tech Talks
 
ENT316 Keeping Pace With The Cloud: Managing and Optimizing as You Scale
ENT316 Keeping Pace With The Cloud: Managing and Optimizing as You ScaleENT316 Keeping Pace With The Cloud: Managing and Optimizing as You Scale
ENT316 Keeping Pace With The Cloud: Managing and Optimizing as You Scale
 
Workshop: Building Your First Big Data Application on AWS
Workshop: Building Your First Big Data Application on AWSWorkshop: Building Your First Big Data Application on AWS
Workshop: Building Your First Big Data Application on AWS
 
Big Data Architectural Patterns and Best Practices on AWS
Big Data Architectural Patterns and Best Practices on AWSBig Data Architectural Patterns and Best Practices on AWS
Big Data Architectural Patterns and Best Practices on AWS
 

Similar a Builders' Day - Best Practises for S3 - BL

Similar a Builders' Day - Best Practises for S3 - BL (20)

SRV403 Deep Dive on Object Storage: Amazon S3 and Amazon Glacier
SRV403 Deep Dive on Object Storage: Amazon S3 and Amazon GlacierSRV403 Deep Dive on Object Storage: Amazon S3 and Amazon Glacier
SRV403 Deep Dive on Object Storage: Amazon S3 and Amazon Glacier
 
Best Practices for Amazon S3 and Amazon Glacier (STG203-R2) - AWS re:Invent 2018
Best Practices for Amazon S3 and Amazon Glacier (STG203-R2) - AWS re:Invent 2018Best Practices for Amazon S3 and Amazon Glacier (STG203-R2) - AWS re:Invent 2018
Best Practices for Amazon S3 and Amazon Glacier (STG203-R2) - AWS re:Invent 2018
 
Deep Dive On Object Storage: Amazon S3 and Amazon Glacier - AWS PS Summit Can...
Deep Dive On Object Storage: Amazon S3 and Amazon Glacier - AWS PS Summit Can...Deep Dive On Object Storage: Amazon S3 and Amazon Glacier - AWS PS Summit Can...
Deep Dive On Object Storage: Amazon S3 and Amazon Glacier - AWS PS Summit Can...
 
Deep Dive on Amazon S3 - March 2017 AWS Online Tech Talks
Deep Dive on Amazon S3 - March 2017 AWS Online Tech TalksDeep Dive on Amazon S3 - March 2017 AWS Online Tech Talks
Deep Dive on Amazon S3 - March 2017 AWS Online Tech Talks
 
AWS S3 Cost Optimization
AWS S3 Cost OptimizationAWS S3 Cost Optimization
AWS S3 Cost Optimization
 
Protect & Manage Amazon S3 & Amazon Glacier Objects at Scale (STG316-R1) - AW...
Protect & Manage Amazon S3 & Amazon Glacier Objects at Scale (STG316-R1) - AW...Protect & Manage Amazon S3 & Amazon Glacier Objects at Scale (STG316-R1) - AW...
Protect & Manage Amazon S3 & Amazon Glacier Objects at Scale (STG316-R1) - AW...
 
Visualizing Amazon S3 Storage Management with QuickSight - AWS Online Tech Talks
Visualizing Amazon S3 Storage Management with QuickSight - AWS Online Tech TalksVisualizing Amazon S3 Storage Management with QuickSight - AWS Online Tech Talks
Visualizing Amazon S3 Storage Management with QuickSight - AWS Online Tech Talks
 
AWS re:Invent 2016: Workshop: AWS S3 Deep-Dive Hands-On Workshop: Deploying a...
AWS re:Invent 2016: Workshop: AWS S3 Deep-Dive Hands-On Workshop: Deploying a...AWS re:Invent 2016: Workshop: AWS S3 Deep-Dive Hands-On Workshop: Deploying a...
AWS re:Invent 2016: Workshop: AWS S3 Deep-Dive Hands-On Workshop: Deploying a...
 
Deep Dive on Amazon S3
Deep Dive on Amazon S3Deep Dive on Amazon S3
Deep Dive on Amazon S3
 
Supercharging the Value of Your Data with Amazon S3
Supercharging the Value of Your Data with Amazon S3Supercharging the Value of Your Data with Amazon S3
Supercharging the Value of Your Data with Amazon S3
 
Object Storage: Amazon S3 and Amazon Glacier
Object Storage: Amazon S3 and Amazon GlacierObject Storage: Amazon S3 and Amazon Glacier
Object Storage: Amazon S3 and Amazon Glacier
 
Deep Dive on Object Storage: Amazon S3 and Amazon Glacier | AWS Public Sector...
Deep Dive on Object Storage: Amazon S3 and Amazon Glacier | AWS Public Sector...Deep Dive on Object Storage: Amazon S3 and Amazon Glacier | AWS Public Sector...
Deep Dive on Object Storage: Amazon S3 and Amazon Glacier | AWS Public Sector...
 
Cost efficiencies and security best practices with Amazon S3 storage - STG301...
Cost efficiencies and security best practices with Amazon S3 storage - STG301...Cost efficiencies and security best practices with Amazon S3 storage - STG301...
Cost efficiencies and security best practices with Amazon S3 storage - STG301...
 
Compliance-Data-Archival
Compliance-Data-ArchivalCompliance-Data-Archival
Compliance-Data-Archival
 
Deep Dive on Amazon S3
Deep Dive on Amazon S3Deep Dive on Amazon S3
Deep Dive on Amazon S3
 
AWS Storage - S3 Fundamentals
AWS Storage - S3 FundamentalsAWS Storage - S3 Fundamentals
AWS Storage - S3 Fundamentals
 
AWS re:Invent 2016: Deep Dive on Amazon Glacier (STG302)
AWS re:Invent 2016: Deep Dive on Amazon Glacier (STG302)AWS re:Invent 2016: Deep Dive on Amazon Glacier (STG302)
AWS re:Invent 2016: Deep Dive on Amazon Glacier (STG302)
 
Storage with Amazon S3 and Amazon Glacier
Storage with Amazon S3 and Amazon GlacierStorage with Amazon S3 and Amazon Glacier
Storage with Amazon S3 and Amazon Glacier
 
SRV403 Deep Dive on Object Storage: Amazon S3 and Amazon Glacier
SRV403 Deep Dive on Object Storage: Amazon S3 and Amazon GlacierSRV403 Deep Dive on Object Storage: Amazon S3 and Amazon Glacier
SRV403 Deep Dive on Object Storage: Amazon S3 and Amazon Glacier
 
Deep Dive on Amazon S3
Deep Dive on Amazon S3Deep Dive on Amazon S3
Deep Dive on Amazon S3
 

Más de Amazon Web Services LATAM

Más de Amazon Web Services LATAM (20)

AWS para terceiro setor - Sessão 1 - Introdução à nuvem
AWS para terceiro setor - Sessão 1 - Introdução à nuvemAWS para terceiro setor - Sessão 1 - Introdução à nuvem
AWS para terceiro setor - Sessão 1 - Introdução à nuvem
 
AWS para terceiro setor - Sessão 2 - Armazenamento e Backup
AWS para terceiro setor - Sessão 2 - Armazenamento e BackupAWS para terceiro setor - Sessão 2 - Armazenamento e Backup
AWS para terceiro setor - Sessão 2 - Armazenamento e Backup
 
AWS para terceiro setor - Sessão 3 - Protegendo seus dados.
AWS para terceiro setor - Sessão 3 - Protegendo seus dados.AWS para terceiro setor - Sessão 3 - Protegendo seus dados.
AWS para terceiro setor - Sessão 3 - Protegendo seus dados.
 
AWS para terceiro setor - Sessão 1 - Introdução à nuvem
AWS para terceiro setor - Sessão 1 - Introdução à nuvemAWS para terceiro setor - Sessão 1 - Introdução à nuvem
AWS para terceiro setor - Sessão 1 - Introdução à nuvem
 
AWS para terceiro setor - Sessão 2 - Armazenamento e Backup
AWS para terceiro setor - Sessão 2 - Armazenamento e BackupAWS para terceiro setor - Sessão 2 - Armazenamento e Backup
AWS para terceiro setor - Sessão 2 - Armazenamento e Backup
 
AWS para terceiro setor - Sessão 3 - Protegendo seus dados.
AWS para terceiro setor - Sessão 3 - Protegendo seus dados.AWS para terceiro setor - Sessão 3 - Protegendo seus dados.
AWS para terceiro setor - Sessão 3 - Protegendo seus dados.
 
Automatice el proceso de entrega con CI/CD en AWS
Automatice el proceso de entrega con CI/CD en AWSAutomatice el proceso de entrega con CI/CD en AWS
Automatice el proceso de entrega con CI/CD en AWS
 
Automatize seu processo de entrega de software com CI/CD na AWS
Automatize seu processo de entrega de software com CI/CD na AWSAutomatize seu processo de entrega de software com CI/CD na AWS
Automatize seu processo de entrega de software com CI/CD na AWS
 
Cómo empezar con Amazon EKS
Cómo empezar con Amazon EKSCómo empezar con Amazon EKS
Cómo empezar con Amazon EKS
 
Como começar com Amazon EKS
Como começar com Amazon EKSComo começar com Amazon EKS
Como começar com Amazon EKS
 
Ransomware: como recuperar os seus dados na nuvem AWS
Ransomware: como recuperar os seus dados na nuvem AWSRansomware: como recuperar os seus dados na nuvem AWS
Ransomware: como recuperar os seus dados na nuvem AWS
 
Ransomware: cómo recuperar sus datos en la nube de AWS
Ransomware: cómo recuperar sus datos en la nube de AWSRansomware: cómo recuperar sus datos en la nube de AWS
Ransomware: cómo recuperar sus datos en la nube de AWS
 
Ransomware: Estratégias de Mitigação
Ransomware: Estratégias de MitigaçãoRansomware: Estratégias de Mitigação
Ransomware: Estratégias de Mitigação
 
Ransomware: Estratégias de Mitigación
Ransomware: Estratégias de MitigaciónRansomware: Estratégias de Mitigación
Ransomware: Estratégias de Mitigación
 
Aprenda a migrar y transferir datos al usar la nube de AWS
Aprenda a migrar y transferir datos al usar la nube de AWSAprenda a migrar y transferir datos al usar la nube de AWS
Aprenda a migrar y transferir datos al usar la nube de AWS
 
Aprenda como migrar e transferir dados ao utilizar a nuvem da AWS
Aprenda como migrar e transferir dados ao utilizar a nuvem da AWSAprenda como migrar e transferir dados ao utilizar a nuvem da AWS
Aprenda como migrar e transferir dados ao utilizar a nuvem da AWS
 
Cómo mover a un almacenamiento de archivos administrados
Cómo mover a un almacenamiento de archivos administradosCómo mover a un almacenamiento de archivos administrados
Cómo mover a un almacenamiento de archivos administrados
 
Simplifique su BI con AWS
Simplifique su BI con AWSSimplifique su BI con AWS
Simplifique su BI con AWS
 
Simplifique o seu BI com a AWS
Simplifique o seu BI com a AWSSimplifique o seu BI com a AWS
Simplifique o seu BI com a AWS
 
Os benefícios de migrar seus workloads de Big Data para a AWS
Os benefícios de migrar seus workloads de Big Data para a AWSOs benefícios de migrar seus workloads de Big Data para a AWS
Os benefícios de migrar seus workloads de Big Data para a AWS
 

Último

Último (20)

Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 

Builders' Day - Best Practises for S3 - BL

  • 1. Best Practices for S3 Bruno Laurenti Solutions Architect
  • 2. • Storage Classes • Bucket Settings & Features • Managing Data at Scale • Performance • Data Transfer Agenda
  • 3. Enterprise applicationsAnalytics Archiving Backup & restore Origin storage for CDN Benefits of Amazon S3 Website hosting Mobile sync and storage
  • 5. Decreasing prices and more storage options 1 2 Decreasing storage prices S3 Standard (2006) Glacier (2012) S-IA (2015) Z-IA (H1-2018) INT (Q4-2018) Accelerating innovation 2006 2018
  • 6. Choose the storage class that fits best ≥ 3 AZs 1 AZ 99.99% 99.5% Milliseconds Hours Hours YearsFrequent Infrequent 0 Bytes 5 Terabytes Reduce storage cost > 80% by choosing the storage class option that best fits your use case 2 Regions 99.9%
  • 7. Your choice of Amazon S3 storage classes Access FrequencyFrequent Infrequent • Active, frequently accessed data • Milliseconds access • > 3 AZ • From: $0.0210/GB • Data with changing access pattern • Milliseconds access • > 3 AZ • From: $0.0210 to $0.0125/GB • Monitoring fee per obj. • Min storage duration • Infrequently accessed data • Milliseconds access • > 3 AZ • From: $0.0125/GB • Retrieval fee per GB • Min storage duration • Min object size S3 Standard S3 Standard-IA S3 One Zone-IA S3 Glacier • Re-creatable less accessed data • Milliseconds access • 1 AZ • From: $0.0100/GB • Retrieval fee per GB • Min storage duration • Min object size • Archive data • Minutes to hours access • > 3 AZ • From: $0.0040/GB • Retrieval fee per GB • Min storage duration • Min object size S3 Intelligent- Tiering S3 Glacier Deep Archive • Archive data • Hours access • > 3 AZ • From: $0.00099/GB • Retrieval fee per GB • Min storage duration • Min object size
  • 8. S3 Storage Class Analysis and S3 Lifecycle Policy Use S3 Storage Class Analysis to identify storage age groups that are less frequently accessed Set S3 Lifecycle Policy to tier storage to lower cost storage classes and expire storage based on age of object Great for predictable workloads (object age indicates access frequency) Fine tune analysis by bucket, prefix, or object tag
  • 9. Set S3 Lifecycle Policy to tier and expire storage S3 Lifecycle Policy to tier to lower cost storage classes and expire storage S3 Storage Class Analysis results help set up a S3 Lifecycle Policy Policies are based on age of object and set by bucket, prefix, or object tag S3 Standard S3 S-IA S3 Glacier
  • 10. S3 Intelligent-Tiering Automatically optimizes storage costs for data with changing access patterns Moves objects between two storage tiers: • Frequent Access Tier • Infrequent Access Tier Monitors access patterns and auto-tiers on granular object level Milliseconds access, > 3 AZ, Monitoring fee per Object, minimum storage duration
  • 11. Ideal use cases for S3 Intelligent-Tiering Dynamic cost optimization with no performance impact and no operational overhead Big Data, Data Lakes Storage with changing access patterns used by multiple applications Enterprises Storage accessed by fragmented applications from various organizations Startups Constrained on resources and experience to optimize storage themselves Amazon S3
  • 12. Bucket Settings & Features
  • 13. Permissions Permissions • IAM Policies • S3 Bucket Policies • S3 ACLs As a general rule, AWS recommends using S3 bucket policies or IAM policies for access control. S3 ACLs is a legacy access control mechanism that predates IAM.
  • 14. Use IAM policies if: • You need to control access to AWS services other than S3. IAM policies will be easier to manage since you can centrally manage all of your permissions in IAM, instead of spreading them between IAM and S3. • You have numerous S3 buckets each with different permissions requirements. IAM policies will be easier to manage since you don’t have to define a large number of S3 bucket policies and can instead rely on fewer, more detailed IAM policies. • You prefer to keep access control policies in the IAM environment. Use S3 bucket policies if: • You want a simple way to grant cross-account access to your S3 environment, without using IAM roles. • Your IAM policies bump up against the size limit (up to 2 kb for users, 5 kb for groups, and 10 kb for roles). S3 supports bucket policies of up 20 kb. • You prefer to keep access control policies in the S3 environment. Permissions
  • 15. With a few clicks in the S3 management console, you can apply S3 Block Public Access to every bucket in your account – both existing and any new buckets created in the future – and make sure that there is no public access to any object S3 Block Public Access Set at the account or bucket-level
  • 16. Versioning Protect your data from accidental deletion • Create a new version with every upload • Previous versions are retained, not overwritten • Protect from unintended user deletes • Making delete requests without a version ID removes access to objects, but keeps the data • Manage previous versions with lifecycle • Transition or expire objects a specified number of days after they are no longer the current version
  • 17. S3 Object Lock Immutable S3 Objects • Write Once Read Many (WORM) Protection for S3 Objects • Object or bucket control of WORM & retention attributes Retention Management Controls • Define retention periods in your app or with bucket-level defaults • Objects Locked for the Duration of the Retention Period • Support for Legal Hold scenarios Data Protection and Compliance • Assessed for use in SEC 17a-4, CFTC, and FINRA environments • Extra protection against accidental or malicious delete
  • 18. S3 Object Lock Modes Compliance Mode • Intended for Compliance • Deletes disallowed, even for root account • Assessed for SEC 17a-4 by Cohasset Associates Governance Mode • Intended for Data Protection • Enables privileged delete of WORM-protected objects • Protects against account compromise & rogue actors • Retention can be changed to Compliance Mode
  • 19. S3 Object Lock Legal Hold
  • 20. Amazon S3 Glacier Enhancements Restore Notifications Notifications fire when a S3 Glacier restore starts and completes Restore Speed Upgrade Upgrade an in-progress restore to a faster restore speed Direct PUT Direct access to S3 Glacier through the S3 PUT API CRR direct to Glacier Replicate data direct to S3 Glacier in a secondary AWS region
  • 21. Object Tags Add up to ten tags to your objects to control access and drive actions For example: • Grant an IAM user permissions to read only objects with specific tags • Use tags to indicate which objects should be replicated • Apply tags to specify granular lifecycle policies • Filter metrics and reports based on tags photos/photo1.jpg project/projectX/document.pdf project/projectY/document2.pdf projectX
  • 22. Cross-Region Replication Flexibility to replicate data: • At the bucket, prefix, or object level • From any region to any region • To any storage class • Across AWS accounts • Change the object owner in the destination region
  • 24. Amazon S3 Batch Operations Take large-scale actions on Amazon S3 objects
  • 25. Amazon S3 Batch Operations Take large-scale actions on Amazon S3 objects
  • 26. S3 Batch Operations Perform API actions across thousands, millions, or billions of objects
  • 27. S3 Batch Operations Choose Objects Select an Operation View Progress Perform API actions across thousands, millions, or billions of objects
  • 29. Amazon S3 performance optimization • What we hear from you … § How do I get the highest request rates? § How do I saturate my compute, network, and storage resources? § How do I maximize my single-threaded throughput? § How do I achieve more predictable outlier performance? § How do I optimize data query performance against Amazon S3? • Where do you start? § Your object naming scheme! § We’ll review this first, then walk through a streaming video workflow use case as an example.
  • 30. Amazon S3 performance increase Amazon S3 for data analytics BEFORE Compute W R I T E T I M E R E A D T I M E 5TB of 2MB objects S3 1with prefix
  • 31. Amazon S3 performance increase Amazon S3 request performance increase NOW Compute W R I T E T I M E R E A D T I M E S3 41m 40s 13m 52s 5TB of 2MB objects 1with prefix0
  • 32. Amazon S3 performance increase Amazon S3 request performance increase PARALLEL PROCESSING Compute W R I T E T I M E R E A D T I M E S3 12m 00s 7m 00s 41m 40s 13m 52s 5TB of 2MB objects 1with 0 prefix
  • 34. Up to 3,500 PUT* tps Up to 5,500 GET tps ExampleAWSbucket Let’s look at how prefixes scale request rate performance BucketName/Prefix: ExampleAWSbucket/LogFiles/ ExampleAWSbucket/Logistics/ ExampleAWSbucket/… * PUT, POST, and DELETE are all included in the PUT tps tps = transactions per second Initial partition
  • 35. 3,500 PUT tps 5,500 GET tps 3,500 PUT tps 5,500 GET tps 2nd partition Initial partition ExampleAWSbucket BucketName/Prefix: ExampleAWSbucket/LogFiles/ ExampleAWSbucket/Logistics/ ExampleAWSbucket/… Let’s look at how prefixes scale request rate performance /Log /… PUT, POST, and DELETE are all included in the PUT tps tps = transactions per second
  • 36. 3,500 PUT tps 5,500 GET tps 3,500 PUT tps 5,500 GET tps 3,500 PUT tps 5,500 GET tps 2nd partition Files/ istics/ 3rd partition 3,500 PUT tps 5,500 GET tps … ExampleAWSbucket BucketName/Prefix: ExampleAWSbucket/LogFiles/ ExampleAWSbucket/Logistics/ ExampleAWSbucket/… Let’s look at how prefixes scale request rate performance /Log /… PUT, POST, and DELETE are all included in the PUT tps tps = transactions per second Initial partition
  • 37. 3,500 PUT tps 5,500 GET tps 3,500 PUT tps 0 GET tps 0 PUT tps 5,500 GET tps Request rate performance is allocated proportionally PUT, POST, and DELETE are all included in the PUT tps tps = transactions per second
  • 39. Flexible Data Transfer More ways to get data into Amazon S3 Database Migration Service
  • 40. AWS Transfer for SFTP Fully-managed service enabling transfer of data over SFTP, while stored in Amazon S3 Seamless migration of existing workflows Native integration with AWS services Simple to use Cost effective Secure and CompliantFully managed in AWS
  • 41. AWS integrated AWS Transfer service that simplifies, automates, and accelerates data movement Transfers up to 10 Gbps per agent Pay as you go Secure and reliable transfers Replicate data to AWS for business continuity Transfer data for timely in-cloud analysis Migrate active application data to AWS Combines the speed and reliability of network acceleration software with the cost-effectiveness of open source tools Simple data movement to S3 or Amazon EFS AWS DataSync