SlideShare una empresa de Scribd logo
1 de 16
Descargar para leer sin conexión
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Modernize your data warehouse with
Amazon Redshift
Matt Scaer
Principal DW specialist SA
AWS
A D B 3 0 1
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Agenda
Startup:
1. Log in to the AWS Console (using your account and credits, or workshop account)
2. Switch to the Oregon region (us-west-2)
3. Create an IAM role for Amazon Redshift Spectrum
4. Create an Amazon Redshift cluster and associate the IAM role
5. Update the security group to allow Amazon Redshift
Refresher on Amazon Redshift
Workshop time
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
https://tinyurl.com/y33amykm
1. Log in to the AWS Console (using your account and credits, or workshop
account)
2. Switch to the Oregon region (us-west-2)
3. Create an IAM role for Amazon Redshift Spectrum
4. Create an Amazon Redshift cluster and associate the IAM role
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Load
Unload
Backup
Restore
Massively parallel, shared nothing columnar
architecture
Leader node
• SQL endpoint
• Stores metadata
• Coordinates parallel SQL processing
Compute nodes
• Local, columnar storage
• Executes queries in parallel
• Load, unload, backup, restore
Amazon Redshift Spectrum nodes
• Execute queries directly against
Amazon Simple Storage Service (Amazon
S3)
SQL Clients/BI Tools
128GB RAM
16TB disk
16 cores
JDBC/ODBC
128GB RAM
16TB disk
16 coresCompute
Node
128GB RAM
16TB disk
16 coresCompute
Node
128GB RAM
16TB disk
16 coresCompute
Node
Leader
Node
Amazon S3
...
1 2 3 4 N
Amazon
Redshift
Spectrum
Load
Query
Amazon Redshift architecture
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Amazon Redshift Advisor: Your DBA’s best friend
• Amazon Redshift expert system available in AWS Console
• Identifies undesirable user behaviors for resolution by providing
high-impact recommendations to improve performance and
reduce cost
• >96% of clusters have tailored feedback
• Actionable WLM, COPY, storage, and system maintenance
feedback
• Analyses have doubled since launch (July ‘18); will double again
by EOY
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Five points of guidance for Amazon Redshift (set DW)
Sort Key (to improve filter performance) Choose up to 3 columns (Compound Sort Key)
ordered in increasing order of specificity, balanced with likelihood of use.
Encoding of Columns Compress all columns except for the first sort key column.
Table Maintenance VACUUM and ANALYZE tables weekly (use the Amazon Redshift Advisor or
STL_ALERT_EVENT_LOG as a guide for frequency).
Distribution Key (to improve join performance) strategy that:
• Follows the common join pattern for the table and evenly distributes the data across the database
slices on the cluster.
• DISTSTYLE AUTO is a great go-to for all tables < ~5 million rows.
• DISTSTYLE EVEN is a good fail-safe, but remember data redistribution.
Workload Management (WLM) and Query Monitoring Rules (QMR)
• Start with defining up to ~3 queues.
• Split up the memory across the queues. Monitor the percent of each queue’s workload
going to disk.
• Anticipate changing WLM settings to match the workload changes (day|night, weekday|weekend).
• Use QMR.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Availability of intelligent administration and maintenance features
Distribution key Recommendation for distribution key
Sort key Recommendation for sort key
Concurrency
setting
Automation for concurrent setting,
making it dynamic
Vacuum Auto vacuum in the background
Analyze Auto analyze in the background
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Concurrency scaling
Amazon Redshift automatically adds transient
clusters in seconds to serve sudden spike in concurrent requests
with consistently fast performance.
Backup Caching layer
How it works:
All queries go to the leader node. User
experiences less wait for queries.
When queries in designated WLM
queue begin queuing, Amazon
Redshift automatically routes them to
the new clusters, enabling
Concurrency Scaling automatically.
Amazon Redshift automatically spins
up a new cluster, processes waiting
queries and automatically shuts down
the Concurrency Scaling cluster.
1
2
3
For every 24 hours that your main
cluster is in use, you accrue a one-
hour credit for concurrency
scaling. This means that
Concurrency Scaling is free for
>97% of customers.
GA March 21,
2019
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Thank you!
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Matt Scaer
Principal DW specialist SA
AWS

Más contenido relacionado

Más de Amazon Web Services

Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 
Come costruire un'architettura Serverless nel Cloud AWS
Come costruire un'architettura Serverless nel Cloud AWSCome costruire un'architettura Serverless nel Cloud AWS
Come costruire un'architettura Serverless nel Cloud AWSAmazon Web Services
 
AWS Serverless per startup: come innovare senza preoccuparsi dei server
AWS Serverless per startup: come innovare senza preoccuparsi dei serverAWS Serverless per startup: come innovare senza preoccuparsi dei server
AWS Serverless per startup: come innovare senza preoccuparsi dei serverAmazon Web Services
 
Crea dashboard interattive con Amazon QuickSight
Crea dashboard interattive con Amazon QuickSightCrea dashboard interattive con Amazon QuickSight
Crea dashboard interattive con Amazon QuickSightAmazon Web Services
 
Costruisci modelli di Machine Learning con Amazon SageMaker Autopilot
Costruisci modelli di Machine Learning con Amazon SageMaker AutopilotCostruisci modelli di Machine Learning con Amazon SageMaker Autopilot
Costruisci modelli di Machine Learning con Amazon SageMaker AutopilotAmazon Web Services
 
Migra le tue file shares in cloud con FSx for Windows
Migra le tue file shares in cloud con FSx for Windows Migra le tue file shares in cloud con FSx for Windows
Migra le tue file shares in cloud con FSx for Windows Amazon Web Services
 
La tua organizzazione è pronta per adottare una strategia di cloud ibrido?
La tua organizzazione è pronta per adottare una strategia di cloud ibrido?La tua organizzazione è pronta per adottare una strategia di cloud ibrido?
La tua organizzazione è pronta per adottare una strategia di cloud ibrido?Amazon Web Services
 
Protect your applications from DDoS/BOT & Advanced Attacks
Protect your applications from DDoS/BOT & Advanced AttacksProtect your applications from DDoS/BOT & Advanced Attacks
Protect your applications from DDoS/BOT & Advanced AttacksAmazon Web Services
 
Track 6 Session 6_ 透過 AWS AI 服務模擬、部署機器人於產業之應用
Track 6 Session 6_ 透過 AWS AI 服務模擬、部署機器人於產業之應用Track 6 Session 6_ 透過 AWS AI 服務模擬、部署機器人於產業之應用
Track 6 Session 6_ 透過 AWS AI 服務模擬、部署機器人於產業之應用Amazon Web Services
 

Más de Amazon Web Services (20)

Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 
Come costruire un'architettura Serverless nel Cloud AWS
Come costruire un'architettura Serverless nel Cloud AWSCome costruire un'architettura Serverless nel Cloud AWS
Come costruire un'architettura Serverless nel Cloud AWS
 
AWS Serverless per startup: come innovare senza preoccuparsi dei server
AWS Serverless per startup: come innovare senza preoccuparsi dei serverAWS Serverless per startup: come innovare senza preoccuparsi dei server
AWS Serverless per startup: come innovare senza preoccuparsi dei server
 
Crea dashboard interattive con Amazon QuickSight
Crea dashboard interattive con Amazon QuickSightCrea dashboard interattive con Amazon QuickSight
Crea dashboard interattive con Amazon QuickSight
 
Costruisci modelli di Machine Learning con Amazon SageMaker Autopilot
Costruisci modelli di Machine Learning con Amazon SageMaker AutopilotCostruisci modelli di Machine Learning con Amazon SageMaker Autopilot
Costruisci modelli di Machine Learning con Amazon SageMaker Autopilot
 
Migra le tue file shares in cloud con FSx for Windows
Migra le tue file shares in cloud con FSx for Windows Migra le tue file shares in cloud con FSx for Windows
Migra le tue file shares in cloud con FSx for Windows
 
La tua organizzazione è pronta per adottare una strategia di cloud ibrido?
La tua organizzazione è pronta per adottare una strategia di cloud ibrido?La tua organizzazione è pronta per adottare una strategia di cloud ibrido?
La tua organizzazione è pronta per adottare una strategia di cloud ibrido?
 
Protect your applications from DDoS/BOT & Advanced Attacks
Protect your applications from DDoS/BOT & Advanced AttacksProtect your applications from DDoS/BOT & Advanced Attacks
Protect your applications from DDoS/BOT & Advanced Attacks
 
Track 6 Session 6_ 透過 AWS AI 服務模擬、部署機器人於產業之應用
Track 6 Session 6_ 透過 AWS AI 服務模擬、部署機器人於產業之應用Track 6 Session 6_ 透過 AWS AI 服務模擬、部署機器人於產業之應用
Track 6 Session 6_ 透過 AWS AI 服務模擬、部署機器人於產業之應用
 

Modernize Your Data Warehouse with Amazon Redshift - ADB301 - Anaheim AWS Summit

  • 1. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Modernize your data warehouse with Amazon Redshift Matt Scaer Principal DW specialist SA AWS A D B 3 0 1
  • 2. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Agenda Startup: 1. Log in to the AWS Console (using your account and credits, or workshop account) 2. Switch to the Oregon region (us-west-2) 3. Create an IAM role for Amazon Redshift Spectrum 4. Create an Amazon Redshift cluster and associate the IAM role 5. Update the security group to allow Amazon Redshift Refresher on Amazon Redshift Workshop time
  • 3. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 4. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. https://tinyurl.com/y33amykm 1. Log in to the AWS Console (using your account and credits, or workshop account) 2. Switch to the Oregon region (us-west-2) 3. Create an IAM role for Amazon Redshift Spectrum 4. Create an Amazon Redshift cluster and associate the IAM role
  • 5. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 6. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Load Unload Backup Restore Massively parallel, shared nothing columnar architecture Leader node • SQL endpoint • Stores metadata • Coordinates parallel SQL processing Compute nodes • Local, columnar storage • Executes queries in parallel • Load, unload, backup, restore Amazon Redshift Spectrum nodes • Execute queries directly against Amazon Simple Storage Service (Amazon S3) SQL Clients/BI Tools 128GB RAM 16TB disk 16 cores JDBC/ODBC 128GB RAM 16TB disk 16 coresCompute Node 128GB RAM 16TB disk 16 coresCompute Node 128GB RAM 16TB disk 16 coresCompute Node Leader Node Amazon S3 ... 1 2 3 4 N Amazon Redshift Spectrum Load Query Amazon Redshift architecture
  • 7. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Amazon Redshift Advisor: Your DBA’s best friend • Amazon Redshift expert system available in AWS Console • Identifies undesirable user behaviors for resolution by providing high-impact recommendations to improve performance and reduce cost • >96% of clusters have tailored feedback • Actionable WLM, COPY, storage, and system maintenance feedback • Analyses have doubled since launch (July ‘18); will double again by EOY
  • 8. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Five points of guidance for Amazon Redshift (set DW) Sort Key (to improve filter performance) Choose up to 3 columns (Compound Sort Key) ordered in increasing order of specificity, balanced with likelihood of use. Encoding of Columns Compress all columns except for the first sort key column. Table Maintenance VACUUM and ANALYZE tables weekly (use the Amazon Redshift Advisor or STL_ALERT_EVENT_LOG as a guide for frequency). Distribution Key (to improve join performance) strategy that: • Follows the common join pattern for the table and evenly distributes the data across the database slices on the cluster. • DISTSTYLE AUTO is a great go-to for all tables < ~5 million rows. • DISTSTYLE EVEN is a good fail-safe, but remember data redistribution. Workload Management (WLM) and Query Monitoring Rules (QMR) • Start with defining up to ~3 queues. • Split up the memory across the queues. Monitor the percent of each queue’s workload going to disk. • Anticipate changing WLM settings to match the workload changes (day|night, weekday|weekend). • Use QMR.
  • 9. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Availability of intelligent administration and maintenance features Distribution key Recommendation for distribution key Sort key Recommendation for sort key Concurrency setting Automation for concurrent setting, making it dynamic Vacuum Auto vacuum in the background Analyze Auto analyze in the background
  • 10. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Concurrency scaling Amazon Redshift automatically adds transient clusters in seconds to serve sudden spike in concurrent requests with consistently fast performance. Backup Caching layer How it works: All queries go to the leader node. User experiences less wait for queries. When queries in designated WLM queue begin queuing, Amazon Redshift automatically routes them to the new clusters, enabling Concurrency Scaling automatically. Amazon Redshift automatically spins up a new cluster, processes waiting queries and automatically shuts down the Concurrency Scaling cluster. 1 2 3 For every 24 hours that your main cluster is in use, you accrue a one- hour credit for concurrency scaling. This means that Concurrency Scaling is free for >97% of customers. GA March 21, 2019
  • 11. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
  • 12. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
  • 13. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
  • 14. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
  • 15. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 16. Thank you! S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Matt Scaer Principal DW specialist SA AWS