SlideShare una empresa de Scribd logo
1 de 30
Descargar para leer sin conexión
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
What’s new with Amazon Redshift
Dennis J. Waldron
Principal Business Development Manager, Amazon Redshift
AWS
A D B 2 0 3
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
AWS databases and analytics
Broad and deep portfolio, built for builders
AWS Marketplace
Amazon Redshift
Data warehousing
Amazon EMR
Hadoop + Spark
Amazon Athena
Interactive analytics
Amazon Kinesis
Data Analytics Real-
time
Amazon Elasticsearch
Service
Operational Analytics
Amazon RDS
MySQL, PostgreSQL, MariaDB,
Oracle, SQL Server
Amazon Aurora
MySQL, PostgreSQL
Amazon
QuickSight
Amazon
SageMaker
Amazon
DynamoDB
Key value, Document
Amazon ElastiCache
Redis, Memcached
Amazon Neptune
Graph
Amazon
Timestream
Time Series
Amazon
QLDB
Ledger Database
Amazon S3 Glacier
AWS Glue
ETL & Data Catalog
AWS Lake Formation
Data lakes
AWS Database Migration Service | AWS Snowball | AWS Snowmobile | Amazon Kinesis Data Firehose
Data Movement
AnalyticsDatabases
Business Intelligence & Machine Learning
Data Lake
Amazon
Managed
Blockchain
Blockchain
Templates
Blockchain
Amazon
Comprehend
Amazon
Rekognition
Amazon
Lex
Amazon
Transcribe
AWS
DeepLens 250+ solutions
730+ Database
solutions
600+ Analytics
solutions
25+ Blockchain
solutions
20+ Data lake
solutions
30+ Solutions
RDS on VMWare
Amazon Kinesis Data Streams | AWS Data Pipeline | AWS Direct Connect
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
process more than 2 Exabytes of data
Most popular Fastest
More than 15K
customers
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
AWS received a score of 5/5 (highest score possible) in the market presence category, use cases, ability to execute, and
road map criteria.
AWSrecognizedasaleaderindatawarehousingandanalyticsbyForresterandGartner
Gartner Magic Quadrant for Data Management
Solutions for Analytics, 2018Forrester Wave™ Cloud Data Warehouse Q4 2018
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Based on the cloud DWH benchmark derived from TPS-DS 30 TB dataset, 4-node cluster
Redshift Vendor 1 Vendor 2
Queries Per Hour
(Higher is better)
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Fastest Most cost-effective
up to 75%
Use RI to reduce the price up to 75% with Reserved Instances (RIs)
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Fastest Most cost-effective Integrates with your data lake
Amazon
Redshift
Amazon
Redshift
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
What’s new with Amazon Redshift?
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Unload
to Parquet
Amazon Redshift: Newly launched features
Speed
Scale
Dynamic WLM
ConcurrencySimplicity
AWS Lake
Formation
integration
Security
Auto-Vacuum
& Auto-
Analyze
Auto Data
Distribution
Deferred
Maintenance
Snapshot
Scheduler
Spectrum
Request
Accelerator
10x average
performance
improvement
Elastic resize
Concurrency
Scaling
Improving short
query
acceleration
Stored
procedures
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Concurrency Scaling
Backup
Amazon Redshift automatically adds transient clusters,
in seconds, to serve sudden spikes in concurrent requests with consistently
fast performance. No hydration required.
Caching Layer
How it works:
All queries go to the leader node;
user only sees less wait
for queries.
When queries in designated WLM
queue begin queuing, Amazon
Redshift automatically routes them
to the new clusters, enabling
Concurrency Scaling automatically.
Amazon Redshift automatically spins
up a new cluster, processes waiting
queries and automatically shuts
down the Concurrency Scaling
cluster.
1
2
3
For every 24 hours that your main
cluster is in use, you accrue a one-
hour credit for Concurrency
Scaling. This means that
Concurrency Scaling is free for >
97% of customers.
New!
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Enabling Concurrency Scaling
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
For every 24 hours your main cluster
is in use, we’ll provide a one-hour
credit for concurrent cluster usage
97% of users will never see a
charge for auto-scale resources
0
2000
4000
6000
8000
10000
12000
5 40 80 120 150 180
QueriesperHour(QpH)
Number of concurrently active users
Throughput scales linearly
Amazon Redshift’s throughput scales linearly with
concurrent users
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Concurrency Scaling is a powerful new feature of
Amazon Redshift. We were really impressed by the
performance of the feature, especially with its
ability to instantly add transient capacity with
nothing to manage on our end. As Redshift
administrators at Yelp, we think that Concurrency
Scaling will keep our many users happy, even under
peak load. We’re excited that Concurrency Scaling
provides the flexibility to handle significant variance
in our workloads over the course of a day.
Shahid Chohan
Software Engineer, Yelp
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Amazon Redshift elastic resize
New!
Amazon Redshift
Cluster
Amazon Redshift
Managed S3
JDBC/ODBC
1
2
3
Leader Node
Backup
How it works:
Amazon Redshift updates the snapshot on
S3 with the most recent data.
New nodes are added (for scaling up) or
removed (for scaling down) during this
period.
The cluster is fully available for read and
write queries. Queries that were being held
are queued for execution automatically.
1
2
3
Scale your Amazon Redshift
clusters up and down in
minutes to get the optimal
performance.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Ease of use
• Auto Analyze (GA): Automatically collects table statistics to deliver enhanced query
performance
• Auto data distribution (GA): Automatically selects table distribution style based on table
size
• Auto Vacuum Delete (GA): Automatically re-sorts and reclaims space from deleted rows,
improving performance and space utilization
• Snapshot scheduler enhancements (GA): Provides more control over automated snapshot
schedule and allows setting snapshot expiration date, and bulk removal of expired manual snapshots
• Stored procedures in Amazon Redshift (GA): Makes migration to Amazon Redshift easier;
you can bring your existing stored procedures
New!
New!
New!
New!
New!
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
New!
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Amazon Redshift: Auto Vacuum
• Amazon Redshift automatically runs the
VACUUM DELETE operation to reclaim
disk space occupied by rows that were
marked for deletion by previous UPDATE
and DELETE operations during idle
periods.
• It defragments the tables to free up
consumed space and improves
performance for your workloads.
Vacuum Delete
Less storage used,
higher performance
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Amazon Redshift: Auto Vacuum
https://twitter.com/esh/status/1076239047813545984
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
stored procedures
New!
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Redshift integrates
seamlessly with your data
lake
DATE data type
Retrieving metadata for late-binding views
Support for Enhanced VPC Routing
IN-list predicate processing in
Spectrum scans
Query external tables during
a resize operation
Specify the root of an S3
bucket as the source for
an existing table
Spectrum queries with aggregations on
partition columns
Renaming external
table columns
Table property to specify the file compression
type for external tables
Push the LENGTH()
string function to
Spectrum
ALTER TABLE ADD/DROP COLUMN for
external tables is now supported via
standard JDBC calls
Map datatypes in
Spectrum to contain
arrays
Support for Parquet, ORC, Avro, CSV, and
other open file formats
New Spectrum
regions
Spectrum support for
JSON and ION
Spectrum support for
nested data
Arrays of arrays and arrays
of maps
S U M M I T
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Amazon Redshift Spectrum
Amazon Redshift Spectrum
query engine
Query across Amazon
Redshift and S3
Amazon
Redshift data
S3
data lake
Extend the data warehouse to exabytes of data in Amazon S3 data lake
No data loading required
Scale compute and storage separately
Directly query data stored in Amazon S3
Parquet, ORC, Avro, JSON, and CSV data formats
Spectrum Request Accelerator
→ Unload to Parquet
Coming
Soon!
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Amazon Redshift is scalable
Amazon Redshift Spectrum: Exabyte data lake query in
under three minutes
Compression
Columnar file format
Scanning with 2,500 nodes
Static partition elimination
Dynamic partition elimination
Amazon Redshift query optimizer
* Query used a 20 node DC1.8XLarge Amazon Redshift cluster
* Not actual sales data—generated for this demo based on data format used by Amazon Retail.
Imagine you are the manager at a Seattle bookstore. An
author released her 8th book in a popular series, and you
need to figure out how many copies to order.
Amazon S3
Amazon Redshift Spectrum
<3 minutes
5X
10X
2,500X
2X
350X
40X
Roughly 140 terabytes of customer item
order detail records for each day over
the past 20 years
190 million files across 15,000 partitions
in S3
One partition per day for USA and rest of
world
Total data size is over an exabyte
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
The power of data lakes
Most ways to bring data in
Terabyte–exabyte scale
Security
compliance, and audit capabilities
Run any analytics
on the same data without movement
Scale
storage and compute independently
Designed for low-cost
storage and analytics
Amazon
Redshift
EMR Athena
AI services
ElasticsearchKinesis
Snowball
Kinesis
Video Streams
Kinesis
Data
Streams
Kinesis
Data Firehose
Snowmobile
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Integration with AWS Lake Formation Coming Soon!
KinesisSocial Web
Sensors Devices
LOBCRM
ERPOLTP
IAM AWS
KMS
Data
Catalog
Amazon
Athena
Amazon EMR
Amazon Elasticsearch Service
AI services
Amazon
QuickSight
Amazon
Redshift
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Built-in security, w/o extra cost
Compliance certifications
10 GigE (HPC)
Customer
VPC
Internal
VPC
JDBC/ODBC
Compute
Nodes
Leader
Node
End-to-end encryption
Integration with AWS Key
Management Service
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Unload Amazon Redshift data as
Parquet to Amazon S3
Amazon Redshift now
supports exporting data
to Amazon S3 in Parquet
format. This makes
sharing data across the
data lake easier and
faster, without
conversion.
supported by
Amazon EMR, Amazon
Athena,
and Amazon Redshift.
Amazon Redshift Unload command
now supports Parquet format. This
allows data in Amazon Redshift to be
exported as Parquet to be processed
by Amazon EMR or Amazon Athena
without any data conversion.
The feature is in preview now and
GA in Q3 ’19.
Amazon EMR
Amazon
Redshift
Amazon
Athena
Amazon S3
AWS Glue
Private
Preview
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Focus areas for 2019…
• Concurrency, elasticity, performance, zero tuning
Amazon Redshift performance & out-of-box performance improvement
Automate remaining tuning knobs
• Ease of use, zero admin, migration
Simplify console, operation, management
• Data lake
More data lake integration for security and access control
Leverage the scale of data lake and different processing engines
• Security, availability
Azure AD integration, more flexible access control
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Thank you!
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Dennis J. Waldron
walddenn@amazon.com

Más contenido relacionado

La actualidad más candente

Resiliency-and-Availability-Design-Patterns-for-the-Cloud
Resiliency-and-Availability-Design-Patterns-for-the-CloudResiliency-and-Availability-Design-Patterns-for-the-Cloud
Resiliency-and-Availability-Design-Patterns-for-the-CloudAmazon Web Services
 
Amazon Aurora Relational Database Built for the AWS Cloud, Version 1 Series
Amazon Aurora Relational Database Built for the AWS Cloud, Version 1 SeriesAmazon Aurora Relational Database Built for the AWS Cloud, Version 1 Series
Amazon Aurora Relational Database Built for the AWS Cloud, Version 1 SeriesDataLeader.io
 
Innovation-at-Hyper-scale-Outlook-on-Emerging-Technologies
Innovation-at-Hyper-scale-Outlook-on-Emerging-TechnologiesInnovation-at-Hyper-scale-Outlook-on-Emerging-Technologies
Innovation-at-Hyper-scale-Outlook-on-Emerging-TechnologiesAmazon Web Services
 
Accelerate Analytics at Scale with Amazon EMR - AWS Summit Sydney 2018
Accelerate Analytics at Scale with Amazon EMR - AWS Summit Sydney 2018Accelerate Analytics at Scale with Amazon EMR - AWS Summit Sydney 2018
Accelerate Analytics at Scale with Amazon EMR - AWS Summit Sydney 2018Amazon Web Services
 
Building a global database with MongoDB Atlas - DEM16-S - New York AWS Summit
Building a global database with MongoDB Atlas - DEM16-S - New York AWS SummitBuilding a global database with MongoDB Atlas - DEM16-S - New York AWS Summit
Building a global database with MongoDB Atlas - DEM16-S - New York AWS SummitAmazon Web Services
 
Cloud_Data_Management_with_Veeam_and_AWS
Cloud_Data_Management_with_Veeam_and_AWSCloud_Data_Management_with_Veeam_and_AWS
Cloud_Data_Management_with_Veeam_and_AWSAmazon Web Services
 
What's New in Amazon Aurora (DAT204-R1) - AWS re:Invent 2018
What's New in Amazon Aurora (DAT204-R1) - AWS re:Invent 2018What's New in Amazon Aurora (DAT204-R1) - AWS re:Invent 2018
What's New in Amazon Aurora (DAT204-R1) - AWS re:Invent 2018Amazon Web Services
 
Make your data move: Best practices for migrating data to AWS - STG201 - New ...
Make your data move: Best practices for migrating data to AWS - STG201 - New ...Make your data move: Best practices for migrating data to AWS - STG201 - New ...
Make your data move: Best practices for migrating data to AWS - STG201 - New ...Amazon Web Services
 
From raw data to business insights. A modern data lake
From raw data to business insights. A modern data lakeFrom raw data to business insights. A modern data lake
From raw data to business insights. A modern data lakejavier ramirez
 
How to Choose The Right Database on AWS - Berlin Summit - 2019
How to Choose The Right Database on AWS - Berlin Summit - 2019How to Choose The Right Database on AWS - Berlin Summit - 2019
How to Choose The Right Database on AWS - Berlin Summit - 2019Randall Hunt
 
Scalable Relational Databases with Amazon Aurora. Madrid Summit 2019
Scalable Relational Databases with Amazon Aurora. Madrid Summit 2019Scalable Relational Databases with Amazon Aurora. Madrid Summit 2019
Scalable Relational Databases with Amazon Aurora. Madrid Summit 2019javier ramirez
 
Introducing Amazon Aurora with PostgreSQL Compatibility - AWS Online Tech Talks
Introducing Amazon Aurora with PostgreSQL Compatibility - AWS Online Tech TalksIntroducing Amazon Aurora with PostgreSQL Compatibility - AWS Online Tech Talks
Introducing Amazon Aurora with PostgreSQL Compatibility - AWS Online Tech TalksAmazon Web Services
 
What would You do with a Million cores? HPC on AWS
What would You do with a Million cores? HPC on AWSWhat would You do with a Million cores? HPC on AWS
What would You do with a Million cores? HPC on AWSAmazon Web Services
 
Scalable, secure log analytics with Amazon ES - ADB302 - Chicago AWS Summit
Scalable, secure log analytics with Amazon ES - ADB302 - Chicago AWS SummitScalable, secure log analytics with Amazon ES - ADB302 - Chicago AWS Summit
Scalable, secure log analytics with Amazon ES - ADB302 - Chicago AWS SummitAmazon Web Services
 
What’s new with Amazon Redshift, featuring ZS Associates - ADB205 - Chicago A...
What’s new with Amazon Redshift, featuring ZS Associates - ADB205 - Chicago A...What’s new with Amazon Redshift, featuring ZS Associates - ADB205 - Chicago A...
What’s new with Amazon Redshift, featuring ZS Associates - ADB205 - Chicago A...Amazon Web Services
 
Databases - Choosing the right Database on AWS
Databases - Choosing the right Database on AWSDatabases - Choosing the right Database on AWS
Databases - Choosing the right Database on AWSAmazon Web Services
 
Design, Deploy, and Optimize Microsoft SQL Server on AWS
Design, Deploy, and Optimize Microsoft SQL Server on AWSDesign, Deploy, and Optimize Microsoft SQL Server on AWS
Design, Deploy, and Optimize Microsoft SQL Server on AWSAmazon Web Services
 
What’s new in Amazon RDS - ADB207 - Chicago AWS Summit
What’s new in Amazon RDS - ADB207 - Chicago AWS SummitWhat’s new in Amazon RDS - ADB207 - Chicago AWS Summit
What’s new in Amazon RDS - ADB207 - Chicago AWS SummitAmazon Web Services
 

La actualidad más candente (20)

EC2_and_VPC_workshop
EC2_and_VPC_workshopEC2_and_VPC_workshop
EC2_and_VPC_workshop
 
Resiliency-and-Availability-Design-Patterns-for-the-Cloud
Resiliency-and-Availability-Design-Patterns-for-the-CloudResiliency-and-Availability-Design-Patterns-for-the-Cloud
Resiliency-and-Availability-Design-Patterns-for-the-Cloud
 
Amazon Aurora Relational Database Built for the AWS Cloud, Version 1 Series
Amazon Aurora Relational Database Built for the AWS Cloud, Version 1 SeriesAmazon Aurora Relational Database Built for the AWS Cloud, Version 1 Series
Amazon Aurora Relational Database Built for the AWS Cloud, Version 1 Series
 
Innovation-at-Hyper-scale-Outlook-on-Emerging-Technologies
Innovation-at-Hyper-scale-Outlook-on-Emerging-TechnologiesInnovation-at-Hyper-scale-Outlook-on-Emerging-Technologies
Innovation-at-Hyper-scale-Outlook-on-Emerging-Technologies
 
Accelerate Analytics at Scale with Amazon EMR - AWS Summit Sydney 2018
Accelerate Analytics at Scale with Amazon EMR - AWS Summit Sydney 2018Accelerate Analytics at Scale with Amazon EMR - AWS Summit Sydney 2018
Accelerate Analytics at Scale with Amazon EMR - AWS Summit Sydney 2018
 
Building a global database with MongoDB Atlas - DEM16-S - New York AWS Summit
Building a global database with MongoDB Atlas - DEM16-S - New York AWS SummitBuilding a global database with MongoDB Atlas - DEM16-S - New York AWS Summit
Building a global database with MongoDB Atlas - DEM16-S - New York AWS Summit
 
Cloud_Data_Management_with_Veeam_and_AWS
Cloud_Data_Management_with_Veeam_and_AWSCloud_Data_Management_with_Veeam_and_AWS
Cloud_Data_Management_with_Veeam_and_AWS
 
What's New in Amazon Aurora (DAT204-R1) - AWS re:Invent 2018
What's New in Amazon Aurora (DAT204-R1) - AWS re:Invent 2018What's New in Amazon Aurora (DAT204-R1) - AWS re:Invent 2018
What's New in Amazon Aurora (DAT204-R1) - AWS re:Invent 2018
 
Make your data move: Best practices for migrating data to AWS - STG201 - New ...
Make your data move: Best practices for migrating data to AWS - STG201 - New ...Make your data move: Best practices for migrating data to AWS - STG201 - New ...
Make your data move: Best practices for migrating data to AWS - STG201 - New ...
 
From raw data to business insights. A modern data lake
From raw data to business insights. A modern data lakeFrom raw data to business insights. A modern data lake
From raw data to business insights. A modern data lake
 
How to Choose The Right Database on AWS - Berlin Summit - 2019
How to Choose The Right Database on AWS - Berlin Summit - 2019How to Choose The Right Database on AWS - Berlin Summit - 2019
How to Choose The Right Database on AWS - Berlin Summit - 2019
 
Scalable Relational Databases with Amazon Aurora. Madrid Summit 2019
Scalable Relational Databases with Amazon Aurora. Madrid Summit 2019Scalable Relational Databases with Amazon Aurora. Madrid Summit 2019
Scalable Relational Databases with Amazon Aurora. Madrid Summit 2019
 
Introducing Amazon Aurora with PostgreSQL Compatibility - AWS Online Tech Talks
Introducing Amazon Aurora with PostgreSQL Compatibility - AWS Online Tech TalksIntroducing Amazon Aurora with PostgreSQL Compatibility - AWS Online Tech Talks
Introducing Amazon Aurora with PostgreSQL Compatibility - AWS Online Tech Talks
 
What would You do with a Million cores? HPC on AWS
What would You do with a Million cores? HPC on AWSWhat would You do with a Million cores? HPC on AWS
What would You do with a Million cores? HPC on AWS
 
Scalable, secure log analytics with Amazon ES - ADB302 - Chicago AWS Summit
Scalable, secure log analytics with Amazon ES - ADB302 - Chicago AWS SummitScalable, secure log analytics with Amazon ES - ADB302 - Chicago AWS Summit
Scalable, secure log analytics with Amazon ES - ADB302 - Chicago AWS Summit
 
What’s new with Amazon Redshift, featuring ZS Associates - ADB205 - Chicago A...
What’s new with Amazon Redshift, featuring ZS Associates - ADB205 - Chicago A...What’s new with Amazon Redshift, featuring ZS Associates - ADB205 - Chicago A...
What’s new with Amazon Redshift, featuring ZS Associates - ADB205 - Chicago A...
 
Databases - Choosing the right Database on AWS
Databases - Choosing the right Database on AWSDatabases - Choosing the right Database on AWS
Databases - Choosing the right Database on AWS
 
Design, Deploy, and Optimize Microsoft SQL Server on AWS
Design, Deploy, and Optimize Microsoft SQL Server on AWSDesign, Deploy, and Optimize Microsoft SQL Server on AWS
Design, Deploy, and Optimize Microsoft SQL Server on AWS
 
Oracle on AWS
Oracle on AWSOracle on AWS
Oracle on AWS
 
What’s new in Amazon RDS - ADB207 - Chicago AWS Summit
What’s new in Amazon RDS - ADB207 - Chicago AWS SummitWhat’s new in Amazon RDS - ADB207 - Chicago AWS Summit
What’s new in Amazon RDS - ADB207 - Chicago AWS Summit
 

Similar a What's new with Amazon Redshift - ADB203 - New York AWS Summit

What's New with Amazon Redshift - ADB202 - Anaheim AWS Summit
What's New with Amazon Redshift - ADB202 - Anaheim AWS SummitWhat's New with Amazon Redshift - ADB202 - Anaheim AWS Summit
What's New with Amazon Redshift - ADB202 - Anaheim AWS SummitAmazon Web Services
 
Data Warehousing in the Cloud - AWS Summit Sydney
Data Warehousing in the Cloud - AWS Summit SydneyData Warehousing in the Cloud - AWS Summit Sydney
Data Warehousing in the Cloud - AWS Summit SydneyAmazon Web Services
 
Big Data@Scale_AWSPSSummit_Singapore
Big Data@Scale_AWSPSSummit_SingaporeBig Data@Scale_AWSPSSummit_Singapore
Big Data@Scale_AWSPSSummit_SingaporeAmazon Web Services
 
Migrating SAP Workloads to AWS: Stories and Tips - AWS Summit Sydney
Migrating SAP Workloads to AWS: Stories and Tips - AWS Summit SydneyMigrating SAP Workloads to AWS: Stories and Tips - AWS Summit Sydney
Migrating SAP Workloads to AWS: Stories and Tips - AWS Summit SydneyAmazon Web Services
 
Database Freedom - ADB304 - Santa Clara AWS Summit
Database Freedom - ADB304 - Santa Clara AWS SummitDatabase Freedom - ADB304 - Santa Clara AWS Summit
Database Freedom - ADB304 - Santa Clara AWS SummitAmazon Web Services
 
Building a Modern Data Warehouse - Deep Dive on Amazon Redshift
Building a Modern Data Warehouse - Deep Dive on Amazon RedshiftBuilding a Modern Data Warehouse - Deep Dive on Amazon Redshift
Building a Modern Data Warehouse - Deep Dive on Amazon RedshiftAmazon Web Services
 
Databases - EBC on the road Brazil Edition [Portuguese]
Databases - EBC on the road Brazil Edition [Portuguese]Databases - EBC on the road Brazil Edition [Portuguese]
Databases - EBC on the road Brazil Edition [Portuguese]Amazon Web Services
 
Enterprise-Database-Migration-Strategies-and-Options-on-AWS
Enterprise-Database-Migration-Strategies-and-Options-on-AWSEnterprise-Database-Migration-Strategies-and-Options-on-AWS
Enterprise-Database-Migration-Strategies-and-Options-on-AWSAmazon Web Services
 
Best Practices for Migrating Oracle Databases to the Cloud - AWS Online Tech ...
Best Practices for Migrating Oracle Databases to the Cloud - AWS Online Tech ...Best Practices for Migrating Oracle Databases to the Cloud - AWS Online Tech ...
Best Practices for Migrating Oracle Databases to the Cloud - AWS Online Tech ...Amazon Web Services
 
Modern Cloud Data Warehousing ft. Intuit: Optimize Analytics Practices (ANT20...
Modern Cloud Data Warehousing ft. Intuit: Optimize Analytics Practices (ANT20...Modern Cloud Data Warehousing ft. Intuit: Optimize Analytics Practices (ANT20...
Modern Cloud Data Warehousing ft. Intuit: Optimize Analytics Practices (ANT20...Amazon Web Services
 
Amazon Aurora, funzionalità e best practice per la migrazione di database su AWS
Amazon Aurora, funzionalità e best practice per la migrazione di database su AWSAmazon Aurora, funzionalità e best practice per la migrazione di database su AWS
Amazon Aurora, funzionalità e best practice per la migrazione di database su AWSAmazon Web Services
 
Managed Relational Databases - Amazon RDS
Managed Relational Databases - Amazon RDSManaged Relational Databases - Amazon RDS
Managed Relational Databases - Amazon RDSAmazon Web Services
 
Best Practices for Migrating Databases to the Cloud - AWS Summit Sydney
Best Practices for Migrating Databases to the Cloud - AWS Summit SydneyBest Practices for Migrating Databases to the Cloud - AWS Summit Sydney
Best Practices for Migrating Databases to the Cloud - AWS Summit SydneyAmazon Web Services
 
Leadership Session: AWS Database and Analytics (DAT206-L) - AWS re:Invent 2018
Leadership Session: AWS Database and Analytics (DAT206-L) - AWS re:Invent 2018Leadership Session: AWS Database and Analytics (DAT206-L) - AWS re:Invent 2018
Leadership Session: AWS Database and Analytics (DAT206-L) - AWS re:Invent 2018Amazon Web Services
 
BDA306 Building a Modern Data Warehouse: Deep Dive on Amazon Redshift
BDA306 Building a Modern Data Warehouse: Deep Dive on Amazon RedshiftBDA306 Building a Modern Data Warehouse: Deep Dive on Amazon Redshift
BDA306 Building a Modern Data Warehouse: Deep Dive on Amazon RedshiftAmazon Web Services
 
Amazon Redshift Update and How Equinox Fitness Clubs Migrated to a Modern Dat...
Amazon Redshift Update and How Equinox Fitness Clubs Migrated to a Modern Dat...Amazon Redshift Update and How Equinox Fitness Clubs Migrated to a Modern Dat...
Amazon Redshift Update and How Equinox Fitness Clubs Migrated to a Modern Dat...Amazon Web Services
 
Bursting on-premise analytic workloads to Amazon EMR using Alluxio
Bursting on-premise analytic workloads to Amazon EMR using AlluxioBursting on-premise analytic workloads to Amazon EMR using Alluxio
Bursting on-premise analytic workloads to Amazon EMR using AlluxioAlluxio, Inc.
 
Using Tableau and AWS for Fearless Reporting at UMD
Using Tableau and AWS for Fearless Reporting at UMDUsing Tableau and AWS for Fearless Reporting at UMD
Using Tableau and AWS for Fearless Reporting at UMDAmazon Web Services
 

Similar a What's new with Amazon Redshift - ADB203 - New York AWS Summit (20)

What's New with Amazon Redshift - ADB202 - Anaheim AWS Summit
What's New with Amazon Redshift - ADB202 - Anaheim AWS SummitWhat's New with Amazon Redshift - ADB202 - Anaheim AWS Summit
What's New with Amazon Redshift - ADB202 - Anaheim AWS Summit
 
Data Warehousing in the Cloud - AWS Summit Sydney
Data Warehousing in the Cloud - AWS Summit SydneyData Warehousing in the Cloud - AWS Summit Sydney
Data Warehousing in the Cloud - AWS Summit Sydney
 
Big Data@Scale_AWSPSSummit_Singapore
Big Data@Scale_AWSPSSummit_SingaporeBig Data@Scale_AWSPSSummit_Singapore
Big Data@Scale_AWSPSSummit_Singapore
 
Migrating SAP Workloads to AWS: Stories and Tips - AWS Summit Sydney
Migrating SAP Workloads to AWS: Stories and Tips - AWS Summit SydneyMigrating SAP Workloads to AWS: Stories and Tips - AWS Summit Sydney
Migrating SAP Workloads to AWS: Stories and Tips - AWS Summit Sydney
 
Database Freedom - ADB304 - Santa Clara AWS Summit
Database Freedom - ADB304 - Santa Clara AWS SummitDatabase Freedom - ADB304 - Santa Clara AWS Summit
Database Freedom - ADB304 - Santa Clara AWS Summit
 
Building a Modern Data Warehouse - Deep Dive on Amazon Redshift
Building a Modern Data Warehouse - Deep Dive on Amazon RedshiftBuilding a Modern Data Warehouse - Deep Dive on Amazon Redshift
Building a Modern Data Warehouse - Deep Dive on Amazon Redshift
 
Databases - EBC on the road Brazil Edition [Portuguese]
Databases - EBC on the road Brazil Edition [Portuguese]Databases - EBC on the road Brazil Edition [Portuguese]
Databases - EBC on the road Brazil Edition [Portuguese]
 
Enterprise-Database-Migration-Strategies-and-Options-on-AWS
Enterprise-Database-Migration-Strategies-and-Options-on-AWSEnterprise-Database-Migration-Strategies-and-Options-on-AWS
Enterprise-Database-Migration-Strategies-and-Options-on-AWS
 
Best Practices for Migrating Oracle Databases to the Cloud - AWS Online Tech ...
Best Practices for Migrating Oracle Databases to the Cloud - AWS Online Tech ...Best Practices for Migrating Oracle Databases to the Cloud - AWS Online Tech ...
Best Practices for Migrating Oracle Databases to the Cloud - AWS Online Tech ...
 
Modern Cloud Data Warehousing ft. Intuit: Optimize Analytics Practices (ANT20...
Modern Cloud Data Warehousing ft. Intuit: Optimize Analytics Practices (ANT20...Modern Cloud Data Warehousing ft. Intuit: Optimize Analytics Practices (ANT20...
Modern Cloud Data Warehousing ft. Intuit: Optimize Analytics Practices (ANT20...
 
Amazon Aurora, funzionalità e best practice per la migrazione di database su AWS
Amazon Aurora, funzionalità e best practice per la migrazione di database su AWSAmazon Aurora, funzionalità e best practice per la migrazione di database su AWS
Amazon Aurora, funzionalità e best practice per la migrazione di database su AWS
 
Managed Relational Databases - Amazon RDS
Managed Relational Databases - Amazon RDSManaged Relational Databases - Amazon RDS
Managed Relational Databases - Amazon RDS
 
Best Practices for Migrating Databases to the Cloud - AWS Summit Sydney
Best Practices for Migrating Databases to the Cloud - AWS Summit SydneyBest Practices for Migrating Databases to the Cloud - AWS Summit Sydney
Best Practices for Migrating Databases to the Cloud - AWS Summit Sydney
 
Leadership Session: AWS Database and Analytics (DAT206-L) - AWS re:Invent 2018
Leadership Session: AWS Database and Analytics (DAT206-L) - AWS re:Invent 2018Leadership Session: AWS Database and Analytics (DAT206-L) - AWS re:Invent 2018
Leadership Session: AWS Database and Analytics (DAT206-L) - AWS re:Invent 2018
 
BDA306 Building a Modern Data Warehouse: Deep Dive on Amazon Redshift
BDA306 Building a Modern Data Warehouse: Deep Dive on Amazon RedshiftBDA306 Building a Modern Data Warehouse: Deep Dive on Amazon Redshift
BDA306 Building a Modern Data Warehouse: Deep Dive on Amazon Redshift
 
Amazon Redshift Update and How Equinox Fitness Clubs Migrated to a Modern Dat...
Amazon Redshift Update and How Equinox Fitness Clubs Migrated to a Modern Dat...Amazon Redshift Update and How Equinox Fitness Clubs Migrated to a Modern Dat...
Amazon Redshift Update and How Equinox Fitness Clubs Migrated to a Modern Dat...
 
Managed Relational Databases
Managed Relational DatabasesManaged Relational Databases
Managed Relational Databases
 
Big Data@Scale
 Big Data@Scale Big Data@Scale
Big Data@Scale
 
Bursting on-premise analytic workloads to Amazon EMR using Alluxio
Bursting on-premise analytic workloads to Amazon EMR using AlluxioBursting on-premise analytic workloads to Amazon EMR using Alluxio
Bursting on-premise analytic workloads to Amazon EMR using Alluxio
 
Using Tableau and AWS for Fearless Reporting at UMD
Using Tableau and AWS for Fearless Reporting at UMDUsing Tableau and AWS for Fearless Reporting at UMD
Using Tableau and AWS for Fearless Reporting at UMD
 

Más de Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

Más de Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

What's new with Amazon Redshift - ADB203 - New York AWS Summit

  • 1. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T What’s new with Amazon Redshift Dennis J. Waldron Principal Business Development Manager, Amazon Redshift AWS A D B 2 0 3
  • 2. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T AWS databases and analytics Broad and deep portfolio, built for builders AWS Marketplace Amazon Redshift Data warehousing Amazon EMR Hadoop + Spark Amazon Athena Interactive analytics Amazon Kinesis Data Analytics Real- time Amazon Elasticsearch Service Operational Analytics Amazon RDS MySQL, PostgreSQL, MariaDB, Oracle, SQL Server Amazon Aurora MySQL, PostgreSQL Amazon QuickSight Amazon SageMaker Amazon DynamoDB Key value, Document Amazon ElastiCache Redis, Memcached Amazon Neptune Graph Amazon Timestream Time Series Amazon QLDB Ledger Database Amazon S3 Glacier AWS Glue ETL & Data Catalog AWS Lake Formation Data lakes AWS Database Migration Service | AWS Snowball | AWS Snowmobile | Amazon Kinesis Data Firehose Data Movement AnalyticsDatabases Business Intelligence & Machine Learning Data Lake Amazon Managed Blockchain Blockchain Templates Blockchain Amazon Comprehend Amazon Rekognition Amazon Lex Amazon Transcribe AWS DeepLens 250+ solutions 730+ Database solutions 600+ Analytics solutions 25+ Blockchain solutions 20+ Data lake solutions 30+ Solutions RDS on VMWare Amazon Kinesis Data Streams | AWS Data Pipeline | AWS Direct Connect
  • 3. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
  • 4. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T process more than 2 Exabytes of data Most popular Fastest More than 15K customers
  • 5. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T AWS received a score of 5/5 (highest score possible) in the market presence category, use cases, ability to execute, and road map criteria. AWSrecognizedasaleaderindatawarehousingandanalyticsbyForresterandGartner Gartner Magic Quadrant for Data Management Solutions for Analytics, 2018Forrester Wave™ Cloud Data Warehouse Q4 2018
  • 6. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Based on the cloud DWH benchmark derived from TPS-DS 30 TB dataset, 4-node cluster Redshift Vendor 1 Vendor 2 Queries Per Hour (Higher is better)
  • 7. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Fastest Most cost-effective up to 75% Use RI to reduce the price up to 75% with Reserved Instances (RIs)
  • 8. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Fastest Most cost-effective Integrates with your data lake Amazon Redshift Amazon Redshift
  • 9. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T What’s new with Amazon Redshift?
  • 10. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Unload to Parquet Amazon Redshift: Newly launched features Speed Scale Dynamic WLM ConcurrencySimplicity AWS Lake Formation integration Security Auto-Vacuum & Auto- Analyze Auto Data Distribution Deferred Maintenance Snapshot Scheduler Spectrum Request Accelerator 10x average performance improvement Elastic resize Concurrency Scaling Improving short query acceleration Stored procedures
  • 11. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Concurrency Scaling Backup Amazon Redshift automatically adds transient clusters, in seconds, to serve sudden spikes in concurrent requests with consistently fast performance. No hydration required. Caching Layer How it works: All queries go to the leader node; user only sees less wait for queries. When queries in designated WLM queue begin queuing, Amazon Redshift automatically routes them to the new clusters, enabling Concurrency Scaling automatically. Amazon Redshift automatically spins up a new cluster, processes waiting queries and automatically shuts down the Concurrency Scaling cluster. 1 2 3 For every 24 hours that your main cluster is in use, you accrue a one- hour credit for Concurrency Scaling. This means that Concurrency Scaling is free for > 97% of customers. New!
  • 12. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Enabling Concurrency Scaling
  • 13. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T For every 24 hours your main cluster is in use, we’ll provide a one-hour credit for concurrent cluster usage 97% of users will never see a charge for auto-scale resources 0 2000 4000 6000 8000 10000 12000 5 40 80 120 150 180 QueriesperHour(QpH) Number of concurrently active users Throughput scales linearly Amazon Redshift’s throughput scales linearly with concurrent users
  • 14. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Concurrency Scaling is a powerful new feature of Amazon Redshift. We were really impressed by the performance of the feature, especially with its ability to instantly add transient capacity with nothing to manage on our end. As Redshift administrators at Yelp, we think that Concurrency Scaling will keep our many users happy, even under peak load. We’re excited that Concurrency Scaling provides the flexibility to handle significant variance in our workloads over the course of a day. Shahid Chohan Software Engineer, Yelp © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 15. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Amazon Redshift elastic resize New! Amazon Redshift Cluster Amazon Redshift Managed S3 JDBC/ODBC 1 2 3 Leader Node Backup How it works: Amazon Redshift updates the snapshot on S3 with the most recent data. New nodes are added (for scaling up) or removed (for scaling down) during this period. The cluster is fully available for read and write queries. Queries that were being held are queued for execution automatically. 1 2 3 Scale your Amazon Redshift clusters up and down in minutes to get the optimal performance.
  • 16. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Ease of use • Auto Analyze (GA): Automatically collects table statistics to deliver enhanced query performance • Auto data distribution (GA): Automatically selects table distribution style based on table size • Auto Vacuum Delete (GA): Automatically re-sorts and reclaims space from deleted rows, improving performance and space utilization • Snapshot scheduler enhancements (GA): Provides more control over automated snapshot schedule and allows setting snapshot expiration date, and bulk removal of expired manual snapshots • Stored procedures in Amazon Redshift (GA): Makes migration to Amazon Redshift easier; you can bring your existing stored procedures New! New! New! New! New!
  • 17. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T New!
  • 18. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Amazon Redshift: Auto Vacuum • Amazon Redshift automatically runs the VACUUM DELETE operation to reclaim disk space occupied by rows that were marked for deletion by previous UPDATE and DELETE operations during idle periods. • It defragments the tables to free up consumed space and improves performance for your workloads. Vacuum Delete Less storage used, higher performance
  • 19. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Amazon Redshift: Auto Vacuum https://twitter.com/esh/status/1076239047813545984
  • 20. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T stored procedures New!
  • 21. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
  • 22. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Redshift integrates seamlessly with your data lake DATE data type Retrieving metadata for late-binding views Support for Enhanced VPC Routing IN-list predicate processing in Spectrum scans Query external tables during a resize operation Specify the root of an S3 bucket as the source for an existing table Spectrum queries with aggregations on partition columns Renaming external table columns Table property to specify the file compression type for external tables Push the LENGTH() string function to Spectrum ALTER TABLE ADD/DROP COLUMN for external tables is now supported via standard JDBC calls Map datatypes in Spectrum to contain arrays Support for Parquet, ORC, Avro, CSV, and other open file formats New Spectrum regions Spectrum support for JSON and ION Spectrum support for nested data Arrays of arrays and arrays of maps S U M M I T
  • 23. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Amazon Redshift Spectrum Amazon Redshift Spectrum query engine Query across Amazon Redshift and S3 Amazon Redshift data S3 data lake Extend the data warehouse to exabytes of data in Amazon S3 data lake No data loading required Scale compute and storage separately Directly query data stored in Amazon S3 Parquet, ORC, Avro, JSON, and CSV data formats Spectrum Request Accelerator → Unload to Parquet Coming Soon!
  • 24. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Amazon Redshift is scalable Amazon Redshift Spectrum: Exabyte data lake query in under three minutes Compression Columnar file format Scanning with 2,500 nodes Static partition elimination Dynamic partition elimination Amazon Redshift query optimizer * Query used a 20 node DC1.8XLarge Amazon Redshift cluster * Not actual sales data—generated for this demo based on data format used by Amazon Retail. Imagine you are the manager at a Seattle bookstore. An author released her 8th book in a popular series, and you need to figure out how many copies to order. Amazon S3 Amazon Redshift Spectrum <3 minutes 5X 10X 2,500X 2X 350X 40X Roughly 140 terabytes of customer item order detail records for each day over the past 20 years 190 million files across 15,000 partitions in S3 One partition per day for USA and rest of world Total data size is over an exabyte
  • 25. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T The power of data lakes Most ways to bring data in Terabyte–exabyte scale Security compliance, and audit capabilities Run any analytics on the same data without movement Scale storage and compute independently Designed for low-cost storage and analytics Amazon Redshift EMR Athena AI services ElasticsearchKinesis Snowball Kinesis Video Streams Kinesis Data Streams Kinesis Data Firehose Snowmobile
  • 26. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Integration with AWS Lake Formation Coming Soon! KinesisSocial Web Sensors Devices LOBCRM ERPOLTP IAM AWS KMS Data Catalog Amazon Athena Amazon EMR Amazon Elasticsearch Service AI services Amazon QuickSight Amazon Redshift
  • 27. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Built-in security, w/o extra cost Compliance certifications 10 GigE (HPC) Customer VPC Internal VPC JDBC/ODBC Compute Nodes Leader Node End-to-end encryption Integration with AWS Key Management Service
  • 28. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Unload Amazon Redshift data as Parquet to Amazon S3 Amazon Redshift now supports exporting data to Amazon S3 in Parquet format. This makes sharing data across the data lake easier and faster, without conversion. supported by Amazon EMR, Amazon Athena, and Amazon Redshift. Amazon Redshift Unload command now supports Parquet format. This allows data in Amazon Redshift to be exported as Parquet to be processed by Amazon EMR or Amazon Athena without any data conversion. The feature is in preview now and GA in Q3 ’19. Amazon EMR Amazon Redshift Amazon Athena Amazon S3 AWS Glue Private Preview
  • 29. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Focus areas for 2019… • Concurrency, elasticity, performance, zero tuning Amazon Redshift performance & out-of-box performance improvement Automate remaining tuning knobs • Ease of use, zero admin, migration Simplify console, operation, management • Data lake More data lake integration for security and access control Leverage the scale of data lake and different processing engines • Security, availability Azure AD integration, more flexible access control
  • 30. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Thank you! S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Dennis J. Waldron walddenn@amazon.com