SlideShare una empresa de Scribd logo
1 de 38
Descargar para leer sin conexión
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Migrate Your On-Premises Data
Warehouse to Amazon Redshift
with AWS DMS and AWS SCT
Shree Kenghe
Solutions Architect
AWS
A N T 3 7 1
Wesley Wilk
Solutions Architect
AWS
Ram Palaniappan
Sr. Director Data Analytics and Insights
TEKsystems
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Agenda
• Workshop Introduction
• Lab architecture and environment setup
• Amazon Redshift overview
• AWS Schema Conversion Tool overview
• Migration considerations
• Workshop lab
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Workshop Details
• Workshop duration
• Proposed solution and AWS services presentation - 20 minutes
• Hands-on workshop - 1.5 hours
• Workshop team
• Shree Kenghe, AWS Solutions Architect
• Wesley Wilk, AWS Solutions Architect
• Arun Kannan, Partner Solutions Architect
• Ram Palaniappan, Sr. Director Data Analytics and Insights, TEKSystems
• Requirements and expectations
• Students use their own AWS accounts to run the lab with IAM admin permissions
• Basic knowledge of AWS services (Amazon RDS, Amazon Simple Storage Service (Amazon S3), AWS
Database Migration Service, AWS SCT, Amazon Redshift)
• Comfortable working on the AWS console and configure AWS services
• Working knowledge of relational databases (Oracle)
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Related sessions
Monday, November 26th Builder’s Session
Modernize Your Data Warehouse with Amazon Redshift
11:30am PST | Aria
Tuesday, November 27th Chalk Talk
Migrate from Teradata to Amazon Redshift: Best Practices with McDonald's
9:15am PST | Aria
Thursday, November 29th Chalk Talk
Migrate from Netezza to Amazon Redshift Best Practices with Financial Engines
11:30am PST | MGM
Friday, November 30th Chalk Talk
Migrating Workloads from Oracle to Amazon Redshift: Best Practices with Pfizer
11:30am PST | Mirage
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Lab Architecture
Availability Zone
DB Subnet (Public)
Internet gateway
Amazon RDS
Oracle
(Source)
Amazon
Redshift (Target)
AWS Management
Console
Internet
SQL client and AWS SCT
on
EC2 Instance
AWS SCT
Extraction
Agents
Schema
Amazon
S3
Data
Data
Data
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Lab Setup and Environment
• Download zip file: https://bit.ly/2TkGyiu
• AWS CloudFormation template
• Lab guide
• SQL file
• PowerPoint presentation
• Policy file
• In AWS Management Console choose AWS Region eu-west-1 (Ireland)
• Follow lab guide Step 1 - Launch AWS CloudFormation template
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Redshift
10x faster at 1/10th the cost
Fast
Delivers fast results for all
types of workloads
Simple
Create and start using a data
warehouse in minutes
Cost-effective
No upfront costs, start small,
and pay as you go
Scalable
Gigabytes to petabytes
to exabytes
Integrated
Integrated with Amazon S3 data lakes,
AWS services and third-party tools
Secure
Audit everything; encrypt
data end-to-end; extensive
certification and compliance
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Redshift Architecture
• Leader Node
• SQL endpoint, JDBC/ODBC
• Stores metadata
• Coordinates query execution
• Compute Nodes
• Local, columnar storage
• Execute queries in parallel
• Load, backup, restore via Amazon S3
• Load from Amazon DynamoDB or SSH
• Fault Tolerant
• Two hardware platforms
• Optimized for data processing
• DS2: HDD; scale from 2TB to 2PB
• DC2: SSD; scale from 160GB to 326TB
10 GigE
(HPC)
Ingestion
Backup
Restore
JDBC/ODBC
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Redshift
Columnar Storage
Row Storage
Columnar Storage
Customer ID Name Age Address Phone
10987823 SALAZAR 88 899 FIRST ST 919-555-0100
89287899 STILES 37 16137 MAIN ST 312-555-0187
31856293 MAJOR 12 42 JUNE ST 704-555-0142
109878823|SALAZAR|88|899 FIRST ST|919-555-0100 89287899|STILES|37|16137 MAIN ST|312-555-0187 31856293|MAJOR|12|42 JUNE ST|704-555-0142
Customer ID Name Age Address Phone
10987823 SALAZAR 88 899 FIRST ST 919-555-0100
89287899 STILES 37 16137 MAIN ST 312-555-0187
31856293 MAJOR 12 42 JUNE ST 704-555-0142
109878823 | 89287899 | 31856293 | 39867232 | 27467298 | 87234892 | 23987278 | 41098739 | 87290312 | 50729812 | 21989734 | 28730912 | 90376879 | 28743448
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Redshift
Z o n e M a p s a n d S o r t K e y s
• Track the minimum and
maximum value for each block
• Skip over blocks that don’t
contain relevant data
• Single Column
• Compound
• Interleaved
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Redshift
S o r t K e y s – S i n g l e C o l u m n
• Best for Queries that use 1st column as primary
filter
• Can Speed up joins and group bys
• Quickest to VACUUM
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Redshift
S o r t K e y s – C o m p o u n d
• Table is sorted by 1st column, then 2nd, etc
• Best for Queries that use 1st column as primary filter, then others
• Can Speed up joins and group bys
• Slower to VACUUM
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Redshift
S o r t K e y s - I n t e r l e a v e d
• Equal weight is given to each column
• Best for queries that use different columns in filter
• Queries get faster the more columns used in the filter
• Slowest to VACUUM
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Redshift
D i s t r i b u t i o n K e y s
• EVEN
• Tables with no joins or group by
• Small Dimension tables (<1000)
• KEY
• Large Fact tables
• Large Dimension tables
• ALL
• Medium Dimension tables (1K-2M)
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Redshift
D i s t r i b u t i o n K e y s - E v e n
ID Gender Name
101 M Carlos Salazar
292 F Li Juan
139 M John Stiles
446 M Arnav Desai
658 F Mary Major
164 M Mateo Jackson
209 M Zhang Wei
306 F Wang Xiulan
ID Gender Name
101 M Carlos Salazar
306 F Wang Xiulan
ID Gender Name
292 F Li Juan
209 M Zhang Wei
ID Gender Name
139 M John Stiles
164 M Mateo Jackson
ID Gender Name
446 M Arnav Desai
658 F Mary Major
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Redshift
D i s t r i b u t i o n K e y s - E v e n
ID Gender Name
101 M Carlos Salazar
292 F Li Juan
139 M John Stiles
446 M Arnav Desai
658 F Mary Major
164 M Mateo Jackson
209 M Zhang Wei
306 F Wang Xiulan
ID Gender Name
101 M Carlos Salazar
306 F Wang Xiulan
ID Gender Name
292 F Li Juan
209 M Zhang Wei
ID Gender Name
139 M John Stiles
164 M Mateo Jackson
ID Gender Name
446 M Arnav Desai
658 F Mary Major
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Redshift
B e s t P r a c t i c e s & P o i n t e r s
• Maximize Load Performance
• COPY multiple files
• COPY to multiple nodes
• Compress source data
• Use a manifest file
• Amazon Redshift does not enforce primary key
constraints
• If you load data multiple times, Amazon Redshift will not complain
• After Loading
• Data all added at end of columns for speed
• Fully functional, but not set for optimum performance
• VACUUM command
• Massages data to optimum disk organization for performance
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
What are AWS Database Migration Service and AWS
SCT?
AWS DMS easily and securely migrates and/or
replicate your databases and data warehouses
to AWS
AWS SCT converts your commercial database and
data warehouse schemas to open-source engines
or AWS-native services, such as Amazon Aurora
and Amazon Redshift
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
When to use AWS SCT?
Modernize
Modernize your database
tier
Modernize and Migrate your
Data Warehouse to Amazon
Redshift
Amazon Aurora
Amazon Redshift
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
AWS SCT helps with converting tables, views, and
code Sequences
User-defined types
Synonyms
Packages
Stored procedures
Functions
Triggers
Schemas
Tables
Indexes
Views
Sort and
distribution keys
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
AWS SCT Migration Assessment Report
•Assessment of migration compatibility of
source databases with open-source
database engines – Amazon RDS MySQL,
Amazon RDS PostgreSQL and Amazon
Aurora
•Recommends best target engine
•Provides details level of efforts to
complete migration
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
AWS SCT Data Extractors
Extract Data from your data warehouse and migrate
to Amazon Redshift
•Extracts data through local migration agents
•Data is optimized for Amazon Redshift and saved in local files
•Files are loaded to an Amazon S3 bucket (through network or AWS Snowball)
and then to Amazon Redshift
Amazon RedshiftAWS SCT
S3 Bucket
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
AWS DMS + Snowball
Common use cases
• Migrate large databases (over 5TB)
• Migrate many databases at once
• Migrate over slow network
• Push vs. Pull
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Migration Objectives
• Protect current
investment
• Reduce migration
cost
• Less disruption
• Successful user
adoption
• Dynamic Scaling
• Better governance
• Business Agility
• Optimize performance
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Considerations for Migration
One Time: AWS SCT, AWS DMS Incremental: Informatica, Attunity, Talend,
Alooma, Oracle DI
Data Movement – Source to DW
Amazon S3, Amazon Redshift, Amazon DynamoDBStar Schema & modeling
AWS Data Pipeline, AWS Glue, Informatica, TalendTransformation Logic
AWS Lambda function, ELT toolsAggregates, Snapshots
SchedulingData refresh
TokenizationData Security.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
TEKsystems Approach
 Discover & Profile Current
Datawarehouse - Tools:
AWS SCT
 Assess & Capture Metadata
- Tools: AWS SCT
Define
Design
Move Data Optimize
Move Process
 Establish Reference
Architecture - TEKsystems
 Schema Generation - Tools:
AWS SCT
 ETL Mapping migration plan
– TEKsystems Lineage
 Initial data migration
– AWS SCT DE, AWS
DMS
 Data Validation –
TEKsystems DW
Scanner
 Segregate ETL jobs –
TEKsystems Lineage
 Retrofit or migrate to
AWS ecosystems –
TEKsystems Genie, Glue
Converter
 Tune for cloud
native functions and
features –
TEKsystems
Optimizer
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Technology
AWS SCT
AWS Lambda
AWS Glue
Amazon Athena
Amazon S3
Amazon Redshift
Vision :
 Build a global view of franchise operations
 Modernize the existing data platform that can be agile and scale
 Drive more self-service reporting for the sales to get better insights
Solution Delivered
 Data engineering using AWS Lambda, AWS Glue, AWS Step Functions and Amazon Athena
 Data warehousing using Amazon Redshift
 Data lake with Amazon S3 and Amazon Athena
 Analytics with Tableau Server on Amazon Elastic Compute Cloud (Amazon EC2) – in a load-
balanced secure configuration using AWS WAF
Benefits
Single canonical view of all of the
franchise data
Reduced the overall licensing cost and
footprint on Oracle, SQL Server workloads
Success Story
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Learnings
 Pilot the migration with migrating incremental data periodically
 Create design patterns based on data type, volume, frequency
 Leverage automation approach to convert transformation and
aggregation ETLs to cloud native (AWS Glue)
 Keep your target data model same to retrofit current reporting structure
 Lot to gain when you go cloud native and tune your Amazon Redshift
after migration
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Experience the power of real partnership. TEKsystems.com
35
At a glance
As an industry leader in Full-Stack Technology Services,
Talent Services and real-world application, we work with
progressive leaders to drive change.
We’re partners in transformation
$650 million
annual revenue
TEKsystems Global Services
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
https://bit.ly/2TkGyiu
Thank you!
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Shree Kenghe
Solutions Architect
AWS
Wesley Wilk
Solutions Architect
AWS
Ram Palaniappan
Sr. Director Data Analytics and Insights
TEKsystems
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.

Más contenido relacionado

Más de Amazon Web Services

Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 
Come costruire un'architettura Serverless nel Cloud AWS
Come costruire un'architettura Serverless nel Cloud AWSCome costruire un'architettura Serverless nel Cloud AWS
Come costruire un'architettura Serverless nel Cloud AWSAmazon Web Services
 
AWS Serverless per startup: come innovare senza preoccuparsi dei server
AWS Serverless per startup: come innovare senza preoccuparsi dei serverAWS Serverless per startup: come innovare senza preoccuparsi dei server
AWS Serverless per startup: come innovare senza preoccuparsi dei serverAmazon Web Services
 
Crea dashboard interattive con Amazon QuickSight
Crea dashboard interattive con Amazon QuickSightCrea dashboard interattive con Amazon QuickSight
Crea dashboard interattive con Amazon QuickSightAmazon Web Services
 
Costruisci modelli di Machine Learning con Amazon SageMaker Autopilot
Costruisci modelli di Machine Learning con Amazon SageMaker AutopilotCostruisci modelli di Machine Learning con Amazon SageMaker Autopilot
Costruisci modelli di Machine Learning con Amazon SageMaker AutopilotAmazon Web Services
 
Migra le tue file shares in cloud con FSx for Windows
Migra le tue file shares in cloud con FSx for Windows Migra le tue file shares in cloud con FSx for Windows
Migra le tue file shares in cloud con FSx for Windows Amazon Web Services
 
La tua organizzazione è pronta per adottare una strategia di cloud ibrido?
La tua organizzazione è pronta per adottare una strategia di cloud ibrido?La tua organizzazione è pronta per adottare una strategia di cloud ibrido?
La tua organizzazione è pronta per adottare una strategia di cloud ibrido?Amazon Web Services
 
Protect your applications from DDoS/BOT & Advanced Attacks
Protect your applications from DDoS/BOT & Advanced AttacksProtect your applications from DDoS/BOT & Advanced Attacks
Protect your applications from DDoS/BOT & Advanced AttacksAmazon Web Services
 
Track 6 Session 6_ 透過 AWS AI 服務模擬、部署機器人於產業之應用
Track 6 Session 6_ 透過 AWS AI 服務模擬、部署機器人於產業之應用Track 6 Session 6_ 透過 AWS AI 服務模擬、部署機器人於產業之應用
Track 6 Session 6_ 透過 AWS AI 服務模擬、部署機器人於產業之應用Amazon Web Services
 

Más de Amazon Web Services (20)

Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 
Come costruire un'architettura Serverless nel Cloud AWS
Come costruire un'architettura Serverless nel Cloud AWSCome costruire un'architettura Serverless nel Cloud AWS
Come costruire un'architettura Serverless nel Cloud AWS
 
AWS Serverless per startup: come innovare senza preoccuparsi dei server
AWS Serverless per startup: come innovare senza preoccuparsi dei serverAWS Serverless per startup: come innovare senza preoccuparsi dei server
AWS Serverless per startup: come innovare senza preoccuparsi dei server
 
Crea dashboard interattive con Amazon QuickSight
Crea dashboard interattive con Amazon QuickSightCrea dashboard interattive con Amazon QuickSight
Crea dashboard interattive con Amazon QuickSight
 
Costruisci modelli di Machine Learning con Amazon SageMaker Autopilot
Costruisci modelli di Machine Learning con Amazon SageMaker AutopilotCostruisci modelli di Machine Learning con Amazon SageMaker Autopilot
Costruisci modelli di Machine Learning con Amazon SageMaker Autopilot
 
Migra le tue file shares in cloud con FSx for Windows
Migra le tue file shares in cloud con FSx for Windows Migra le tue file shares in cloud con FSx for Windows
Migra le tue file shares in cloud con FSx for Windows
 
La tua organizzazione è pronta per adottare una strategia di cloud ibrido?
La tua organizzazione è pronta per adottare una strategia di cloud ibrido?La tua organizzazione è pronta per adottare una strategia di cloud ibrido?
La tua organizzazione è pronta per adottare una strategia di cloud ibrido?
 
Protect your applications from DDoS/BOT & Advanced Attacks
Protect your applications from DDoS/BOT & Advanced AttacksProtect your applications from DDoS/BOT & Advanced Attacks
Protect your applications from DDoS/BOT & Advanced Attacks
 
Track 6 Session 6_ 透過 AWS AI 服務模擬、部署機器人於產業之應用
Track 6 Session 6_ 透過 AWS AI 服務模擬、部署機器人於產業之應用Track 6 Session 6_ 透過 AWS AI 服務模擬、部署機器人於產業之應用
Track 6 Session 6_ 透過 AWS AI 服務模擬、部署機器人於產業之應用
 

Migrate Your On-Premises Data Warehouse to Amazon Redshift with AWS DMS and AWS SCT (ANT371) - AWS re:Invent 2018

  • 1.
  • 2. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Migrate Your On-Premises Data Warehouse to Amazon Redshift with AWS DMS and AWS SCT Shree Kenghe Solutions Architect AWS A N T 3 7 1 Wesley Wilk Solutions Architect AWS Ram Palaniappan Sr. Director Data Analytics and Insights TEKsystems
  • 3. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Agenda • Workshop Introduction • Lab architecture and environment setup • Amazon Redshift overview • AWS Schema Conversion Tool overview • Migration considerations • Workshop lab
  • 4. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 5. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Workshop Details • Workshop duration • Proposed solution and AWS services presentation - 20 minutes • Hands-on workshop - 1.5 hours • Workshop team • Shree Kenghe, AWS Solutions Architect • Wesley Wilk, AWS Solutions Architect • Arun Kannan, Partner Solutions Architect • Ram Palaniappan, Sr. Director Data Analytics and Insights, TEKSystems • Requirements and expectations • Students use their own AWS accounts to run the lab with IAM admin permissions • Basic knowledge of AWS services (Amazon RDS, Amazon Simple Storage Service (Amazon S3), AWS Database Migration Service, AWS SCT, Amazon Redshift) • Comfortable working on the AWS console and configure AWS services • Working knowledge of relational databases (Oracle)
  • 6. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Related sessions Monday, November 26th Builder’s Session Modernize Your Data Warehouse with Amazon Redshift 11:30am PST | Aria Tuesday, November 27th Chalk Talk Migrate from Teradata to Amazon Redshift: Best Practices with McDonald's 9:15am PST | Aria Thursday, November 29th Chalk Talk Migrate from Netezza to Amazon Redshift Best Practices with Financial Engines 11:30am PST | MGM Friday, November 30th Chalk Talk Migrating Workloads from Oracle to Amazon Redshift: Best Practices with Pfizer 11:30am PST | Mirage
  • 7. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 8. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Lab Architecture Availability Zone DB Subnet (Public) Internet gateway Amazon RDS Oracle (Source) Amazon Redshift (Target) AWS Management Console Internet SQL client and AWS SCT on EC2 Instance AWS SCT Extraction Agents Schema Amazon S3 Data Data Data
  • 9. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Lab Setup and Environment • Download zip file: https://bit.ly/2TkGyiu • AWS CloudFormation template • Lab guide • SQL file • PowerPoint presentation • Policy file • In AWS Management Console choose AWS Region eu-west-1 (Ireland) • Follow lab guide Step 1 - Launch AWS CloudFormation template
  • 10. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 11. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Redshift 10x faster at 1/10th the cost Fast Delivers fast results for all types of workloads Simple Create and start using a data warehouse in minutes Cost-effective No upfront costs, start small, and pay as you go Scalable Gigabytes to petabytes to exabytes Integrated Integrated with Amazon S3 data lakes, AWS services and third-party tools Secure Audit everything; encrypt data end-to-end; extensive certification and compliance
  • 12. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Redshift Architecture • Leader Node • SQL endpoint, JDBC/ODBC • Stores metadata • Coordinates query execution • Compute Nodes • Local, columnar storage • Execute queries in parallel • Load, backup, restore via Amazon S3 • Load from Amazon DynamoDB or SSH • Fault Tolerant • Two hardware platforms • Optimized for data processing • DS2: HDD; scale from 2TB to 2PB • DC2: SSD; scale from 160GB to 326TB 10 GigE (HPC) Ingestion Backup Restore JDBC/ODBC
  • 13. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Redshift Columnar Storage Row Storage Columnar Storage Customer ID Name Age Address Phone 10987823 SALAZAR 88 899 FIRST ST 919-555-0100 89287899 STILES 37 16137 MAIN ST 312-555-0187 31856293 MAJOR 12 42 JUNE ST 704-555-0142 109878823|SALAZAR|88|899 FIRST ST|919-555-0100 89287899|STILES|37|16137 MAIN ST|312-555-0187 31856293|MAJOR|12|42 JUNE ST|704-555-0142 Customer ID Name Age Address Phone 10987823 SALAZAR 88 899 FIRST ST 919-555-0100 89287899 STILES 37 16137 MAIN ST 312-555-0187 31856293 MAJOR 12 42 JUNE ST 704-555-0142 109878823 | 89287899 | 31856293 | 39867232 | 27467298 | 87234892 | 23987278 | 41098739 | 87290312 | 50729812 | 21989734 | 28730912 | 90376879 | 28743448
  • 14. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Redshift Z o n e M a p s a n d S o r t K e y s • Track the minimum and maximum value for each block • Skip over blocks that don’t contain relevant data • Single Column • Compound • Interleaved
  • 15. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Redshift S o r t K e y s – S i n g l e C o l u m n • Best for Queries that use 1st column as primary filter • Can Speed up joins and group bys • Quickest to VACUUM
  • 16. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Redshift S o r t K e y s – C o m p o u n d • Table is sorted by 1st column, then 2nd, etc • Best for Queries that use 1st column as primary filter, then others • Can Speed up joins and group bys • Slower to VACUUM
  • 17. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Redshift S o r t K e y s - I n t e r l e a v e d • Equal weight is given to each column • Best for queries that use different columns in filter • Queries get faster the more columns used in the filter • Slowest to VACUUM
  • 18. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Redshift D i s t r i b u t i o n K e y s • EVEN • Tables with no joins or group by • Small Dimension tables (<1000) • KEY • Large Fact tables • Large Dimension tables • ALL • Medium Dimension tables (1K-2M)
  • 19. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Redshift D i s t r i b u t i o n K e y s - E v e n ID Gender Name 101 M Carlos Salazar 292 F Li Juan 139 M John Stiles 446 M Arnav Desai 658 F Mary Major 164 M Mateo Jackson 209 M Zhang Wei 306 F Wang Xiulan ID Gender Name 101 M Carlos Salazar 306 F Wang Xiulan ID Gender Name 292 F Li Juan 209 M Zhang Wei ID Gender Name 139 M John Stiles 164 M Mateo Jackson ID Gender Name 446 M Arnav Desai 658 F Mary Major
  • 20. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Redshift D i s t r i b u t i o n K e y s - E v e n ID Gender Name 101 M Carlos Salazar 292 F Li Juan 139 M John Stiles 446 M Arnav Desai 658 F Mary Major 164 M Mateo Jackson 209 M Zhang Wei 306 F Wang Xiulan ID Gender Name 101 M Carlos Salazar 306 F Wang Xiulan ID Gender Name 292 F Li Juan 209 M Zhang Wei ID Gender Name 139 M John Stiles 164 M Mateo Jackson ID Gender Name 446 M Arnav Desai 658 F Mary Major
  • 21. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Redshift B e s t P r a c t i c e s & P o i n t e r s • Maximize Load Performance • COPY multiple files • COPY to multiple nodes • Compress source data • Use a manifest file • Amazon Redshift does not enforce primary key constraints • If you load data multiple times, Amazon Redshift will not complain • After Loading • Data all added at end of columns for speed • Fully functional, but not set for optimum performance • VACUUM command • Massages data to optimum disk organization for performance
  • 22. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 23. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. What are AWS Database Migration Service and AWS SCT? AWS DMS easily and securely migrates and/or replicate your databases and data warehouses to AWS AWS SCT converts your commercial database and data warehouse schemas to open-source engines or AWS-native services, such as Amazon Aurora and Amazon Redshift
  • 24. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. When to use AWS SCT? Modernize Modernize your database tier Modernize and Migrate your Data Warehouse to Amazon Redshift Amazon Aurora Amazon Redshift
  • 25. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. AWS SCT helps with converting tables, views, and code Sequences User-defined types Synonyms Packages Stored procedures Functions Triggers Schemas Tables Indexes Views Sort and distribution keys
  • 26. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. AWS SCT Migration Assessment Report •Assessment of migration compatibility of source databases with open-source database engines – Amazon RDS MySQL, Amazon RDS PostgreSQL and Amazon Aurora •Recommends best target engine •Provides details level of efforts to complete migration
  • 27. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. AWS SCT Data Extractors Extract Data from your data warehouse and migrate to Amazon Redshift •Extracts data through local migration agents •Data is optimized for Amazon Redshift and saved in local files •Files are loaded to an Amazon S3 bucket (through network or AWS Snowball) and then to Amazon Redshift Amazon RedshiftAWS SCT S3 Bucket
  • 28. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. AWS DMS + Snowball Common use cases • Migrate large databases (over 5TB) • Migrate many databases at once • Migrate over slow network • Push vs. Pull
  • 29. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 30. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Migration Objectives • Protect current investment • Reduce migration cost • Less disruption • Successful user adoption • Dynamic Scaling • Better governance • Business Agility • Optimize performance
  • 31. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Considerations for Migration One Time: AWS SCT, AWS DMS Incremental: Informatica, Attunity, Talend, Alooma, Oracle DI Data Movement – Source to DW Amazon S3, Amazon Redshift, Amazon DynamoDBStar Schema & modeling AWS Data Pipeline, AWS Glue, Informatica, TalendTransformation Logic AWS Lambda function, ELT toolsAggregates, Snapshots SchedulingData refresh TokenizationData Security.
  • 32. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. TEKsystems Approach  Discover & Profile Current Datawarehouse - Tools: AWS SCT  Assess & Capture Metadata - Tools: AWS SCT Define Design Move Data Optimize Move Process  Establish Reference Architecture - TEKsystems  Schema Generation - Tools: AWS SCT  ETL Mapping migration plan – TEKsystems Lineage  Initial data migration – AWS SCT DE, AWS DMS  Data Validation – TEKsystems DW Scanner  Segregate ETL jobs – TEKsystems Lineage  Retrofit or migrate to AWS ecosystems – TEKsystems Genie, Glue Converter  Tune for cloud native functions and features – TEKsystems Optimizer
  • 33. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Technology AWS SCT AWS Lambda AWS Glue Amazon Athena Amazon S3 Amazon Redshift Vision :  Build a global view of franchise operations  Modernize the existing data platform that can be agile and scale  Drive more self-service reporting for the sales to get better insights Solution Delivered  Data engineering using AWS Lambda, AWS Glue, AWS Step Functions and Amazon Athena  Data warehousing using Amazon Redshift  Data lake with Amazon S3 and Amazon Athena  Analytics with Tableau Server on Amazon Elastic Compute Cloud (Amazon EC2) – in a load- balanced secure configuration using AWS WAF Benefits Single canonical view of all of the franchise data Reduced the overall licensing cost and footprint on Oracle, SQL Server workloads Success Story
  • 34. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Learnings  Pilot the migration with migrating incremental data periodically  Create design patterns based on data type, volume, frequency  Leverage automation approach to convert transformation and aggregation ETLs to cloud native (AWS Glue)  Keep your target data model same to retrofit current reporting structure  Lot to gain when you go cloud native and tune your Amazon Redshift after migration
  • 35. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Experience the power of real partnership. TEKsystems.com 35 At a glance As an industry leader in Full-Stack Technology Services, Talent Services and real-world application, we work with progressive leaders to drive change. We’re partners in transformation $650 million annual revenue TEKsystems Global Services
  • 36. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. https://bit.ly/2TkGyiu
  • 37. Thank you! © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Shree Kenghe Solutions Architect AWS Wesley Wilk Solutions Architect AWS Ram Palaniappan Sr. Director Data Analytics and Insights TEKsystems
  • 38. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.