SlideShare una empresa de Scribd logo
1 de 59
Descargar para leer sin conexión
November 12, 2014 | Las Vegas, NV 
BDT206See How Amazon Redshift is Powering Business Intelligence in the Enterprise 
Rahul Pathak, Amazon Redshift 
Jason Timmes, Nasdaq 
Kevin Diamond, Hautelook
Amazon 
Redshift 
Amazon Elastic 
MapReduce 
Amazon EC2 
Analyze 
AWS Data 
Pipeline 
Amazon 
Glacier 
Amazon 
DynamoDB 
Store 
AWS Direct 
Connect 
Collect 
Amazon Kinesis 
Amazon 
S3
10 GigE 
(HPC) 
Ingestion 
Backup 
Restore 
JDBC/ODBC
10 GigE 
(HPC) 
Ingestion 
Backup 
Restore 
Customer VPC 
InternalVPC 
JDBC/ODBC
Data Source ET 
Direct 
Connect 
Client 
Forwarder 
State Management Loader 
Amazon Redshift Sandbox 
S3
11 
LEADING INDEX PROVIDER WITH 
41,000+ INDEXES 
ACROSS ASSET CLASSES AND GEOGRAPHIES 
Over 10,000 Corporate Clients in 
60 countries 
Our technology powers over 
70 MARKETPLACES, regulators, CSDs and clearing- houses 
in over 
50 COUNTRIES 
100+ DATA 
PRODUCT OFFERINGS 
supporting 2.5+ million 
investment professionals and users 
IN 98 COUNTRIES 
26Markets 
3 Clearing Houses 
5Central Securities Depositories 
Lists more than 3,500 
companies in 35 countries, representing more than $8.8 trillionin total market value
Our warehouse can be used to analyze market share, client activity, surveillance, power our billing, and more…
•Pay close attention to manifest mandatory flag! 
–Amazon Redshift UNLOAD always sets this to false!!!
•TableIngestStatus 
–We originally put this table in Amazon Redshift itself 
–Turns out Amazon Redshift is not efficient on really small data sets 
–Significantly impacted performance, and increased concurrency contention 
•Solution: Moved TableIngestStatus to a separate transactional RDBMS (MySQL) 
–We were already using a MySQL instance to persist workflow states
•Direct Connect (private lines) 
•VPC 
•Encryption in flight(HTTPS/SSL/TLS on API, JDBC) 
–Parameter Group: require_ssl = true 
–Use Amazon Redshift cluster SSL certificate to verify cluster identity 
•Encryption at rest 
–AES-256 encrypt files prior to loading to S3 (not using S3 SSE) 
–Amazon Redshift encryption 
•Specified at cluster creation, applies to backups/snapshots too
•Amazon Redshift will store the cluster key in a singlecustomer premise HSM (or CloudHSM) 
–SafeNet Luna SA HSM, firmware version should match CloudHSM 
–Requires certificate exchange between cluster and HSM 
–Requires cluster have an EIP 
•On our side, required static 1-to-1 NAT of HSM private IP 
•VPC Security Groups still apply; can still isolate cluster from others 
–Encrypted database key decrypted in HSM, passed over encrypted channel to cluster on startup, stored in memory to decrypt data encryption (block) keys 
–If running an HSM HA group, must synchronize keys after creation
•HSM integration was critical to Nasdaq adoption 
•Monitor cluster access, react to any unauthorized connections 
–STL_CONNECTION_LOG 
•Query system table on a timed basis, alert to any unexpected access 
–CloudTrail to Splunk Amazon Redshift connection & user logs 
•Captures all API calls, not activity inside Amazon Redshift 
–STL_DDLTEXT 
•Audits all schema changes in the cluster 
•In response to an alert, Amazon Redshift/HSM connectivity is severed, and cluster is immediately shut down
•With validation, data integrity, and security requirements met, the challenge remains to optimize ingest 
•Why? 
–Concurrency is a huge performance factor; can’t afford to be loading yesterday’s data when clients are running queries
- 
20 
40 
60 
80 
100 
120 
140 
1 
2 
4 
6 
8 
10 
12 
14 
16 
18 
Throughput (MB/sec) 
Concurrent Threads 
S3 (over HTTPS) Multithreaded Throughput
On premises 
AWS Regional (Multi-AZ) Scope 
AWS (US-East, primary AZ/VPC) 
S3 
Amazon SNS 
Redshift Database Cluster 
HSM Key Appliance Cluster 
MySQL 
Redshift Load files/ Manifests 
Redshift Snapshots/ Backups 
Data Loaded Topic 
RMS Input Sources (multiple systems) 
Data Ingest Process
November 12, 2014 | Las Vegas, NV 
BDT206See How Amazon Redshift is Powering Business Intelligence in the Enterprise 
Kevin Diamond, Nordstromrack.com | HauteLook
Amazon Redshift
Staging 
Prod 
EMR 
Data Pipeline 
Data Pipeline
Staging 
Prod
medium speed 
medium storage 
$3.7k/month 
awesome support 
small storage 
$3.7k/month 
awesome support 
medium concurrency 
$10k/month 
awesome support
Total Storage 
Daily Transfer 
Monthly Growth 
Monthly Spend 
Estimated 3yr Savings
http://bit.ly/awsevals

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Database Services at AWS - Darin Briskman
Database Services at AWS - Darin BriskmanDatabase Services at AWS - Darin Briskman
Database Services at AWS - Darin Briskman
 
Migrating Oracle Databases to AWS
Migrating Oracle Databases to AWSMigrating Oracle Databases to AWS
Migrating Oracle Databases to AWS
 
Upgrade your SQL Server like a Ninja
Upgrade your SQL Server like a NinjaUpgrade your SQL Server like a Ninja
Upgrade your SQL Server like a Ninja
 
Migrating and modernizing your data estate to Azure with Data Migration Services
Migrating and modernizing your data estate to Azure with Data Migration ServicesMigrating and modernizing your data estate to Azure with Data Migration Services
Migrating and modernizing your data estate to Azure with Data Migration Services
 
Modernize Legacy and Enterprise Application Through Implementation of Cloud N...
Modernize Legacy and Enterprise Application Through Implementation of Cloud N...Modernize Legacy and Enterprise Application Through Implementation of Cloud N...
Modernize Legacy and Enterprise Application Through Implementation of Cloud N...
 
A Quick Guide to Sql Server Availability Groups
A Quick Guide to Sql Server Availability GroupsA Quick Guide to Sql Server Availability Groups
A Quick Guide to Sql Server Availability Groups
 
Webinar | How Clear Capital Delivers Always-on Appraisals on 122 Million Prop...
Webinar | How Clear Capital Delivers Always-on Appraisals on 122 Million Prop...Webinar | How Clear Capital Delivers Always-on Appraisals on 122 Million Prop...
Webinar | How Clear Capital Delivers Always-on Appraisals on 122 Million Prop...
 
New Database Migration Services & RDS Updates
New Database Migration Services & RDS UpdatesNew Database Migration Services & RDS Updates
New Database Migration Services & RDS Updates
 
Module 2 - Datalake
Module 2 - DatalakeModule 2 - Datalake
Module 2 - Datalake
 
Azure SQL Data Warehouse
Azure SQL Data Warehouse Azure SQL Data Warehouse
Azure SQL Data Warehouse
 
Migrate from Oracle to Aurora PostgreSQL: Best Practices, Design Patterns, & ...
Migrate from Oracle to Aurora PostgreSQL: Best Practices, Design Patterns, & ...Migrate from Oracle to Aurora PostgreSQL: Best Practices, Design Patterns, & ...
Migrate from Oracle to Aurora PostgreSQL: Best Practices, Design Patterns, & ...
 
Oracle and SQL Server on the Cloud - Bill Baldwin
Oracle and SQL Server on the Cloud - Bill BaldwinOracle and SQL Server on the Cloud - Bill Baldwin
Oracle and SQL Server on the Cloud - Bill Baldwin
 
Modern ETL: Azure Data Factory, Data Lake, and SQL Database
Modern ETL: Azure Data Factory, Data Lake, and SQL DatabaseModern ETL: Azure Data Factory, Data Lake, and SQL Database
Modern ETL: Azure Data Factory, Data Lake, and SQL Database
 
How to Set Up ApsaraDB for RDS on Alibaba Cloud
How to Set Up ApsaraDB for RDS on Alibaba CloudHow to Set Up ApsaraDB for RDS on Alibaba Cloud
How to Set Up ApsaraDB for RDS on Alibaba Cloud
 
Migrating to Amazon RDS with Database Migration Service
Migrating to Amazon RDS with Database Migration ServiceMigrating to Amazon RDS with Database Migration Service
Migrating to Amazon RDS with Database Migration Service
 
AWS re:Invent 2016: Migrating a Highly Available and Scalable Database from O...
AWS re:Invent 2016: Migrating a Highly Available and Scalable Database from O...AWS re:Invent 2016: Migrating a Highly Available and Scalable Database from O...
AWS re:Invent 2016: Migrating a Highly Available and Scalable Database from O...
 
PaaSport to Paradise: Lifting & Shifting with Azure SQL Database/Managed Inst...
PaaSport to Paradise: Lifting & Shifting with Azure SQL Database/Managed Inst...PaaSport to Paradise: Lifting & Shifting with Azure SQL Database/Managed Inst...
PaaSport to Paradise: Lifting & Shifting with Azure SQL Database/Managed Inst...
 
SQL Server Database as a Cloud Service
SQL Server Database as a Cloud ServiceSQL Server Database as a Cloud Service
SQL Server Database as a Cloud Service
 
Start your datacentre transformation journey with azure migrate
Start your datacentre transformation journey with azure migrateStart your datacentre transformation journey with azure migrate
Start your datacentre transformation journey with azure migrate
 
Azure database services for PostgreSQL and MySQL
Azure database services for PostgreSQL and MySQLAzure database services for PostgreSQL and MySQL
Azure database services for PostgreSQL and MySQL
 

Destacado

Media Processing and Delivery on AWS, Santa Monica Meetup 6/25/14
Media Processing and Delivery on AWS, Santa Monica Meetup 6/25/14Media Processing and Delivery on AWS, Santa Monica Meetup 6/25/14
Media Processing and Delivery on AWS, Santa Monica Meetup 6/25/14
Amazon Web Services
 

Destacado (20)

AWS Customer Presentation - WeoGeo
AWS Customer Presentation - WeoGeo AWS Customer Presentation - WeoGeo
AWS Customer Presentation - WeoGeo
 
(APP311) Lessons Learned From Over a Decade of Deployments at Amazon | AWS re...
(APP311) Lessons Learned From Over a Decade of Deployments at Amazon | AWS re...(APP311) Lessons Learned From Over a Decade of Deployments at Amazon | AWS re...
(APP311) Lessons Learned From Over a Decade of Deployments at Amazon | AWS re...
 
AWS Customer Presentation - Skipso
AWS Customer Presentation - Skipso AWS Customer Presentation - Skipso
AWS Customer Presentation - Skipso
 
Migrating Entire Enterprise IT
Migrating Entire Enterprise ITMigrating Entire Enterprise IT
Migrating Entire Enterprise IT
 
AWS Webcast - Sumo Logic
AWS Webcast - Sumo LogicAWS Webcast - Sumo Logic
AWS Webcast - Sumo Logic
 
AWS Customer Presentation - Admeld
AWS Customer Presentation - Admeld AWS Customer Presentation - Admeld
AWS Customer Presentation - Admeld
 
(MBL305) The World Cup Second Screen Experience | AWS re:Invent 2014
(MBL305) The World Cup Second Screen Experience | AWS re:Invent 2014(MBL305) The World Cup Second Screen Experience | AWS re:Invent 2014
(MBL305) The World Cup Second Screen Experience | AWS re:Invent 2014
 
AWS Customer Presentation - Mind Touch
AWS Customer Presentation - Mind TouchAWS Customer Presentation - Mind Touch
AWS Customer Presentation - Mind Touch
 
AWS GovCloud (US) Fundamentals: Past, Present, and Future - AWS Symposium 201...
AWS GovCloud (US) Fundamentals: Past, Present, and Future - AWS Symposium 201...AWS GovCloud (US) Fundamentals: Past, Present, and Future - AWS Symposium 201...
AWS GovCloud (US) Fundamentals: Past, Present, and Future - AWS Symposium 201...
 
Media Processing and Delivery on AWS, Santa Monica Meetup 6/25/14
Media Processing and Delivery on AWS, Santa Monica Meetup 6/25/14Media Processing and Delivery on AWS, Santa Monica Meetup 6/25/14
Media Processing and Delivery on AWS, Santa Monica Meetup 6/25/14
 
(BDT205) Your First Big Data Application on AWS | AWS re:Invent 2014
(BDT205) Your First Big Data Application on AWS | AWS re:Invent 2014(BDT205) Your First Big Data Application on AWS | AWS re:Invent 2014
(BDT205) Your First Big Data Application on AWS | AWS re:Invent 2014
 
AWS Paris Summit 2014 - T2 - Optimisation du coût total de possession avec AWS
AWS Paris Summit 2014 - T2 - Optimisation du coût total de possession avec AWSAWS Paris Summit 2014 - T2 - Optimisation du coût total de possession avec AWS
AWS Paris Summit 2014 - T2 - Optimisation du coût total de possession avec AWS
 
DevOps for the Enterprise: Automated Testing and Monitoring
DevOps for the Enterprise: Automated Testing and Monitoring DevOps for the Enterprise: Automated Testing and Monitoring
DevOps for the Enterprise: Automated Testing and Monitoring
 
Intro to Amazon Web Services at Edinburgh Startup Event
Intro to Amazon Web Services at Edinburgh Startup EventIntro to Amazon Web Services at Edinburgh Startup Event
Intro to Amazon Web Services at Edinburgh Startup Event
 
(GAM201) Scalable Game Architectures That Don't Break the Bank | AWS re:Inven...
(GAM201) Scalable Game Architectures That Don't Break the Bank | AWS re:Inven...(GAM201) Scalable Game Architectures That Don't Break the Bank | AWS re:Inven...
(GAM201) Scalable Game Architectures That Don't Break the Bank | AWS re:Inven...
 
The Great Cloud Migration
The Great Cloud MigrationThe Great Cloud Migration
The Great Cloud Migration
 
Masterclass Webinar: Amazon DynamoDB July 2014
Masterclass Webinar: Amazon DynamoDB July 2014Masterclass Webinar: Amazon DynamoDB July 2014
Masterclass Webinar: Amazon DynamoDB July 2014
 
APN Partner Webinar - Security & Compliance for AWS EMEA Partners
APN Partner Webinar - Security & Compliance for AWS EMEA PartnersAPN Partner Webinar - Security & Compliance for AWS EMEA Partners
APN Partner Webinar - Security & Compliance for AWS EMEA Partners
 
10 Pro Tips for Scaling Your Startup from 0-10M Users
10 Pro Tips for Scaling Your Startup from 0-10M Users10 Pro Tips for Scaling Your Startup from 0-10M Users
10 Pro Tips for Scaling Your Startup from 0-10M Users
 
Continuous Integration and Deployment Best Practices on AWS - AWS Symposium 2...
Continuous Integration and Deployment Best Practices on AWS - AWS Symposium 2...Continuous Integration and Deployment Best Practices on AWS - AWS Symposium 2...
Continuous Integration and Deployment Best Practices on AWS - AWS Symposium 2...
 

Similar a (BDT206) See How Amazon Redshift is Powering Business Intelligence in the Enterprise | AWS re:Invent 2014

Similar a (BDT206) See How Amazon Redshift is Powering Business Intelligence in the Enterprise | AWS re:Invent 2014 (20)

[よくわかるAmazon Redshift in 大阪]Amazon Redshift最新情報と導入事例のご紹介
[よくわかるAmazon Redshift in 大阪]Amazon Redshift最新情報と導入事例のご紹介[よくわかるAmazon Redshift in 大阪]Amazon Redshift最新情報と導入事例のご紹介
[よくわかるAmazon Redshift in 大阪]Amazon Redshift最新情報と導入事例のご紹介
 
[よくわかるAmazon Redshift]Amazon Redshift最新情報と導入事例のご紹介
[よくわかるAmazon Redshift]Amazon Redshift最新情報と導入事例のご紹介[よくわかるAmazon Redshift]Amazon Redshift最新情報と導入事例のご紹介
[よくわかるAmazon Redshift]Amazon Redshift最新情報と導入事例のご紹介
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon Redshift
 
AWS March 2016 Webinar Series - Building Big Data Solutions with Amazon EMR a...
AWS March 2016 Webinar Series - Building Big Data Solutions with Amazon EMR a...AWS March 2016 Webinar Series - Building Big Data Solutions with Amazon EMR a...
AWS March 2016 Webinar Series - Building Big Data Solutions with Amazon EMR a...
 
Deploying your Data Warehouse on AWS
Deploying your Data Warehouse on AWSDeploying your Data Warehouse on AWS
Deploying your Data Warehouse on AWS
 
Migrating Your Data Warehouse to Amazon Redshift (DAT337) - AWS re:Invent 2018
Migrating Your Data Warehouse to Amazon Redshift (DAT337) - AWS re:Invent 2018Migrating Your Data Warehouse to Amazon Redshift (DAT337) - AWS re:Invent 2018
Migrating Your Data Warehouse to Amazon Redshift (DAT337) - AWS re:Invent 2018
 
Migrating your Databases to AWS: Deep Dive on Amazon RDS and AWS Database Mig...
Migrating your Databases to AWS: Deep Dive on Amazon RDS and AWS Database Mig...Migrating your Databases to AWS: Deep Dive on Amazon RDS and AWS Database Mig...
Migrating your Databases to AWS: Deep Dive on Amazon RDS and AWS Database Mig...
 
Using Data Lakes
Using Data LakesUsing Data Lakes
Using Data Lakes
 
Serverless Analytics with Amazon Redshift Spectrum, AWS Glue, and Amazon Quic...
Serverless Analytics with Amazon Redshift Spectrum, AWS Glue, and Amazon Quic...Serverless Analytics with Amazon Redshift Spectrum, AWS Glue, and Amazon Quic...
Serverless Analytics with Amazon Redshift Spectrum, AWS Glue, and Amazon Quic...
 
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
 
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
 
Databases - State of the Union
Databases - State of the UnionDatabases - State of the Union
Databases - State of the Union
 
Success has Many Query Engines- Tel Aviv Summit 2018
Success has Many Query Engines- Tel Aviv Summit 2018Success has Many Query Engines- Tel Aviv Summit 2018
Success has Many Query Engines- Tel Aviv Summit 2018
 
Deep Dive on Amazon Relational Database Service
Deep Dive on Amazon Relational Database ServiceDeep Dive on Amazon Relational Database Service
Deep Dive on Amazon Relational Database Service
 
London Redshift Meetup - July 2017
London Redshift Meetup - July 2017London Redshift Meetup - July 2017
London Redshift Meetup - July 2017
 
(ISM304) Oracle to Amazon RDS MySQL & Aurora: How Gallup Made the Move
(ISM304) Oracle to Amazon RDS MySQL & Aurora: How Gallup Made the Move(ISM304) Oracle to Amazon RDS MySQL & Aurora: How Gallup Made the Move
(ISM304) Oracle to Amazon RDS MySQL & Aurora: How Gallup Made the Move
 
Introduction to Amazon Redshift
Introduction to Amazon RedshiftIntroduction to Amazon Redshift
Introduction to Amazon Redshift
 
Owning Your Own (Data) Lake House
Owning Your Own (Data) Lake HouseOwning Your Own (Data) Lake House
Owning Your Own (Data) Lake House
 
Migrating Your Oracle Database to PostgreSQL - AWS Online Tech Talks
Migrating Your Oracle Database to PostgreSQL - AWS Online Tech TalksMigrating Your Oracle Database to PostgreSQL - AWS Online Tech Talks
Migrating Your Oracle Database to PostgreSQL - AWS Online Tech Talks
 
Migrating Your Oracle Database to PostgreSQL - AWS Online Tech Talks
Migrating Your Oracle Database to PostgreSQL - AWS Online Tech TalksMigrating Your Oracle Database to PostgreSQL - AWS Online Tech Talks
Migrating Your Oracle Database to PostgreSQL - AWS Online Tech Talks
 

Más de Amazon Web Services

Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
Amazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
Amazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
Amazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
Amazon Web Services
 

Más de Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Último

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Último (20)

DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 

(BDT206) See How Amazon Redshift is Powering Business Intelligence in the Enterprise | AWS re:Invent 2014

  • 1. November 12, 2014 | Las Vegas, NV BDT206See How Amazon Redshift is Powering Business Intelligence in the Enterprise Rahul Pathak, Amazon Redshift Jason Timmes, Nasdaq Kevin Diamond, Hautelook
  • 2.
  • 3. Amazon Redshift Amazon Elastic MapReduce Amazon EC2 Analyze AWS Data Pipeline Amazon Glacier Amazon DynamoDB Store AWS Direct Connect Collect Amazon Kinesis Amazon S3
  • 4. 10 GigE (HPC) Ingestion Backup Restore JDBC/ODBC
  • 5. 10 GigE (HPC) Ingestion Backup Restore Customer VPC InternalVPC JDBC/ODBC
  • 6.
  • 7.
  • 8. Data Source ET Direct Connect Client Forwarder State Management Loader Amazon Redshift Sandbox S3
  • 9.
  • 10.
  • 11. 11 LEADING INDEX PROVIDER WITH 41,000+ INDEXES ACROSS ASSET CLASSES AND GEOGRAPHIES Over 10,000 Corporate Clients in 60 countries Our technology powers over 70 MARKETPLACES, regulators, CSDs and clearing- houses in over 50 COUNTRIES 100+ DATA PRODUCT OFFERINGS supporting 2.5+ million investment professionals and users IN 98 COUNTRIES 26Markets 3 Clearing Houses 5Central Securities Depositories Lists more than 3,500 companies in 35 countries, representing more than $8.8 trillionin total market value
  • 12.
  • 13. Our warehouse can be used to analyze market share, client activity, surveillance, power our billing, and more…
  • 14.
  • 15.
  • 16.
  • 17.
  • 18. •Pay close attention to manifest mandatory flag! –Amazon Redshift UNLOAD always sets this to false!!!
  • 19. •TableIngestStatus –We originally put this table in Amazon Redshift itself –Turns out Amazon Redshift is not efficient on really small data sets –Significantly impacted performance, and increased concurrency contention •Solution: Moved TableIngestStatus to a separate transactional RDBMS (MySQL) –We were already using a MySQL instance to persist workflow states
  • 20. •Direct Connect (private lines) •VPC •Encryption in flight(HTTPS/SSL/TLS on API, JDBC) –Parameter Group: require_ssl = true –Use Amazon Redshift cluster SSL certificate to verify cluster identity •Encryption at rest –AES-256 encrypt files prior to loading to S3 (not using S3 SSE) –Amazon Redshift encryption •Specified at cluster creation, applies to backups/snapshots too
  • 21. •Amazon Redshift will store the cluster key in a singlecustomer premise HSM (or CloudHSM) –SafeNet Luna SA HSM, firmware version should match CloudHSM –Requires certificate exchange between cluster and HSM –Requires cluster have an EIP •On our side, required static 1-to-1 NAT of HSM private IP •VPC Security Groups still apply; can still isolate cluster from others –Encrypted database key decrypted in HSM, passed over encrypted channel to cluster on startup, stored in memory to decrypt data encryption (block) keys –If running an HSM HA group, must synchronize keys after creation
  • 22. •HSM integration was critical to Nasdaq adoption •Monitor cluster access, react to any unauthorized connections –STL_CONNECTION_LOG •Query system table on a timed basis, alert to any unexpected access –CloudTrail to Splunk Amazon Redshift connection & user logs •Captures all API calls, not activity inside Amazon Redshift –STL_DDLTEXT •Audits all schema changes in the cluster •In response to an alert, Amazon Redshift/HSM connectivity is severed, and cluster is immediately shut down
  • 23. •With validation, data integrity, and security requirements met, the challenge remains to optimize ingest •Why? –Concurrency is a huge performance factor; can’t afford to be loading yesterday’s data when clients are running queries
  • 24.
  • 25. - 20 40 60 80 100 120 140 1 2 4 6 8 10 12 14 16 18 Throughput (MB/sec) Concurrent Threads S3 (over HTTPS) Multithreaded Throughput
  • 26.
  • 27. On premises AWS Regional (Multi-AZ) Scope AWS (US-East, primary AZ/VPC) S3 Amazon SNS Redshift Database Cluster HSM Key Appliance Cluster MySQL Redshift Load files/ Manifests Redshift Snapshots/ Backups Data Loaded Topic RMS Input Sources (multiple systems) Data Ingest Process
  • 28.
  • 29.
  • 30.
  • 31.
  • 32.
  • 33.
  • 34.
  • 35. November 12, 2014 | Las Vegas, NV BDT206See How Amazon Redshift is Powering Business Intelligence in the Enterprise Kevin Diamond, Nordstromrack.com | HauteLook
  • 36.
  • 37.
  • 38.
  • 39.
  • 40.
  • 41.
  • 42.
  • 43.
  • 44.
  • 45.
  • 47.
  • 48.
  • 49. Staging Prod EMR Data Pipeline Data Pipeline
  • 51.
  • 52.
  • 53. medium speed medium storage $3.7k/month awesome support small storage $3.7k/month awesome support medium concurrency $10k/month awesome support
  • 54.
  • 55.
  • 56.
  • 57. Total Storage Daily Transfer Monthly Growth Monthly Spend Estimated 3yr Savings
  • 58.