SlideShare una empresa de Scribd logo
1 de 30
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AWS re:INVENT
Self-Service Analytics with AWS
Big Data and Tableau
T A D B U H M A N - E X P E D I A
A N N A T E R P - E X P E D I A
N o v e m b e r 2 9 , 2 0 1 7
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AGENDA
WHO AND WHY
PLATFORM
PRODUCT
SELF-SERVICE ANALYTICS
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
WHO	AND	WHY?
Expedia is a global leader in
online travel
Expedia does business in 200 countries
$2.96B revenue/millions of transactions Q3 2017
Optimize payment processing
Track high cost of credit card processing
Transaction systems are not built for reporting
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
STRATEGIC TOPICS
Think about guiding principles
Embrace the impermanence of the cloud
End product is usable and familiar
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
GUIDING PRINCIPLES
Transaction level data not aggregated
Cloud-ready technology
Standard, loosely coupled interfaces
Horizontally scalable
Self-Service analytics
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
PRODUCT AND PLATFORM
HIGH LEVEL OVERVIEW
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
ENGINEERED FACT TABLES AND
ASSOCIATED VIEWS
MANAGED METRICS MANAGED DATA SOURCES
CORE PRODUCT – ENABLES SELF-SERVICE
orig_payment_instrument_
type
orig_paymen
t_transaction
_code
Success Fail
CreditCard AUTH X X
CreditCard BASV X
GiftCard AUTH X X
InternetBankPayment CAPTURE X X
InternetBankPayment VOID X
PayPal REDIRECT X
PayPal AUTH X X
Points AUTH X X
ELV CAPTURE X X
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
PARAMETERIZED REPORTS PIVOT/CHART BUILDERS “START FROM SCRATCH”
EDITABLE TEMPLATES
SELF-SERVICE REPORTING
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
PLATFORM ARCHITECTURE
Collection Transformation & Data Store
Orchestration
Information Delivery
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
PLATFORM – BUILD STRATEGY
Platform components
Data sources
Transformation/data store
Orchestration
Information delivery
Loosely coupled interfaces
Prototype
Justify complexity
#1 Risk and Priority
#2 Risk and Priority
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
COLLECTION
• Loosely coupled interface
• JSON data
• Physical files
• Wide support
• Self describing
• S3 Landing Zone
• Central location
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
TRANSFORMATION – DATA STORE
• Hive on EMR
• OnDemand
• HiveQL
• MySQL Hive Metastore
• Persist schema
• S3 Data Store
• Persist data
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Service Interaction Data lineage & Workflow management
ORCHESTRATION
DynamoDBData Pipeline
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
INFORMATION DELIVERY
Redshift
• Fast analytic queries
• OnDemand reports
• SQL
• Client and Reporting tool support
• Scalable
Data abstraction
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
JUSTIFY COMPLEXITY COMPONENT INTERFACES – IMPERMANENCE
PLATFORM – STRATEGY REVIEW
Amazon S3
Data Lake
Amazon Redshift
Spectrum
Amazon Athena
Amazon EMR
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
TEST DATA VOLUME AGGREGATE WHERE NEEDEDUNDERSTAND TABLE USAGE &
BUILD ACCORDINGLY
PLATFORM MEETS PRESENTATION
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
SELF-SERVICE ANALYTICS
HIGH LEVEL OVERVIEW
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
PRESENTATION LAYER IMPLEMENTATION
How we evaluated products in the Cloud
How we maintained familiar concepts
Our Self-Service methodology
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
PRESENTATION LAYER CHOICE
Evaluation Best Practices
Test Evaluate
Load Software
Drop Instance
Repeat
Create EC2 Instance
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
FAMILIAR CONCEPTS
• Consistent, familiar
names
• Numbering for sorting
• Folders
• Time and effort on
calculated measures
Cube – Legacy Tableau Data Source
CALCULATED MEASURES
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
FAMILIAR CONCEPTS – FOLDERS
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
SELF-SERVICE ANALYTICS
Parameterized Reports
Pivot Builder
Editable Templates
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
TABLEAU
Best Practices
Lessons Learned
Optimization
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
TABLEAU – BEST PRACTICES
• Abstract tables using Views
• Naming: Generic, descriptive, readable
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
TABLEAU – LESSONS LEARNED
• Don’t try to extract 500M rows
• Consider performance – always
• Edit XML to change environment
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
TABLEAU – EDIT DATA SOURCE
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
TABLEAU – OPTIMIZATION
• Aggregation
• Assume Referential Integrity
• Consider performance when writing reports
• Data Source Optimization
• Create Calculations
• Create Hierarchies
• Hide fields – hide anything empty or not used by anyone, for
improved extract generation performance
• Set default properties (number format, comments)
• Assign Data Types and Geographic Roles
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
CONCLUSION
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
TAKEAWAYS
Think about guiding principles
Embrace the impermanence of the cloud
Development doesn’t have to happen in sequence
End product is usable and familiar
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
QUESTIONS

Más contenido relacionado

La actualidad más candente

Building Best Practices and the Right Foundation for your 1st Production Work...
Building Best Practices and the Right Foundation for your 1st Production Work...Building Best Practices and the Right Foundation for your 1st Production Work...
Building Best Practices and the Right Foundation for your 1st Production Work...Amazon Web Services
 
RET303_Drive Warehouse Efficiencies with the Same AWS IoT Technology that Pow...
RET303_Drive Warehouse Efficiencies with the Same AWS IoT Technology that Pow...RET303_Drive Warehouse Efficiencies with the Same AWS IoT Technology that Pow...
RET303_Drive Warehouse Efficiencies with the Same AWS IoT Technology that Pow...Amazon Web Services
 
Getting from Here to There: A Journey from On-premises to Serverless Architec...
Getting from Here to There: A Journey from On-premises to Serverless Architec...Getting from Here to There: A Journey from On-premises to Serverless Architec...
Getting from Here to There: A Journey from On-premises to Serverless Architec...Amazon Web Services
 
Disney's Magic The Story of Cloud Transformation - ARC206 - re:Invent 2017
Disney's Magic The Story of Cloud Transformation - ARC206 - re:Invent 2017Disney's Magic The Story of Cloud Transformation - ARC206 - re:Invent 2017
Disney's Magic The Story of Cloud Transformation - ARC206 - re:Invent 2017Amazon Web Services
 
Exploring Blockchain Technology, Risks, and Emerging Trends - ARC313 - re:Inv...
Exploring Blockchain Technology, Risks, and Emerging Trends - ARC313 - re:Inv...Exploring Blockchain Technology, Risks, and Emerging Trends - ARC313 - re:Inv...
Exploring Blockchain Technology, Risks, and Emerging Trends - ARC313 - re:Inv...Amazon Web Services
 
ABD302_Real-Time Data Exploration and Analytics with Amazon Elasticsearch Ser...
ABD302_Real-Time Data Exploration and Analytics with Amazon Elasticsearch Ser...ABD302_Real-Time Data Exploration and Analytics with Amazon Elasticsearch Ser...
ABD302_Real-Time Data Exploration and Analytics with Amazon Elasticsearch Ser...Amazon Web Services
 
STG205_#EarthOnAWS How NASA is Using AWS
STG205_#EarthOnAWS How NASA is Using AWSSTG205_#EarthOnAWS How NASA is Using AWS
STG205_#EarthOnAWS How NASA is Using AWSAmazon Web Services
 
AMF305_Autonomous Driving Algorithm Development on Amazon AI
AMF305_Autonomous Driving Algorithm Development on Amazon AIAMF305_Autonomous Driving Algorithm Development on Amazon AI
AMF305_Autonomous Driving Algorithm Development on Amazon AIAmazon Web Services
 
CMP316_Hedge Your Own Funds Run Monte Carlo Simulations on EC2 Spot Fleet
CMP316_Hedge Your Own Funds Run Monte Carlo Simulations on EC2 Spot FleetCMP316_Hedge Your Own Funds Run Monte Carlo Simulations on EC2 Spot Fleet
CMP316_Hedge Your Own Funds Run Monte Carlo Simulations on EC2 Spot FleetAmazon Web Services
 
MAE402-Media Intelligence for the Cloud with Amazon AI.pdf
MAE402-Media Intelligence for the Cloud with Amazon AI.pdfMAE402-Media Intelligence for the Cloud with Amazon AI.pdf
MAE402-Media Intelligence for the Cloud with Amazon AI.pdfAmazon Web Services
 
GPSTEC305-Machine Learning in Capital Markets
GPSTEC305-Machine Learning in Capital MarketsGPSTEC305-Machine Learning in Capital Markets
GPSTEC305-Machine Learning in Capital MarketsAmazon Web Services
 
GAM310_Build a Telemetry and Analytics Pipeline for Game Balancing
GAM310_Build a Telemetry and Analytics Pipeline for Game BalancingGAM310_Build a Telemetry and Analytics Pipeline for Game Balancing
GAM310_Build a Telemetry and Analytics Pipeline for Game BalancingAmazon Web Services
 
ARC210_Building Scalable Multi-Tenant Email Sending Programs
ARC210_Building Scalable Multi-Tenant Email Sending ProgramsARC210_Building Scalable Multi-Tenant Email Sending Programs
ARC210_Building Scalable Multi-Tenant Email Sending ProgramsAmazon Web Services
 
Building Serverless Websites with Lambda@Edge - CTD309 - re:Invent 2017
Building Serverless Websites with Lambda@Edge - CTD309 - re:Invent 2017Building Serverless Websites with Lambda@Edge - CTD309 - re:Invent 2017
Building Serverless Websites with Lambda@Edge - CTD309 - re:Invent 2017Amazon Web Services
 
MAE304-Turners Cloud Archive for CNN's Video Library and Global Multiplatform...
MAE304-Turners Cloud Archive for CNN's Video Library and Global Multiplatform...MAE304-Turners Cloud Archive for CNN's Video Library and Global Multiplatform...
MAE304-Turners Cloud Archive for CNN's Video Library and Global Multiplatform...Amazon Web Services
 
AMF302-Alexa Wheres My Car A Test Drive of the AWS Connected Car Reference.pdf
AMF302-Alexa Wheres My Car A Test Drive of the AWS Connected Car Reference.pdfAMF302-Alexa Wheres My Car A Test Drive of the AWS Connected Car Reference.pdf
AMF302-Alexa Wheres My Car A Test Drive of the AWS Connected Car Reference.pdfAmazon Web Services
 
GPSWKS408-GPS Migrate Your Databases with AWS Database Migration Service and ...
GPSWKS408-GPS Migrate Your Databases with AWS Database Migration Service and ...GPSWKS408-GPS Migrate Your Databases with AWS Database Migration Service and ...
GPSWKS408-GPS Migrate Your Databases with AWS Database Migration Service and ...Amazon Web Services
 
Reinforcement Learning – The Ultimate AI - ARC320 - re:Invent 2017
Reinforcement Learning – The Ultimate AI - ARC320 - re:Invent 2017Reinforcement Learning – The Ultimate AI - ARC320 - re:Invent 2017
Reinforcement Learning – The Ultimate AI - ARC320 - re:Invent 2017Amazon Web Services
 
ABD202_Best Practices for Building Serverless Big Data Applications
ABD202_Best Practices for Building Serverless Big Data ApplicationsABD202_Best Practices for Building Serverless Big Data Applications
ABD202_Best Practices for Building Serverless Big Data ApplicationsAmazon Web Services
 
CON208_Building Microservices on AWS
CON208_Building Microservices on AWSCON208_Building Microservices on AWS
CON208_Building Microservices on AWSAmazon Web Services
 

La actualidad más candente (20)

Building Best Practices and the Right Foundation for your 1st Production Work...
Building Best Practices and the Right Foundation for your 1st Production Work...Building Best Practices and the Right Foundation for your 1st Production Work...
Building Best Practices and the Right Foundation for your 1st Production Work...
 
RET303_Drive Warehouse Efficiencies with the Same AWS IoT Technology that Pow...
RET303_Drive Warehouse Efficiencies with the Same AWS IoT Technology that Pow...RET303_Drive Warehouse Efficiencies with the Same AWS IoT Technology that Pow...
RET303_Drive Warehouse Efficiencies with the Same AWS IoT Technology that Pow...
 
Getting from Here to There: A Journey from On-premises to Serverless Architec...
Getting from Here to There: A Journey from On-premises to Serverless Architec...Getting from Here to There: A Journey from On-premises to Serverless Architec...
Getting from Here to There: A Journey from On-premises to Serverless Architec...
 
Disney's Magic The Story of Cloud Transformation - ARC206 - re:Invent 2017
Disney's Magic The Story of Cloud Transformation - ARC206 - re:Invent 2017Disney's Magic The Story of Cloud Transformation - ARC206 - re:Invent 2017
Disney's Magic The Story of Cloud Transformation - ARC206 - re:Invent 2017
 
Exploring Blockchain Technology, Risks, and Emerging Trends - ARC313 - re:Inv...
Exploring Blockchain Technology, Risks, and Emerging Trends - ARC313 - re:Inv...Exploring Blockchain Technology, Risks, and Emerging Trends - ARC313 - re:Inv...
Exploring Blockchain Technology, Risks, and Emerging Trends - ARC313 - re:Inv...
 
ABD302_Real-Time Data Exploration and Analytics with Amazon Elasticsearch Ser...
ABD302_Real-Time Data Exploration and Analytics with Amazon Elasticsearch Ser...ABD302_Real-Time Data Exploration and Analytics with Amazon Elasticsearch Ser...
ABD302_Real-Time Data Exploration and Analytics with Amazon Elasticsearch Ser...
 
STG205_#EarthOnAWS How NASA is Using AWS
STG205_#EarthOnAWS How NASA is Using AWSSTG205_#EarthOnAWS How NASA is Using AWS
STG205_#EarthOnAWS How NASA is Using AWS
 
AMF305_Autonomous Driving Algorithm Development on Amazon AI
AMF305_Autonomous Driving Algorithm Development on Amazon AIAMF305_Autonomous Driving Algorithm Development on Amazon AI
AMF305_Autonomous Driving Algorithm Development on Amazon AI
 
CMP316_Hedge Your Own Funds Run Monte Carlo Simulations on EC2 Spot Fleet
CMP316_Hedge Your Own Funds Run Monte Carlo Simulations on EC2 Spot FleetCMP316_Hedge Your Own Funds Run Monte Carlo Simulations on EC2 Spot Fleet
CMP316_Hedge Your Own Funds Run Monte Carlo Simulations on EC2 Spot Fleet
 
MAE402-Media Intelligence for the Cloud with Amazon AI.pdf
MAE402-Media Intelligence for the Cloud with Amazon AI.pdfMAE402-Media Intelligence for the Cloud with Amazon AI.pdf
MAE402-Media Intelligence for the Cloud with Amazon AI.pdf
 
GPSTEC305-Machine Learning in Capital Markets
GPSTEC305-Machine Learning in Capital MarketsGPSTEC305-Machine Learning in Capital Markets
GPSTEC305-Machine Learning in Capital Markets
 
GAM310_Build a Telemetry and Analytics Pipeline for Game Balancing
GAM310_Build a Telemetry and Analytics Pipeline for Game BalancingGAM310_Build a Telemetry and Analytics Pipeline for Game Balancing
GAM310_Build a Telemetry and Analytics Pipeline for Game Balancing
 
ARC210_Building Scalable Multi-Tenant Email Sending Programs
ARC210_Building Scalable Multi-Tenant Email Sending ProgramsARC210_Building Scalable Multi-Tenant Email Sending Programs
ARC210_Building Scalable Multi-Tenant Email Sending Programs
 
Building Serverless Websites with Lambda@Edge - CTD309 - re:Invent 2017
Building Serverless Websites with Lambda@Edge - CTD309 - re:Invent 2017Building Serverless Websites with Lambda@Edge - CTD309 - re:Invent 2017
Building Serverless Websites with Lambda@Edge - CTD309 - re:Invent 2017
 
MAE304-Turners Cloud Archive for CNN's Video Library and Global Multiplatform...
MAE304-Turners Cloud Archive for CNN's Video Library and Global Multiplatform...MAE304-Turners Cloud Archive for CNN's Video Library and Global Multiplatform...
MAE304-Turners Cloud Archive for CNN's Video Library and Global Multiplatform...
 
AMF302-Alexa Wheres My Car A Test Drive of the AWS Connected Car Reference.pdf
AMF302-Alexa Wheres My Car A Test Drive of the AWS Connected Car Reference.pdfAMF302-Alexa Wheres My Car A Test Drive of the AWS Connected Car Reference.pdf
AMF302-Alexa Wheres My Car A Test Drive of the AWS Connected Car Reference.pdf
 
GPSWKS408-GPS Migrate Your Databases with AWS Database Migration Service and ...
GPSWKS408-GPS Migrate Your Databases with AWS Database Migration Service and ...GPSWKS408-GPS Migrate Your Databases with AWS Database Migration Service and ...
GPSWKS408-GPS Migrate Your Databases with AWS Database Migration Service and ...
 
Reinforcement Learning – The Ultimate AI - ARC320 - re:Invent 2017
Reinforcement Learning – The Ultimate AI - ARC320 - re:Invent 2017Reinforcement Learning – The Ultimate AI - ARC320 - re:Invent 2017
Reinforcement Learning – The Ultimate AI - ARC320 - re:Invent 2017
 
ABD202_Best Practices for Building Serverless Big Data Applications
ABD202_Best Practices for Building Serverless Big Data ApplicationsABD202_Best Practices for Building Serverless Big Data Applications
ABD202_Best Practices for Building Serverless Big Data Applications
 
CON208_Building Microservices on AWS
CON208_Building Microservices on AWSCON208_Building Microservices on AWS
CON208_Building Microservices on AWS
 

Similar a Self-Service Analytics with AWS Big Data and Tableau - ARC217 - re:Invent 2017

Architecting an Open Data Lake for the Enterprise
Architecting an Open Data Lake for the EnterpriseArchitecting an Open Data Lake for the Enterprise
Architecting an Open Data Lake for the EnterpriseAmazon Web Services
 
STG206_Big Data Data Lakes and Data Oceans
STG206_Big Data Data Lakes and Data OceansSTG206_Big Data Data Lakes and Data Oceans
STG206_Big Data Data Lakes and Data OceansAmazon Web Services
 
21st Century Analytics with Zopa
21st Century Analytics with Zopa21st Century Analytics with Zopa
21st Century Analytics with ZopaAmazon Web Services
 
Fanatics Ingests Streaming Data to a Data Lake on AWS
Fanatics Ingests Streaming Data to a Data Lake on AWSFanatics Ingests Streaming Data to a Data Lake on AWS
Fanatics Ingests Streaming Data to a Data Lake on AWSAmazon Web Services
 
Managing a Database Migration Project Best Practices and Customer References.pdf
Managing a Database Migration Project Best Practices and Customer References.pdfManaging a Database Migration Project Best Practices and Customer References.pdf
Managing a Database Migration Project Best Practices and Customer References.pdfAmazon Web Services
 
Big Data Architecture and Design Patterns
Big Data Architecture and Design PatternsBig Data Architecture and Design Patterns
Big Data Architecture and Design PatternsJohn Yeung
 
ABD201-Big Data Architectural Patterns and Best Practices on AWS
ABD201-Big Data Architectural Patterns and Best Practices on AWSABD201-Big Data Architectural Patterns and Best Practices on AWS
ABD201-Big Data Architectural Patterns and Best Practices on AWSAmazon Web Services
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon RedshiftAmazon Web Services
 
Migrating to Amazon RDS with Database Migration Service:
Migrating to Amazon RDS with Database Migration Service:Migrating to Amazon RDS with Database Migration Service:
Migrating to Amazon RDS with Database Migration Service:Amazon Web Services
 
DAT317_Migrating Databases and Data Warehouses to the Cloud
DAT317_Migrating Databases and Data Warehouses to the CloudDAT317_Migrating Databases and Data Warehouses to the Cloud
DAT317_Migrating Databases and Data Warehouses to the CloudAmazon Web Services
 
Scale Website dan Mobile Applications Anda di AWS hingga 10 juta pengguna
Scale Website dan Mobile Applications Anda di AWS hingga 10 juta penggunaScale Website dan Mobile Applications Anda di AWS hingga 10 juta pengguna
Scale Website dan Mobile Applications Anda di AWS hingga 10 juta penggunaAmazon Web Services
 
ABD307_Deep Analytics for Global AWS Marketing Organization
ABD307_Deep Analytics for Global AWS Marketing OrganizationABD307_Deep Analytics for Global AWS Marketing Organization
ABD307_Deep Analytics for Global AWS Marketing OrganizationAmazon Web Services
 
WIN301-Migrating Microsoft SQL Server Databases to AWS-Best Practices and Pat...
WIN301-Migrating Microsoft SQL Server Databases to AWS-Best Practices and Pat...WIN301-Migrating Microsoft SQL Server Databases to AWS-Best Practices and Pat...
WIN301-Migrating Microsoft SQL Server Databases to AWS-Best Practices and Pat...Amazon Web Services
 
Migrating Microsoft SQL Server Databases to AWS – Best Practices and Patterns...
Migrating Microsoft SQL Server Databases to AWS – Best Practices and Patterns...Migrating Microsoft SQL Server Databases to AWS – Best Practices and Patterns...
Migrating Microsoft SQL Server Databases to AWS – Best Practices and Patterns...Amazon Web Services
 
Modernizing DMS: Database Week SF
Modernizing DMS: Database Week SFModernizing DMS: Database Week SF
Modernizing DMS: Database Week SFAmazon Web Services
 
McGraw-Hill Optimizes Analytics Workloads with Databricks
 McGraw-Hill Optimizes Analytics Workloads with Databricks McGraw-Hill Optimizes Analytics Workloads with Databricks
McGraw-Hill Optimizes Analytics Workloads with DatabricksAmazon Web Services
 
GPS: Industry 4.0: AI and the Future of Manufacturing - GPSTEC326 - re:Invent...
GPS: Industry 4.0: AI and the Future of Manufacturing - GPSTEC326 - re:Invent...GPS: Industry 4.0: AI and the Future of Manufacturing - GPSTEC326 - re:Invent...
GPS: Industry 4.0: AI and the Future of Manufacturing - GPSTEC326 - re:Invent...Amazon Web Services
 
GPSTEC326-GPS Industry 4.0 AI and the Future of Manufacturing
GPSTEC326-GPS Industry 4.0 AI and the Future of ManufacturingGPSTEC326-GPS Industry 4.0 AI and the Future of Manufacturing
GPSTEC326-GPS Industry 4.0 AI and the Future of ManufacturingAmazon Web Services
 

Similar a Self-Service Analytics with AWS Big Data and Tableau - ARC217 - re:Invent 2017 (20)

Architecting an Open Data Lake for the Enterprise
Architecting an Open Data Lake for the EnterpriseArchitecting an Open Data Lake for the Enterprise
Architecting an Open Data Lake for the Enterprise
 
STG206_Big Data Data Lakes and Data Oceans
STG206_Big Data Data Lakes and Data OceansSTG206_Big Data Data Lakes and Data Oceans
STG206_Big Data Data Lakes and Data Oceans
 
21st Century Analytics with Zopa
21st Century Analytics with Zopa21st Century Analytics with Zopa
21st Century Analytics with Zopa
 
Fanatics Ingests Streaming Data to a Data Lake on AWS
Fanatics Ingests Streaming Data to a Data Lake on AWSFanatics Ingests Streaming Data to a Data Lake on AWS
Fanatics Ingests Streaming Data to a Data Lake on AWS
 
Managing a Database Migration Project Best Practices and Customer References.pdf
Managing a Database Migration Project Best Practices and Customer References.pdfManaging a Database Migration Project Best Practices and Customer References.pdf
Managing a Database Migration Project Best Practices and Customer References.pdf
 
Big Data Architecture and Design Patterns
Big Data Architecture and Design PatternsBig Data Architecture and Design Patterns
Big Data Architecture and Design Patterns
 
Deep Dive on Big Data
Deep Dive on Big Data Deep Dive on Big Data
Deep Dive on Big Data
 
ABD201-Big Data Architectural Patterns and Best Practices on AWS
ABD201-Big Data Architectural Patterns and Best Practices on AWSABD201-Big Data Architectural Patterns and Best Practices on AWS
ABD201-Big Data Architectural Patterns and Best Practices on AWS
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon Redshift
 
Migrating to Amazon RDS with Database Migration Service:
Migrating to Amazon RDS with Database Migration Service:Migrating to Amazon RDS with Database Migration Service:
Migrating to Amazon RDS with Database Migration Service:
 
DAT317_Migrating Databases and Data Warehouses to the Cloud
DAT317_Migrating Databases and Data Warehouses to the CloudDAT317_Migrating Databases and Data Warehouses to the Cloud
DAT317_Migrating Databases and Data Warehouses to the Cloud
 
Scale Website dan Mobile Applications Anda di AWS hingga 10 juta pengguna
Scale Website dan Mobile Applications Anda di AWS hingga 10 juta penggunaScale Website dan Mobile Applications Anda di AWS hingga 10 juta pengguna
Scale Website dan Mobile Applications Anda di AWS hingga 10 juta pengguna
 
ABD307_Deep Analytics for Global AWS Marketing Organization
ABD307_Deep Analytics for Global AWS Marketing OrganizationABD307_Deep Analytics for Global AWS Marketing Organization
ABD307_Deep Analytics for Global AWS Marketing Organization
 
Modernizing Databases with DMS
Modernizing Databases with DMSModernizing Databases with DMS
Modernizing Databases with DMS
 
WIN301-Migrating Microsoft SQL Server Databases to AWS-Best Practices and Pat...
WIN301-Migrating Microsoft SQL Server Databases to AWS-Best Practices and Pat...WIN301-Migrating Microsoft SQL Server Databases to AWS-Best Practices and Pat...
WIN301-Migrating Microsoft SQL Server Databases to AWS-Best Practices and Pat...
 
Migrating Microsoft SQL Server Databases to AWS – Best Practices and Patterns...
Migrating Microsoft SQL Server Databases to AWS – Best Practices and Patterns...Migrating Microsoft SQL Server Databases to AWS – Best Practices and Patterns...
Migrating Microsoft SQL Server Databases to AWS – Best Practices and Patterns...
 
Modernizing DMS: Database Week SF
Modernizing DMS: Database Week SFModernizing DMS: Database Week SF
Modernizing DMS: Database Week SF
 
McGraw-Hill Optimizes Analytics Workloads with Databricks
 McGraw-Hill Optimizes Analytics Workloads with Databricks McGraw-Hill Optimizes Analytics Workloads with Databricks
McGraw-Hill Optimizes Analytics Workloads with Databricks
 
GPS: Industry 4.0: AI and the Future of Manufacturing - GPSTEC326 - re:Invent...
GPS: Industry 4.0: AI and the Future of Manufacturing - GPSTEC326 - re:Invent...GPS: Industry 4.0: AI and the Future of Manufacturing - GPSTEC326 - re:Invent...
GPS: Industry 4.0: AI and the Future of Manufacturing - GPSTEC326 - re:Invent...
 
GPSTEC326-GPS Industry 4.0 AI and the Future of Manufacturing
GPSTEC326-GPS Industry 4.0 AI and the Future of ManufacturingGPSTEC326-GPS Industry 4.0 AI and the Future of Manufacturing
GPSTEC326-GPS Industry 4.0 AI and the Future of Manufacturing
 

Más de Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

Más de Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Self-Service Analytics with AWS Big Data and Tableau - ARC217 - re:Invent 2017

  • 1. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AWS re:INVENT Self-Service Analytics with AWS Big Data and Tableau T A D B U H M A N - E X P E D I A A N N A T E R P - E X P E D I A N o v e m b e r 2 9 , 2 0 1 7
  • 2. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AGENDA WHO AND WHY PLATFORM PRODUCT SELF-SERVICE ANALYTICS
  • 3. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. WHO AND WHY? Expedia is a global leader in online travel Expedia does business in 200 countries $2.96B revenue/millions of transactions Q3 2017 Optimize payment processing Track high cost of credit card processing Transaction systems are not built for reporting
  • 4. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. STRATEGIC TOPICS Think about guiding principles Embrace the impermanence of the cloud End product is usable and familiar
  • 5. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. GUIDING PRINCIPLES Transaction level data not aggregated Cloud-ready technology Standard, loosely coupled interfaces Horizontally scalable Self-Service analytics
  • 6. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. PRODUCT AND PLATFORM HIGH LEVEL OVERVIEW
  • 7. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. ENGINEERED FACT TABLES AND ASSOCIATED VIEWS MANAGED METRICS MANAGED DATA SOURCES CORE PRODUCT – ENABLES SELF-SERVICE orig_payment_instrument_ type orig_paymen t_transaction _code Success Fail CreditCard AUTH X X CreditCard BASV X GiftCard AUTH X X InternetBankPayment CAPTURE X X InternetBankPayment VOID X PayPal REDIRECT X PayPal AUTH X X Points AUTH X X ELV CAPTURE X X
  • 8. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. PARAMETERIZED REPORTS PIVOT/CHART BUILDERS “START FROM SCRATCH” EDITABLE TEMPLATES SELF-SERVICE REPORTING
  • 9. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. PLATFORM ARCHITECTURE Collection Transformation & Data Store Orchestration Information Delivery
  • 10. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. PLATFORM – BUILD STRATEGY Platform components Data sources Transformation/data store Orchestration Information delivery Loosely coupled interfaces Prototype Justify complexity #1 Risk and Priority #2 Risk and Priority
  • 11. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. COLLECTION • Loosely coupled interface • JSON data • Physical files • Wide support • Self describing • S3 Landing Zone • Central location
  • 12. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. TRANSFORMATION – DATA STORE • Hive on EMR • OnDemand • HiveQL • MySQL Hive Metastore • Persist schema • S3 Data Store • Persist data
  • 13. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Service Interaction Data lineage & Workflow management ORCHESTRATION DynamoDBData Pipeline
  • 14. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. INFORMATION DELIVERY Redshift • Fast analytic queries • OnDemand reports • SQL • Client and Reporting tool support • Scalable Data abstraction
  • 15. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. JUSTIFY COMPLEXITY COMPONENT INTERFACES – IMPERMANENCE PLATFORM – STRATEGY REVIEW Amazon S3 Data Lake Amazon Redshift Spectrum Amazon Athena Amazon EMR
  • 16. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. TEST DATA VOLUME AGGREGATE WHERE NEEDEDUNDERSTAND TABLE USAGE & BUILD ACCORDINGLY PLATFORM MEETS PRESENTATION
  • 17. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. SELF-SERVICE ANALYTICS HIGH LEVEL OVERVIEW
  • 18. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. PRESENTATION LAYER IMPLEMENTATION How we evaluated products in the Cloud How we maintained familiar concepts Our Self-Service methodology
  • 19. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. PRESENTATION LAYER CHOICE Evaluation Best Practices Test Evaluate Load Software Drop Instance Repeat Create EC2 Instance
  • 20. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. FAMILIAR CONCEPTS • Consistent, familiar names • Numbering for sorting • Folders • Time and effort on calculated measures Cube – Legacy Tableau Data Source CALCULATED MEASURES
  • 21. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. FAMILIAR CONCEPTS – FOLDERS
  • 22. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. SELF-SERVICE ANALYTICS Parameterized Reports Pivot Builder Editable Templates
  • 23. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. TABLEAU Best Practices Lessons Learned Optimization
  • 24. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. TABLEAU – BEST PRACTICES • Abstract tables using Views • Naming: Generic, descriptive, readable
  • 25. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. TABLEAU – LESSONS LEARNED • Don’t try to extract 500M rows • Consider performance – always • Edit XML to change environment
  • 26. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. TABLEAU – EDIT DATA SOURCE
  • 27. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. TABLEAU – OPTIMIZATION • Aggregation • Assume Referential Integrity • Consider performance when writing reports • Data Source Optimization • Create Calculations • Create Hierarchies • Hide fields – hide anything empty or not used by anyone, for improved extract generation performance • Set default properties (number format, comments) • Assign Data Types and Geographic Roles
  • 28. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. CONCLUSION
  • 29. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. TAKEAWAYS Think about guiding principles Embrace the impermanence of the cloud Development doesn’t have to happen in sequence End product is usable and familiar
  • 30. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. QUESTIONS