SlideShare una empresa de Scribd logo
1 de 50
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AWS re:INVENT
Serverless Anaylsis & Visualization of
Website Access Logs
J o s e p h P e l l e , A W S S o f t w a r e D e v e l o p m e n t M a n a g e r
R a j e e v S r i n i v a s a n , A W S S o l u t i o n A r c h i t e c t
S a i S r i p a r a s a , A W S B i g D a t a C o n s u l t a n t
C T 4 0 8
N o v e m b e r 3 0 , 2 0 1 7
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
What Are We Going to Cover?
• Benefits of Analyzing Amazon CloudFront Access Logs
• Service Introduction
• Amazon CloudFront Reports and Operational Metrics
• Custom Report Generation using Batch Analysis
• Bot detection and mitigation using Batch & Online Analysis
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Increased Service and Application Availability Meet Compliance & Audit Requirements
Reduce Website Latency Content Optimization Bot Detection & Mitigation
Benefits of Analyzing Access Logs
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
SERVICE INTRODUCTION
A W S L a m b d a
A m a z o n Q u i c k S i g h tA m a z o n K i n e s i s
A m a z o n C l o u d F r o n t
A m a z o n W A F
A m a z o n A t h e n a A W S G L U E
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon CloudFront
Global Content Delivery Network
with Massive Capacity and Scale
Optimized for
Performance and Scale Built-in Security Features
Deep Integration with Key AWS Services Robust Real-Time Reporting Static and Dynamic Object and Video Delivery
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon CloudFront
Amazon
CloudFront
Route 53
Amazon S3 Bucket Origin
Amazon S3 Bucket
for Access Logs
Amazon EC2 Origin
Access Logs
Private Datacenter
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon CloudFront + AWS WAF
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon CloudFront Access Logs
Amazon CloudFront
Edge Location
Amazon CloudFront
Edge Location
Amazon CloudFront
Edge Location
Amazon CloudFront
Edge Location
Website B
Website A
Log File for
Distribution A
Log File for
Distribution B
Amazon S3 Bucket
for Access Logs
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon CloudFront Access Logs
• S3 access control list (ACL) for the bucket must grant FULL_CONTROL to write log files to the bucket
• File Name Format
bucket-name.s3.amazonaws.com/optional-prefix/distribution-ID.YYYY-MM-DD-HH.unique-ID.gz
• File name includes the date and time of the period in which the requests occurred
• Delivers access logs for a distribution up to several times an hour
• CloudFront can save one or more files for a time period
• Disable logging: Amazon CloudFront doesn't delete the ACLs for both the bucket or the log files
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon CloudFront: Customer Stories
Social Media, Digital
Advertising, EdTech,
Finance
Enterprise
E-commerce
Media & Entertainment Gaming
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AWS Lambda: Stream ProcessingCapture Data
Streams
IoT Data
Financial
Data
Log Data
No servers to
provision or manage
EVENT SOURCE
Node.js
Python
Java
C#
Process Data Streams
FUNCTION
Clickstream
Data
Output
Data
DATABASE
CLOUD
SERVICES
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon Kinesis
Amazon Kinesis
Streams
Amazon Kinesis
Firehose
Amazon Kinesis
Analytics
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon Kinesis
Easy administration: Create a stream, set capacity level with shards. Scale to match your data throughput rate and
volume.
Build real-time applications: Process streaming data with Kinesis Client Library (KCL), Apache Spark/Storm, AWS
Lambda...
Low cost: Cost-efficient for workloads of any scale.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon Firehose
Capture and submit streaming
data to Firehose
Analyze streaming data
using your favorite BI toolsFirehose loads streaming data continuously into Amazon S3,
Amazon Redshift and Amazon Elasticsearch
Zero administration: Capture and deliver streaming data to Amazon S3, Redshift, Elasticsearch
without writing an app or managing infrastructure
Direct-to-data store integration: Batch, compress, and encrypt streaming data for delivery in as little
as 60 seconds
Seamless elasticity: Seamlessly scales to match data throughput without intervention
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Delivery Format
• Multiple records concatenated based on buffering options
• Object path: app-name/YYYY/MM/DD/HH
• Object name: DeliveryStreamName-DeliveryStreamVersion-YYYY-MM-DD-HH-MM-SS-RandomString
Amazon Firehose
Delivery Stream
• Delivery stream can deliver to single S3 bucket
• Buffer size (1 MB to 128 MB)
• Buffer interval (60 to 900 seconds)
• Condition satisfied first triggers data delivery
Failure and Error Handling
Pause and retry for up to 24 hours
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon Kinesis Analytics
Apply SQL on streams: Easily connect to a Kinesis Stream or Firehose Delivery Stream and apply SQL skills
Build real-time applications: Perform continual processing on streaming big data with sub-second processing
latencies
Easy scalability : Elastically scales to match data throughput
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon Athena
Start Querying Instantly
Serverless, no ETL
Pay Per Query
Only pay for data scanned
Open, Powerful, Standard
Built on Presto, runs standard SQL.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon QuickSight
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon QuickSight: Data Sources
S P I C E
S P I C E
Amazon
Athena
Amazon
S3
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon QuickSight: Data Sources
Amazon
Redshift
Redshift
Spectrum
S P I C E
S P I C E
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AWS Glue
Integrated
Data Catalog
Automated
Data Discovery
Code
Generation
Developer
Endpoints
Flexible
Job Scheduler
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AWS Glue: Components
Ø Hive Metastore-compatible with enhanced functionality
Ø Crawlers automatically extract metadata and create tables
Ø Integrated with Amazon Athena, Amazon Redshift Spectrum
Ø Run jobs on a serverless Spark platform
Ø Provides flexible scheduling
Ø Handles dependency resolution, monitoring, and alerting
Ø Auto-generates ETL code
Ø Build on open frameworks: Python and Spark
Ø Developer-centric: editing, debugging, sharing
Data Catalog
Job Authoring
Job Execution
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon CloudFront
Reports and Operational Metrics
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
• 5 ready-for-you reports
• Provides insight from viewer requests
• Usage, Cache Stats, Popular Objects, Viewer
Data, Top Referrers
• Built on top of Access Logs and Billing Data
• Use for an accurate historical view
• Up to 60-day history
• Data in CSV format
Amazon CloudFront Reports
Amazon
CloudFront
Edge Locations Amazon
CloudFront Console
Amazon S3
Log Bucket
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
• 6 Operational Metrics
• Near-real time operational view
• Use to watch what is happening now
• 1 minute, 5 minute, 1 hour, 1 day granularity
• Available in Amazon CloudFront Console and
Amazon CloudWatch
• Can add Amazon CloudWatch Alarms and get SNS
notifications
• Monitor requests, bytes downloaded, bytes
uploaded, total error rate, 4xx Error Rate, 5xx
Error Rate
Amazon CloudFront Metrics
Amazon
CloudFront
Edge Locations
Amazon
CloudFront Console
Amazon
CloudWatch
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Operational Monitoring
• Set alarm on 4xx Error Rate
• Monitor the spike in Amazon CloudFront or
Amazon CloudWatch
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Operational Analysis
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Operational Analysis
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Operational Analysis
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Custom Report Generation
Batch Analysis
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Deeper Analysis of Access Logs
#Version: 1.0
#Fields: date time x-edge-location sc-
bytes c-ip cs-method cs(Host) cs-uri-
stem …
2017-01-18 10:00:00 AMS1 110891
194.xxx.xxx.xxx GET
d1234.cloudfront.net
/assets/some-static.jpg 200 …
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Setup the Reporting Pipeline
Amazon Athena
Amazon CloudFront
Access Logs
Amazon S3
AWS GlueAmazon
QuickSight
• Amazon CloudFront Access Logs are
available in S3
• Use AWS Glue to discover your data
sources, including your log bucket
• Build a data catalog of data and make it
available to Amazon Athena
• Use Amazon QuickSight to analyze the
data
• Amazon Athena will query your S3 bucket
• Simple and serverless!
AWS Glue Catalog
Data Source
Discovery
Query Data
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
What are the Benefits?
Amazon Athena
Amazon S3
AWS GlueAmazon
QuickSight
• Deeper insight than default reports
• Customized reports
• Drill into 4xx, 5xx
• Show error rates for region
• Look at top X popular objects, not top 50
• Optimize your cache by correlating user
agents and cache behavior
• Edge Location reporting
• Sales and Marketing
• Mine your Viewer and Geo data
• Compare to popular objects
AWS Glue Catalog
Data Source
Discovery
Query Data
Amazon CloudFront
Access Logs
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Bot Detection & Mitigation
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Example: Blacklisting Bad Bots
Block unwanted user agent headers and use transforms to stop evasion:
Host: www.example.com
User-Agent: bAdBoT
Accept: image/png,image/*;q=0.8,*/*;q=0.5
Accept-Language: en-US,en;q=0.5
Accept-Encoding: gzip, deflate
Referrer: http://www.InTeRnEtkItTiEs.com/
Connection: keep-alive
AWS
WAF
RAW request headers
CloudFront
Check: Header “User-Agent”
Transform: To lower
Match Type: Contains
Match: “badbot”
Action: BLOCK
Rule
String match condition
Scraper bot
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon
CloudWatch
AWS WAF
Bot Detection Using Batch Analysis
Amazon
CloudFront
Users
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon
CloudWatch
AWS WAF
Bot Detection Using Batch Analysis
Amazon S3
Amazon
CloudFront
Users
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AWS
Lambda
Amazon S3
Amazon
CloudWatch
AWS WAF
Bot Detection Using Batch Analysis
Users
Amazon
CloudFront
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon S3
Amazon
Kinesis Firehose
AWS
Lambda
Amazon S3
Amazon
CloudWatch
AWS WAF
Bot Detection Using Batch Analysis
Users
Amazon
CloudFront
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon
Athena
Amazon S3AWS
Lambda
Amazon S3
Amazon
CloudWatch
AWS WAF
Bot Detection Using Batch Analysis
Users
Amazon
CloudFront
Amazon
Kinesis Firehose
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon
CloudWatch
Amazon S3 Amazon
Kinesis Firehose
AWS
Lambda
AWS WAF
Bot Detection Using Online Analysis
Users
Amazon
CloudFront
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon
CloudWatch
Amazon S3 Amazon
Kinesis Firehose
AWS
Lambda
Amazon
Kinesis Analytics
AWS WAF
Bot Detection Using Online Analysis
Users
Amazon
CloudFront
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon
CloudWatch
Amazon S3 Amazon
Kinesis Firehose
AWS
Lambda
Amazon
Kinesis Analytics
Amazon
Kinesis Streams
AWS
Lambda
AWS WAF
Bot Detection Using Online Analysis
Users
Amazon
CloudFront
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon
CloudWatch
Amazon S3
Amazon
Kinesis FirehoseAWS
Lambda
Amazon
Kinesis Analytics
Amazon
Kinesis Streams
AWS Lambda
AWS WAF
Bot Detection & Mitigation
Users
Amazon
CloudFront
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon
CloudWatch
Amazon S3 Amazon
Kinesis Firehose
AWS
Lambda
Amazon Kinesis
Analytics
Amazon Kinesis
Streams
AWS
Lambda
Amazon
Athena
Amazon S3
AWS WAF
Bot Detection & Mitigation
Users Amazon
CloudFront
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon
CloudWatch
Amazon
S3
Amazon
Kinesis FirehoseAWS
Lambda
Amazon
Kinesis Analytics
Amazon
Kinesis Streams
AWS
Lambda
Amazon
Athena
Amazon
S3
Amazon
QuickSight
AWS WAF
Using Batch & Online Analysis for
Bot Detection & Mitigation
Users
Amazon
CloudFront
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Best Practices
Partitioning
• Reduce the amount of data scanned files necessary for queries
Compression file sizes
• Splittable files allow Athena’s execution engine to split the reading of a file by
multiple readers to increase parallelism
Columnar formats for analytics
• Optimize column-based reads
• Use Apache Parquet and Apache ORC
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Best Practices
Use environmental variables to pass operational parameters to your function
Control the dependencies in your function’s deployment package
Test your Lambda functions with different batches and record sizes
Use most-restrictive permissions when setting IAM policies
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Best Practices
Set up CloudWatch alarms
• Set up CloudWatch alarms to monitor input/output bytes and input/output records
• Monitor MillisBehindLatest to track how far behind the application is in reading from
the streaming source
Monitor CloudWatch alarms
• Getting the optimal number of Kinesis Analytics applications reading from the
Kinesis Data Streams and Kinesis Firehose Delivery Stream
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Thank you!
Pl e ase c o m p l e te yo ur su r v e y

Más contenido relacionado

La actualidad más candente

MBL209_Learn How MicroStrategy on AWS is Helping Vivint Solar Deliver Clean E...
MBL209_Learn How MicroStrategy on AWS is Helping Vivint Solar Deliver Clean E...MBL209_Learn How MicroStrategy on AWS is Helping Vivint Solar Deliver Clean E...
MBL209_Learn How MicroStrategy on AWS is Helping Vivint Solar Deliver Clean E...Amazon Web Services
 
NET309_Best Practices for Securing an Amazon Virtual Private Cloud
NET309_Best Practices for Securing an Amazon Virtual Private CloudNET309_Best Practices for Securing an Amazon Virtual Private Cloud
NET309_Best Practices for Securing an Amazon Virtual Private CloudAmazon Web Services
 
NET203_Using Amazon VPC Flow Logs to Do Predictive Security Analytics
NET203_Using Amazon VPC Flow Logs to Do Predictive Security AnalyticsNET203_Using Amazon VPC Flow Logs to Do Predictive Security Analytics
NET203_Using Amazon VPC Flow Logs to Do Predictive Security AnalyticsAmazon Web Services
 
ARC306_High Resiliency & Availability Of Online Entertainment Communities Usi...
ARC306_High Resiliency & Availability Of Online Entertainment Communities Usi...ARC306_High Resiliency & Availability Of Online Entertainment Communities Usi...
ARC306_High Resiliency & Availability Of Online Entertainment Communities Usi...Amazon Web Services
 
Deep Dive on Amazon Glacier - STG303 - re:Invent 2017
Deep Dive on Amazon Glacier - STG303 - re:Invent 2017Deep Dive on Amazon Glacier - STG303 - re:Invent 2017
Deep Dive on Amazon Glacier - STG303 - re:Invent 2017Amazon Web Services
 
Living on the Edge, It’s Safer Than You Think! Building Strong with Amazon Cl...
Living on the Edge, It’s Safer Than You Think! Building Strong with Amazon Cl...Living on the Edge, It’s Safer Than You Think! Building Strong with Amazon Cl...
Living on the Edge, It’s Safer Than You Think! Building Strong with Amazon Cl...Amazon Web Services
 
DEV329_Cisco’s Journey from Monolith to Microservices
DEV329_Cisco’s Journey from Monolith to MicroservicesDEV329_Cisco’s Journey from Monolith to Microservices
DEV329_Cisco’s Journey from Monolith to MicroservicesAmazon Web Services
 
What's New for AWS Purpose Built, Non-relational Databases - DAT204 - re:Inve...
What's New for AWS Purpose Built, Non-relational Databases - DAT204 - re:Inve...What's New for AWS Purpose Built, Non-relational Databases - DAT204 - re:Inve...
What's New for AWS Purpose Built, Non-relational Databases - DAT204 - re:Inve...Amazon Web Services
 
STG203_Get Rid of Tape and Modernize Backup with AWS
STG203_Get Rid of Tape and Modernize Backup with AWSSTG203_Get Rid of Tape and Modernize Backup with AWS
STG203_Get Rid of Tape and Modernize Backup with AWSAmazon Web Services
 
AWS X-Ray: Debugging Applications at Scale - AWS Online Tech Talks
AWS X-Ray: Debugging Applications at Scale - AWS Online Tech TalksAWS X-Ray: Debugging Applications at Scale - AWS Online Tech Talks
AWS X-Ray: Debugging Applications at Scale - AWS Online Tech TalksAmazon Web Services
 
MBL309_User Engagement, Messaging, and Analytics Using Amazon Pinpoint from A...
MBL309_User Engagement, Messaging, and Analytics Using Amazon Pinpoint from A...MBL309_User Engagement, Messaging, and Analytics Using Amazon Pinpoint from A...
MBL309_User Engagement, Messaging, and Analytics Using Amazon Pinpoint from A...Amazon Web Services
 
MSC204_Leverage AWS Marketplace to accelerate production ready workloads
MSC204_Leverage AWS Marketplace to accelerate production ready workloadsMSC204_Leverage AWS Marketplace to accelerate production ready workloads
MSC204_Leverage AWS Marketplace to accelerate production ready workloadsAmazon Web Services
 
AWS Database and Analytics State of the Union - 2017 - DAT201 - re:Invent 2017
AWS Database and Analytics State of the Union - 2017 - DAT201 - re:Invent 2017AWS Database and Analytics State of the Union - 2017 - DAT201 - re:Invent 2017
AWS Database and Analytics State of the Union - 2017 - DAT201 - re:Invent 2017Amazon Web Services
 
ARC304_From One to Many Evolving VPC Design
ARC304_From One to Many Evolving VPC DesignARC304_From One to Many Evolving VPC Design
ARC304_From One to Many Evolving VPC DesignAmazon Web Services
 
MBL201_Progressive Web Apps in the Real World
MBL201_Progressive Web Apps in the Real WorldMBL201_Progressive Web Apps in the Real World
MBL201_Progressive Web Apps in the Real WorldAmazon Web Services
 
CTD405_Building Serverless Video Workflows
CTD405_Building Serverless Video WorkflowsCTD405_Building Serverless Video Workflows
CTD405_Building Serverless Video WorkflowsAmazon Web Services
 
Storage State of the Union - STG201 - re:Invent 2017
Storage State of the Union - STG201 - re:Invent 2017Storage State of the Union - STG201 - re:Invent 2017
Storage State of the Union - STG201 - re:Invent 2017Amazon Web Services
 
Building Serverless Real-time Data Processing (workshop)
Building Serverless Real-time Data Processing (workshop)Building Serverless Real-time Data Processing (workshop)
Building Serverless Real-time Data Processing (workshop)Amazon Web Services
 
STG305_Deep Dive on Backup to the AWS Cloud
STG305_Deep Dive on Backup to the AWS CloudSTG305_Deep Dive on Backup to the AWS Cloud
STG305_Deep Dive on Backup to the AWS CloudAmazon Web Services
 
NET308_VPC Design Scenarios for Real-Life Use Cases
NET308_VPC Design Scenarios for Real-Life Use CasesNET308_VPC Design Scenarios for Real-Life Use Cases
NET308_VPC Design Scenarios for Real-Life Use CasesAmazon Web Services
 

La actualidad más candente (20)

MBL209_Learn How MicroStrategy on AWS is Helping Vivint Solar Deliver Clean E...
MBL209_Learn How MicroStrategy on AWS is Helping Vivint Solar Deliver Clean E...MBL209_Learn How MicroStrategy on AWS is Helping Vivint Solar Deliver Clean E...
MBL209_Learn How MicroStrategy on AWS is Helping Vivint Solar Deliver Clean E...
 
NET309_Best Practices for Securing an Amazon Virtual Private Cloud
NET309_Best Practices for Securing an Amazon Virtual Private CloudNET309_Best Practices for Securing an Amazon Virtual Private Cloud
NET309_Best Practices for Securing an Amazon Virtual Private Cloud
 
NET203_Using Amazon VPC Flow Logs to Do Predictive Security Analytics
NET203_Using Amazon VPC Flow Logs to Do Predictive Security AnalyticsNET203_Using Amazon VPC Flow Logs to Do Predictive Security Analytics
NET203_Using Amazon VPC Flow Logs to Do Predictive Security Analytics
 
ARC306_High Resiliency & Availability Of Online Entertainment Communities Usi...
ARC306_High Resiliency & Availability Of Online Entertainment Communities Usi...ARC306_High Resiliency & Availability Of Online Entertainment Communities Usi...
ARC306_High Resiliency & Availability Of Online Entertainment Communities Usi...
 
Deep Dive on Amazon Glacier - STG303 - re:Invent 2017
Deep Dive on Amazon Glacier - STG303 - re:Invent 2017Deep Dive on Amazon Glacier - STG303 - re:Invent 2017
Deep Dive on Amazon Glacier - STG303 - re:Invent 2017
 
Living on the Edge, It’s Safer Than You Think! Building Strong with Amazon Cl...
Living on the Edge, It’s Safer Than You Think! Building Strong with Amazon Cl...Living on the Edge, It’s Safer Than You Think! Building Strong with Amazon Cl...
Living on the Edge, It’s Safer Than You Think! Building Strong with Amazon Cl...
 
DEV329_Cisco’s Journey from Monolith to Microservices
DEV329_Cisco’s Journey from Monolith to MicroservicesDEV329_Cisco’s Journey from Monolith to Microservices
DEV329_Cisco’s Journey from Monolith to Microservices
 
What's New for AWS Purpose Built, Non-relational Databases - DAT204 - re:Inve...
What's New for AWS Purpose Built, Non-relational Databases - DAT204 - re:Inve...What's New for AWS Purpose Built, Non-relational Databases - DAT204 - re:Inve...
What's New for AWS Purpose Built, Non-relational Databases - DAT204 - re:Inve...
 
STG203_Get Rid of Tape and Modernize Backup with AWS
STG203_Get Rid of Tape and Modernize Backup with AWSSTG203_Get Rid of Tape and Modernize Backup with AWS
STG203_Get Rid of Tape and Modernize Backup with AWS
 
AWS X-Ray: Debugging Applications at Scale - AWS Online Tech Talks
AWS X-Ray: Debugging Applications at Scale - AWS Online Tech TalksAWS X-Ray: Debugging Applications at Scale - AWS Online Tech Talks
AWS X-Ray: Debugging Applications at Scale - AWS Online Tech Talks
 
MBL309_User Engagement, Messaging, and Analytics Using Amazon Pinpoint from A...
MBL309_User Engagement, Messaging, and Analytics Using Amazon Pinpoint from A...MBL309_User Engagement, Messaging, and Analytics Using Amazon Pinpoint from A...
MBL309_User Engagement, Messaging, and Analytics Using Amazon Pinpoint from A...
 
MSC204_Leverage AWS Marketplace to accelerate production ready workloads
MSC204_Leverage AWS Marketplace to accelerate production ready workloadsMSC204_Leverage AWS Marketplace to accelerate production ready workloads
MSC204_Leverage AWS Marketplace to accelerate production ready workloads
 
AWS Database and Analytics State of the Union - 2017 - DAT201 - re:Invent 2017
AWS Database and Analytics State of the Union - 2017 - DAT201 - re:Invent 2017AWS Database and Analytics State of the Union - 2017 - DAT201 - re:Invent 2017
AWS Database and Analytics State of the Union - 2017 - DAT201 - re:Invent 2017
 
ARC304_From One to Many Evolving VPC Design
ARC304_From One to Many Evolving VPC DesignARC304_From One to Many Evolving VPC Design
ARC304_From One to Many Evolving VPC Design
 
MBL201_Progressive Web Apps in the Real World
MBL201_Progressive Web Apps in the Real WorldMBL201_Progressive Web Apps in the Real World
MBL201_Progressive Web Apps in the Real World
 
CTD405_Building Serverless Video Workflows
CTD405_Building Serverless Video WorkflowsCTD405_Building Serverless Video Workflows
CTD405_Building Serverless Video Workflows
 
Storage State of the Union - STG201 - re:Invent 2017
Storage State of the Union - STG201 - re:Invent 2017Storage State of the Union - STG201 - re:Invent 2017
Storage State of the Union - STG201 - re:Invent 2017
 
Building Serverless Real-time Data Processing (workshop)
Building Serverless Real-time Data Processing (workshop)Building Serverless Real-time Data Processing (workshop)
Building Serverless Real-time Data Processing (workshop)
 
STG305_Deep Dive on Backup to the AWS Cloud
STG305_Deep Dive on Backup to the AWS CloudSTG305_Deep Dive on Backup to the AWS Cloud
STG305_Deep Dive on Backup to the AWS Cloud
 
NET308_VPC Design Scenarios for Real-Life Use Cases
NET308_VPC Design Scenarios for Real-Life Use CasesNET308_VPC Design Scenarios for Real-Life Use Cases
NET308_VPC Design Scenarios for Real-Life Use Cases
 

Similar a I Want to Analyze and Visualize Website Access Logs, but Why Do I Need Servers to Do That? - CTD408 - re:Invent 2017

How Nextdoor Built a Scalable, Serverless Data Pipeline for Billions of Event...
How Nextdoor Built a Scalable, Serverless Data Pipeline for Billions of Event...How Nextdoor Built a Scalable, Serverless Data Pipeline for Billions of Event...
How Nextdoor Built a Scalable, Serverless Data Pipeline for Billions of Event...Amazon Web Services
 
ABD317_Building Your First Big Data Application on AWS - ABD317
ABD317_Building Your First Big Data Application on AWS - ABD317ABD317_Building Your First Big Data Application on AWS - ABD317
ABD317_Building Your First Big Data Application on AWS - ABD317Amazon Web Services
 
ABD206-Building Visualizations and Dashboards with Amazon QuickSight
ABD206-Building Visualizations and Dashboards with Amazon QuickSightABD206-Building Visualizations and Dashboards with Amazon QuickSight
ABD206-Building Visualizations and Dashboards with Amazon QuickSightAmazon Web Services
 
Serverless Architectural Patterns
Serverless Architectural PatternsServerless Architectural Patterns
Serverless Architectural PatternsAmazon Web Services
 
Dive deep into technical enhancements - re:Invent Come to London 2.0
Dive deep into technical enhancements - re:Invent Come to London 2.0Dive deep into technical enhancements - re:Invent Come to London 2.0
Dive deep into technical enhancements - re:Invent Come to London 2.0Amazon Web Services
 
21st Century Analytics with Zopa
21st Century Analytics with Zopa21st Century Analytics with Zopa
21st Century Analytics with ZopaAmazon Web Services
 
Serverless Architecture Patterns
Serverless Architecture PatternsServerless Architecture Patterns
Serverless Architecture PatternsAmazon Web Services
 
Building Serverless Websites with Lambda@Edge - CTD309 - re:Invent 2017
Building Serverless Websites with Lambda@Edge - CTD309 - re:Invent 2017Building Serverless Websites with Lambda@Edge - CTD309 - re:Invent 2017
Building Serverless Websites with Lambda@Edge - CTD309 - re:Invent 2017Amazon Web Services
 
Building Serverless Websites with Lambda@Edge - CTD309 - re:Invent 2017
Building Serverless Websites with Lambda@Edge - CTD309 - re:Invent 2017Building Serverless Websites with Lambda@Edge - CTD309 - re:Invent 2017
Building Serverless Websites with Lambda@Edge - CTD309 - re:Invent 2017Amazon Web Services
 
ABD201-Big Data Architectural Patterns and Best Practices on AWS
ABD201-Big Data Architectural Patterns and Best Practices on AWSABD201-Big Data Architectural Patterns and Best Practices on AWS
ABD201-Big Data Architectural Patterns and Best Practices on AWSAmazon Web Services
 
Scaling Up to Your First 10 Million Users
Scaling Up to Your First 10 Million UsersScaling Up to Your First 10 Million Users
Scaling Up to Your First 10 Million UsersAmazon Web Services
 
Journey Towards Scaling Your API to 10 Million Users
Journey Towards Scaling Your API to 10 Million UsersJourney Towards Scaling Your API to 10 Million Users
Journey Towards Scaling Your API to 10 Million UsersAdrian Hornsby
 
Manage Infrastructure Securely at Scale and Eliminate Operational Risks - DEV...
Manage Infrastructure Securely at Scale and Eliminate Operational Risks - DEV...Manage Infrastructure Securely at Scale and Eliminate Operational Risks - DEV...
Manage Infrastructure Securely at Scale and Eliminate Operational Risks - DEV...Amazon Web Services
 
RET301-Build Single Customer View across Multiple Retail Channels using AWS S...
RET301-Build Single Customer View across Multiple Retail Channels using AWS S...RET301-Build Single Customer View across Multiple Retail Channels using AWS S...
RET301-Build Single Customer View across Multiple Retail Channels using AWS S...Amazon Web Services
 
Introduction to AWS for Mobile Developers
Introduction to AWS for Mobile DevelopersIntroduction to AWS for Mobile Developers
Introduction to AWS for Mobile DevelopersAmazon Web Services
 
End-User Computing on AWS with Amazon WorkSpaces and Amazon AppStream 2.0 - E...
End-User Computing on AWS with Amazon WorkSpaces and Amazon AppStream 2.0 - E...End-User Computing on AWS with Amazon WorkSpaces and Amazon AppStream 2.0 - E...
End-User Computing on AWS with Amazon WorkSpaces and Amazon AppStream 2.0 - E...Amazon Web Services
 
Building .NET-based Serverless Architectures and Running .NET Core Microservi...
Building .NET-based Serverless Architectures and Running .NET Core Microservi...Building .NET-based Serverless Architectures and Running .NET Core Microservi...
Building .NET-based Serverless Architectures and Running .NET Core Microservi...Amazon Web Services
 
NET203_Using Amazon VPC Flow Logs to Do Predictive Security Analytics
NET203_Using Amazon VPC Flow Logs to Do Predictive Security AnalyticsNET203_Using Amazon VPC Flow Logs to Do Predictive Security Analytics
NET203_Using Amazon VPC Flow Logs to Do Predictive Security AnalyticsAmazon Web Services
 

Similar a I Want to Analyze and Visualize Website Access Logs, but Why Do I Need Servers to Do That? - CTD408 - re:Invent 2017 (20)

How Nextdoor Built a Scalable, Serverless Data Pipeline for Billions of Event...
How Nextdoor Built a Scalable, Serverless Data Pipeline for Billions of Event...How Nextdoor Built a Scalable, Serverless Data Pipeline for Billions of Event...
How Nextdoor Built a Scalable, Serverless Data Pipeline for Billions of Event...
 
STG401_This Is My Architecture
STG401_This Is My ArchitectureSTG401_This Is My Architecture
STG401_This Is My Architecture
 
ABD317_Building Your First Big Data Application on AWS - ABD317
ABD317_Building Your First Big Data Application on AWS - ABD317ABD317_Building Your First Big Data Application on AWS - ABD317
ABD317_Building Your First Big Data Application on AWS - ABD317
 
ABD206-Building Visualizations and Dashboards with Amazon QuickSight
ABD206-Building Visualizations and Dashboards with Amazon QuickSightABD206-Building Visualizations and Dashboards with Amazon QuickSight
ABD206-Building Visualizations and Dashboards with Amazon QuickSight
 
Serverless Architectural Patterns
Serverless Architectural PatternsServerless Architectural Patterns
Serverless Architectural Patterns
 
Dive deep into technical enhancements - re:Invent Come to London 2.0
Dive deep into technical enhancements - re:Invent Come to London 2.0Dive deep into technical enhancements - re:Invent Come to London 2.0
Dive deep into technical enhancements - re:Invent Come to London 2.0
 
21st Century Analytics with Zopa
21st Century Analytics with Zopa21st Century Analytics with Zopa
21st Century Analytics with Zopa
 
Serverless Architecture Patterns
Serverless Architecture PatternsServerless Architecture Patterns
Serverless Architecture Patterns
 
Serverless Developer Experience
Serverless Developer ExperienceServerless Developer Experience
Serverless Developer Experience
 
Building Serverless Websites with Lambda@Edge - CTD309 - re:Invent 2017
Building Serverless Websites with Lambda@Edge - CTD309 - re:Invent 2017Building Serverless Websites with Lambda@Edge - CTD309 - re:Invent 2017
Building Serverless Websites with Lambda@Edge - CTD309 - re:Invent 2017
 
Building Serverless Websites with Lambda@Edge - CTD309 - re:Invent 2017
Building Serverless Websites with Lambda@Edge - CTD309 - re:Invent 2017Building Serverless Websites with Lambda@Edge - CTD309 - re:Invent 2017
Building Serverless Websites with Lambda@Edge - CTD309 - re:Invent 2017
 
ABD201-Big Data Architectural Patterns and Best Practices on AWS
ABD201-Big Data Architectural Patterns and Best Practices on AWSABD201-Big Data Architectural Patterns and Best Practices on AWS
ABD201-Big Data Architectural Patterns and Best Practices on AWS
 
Scaling Up to Your First 10 Million Users
Scaling Up to Your First 10 Million UsersScaling Up to Your First 10 Million Users
Scaling Up to Your First 10 Million Users
 
Journey Towards Scaling Your API to 10 Million Users
Journey Towards Scaling Your API to 10 Million UsersJourney Towards Scaling Your API to 10 Million Users
Journey Towards Scaling Your API to 10 Million Users
 
Manage Infrastructure Securely at Scale and Eliminate Operational Risks - DEV...
Manage Infrastructure Securely at Scale and Eliminate Operational Risks - DEV...Manage Infrastructure Securely at Scale and Eliminate Operational Risks - DEV...
Manage Infrastructure Securely at Scale and Eliminate Operational Risks - DEV...
 
RET301-Build Single Customer View across Multiple Retail Channels using AWS S...
RET301-Build Single Customer View across Multiple Retail Channels using AWS S...RET301-Build Single Customer View across Multiple Retail Channels using AWS S...
RET301-Build Single Customer View across Multiple Retail Channels using AWS S...
 
Introduction to AWS for Mobile Developers
Introduction to AWS for Mobile DevelopersIntroduction to AWS for Mobile Developers
Introduction to AWS for Mobile Developers
 
End-User Computing on AWS with Amazon WorkSpaces and Amazon AppStream 2.0 - E...
End-User Computing on AWS with Amazon WorkSpaces and Amazon AppStream 2.0 - E...End-User Computing on AWS with Amazon WorkSpaces and Amazon AppStream 2.0 - E...
End-User Computing on AWS with Amazon WorkSpaces and Amazon AppStream 2.0 - E...
 
Building .NET-based Serverless Architectures and Running .NET Core Microservi...
Building .NET-based Serverless Architectures and Running .NET Core Microservi...Building .NET-based Serverless Architectures and Running .NET Core Microservi...
Building .NET-based Serverless Architectures and Running .NET Core Microservi...
 
NET203_Using Amazon VPC Flow Logs to Do Predictive Security Analytics
NET203_Using Amazon VPC Flow Logs to Do Predictive Security AnalyticsNET203_Using Amazon VPC Flow Logs to Do Predictive Security Analytics
NET203_Using Amazon VPC Flow Logs to Do Predictive Security Analytics
 

Más de Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

Más de Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

I Want to Analyze and Visualize Website Access Logs, but Why Do I Need Servers to Do That? - CTD408 - re:Invent 2017

  • 1. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AWS re:INVENT Serverless Anaylsis & Visualization of Website Access Logs J o s e p h P e l l e , A W S S o f t w a r e D e v e l o p m e n t M a n a g e r R a j e e v S r i n i v a s a n , A W S S o l u t i o n A r c h i t e c t S a i S r i p a r a s a , A W S B i g D a t a C o n s u l t a n t C T 4 0 8 N o v e m b e r 3 0 , 2 0 1 7
  • 2. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. What Are We Going to Cover? • Benefits of Analyzing Amazon CloudFront Access Logs • Service Introduction • Amazon CloudFront Reports and Operational Metrics • Custom Report Generation using Batch Analysis • Bot detection and mitigation using Batch & Online Analysis
  • 3. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Increased Service and Application Availability Meet Compliance & Audit Requirements Reduce Website Latency Content Optimization Bot Detection & Mitigation Benefits of Analyzing Access Logs
  • 4. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. SERVICE INTRODUCTION A W S L a m b d a A m a z o n Q u i c k S i g h tA m a z o n K i n e s i s A m a z o n C l o u d F r o n t A m a z o n W A F A m a z o n A t h e n a A W S G L U E
  • 5. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon CloudFront Global Content Delivery Network with Massive Capacity and Scale Optimized for Performance and Scale Built-in Security Features Deep Integration with Key AWS Services Robust Real-Time Reporting Static and Dynamic Object and Video Delivery
  • 6. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon CloudFront Amazon CloudFront Route 53 Amazon S3 Bucket Origin Amazon S3 Bucket for Access Logs Amazon EC2 Origin Access Logs Private Datacenter
  • 7. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon CloudFront + AWS WAF
  • 8. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon CloudFront Access Logs Amazon CloudFront Edge Location Amazon CloudFront Edge Location Amazon CloudFront Edge Location Amazon CloudFront Edge Location Website B Website A Log File for Distribution A Log File for Distribution B Amazon S3 Bucket for Access Logs
  • 9. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon CloudFront Access Logs • S3 access control list (ACL) for the bucket must grant FULL_CONTROL to write log files to the bucket • File Name Format bucket-name.s3.amazonaws.com/optional-prefix/distribution-ID.YYYY-MM-DD-HH.unique-ID.gz • File name includes the date and time of the period in which the requests occurred • Delivers access logs for a distribution up to several times an hour • CloudFront can save one or more files for a time period • Disable logging: Amazon CloudFront doesn't delete the ACLs for both the bucket or the log files
  • 10. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon CloudFront: Customer Stories Social Media, Digital Advertising, EdTech, Finance Enterprise E-commerce Media & Entertainment Gaming
  • 11. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AWS Lambda: Stream ProcessingCapture Data Streams IoT Data Financial Data Log Data No servers to provision or manage EVENT SOURCE Node.js Python Java C# Process Data Streams FUNCTION Clickstream Data Output Data DATABASE CLOUD SERVICES
  • 12. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Kinesis Amazon Kinesis Streams Amazon Kinesis Firehose Amazon Kinesis Analytics
  • 13. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Kinesis Easy administration: Create a stream, set capacity level with shards. Scale to match your data throughput rate and volume. Build real-time applications: Process streaming data with Kinesis Client Library (KCL), Apache Spark/Storm, AWS Lambda... Low cost: Cost-efficient for workloads of any scale.
  • 14. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Firehose Capture and submit streaming data to Firehose Analyze streaming data using your favorite BI toolsFirehose loads streaming data continuously into Amazon S3, Amazon Redshift and Amazon Elasticsearch Zero administration: Capture and deliver streaming data to Amazon S3, Redshift, Elasticsearch without writing an app or managing infrastructure Direct-to-data store integration: Batch, compress, and encrypt streaming data for delivery in as little as 60 seconds Seamless elasticity: Seamlessly scales to match data throughput without intervention
  • 15. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Delivery Format • Multiple records concatenated based on buffering options • Object path: app-name/YYYY/MM/DD/HH • Object name: DeliveryStreamName-DeliveryStreamVersion-YYYY-MM-DD-HH-MM-SS-RandomString Amazon Firehose Delivery Stream • Delivery stream can deliver to single S3 bucket • Buffer size (1 MB to 128 MB) • Buffer interval (60 to 900 seconds) • Condition satisfied first triggers data delivery Failure and Error Handling Pause and retry for up to 24 hours
  • 16. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Kinesis Analytics Apply SQL on streams: Easily connect to a Kinesis Stream or Firehose Delivery Stream and apply SQL skills Build real-time applications: Perform continual processing on streaming big data with sub-second processing latencies Easy scalability : Elastically scales to match data throughput
  • 17. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Athena Start Querying Instantly Serverless, no ETL Pay Per Query Only pay for data scanned Open, Powerful, Standard Built on Presto, runs standard SQL.
  • 18. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon QuickSight
  • 19. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon QuickSight: Data Sources S P I C E S P I C E Amazon Athena Amazon S3
  • 20. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon QuickSight: Data Sources Amazon Redshift Redshift Spectrum S P I C E S P I C E
  • 21. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AWS Glue Integrated Data Catalog Automated Data Discovery Code Generation Developer Endpoints Flexible Job Scheduler
  • 22. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AWS Glue: Components Ø Hive Metastore-compatible with enhanced functionality Ø Crawlers automatically extract metadata and create tables Ø Integrated with Amazon Athena, Amazon Redshift Spectrum Ø Run jobs on a serverless Spark platform Ø Provides flexible scheduling Ø Handles dependency resolution, monitoring, and alerting Ø Auto-generates ETL code Ø Build on open frameworks: Python and Spark Ø Developer-centric: editing, debugging, sharing Data Catalog Job Authoring Job Execution
  • 23. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon CloudFront Reports and Operational Metrics
  • 24. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. • 5 ready-for-you reports • Provides insight from viewer requests • Usage, Cache Stats, Popular Objects, Viewer Data, Top Referrers • Built on top of Access Logs and Billing Data • Use for an accurate historical view • Up to 60-day history • Data in CSV format Amazon CloudFront Reports Amazon CloudFront Edge Locations Amazon CloudFront Console Amazon S3 Log Bucket
  • 25. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. • 6 Operational Metrics • Near-real time operational view • Use to watch what is happening now • 1 minute, 5 minute, 1 hour, 1 day granularity • Available in Amazon CloudFront Console and Amazon CloudWatch • Can add Amazon CloudWatch Alarms and get SNS notifications • Monitor requests, bytes downloaded, bytes uploaded, total error rate, 4xx Error Rate, 5xx Error Rate Amazon CloudFront Metrics Amazon CloudFront Edge Locations Amazon CloudFront Console Amazon CloudWatch
  • 26. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Operational Monitoring • Set alarm on 4xx Error Rate • Monitor the spike in Amazon CloudFront or Amazon CloudWatch
  • 27. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Operational Analysis
  • 28. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Operational Analysis
  • 29. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Operational Analysis
  • 30. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Custom Report Generation Batch Analysis
  • 31. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Deeper Analysis of Access Logs #Version: 1.0 #Fields: date time x-edge-location sc- bytes c-ip cs-method cs(Host) cs-uri- stem … 2017-01-18 10:00:00 AMS1 110891 194.xxx.xxx.xxx GET d1234.cloudfront.net /assets/some-static.jpg 200 …
  • 32. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Setup the Reporting Pipeline Amazon Athena Amazon CloudFront Access Logs Amazon S3 AWS GlueAmazon QuickSight • Amazon CloudFront Access Logs are available in S3 • Use AWS Glue to discover your data sources, including your log bucket • Build a data catalog of data and make it available to Amazon Athena • Use Amazon QuickSight to analyze the data • Amazon Athena will query your S3 bucket • Simple and serverless! AWS Glue Catalog Data Source Discovery Query Data
  • 33. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. What are the Benefits? Amazon Athena Amazon S3 AWS GlueAmazon QuickSight • Deeper insight than default reports • Customized reports • Drill into 4xx, 5xx • Show error rates for region • Look at top X popular objects, not top 50 • Optimize your cache by correlating user agents and cache behavior • Edge Location reporting • Sales and Marketing • Mine your Viewer and Geo data • Compare to popular objects AWS Glue Catalog Data Source Discovery Query Data Amazon CloudFront Access Logs
  • 34. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Bot Detection & Mitigation
  • 35. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Example: Blacklisting Bad Bots Block unwanted user agent headers and use transforms to stop evasion: Host: www.example.com User-Agent: bAdBoT Accept: image/png,image/*;q=0.8,*/*;q=0.5 Accept-Language: en-US,en;q=0.5 Accept-Encoding: gzip, deflate Referrer: http://www.InTeRnEtkItTiEs.com/ Connection: keep-alive AWS WAF RAW request headers CloudFront Check: Header “User-Agent” Transform: To lower Match Type: Contains Match: “badbot” Action: BLOCK Rule String match condition Scraper bot
  • 36. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon CloudWatch AWS WAF Bot Detection Using Batch Analysis Amazon CloudFront Users
  • 37. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon CloudWatch AWS WAF Bot Detection Using Batch Analysis Amazon S3 Amazon CloudFront Users
  • 38. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AWS Lambda Amazon S3 Amazon CloudWatch AWS WAF Bot Detection Using Batch Analysis Users Amazon CloudFront
  • 39. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon S3 Amazon Kinesis Firehose AWS Lambda Amazon S3 Amazon CloudWatch AWS WAF Bot Detection Using Batch Analysis Users Amazon CloudFront
  • 40. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Athena Amazon S3AWS Lambda Amazon S3 Amazon CloudWatch AWS WAF Bot Detection Using Batch Analysis Users Amazon CloudFront Amazon Kinesis Firehose
  • 41. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon CloudWatch Amazon S3 Amazon Kinesis Firehose AWS Lambda AWS WAF Bot Detection Using Online Analysis Users Amazon CloudFront
  • 42. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon CloudWatch Amazon S3 Amazon Kinesis Firehose AWS Lambda Amazon Kinesis Analytics AWS WAF Bot Detection Using Online Analysis Users Amazon CloudFront
  • 43. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon CloudWatch Amazon S3 Amazon Kinesis Firehose AWS Lambda Amazon Kinesis Analytics Amazon Kinesis Streams AWS Lambda AWS WAF Bot Detection Using Online Analysis Users Amazon CloudFront
  • 44. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon CloudWatch Amazon S3 Amazon Kinesis FirehoseAWS Lambda Amazon Kinesis Analytics Amazon Kinesis Streams AWS Lambda AWS WAF Bot Detection & Mitigation Users Amazon CloudFront
  • 45. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon CloudWatch Amazon S3 Amazon Kinesis Firehose AWS Lambda Amazon Kinesis Analytics Amazon Kinesis Streams AWS Lambda Amazon Athena Amazon S3 AWS WAF Bot Detection & Mitigation Users Amazon CloudFront
  • 46. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon CloudWatch Amazon S3 Amazon Kinesis FirehoseAWS Lambda Amazon Kinesis Analytics Amazon Kinesis Streams AWS Lambda Amazon Athena Amazon S3 Amazon QuickSight AWS WAF Using Batch & Online Analysis for Bot Detection & Mitigation Users Amazon CloudFront
  • 47. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Best Practices Partitioning • Reduce the amount of data scanned files necessary for queries Compression file sizes • Splittable files allow Athena’s execution engine to split the reading of a file by multiple readers to increase parallelism Columnar formats for analytics • Optimize column-based reads • Use Apache Parquet and Apache ORC
  • 48. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Best Practices Use environmental variables to pass operational parameters to your function Control the dependencies in your function’s deployment package Test your Lambda functions with different batches and record sizes Use most-restrictive permissions when setting IAM policies
  • 49. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Best Practices Set up CloudWatch alarms • Set up CloudWatch alarms to monitor input/output bytes and input/output records • Monitor MillisBehindLatest to track how far behind the application is in reading from the streaming source Monitor CloudWatch alarms • Getting the optimal number of Kinesis Analytics applications reading from the Kinesis Data Streams and Kinesis Firehose Delivery Stream
  • 50. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Thank you! Pl e ase c o m p l e te yo ur su r v e y