SlideShare una empresa de Scribd logo
1 de 36
Descargar para leer sin conexión
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Data Lake Patterns for Voice, Vision, Advanced
Analytics, & ML Using Serverless
Paul Armstrong
Principal Solutions
Architect
Greg Share
Enterprise Solutions
Architect
Jagadeesh Pusapadi
Enterprise Solutions
Architect
A R C 3 2 0
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Session abstract
Industry 4.0 demands greater insight into data to bring people, processes, and
equipment together. This workshop will illustrate how to gain business insights from
video and voice data sources, highlighting the data pipeline. We
will ingest source feeds, efficiently store the data, and perform advanced analytics
using Amazon Machine Learning (Amazon ML) services and analysis tools. Typical
applications include anomaly detection (detecting spills or hazardous objects
and predictive maintenance), voice sentiment analysis (customer service insights). By
the end of the session you will be able to quickly analyze data for uncommon
characteristics, using those detections to initiate a wide variety of actions.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Collect Store
Process/
analyze
Consume
Data Answers
Time to answer (latency)
Throughput
Cost
The data pipeline
Streaming
Collect Store ConsumeProcess/analyze
Apache Kafka
HotHotWarm
Fast
Stream
SQLNoSQLCacheFileStream
Mobile apps
Web apps
Devices
Sensors
IoT platforms
Data centers
Migration
Logging
RECORDS
FILES
STREAMS
Analysis&visualizationDataScience
DataTransport&LoggingIoTApplications
Presto
FastSlow
BatchInteractivePredictive
AmazonAI
Apps
Model
Train/
Eval
Models
Deploy
ETL
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Workshop objectives
• Work through the data pipeline
• Focus on voice and video
• Demonstrate how to use the services to solve typical problems and use cases
• Leave with a working end to end solution
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Target audience
• Architects
• Developers/engineers
• Level 300 – good working knowledge of AWS services, CLI
• All code will be provided
• Note ML services will be used and the models shared, not an in-depth ML
session
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
What do I need?
• AWS account with admin access
• AWS Command Line Interface (AWS CLI) command tools installed on laptop
• Understanding of AWS CloudFormation
• We will cover a broad spectrum of topics
• Voice/video
• Archive/storage/retrieval
• Analytics and ML
• Notification services
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Workshop setup steps
• Login to AWS account
• Create an Amazon Simple Storage Service (Amazon S3) bucket
• Download the zip file from the link provided
• Un-compress and sync with the Amazon S3 bucket
• Open the workbook for detailed instructions
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Video footage and voice ingestion
Industrial/Smart City/Smart Home video feeds
Customer services/operations voice inputs
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Objectives
Collect video feed from camera
• CCTV footage
• Production line camera
Collect voice feed using hotline
• Automated hotline
• Analyze contact center recordings
Collect Store
Process/
analyze
Consume
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Data collection process
• Call center hotline interaction
to data lake
• Data combined with footage
data
• CCTV footage streamed into
the data lake
• Optional customer producer
interaction with incoming
footage
Collect Store
Process/
analyze
Consume
ML inference
at the edge
(anomaly/face detection)
Proxied video
stream
(optional)
Footage
Customer interaction data
Voice
CCTV
Image
Contact
center
Streaming
ingest
engine
Data
persistence
(data lake)
Footage direct into
streaming ingest engine
Streaming
Ingest Engine
Customer
Producer
Streaming
Ingest Engine
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Collect target architecture
• Phone hotline writing
contact records and voice
recordings to S3
• Shared bucket for images
• Cameras set up in workshop
and shared streams for cross
account access to Amazon
Kinesis Video Streams for
CCTV
• Optional direct access RTSP
stream and use own
producer to Kinesis Video
Streams for CCTV
Collect Store
Process/
analyze
Consume
Producer SDK writes directly to
the video stream
Model inference at the
edge to detect faces
Producer SDK
RTSP stream
read by proxy and
send to stream
Cropped images
from video stream
Contact records and
call recordings
Voice
CCTV
Image
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Module activities
• Stream from local web camera or test setup of S3 sample
• Set up connect or use a sample audio file for analysis
Key activities
• Connect to Kinesis Video Streams streams to view content on the stream or
simulate sample frames for analysis
• Optionally configure connect to create voice recordings
• Ensure can access S3 of call recordings
Collect Store
Process/
analyze
Consume
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Module outcomes
• Video stream enabled
• [Optional] Camera producer sending content to the stream
Collect Store
Process/
analyze
Consume
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Video footage and voice persistence
Video and voice persistence options for analysis use cases
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Data storage process
• Voice/video data combined in
the data lake
• Authenticated access
mechanism for streaming
video footage access –
persisted to the data lake
Collect Store
Process/
analyze
Consume
Voice
CCTV
Video
Get content
from stream
Data
persistence
(data lake)
Serverless
processing
Data
persistence
(data lake)
Serverless ingested
footage
access mechanism
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Store—Target architecture
• Voice/image stored in S3
• CCTV API to request video
from steam and store in S3
with signed URL
Collect Store
Process/
analyze
Consume
Voice
CCTV
Image
Get content
from stream
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Module activities
• Test uploading files to S3 for
• Video analysis
• Voice analysis
• Sample video frame for video object detection
Collect Store
Process/
analyze
Consume
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Module outcomes
• Store locations understood and accessible with sample data
Collect Store
Process/
analyze
Consume
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Video and voice analysis
Combined analysis for sentiment/anomaly detection
(Industrial H&S across manufacturing, travel and transport safety, and customer service)
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Video and voice data processing
• Voice/image processing >
analysis > indexing
• CCTV run model for object
detection
Collect Store
Process/
analyze
Consume
Voice
CCTV
Image
Serverless
processing
Serverless
processing
workflow
Video
analysis
Voice to text
conversion
Sentiment
analysis
Analysis
indexing
Machine learning
image
analysis
Serverless
processing
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Process—Target architecture
• Voice/image use AWS managed
services for indexing
• CCTV run model for object
detection
Collect Store
Process/
analyze
Consume
Voice
CCTV
Image
Deployed
model and
endpoint
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Module activities
• Test the ingestion of video or sample frame
• Validate that the image is processed and view the output image with detected
object and bounding box. Explanation of how the lambda parameters are
managed
• Explanation of model and further activities for building and enhancing the
deployed model
Collect Store
Process/
analyze
Consume
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Module outcomes
• Solution using Amazon SageMaker to detect objects
• Solution that can analyze voice recordings
Collect Store
Process/
analyze
Consume
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Data visualizations
Share CCTV footage with third parties
Real-time dashboards and alerting for both internal and external parties
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Output consumption
• Authenticated access to
Web based visualizations
• Indexed views
• Detected anomaly alerts
and alarm notifications
Collect Store
Process/
analyze
Consume
Voice
CCTV
Video
Get content from stream
Get Content
Notifications
Serverless
processing
Serverless
processing
Serverless
access
endpoint
Data
lake
Alerts and
alarms
Consumption
auth.
mechanism
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Consume—Target architecture
• HTML authenticated viewer
with Amazon Cognito
• View indexes
• Kibana dashboard viewer
• Amazon Simple Notification
Service (Amazon SNS)
notifications of detected
spills
Collect Store
Process/
analyze
Consume
Voice
CCTV
Image
Get content
from stream
Get content
Stream
processor
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Module activities
• View video from web page
• Configure alerts receive alert when an object is detected
• View analytics of voice streams
• View sentiment analysis (voice/video)
[Log in to local example or use AWS deployed solution]
Collect Store
Process/
analyze
Consume
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Module outcomes
• Visualization of the solutions in real time/historic
• Alerting to key events
• Reference architecture to use cases for real time voice/video detection and
alerting
Collect Store
Process/
analyze
Consume
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Voice and video analysis framework
Collect Store
Process/
analyze
Consume
Get content
from stream
Deployed model
and endpoint
Object/
image
detection
Video and customer
interaction processing
Contact
center
Streaming
ingest engine
Customer interaction data
Data
lake
Streaming
ingest engine
Footage direct into
streaming ingest engine
ML inference
at the edge
(anomaly/face detection)
Customer
producer
Proxied video
stream
(optional)
Streaming
ingest engine
Serverless processing
workflow
Video
analysis
Voice to text
conversion
Sentiment
analysis
Analysis
indexing
Serverless
processing
Serverless
access
endpoint
Data
lake
Consumption
auth. mechanism
Get
content
Serverless
processing
Serverless
processing
Machine learning
image
analysis
Notifications
Alerts and
alarms
Overall architecture
Collect Store
Process/
analyze
Consume
Get content
from stream
Get content
Deployed
model and
endpoint
Object/image detection
Producer SDK writes directly to
the video stream
Model inference at the
edge to detect faces
RTSP stream
read by proxy and
send to stream
Cropped images
from video stream
Contact records and
call recordings
Process
media
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Summary
• Ingest data securely and at scale
• Analyze new data sources to augment transactional data sources
• Use data sources for ML training and inference
• Integrate with existing applications and workflows
• Provide visualization on demand
• Real-time alerting
Thank you!
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Behind the Scenes: Exploring the AWS Global Network (NET305) - AWS re:Invent ...
Behind the Scenes: Exploring the AWS Global Network (NET305) - AWS re:Invent ...Behind the Scenes: Exploring the AWS Global Network (NET305) - AWS re:Invent ...
Behind the Scenes: Exploring the AWS Global Network (NET305) - AWS re:Invent ...
 
Amazon S3 & Amazon Glacier - Object Storage Overview
Amazon S3 & Amazon Glacier - Object Storage OverviewAmazon S3 & Amazon Glacier - Object Storage Overview
Amazon S3 & Amazon Glacier - Object Storage Overview
 
Amazon DynamoDB Under the Hood: How We Built a Hyper-Scale Database (DAT321) ...
Amazon DynamoDB Under the Hood: How We Built a Hyper-Scale Database (DAT321) ...Amazon DynamoDB Under the Hood: How We Built a Hyper-Scale Database (DAT321) ...
Amazon DynamoDB Under the Hood: How We Built a Hyper-Scale Database (DAT321) ...
 
Containers - Amazon EKS
Containers - Amazon EKSContainers - Amazon EKS
Containers - Amazon EKS
 
Amazon SageMaker 모델 배포 방법 소개::김대근, AI/ML 스페셜리스트 솔루션즈 아키텍트, AWS::AWS AIML 스페셜 웨비나
Amazon SageMaker 모델 배포 방법 소개::김대근, AI/ML 스페셜리스트 솔루션즈 아키텍트, AWS::AWS AIML 스페셜 웨비나Amazon SageMaker 모델 배포 방법 소개::김대근, AI/ML 스페셜리스트 솔루션즈 아키텍트, AWS::AWS AIML 스페셜 웨비나
Amazon SageMaker 모델 배포 방법 소개::김대근, AI/ML 스페셜리스트 솔루션즈 아키텍트, AWS::AWS AIML 스페셜 웨비나
 
ABD315_Serverless ETL with AWS Glue
ABD315_Serverless ETL with AWS GlueABD315_Serverless ETL with AWS Glue
ABD315_Serverless ETL with AWS Glue
 
Deep dive ECS & Fargate Deep Dive
Deep dive ECS & Fargate Deep DiveDeep dive ECS & Fargate Deep Dive
Deep dive ECS & Fargate Deep Dive
 
AWS January 2016 Webinar Series - Managing your Infrastructure as Code
AWS January 2016 Webinar Series - Managing your Infrastructure as CodeAWS January 2016 Webinar Series - Managing your Infrastructure as Code
AWS January 2016 Webinar Series - Managing your Infrastructure as Code
 
LG 이노텍 - Amazon Redshift Serverless를 활용한 데이터 분석 플랫폼 혁신 과정 - 발표자: 유재상 선임, LG이노...
LG 이노텍 - Amazon Redshift Serverless를 활용한 데이터 분석 플랫폼 혁신 과정 - 발표자: 유재상 선임, LG이노...LG 이노텍 - Amazon Redshift Serverless를 활용한 데이터 분석 플랫폼 혁신 과정 - 발표자: 유재상 선임, LG이노...
LG 이노텍 - Amazon Redshift Serverless를 활용한 데이터 분석 플랫폼 혁신 과정 - 발표자: 유재상 선임, LG이노...
 
Introduction to AWS Organizations
Introduction to AWS OrganizationsIntroduction to AWS Organizations
Introduction to AWS Organizations
 
Trino: A Ludicrously Fast Query Engine - Pulsar Summit NA 2021
Trino: A Ludicrously Fast Query Engine - Pulsar Summit NA 2021Trino: A Ludicrously Fast Query Engine - Pulsar Summit NA 2021
Trino: A Ludicrously Fast Query Engine - Pulsar Summit NA 2021
 
Amazon EKS - Elastic Container Service for Kubernetes
Amazon EKS - Elastic Container Service for KubernetesAmazon EKS - Elastic Container Service for Kubernetes
Amazon EKS - Elastic Container Service for Kubernetes
 
Amazon Redshift로 데이터웨어하우스(DW) 구축하기
Amazon Redshift로 데이터웨어하우스(DW) 구축하기Amazon Redshift로 데이터웨어하우스(DW) 구축하기
Amazon Redshift로 데이터웨어하우스(DW) 구축하기
 
Visualization with Amazon QuickSight
Visualization with Amazon QuickSightVisualization with Amazon QuickSight
Visualization with Amazon QuickSight
 
Large Scale Lakehouse Implementation Using Structured Streaming
Large Scale Lakehouse Implementation Using Structured StreamingLarge Scale Lakehouse Implementation Using Structured Streaming
Large Scale Lakehouse Implementation Using Structured Streaming
 
서버리스 기반 데이터베이스 모델링 및 운영 노하우 알아보기 - 변규현 SW 엔지니어, 당근마켓 / 김선형 CTO, 티클 :: AWS Sum...
서버리스 기반 데이터베이스 모델링 및 운영 노하우 알아보기 - 변규현 SW 엔지니어, 당근마켓 / 김선형 CTO, 티클 :: AWS Sum...서버리스 기반 데이터베이스 모델링 및 운영 노하우 알아보기 - 변규현 SW 엔지니어, 당근마켓 / 김선형 CTO, 티클 :: AWS Sum...
서버리스 기반 데이터베이스 모델링 및 운영 노하우 알아보기 - 변규현 SW 엔지니어, 당근마켓 / 김선형 CTO, 티클 :: AWS Sum...
 
AWS Elastic Beanstalk
AWS Elastic BeanstalkAWS Elastic Beanstalk
AWS Elastic Beanstalk
 
Best Practices for CI/CD with AWS Lambda and Amazon API Gateway (SRV355-R1) -...
Best Practices for CI/CD with AWS Lambda and Amazon API Gateway (SRV355-R1) -...Best Practices for CI/CD with AWS Lambda and Amazon API Gateway (SRV355-R1) -...
Best Practices for CI/CD with AWS Lambda and Amazon API Gateway (SRV355-R1) -...
 
Using Databricks as an Analysis Platform
Using Databricks as an Analysis PlatformUsing Databricks as an Analysis Platform
Using Databricks as an Analysis Platform
 
Amazon Aurora: Under the Hood
Amazon Aurora: Under the HoodAmazon Aurora: Under the Hood
Amazon Aurora: Under the Hood
 

Similar a Data Lake Patterns for Voice, Vision, Advanced Analytics, & ML Using Serverless (ARC320-R1) - AWS re:Invent 2018

Amazon Kinesis - Building Serverless real-time solution - Tel Aviv Summit 2018
Amazon Kinesis - Building Serverless real-time solution - Tel Aviv Summit 2018Amazon Kinesis - Building Serverless real-time solution - Tel Aviv Summit 2018
Amazon Kinesis - Building Serverless real-time solution - Tel Aviv Summit 2018
Amazon Web Services
 

Similar a Data Lake Patterns for Voice, Vision, Advanced Analytics, & ML Using Serverless (ARC320-R1) - AWS re:Invent 2018 (20)

Serverless Video Ingestion & Analytics with Amazon Kinesis Video Streams (ANT...
Serverless Video Ingestion & Analytics with Amazon Kinesis Video Streams (ANT...Serverless Video Ingestion & Analytics with Amazon Kinesis Video Streams (ANT...
Serverless Video Ingestion & Analytics with Amazon Kinesis Video Streams (ANT...
 
Running Your SQL Server Database on Amazon RDS (DAT329) - AWS re:Invent 2018
Running Your SQL Server Database on Amazon RDS (DAT329) - AWS re:Invent 2018Running Your SQL Server Database on Amazon RDS (DAT329) - AWS re:Invent 2018
Running Your SQL Server Database on Amazon RDS (DAT329) - AWS re:Invent 2018
 
Considerations for Building Your First Streaming Application (ANT359) - AWS r...
Considerations for Building Your First Streaming Application (ANT359) - AWS r...Considerations for Building Your First Streaming Application (ANT359) - AWS r...
Considerations for Building Your First Streaming Application (ANT359) - AWS r...
 
Build Your Own Log Analytics Solutions on AWS (ANT323-R) - AWS re:Invent 2018
Build Your Own Log Analytics Solutions on AWS (ANT323-R) - AWS re:Invent 2018Build Your Own Log Analytics Solutions on AWS (ANT323-R) - AWS re:Invent 2018
Build Your Own Log Analytics Solutions on AWS (ANT323-R) - AWS re:Invent 2018
 
Architecting for Real-Time Insights with Amazon Kinesis (ANT310) - AWS re:Inv...
Architecting for Real-Time Insights with Amazon Kinesis (ANT310) - AWS re:Inv...Architecting for Real-Time Insights with Amazon Kinesis (ANT310) - AWS re:Inv...
Architecting for Real-Time Insights with Amazon Kinesis (ANT310) - AWS re:Inv...
 
Analyze Amazon CloudFront and Lambda@Edge Logs to Improve Customer Experience...
Analyze Amazon CloudFront and Lambda@Edge Logs to Improve Customer Experience...Analyze Amazon CloudFront and Lambda@Edge Logs to Improve Customer Experience...
Analyze Amazon CloudFront and Lambda@Edge Logs to Improve Customer Experience...
 
CI/CD for Your Machine Learning Pipeline with Amazon SageMaker (DVC303) - AWS...
CI/CD for Your Machine Learning Pipeline with Amazon SageMaker (DVC303) - AWS...CI/CD for Your Machine Learning Pipeline with Amazon SageMaker (DVC303) - AWS...
CI/CD for Your Machine Learning Pipeline with Amazon SageMaker (DVC303) - AWS...
 
Building an end to end image recognition service - Tel Aviv Summit 2018
Building an end to end image recognition service - Tel Aviv Summit 2018Building an end to end image recognition service - Tel Aviv Summit 2018
Building an end to end image recognition service - Tel Aviv Summit 2018
 
Increase the Value of Video with ML & Media Services - SRV322 - Anaheim AWS S...
Increase the Value of Video with ML & Media Services - SRV322 - Anaheim AWS S...Increase the Value of Video with ML & Media Services - SRV322 - Anaheim AWS S...
Increase the Value of Video with ML & Media Services - SRV322 - Anaheim AWS S...
 
Increase the Value of Video with Machine Learning & Media Services - SRV322 -...
Increase the Value of Video with Machine Learning & Media Services - SRV322 -...Increase the Value of Video with Machine Learning & Media Services - SRV322 -...
Increase the Value of Video with Machine Learning & Media Services - SRV322 -...
 
[REPEAT 1] Safeguard the Integrity of Your Code for Fast and Secure Deploymen...
[REPEAT 1] Safeguard the Integrity of Your Code for Fast and Secure Deploymen...[REPEAT 1] Safeguard the Integrity of Your Code for Fast and Secure Deploymen...
[REPEAT 1] Safeguard the Integrity of Your Code for Fast and Secure Deploymen...
 
Safeguard the Integrity of Your Code for Fast and Secure Deployments (DEV349-...
Safeguard the Integrity of Your Code for Fast and Secure Deployments (DEV349-...Safeguard the Integrity of Your Code for Fast and Secure Deployments (DEV349-...
Safeguard the Integrity of Your Code for Fast and Secure Deployments (DEV349-...
 
Get Started with Deep Learning and Computer Vision Using AWS DeepLens (AIM316...
Get Started with Deep Learning and Computer Vision Using AWS DeepLens (AIM316...Get Started with Deep Learning and Computer Vision Using AWS DeepLens (AIM316...
Get Started with Deep Learning and Computer Vision Using AWS DeepLens (AIM316...
 
Using AWS to Ingest, Store, Archive, Share and carry out Analysis of Video Co...
Using AWS to Ingest, Store, Archive, Share and carry out Analysis of Video Co...Using AWS to Ingest, Store, Archive, Share and carry out Analysis of Video Co...
Using AWS to Ingest, Store, Archive, Share and carry out Analysis of Video Co...
 
Need for Speed – Intro To Real-Time Data Streaming Analytics on AWS | AWS Sum...
Need for Speed – Intro To Real-Time Data Streaming Analytics on AWS | AWS Sum...Need for Speed – Intro To Real-Time Data Streaming Analytics on AWS | AWS Sum...
Need for Speed – Intro To Real-Time Data Streaming Analytics on AWS | AWS Sum...
 
Need for Speed – Intro To Real-Time Data Streaming Analytics on AWS | AWS Sum...
Need for Speed – Intro To Real-Time Data Streaming Analytics on AWS | AWS Sum...Need for Speed – Intro To Real-Time Data Streaming Analytics on AWS | AWS Sum...
Need for Speed – Intro To Real-Time Data Streaming Analytics on AWS | AWS Sum...
 
Stream Video, Analyze It in Real Time, and Share It in Real Time (ANT357) - A...
Stream Video, Analyze It in Real Time, and Share It in Real Time (ANT357) - A...Stream Video, Analyze It in Real Time, and Share It in Real Time (ANT357) - A...
Stream Video, Analyze It in Real Time, and Share It in Real Time (ANT357) - A...
 
Best Practices to Secure Data Lake on AWS (ANT327) - AWS re:Invent 2018
Best Practices to Secure Data Lake on AWS (ANT327) - AWS re:Invent 2018Best Practices to Secure Data Lake on AWS (ANT327) - AWS re:Invent 2018
Best Practices to Secure Data Lake on AWS (ANT327) - AWS re:Invent 2018
 
Industrialize Machine Learning Using CI/CD Techniques (FSV304-i) - AWS re:Inv...
Industrialize Machine Learning Using CI/CD Techniques (FSV304-i) - AWS re:Inv...Industrialize Machine Learning Using CI/CD Techniques (FSV304-i) - AWS re:Inv...
Industrialize Machine Learning Using CI/CD Techniques (FSV304-i) - AWS re:Inv...
 
Amazon Kinesis - Building Serverless real-time solution - Tel Aviv Summit 2018
Amazon Kinesis - Building Serverless real-time solution - Tel Aviv Summit 2018Amazon Kinesis - Building Serverless real-time solution - Tel Aviv Summit 2018
Amazon Kinesis - Building Serverless real-time solution - Tel Aviv Summit 2018
 

Más de Amazon Web Services

Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
Amazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
Amazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
Amazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
Amazon Web Services
 

Más de Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Data Lake Patterns for Voice, Vision, Advanced Analytics, & ML Using Serverless (ARC320-R1) - AWS re:Invent 2018

  • 1.
  • 2. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Data Lake Patterns for Voice, Vision, Advanced Analytics, & ML Using Serverless Paul Armstrong Principal Solutions Architect Greg Share Enterprise Solutions Architect Jagadeesh Pusapadi Enterprise Solutions Architect A R C 3 2 0
  • 3. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Session abstract Industry 4.0 demands greater insight into data to bring people, processes, and equipment together. This workshop will illustrate how to gain business insights from video and voice data sources, highlighting the data pipeline. We will ingest source feeds, efficiently store the data, and perform advanced analytics using Amazon Machine Learning (Amazon ML) services and analysis tools. Typical applications include anomaly detection (detecting spills or hazardous objects and predictive maintenance), voice sentiment analysis (customer service insights). By the end of the session you will be able to quickly analyze data for uncommon characteristics, using those detections to initiate a wide variety of actions.
  • 4. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Collect Store Process/ analyze Consume Data Answers Time to answer (latency) Throughput Cost The data pipeline
  • 5. Streaming Collect Store ConsumeProcess/analyze Apache Kafka HotHotWarm Fast Stream SQLNoSQLCacheFileStream Mobile apps Web apps Devices Sensors IoT platforms Data centers Migration Logging RECORDS FILES STREAMS Analysis&visualizationDataScience DataTransport&LoggingIoTApplications Presto FastSlow BatchInteractivePredictive AmazonAI Apps Model Train/ Eval Models Deploy ETL
  • 6. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Workshop objectives • Work through the data pipeline • Focus on voice and video • Demonstrate how to use the services to solve typical problems and use cases • Leave with a working end to end solution
  • 7. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Target audience • Architects • Developers/engineers • Level 300 – good working knowledge of AWS services, CLI • All code will be provided • Note ML services will be used and the models shared, not an in-depth ML session
  • 8. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. What do I need? • AWS account with admin access • AWS Command Line Interface (AWS CLI) command tools installed on laptop • Understanding of AWS CloudFormation • We will cover a broad spectrum of topics • Voice/video • Archive/storage/retrieval • Analytics and ML • Notification services
  • 9. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 10. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Workshop setup steps • Login to AWS account • Create an Amazon Simple Storage Service (Amazon S3) bucket • Download the zip file from the link provided • Un-compress and sync with the Amazon S3 bucket • Open the workbook for detailed instructions
  • 11. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Video footage and voice ingestion Industrial/Smart City/Smart Home video feeds Customer services/operations voice inputs
  • 12. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Objectives Collect video feed from camera • CCTV footage • Production line camera Collect voice feed using hotline • Automated hotline • Analyze contact center recordings Collect Store Process/ analyze Consume
  • 13. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Data collection process • Call center hotline interaction to data lake • Data combined with footage data • CCTV footage streamed into the data lake • Optional customer producer interaction with incoming footage Collect Store Process/ analyze Consume ML inference at the edge (anomaly/face detection) Proxied video stream (optional) Footage Customer interaction data Voice CCTV Image Contact center Streaming ingest engine Data persistence (data lake) Footage direct into streaming ingest engine Streaming Ingest Engine Customer Producer Streaming Ingest Engine
  • 14. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Collect target architecture • Phone hotline writing contact records and voice recordings to S3 • Shared bucket for images • Cameras set up in workshop and shared streams for cross account access to Amazon Kinesis Video Streams for CCTV • Optional direct access RTSP stream and use own producer to Kinesis Video Streams for CCTV Collect Store Process/ analyze Consume Producer SDK writes directly to the video stream Model inference at the edge to detect faces Producer SDK RTSP stream read by proxy and send to stream Cropped images from video stream Contact records and call recordings Voice CCTV Image
  • 15. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Module activities • Stream from local web camera or test setup of S3 sample • Set up connect or use a sample audio file for analysis Key activities • Connect to Kinesis Video Streams streams to view content on the stream or simulate sample frames for analysis • Optionally configure connect to create voice recordings • Ensure can access S3 of call recordings Collect Store Process/ analyze Consume
  • 16. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Module outcomes • Video stream enabled • [Optional] Camera producer sending content to the stream Collect Store Process/ analyze Consume
  • 17. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Video footage and voice persistence Video and voice persistence options for analysis use cases
  • 18. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Data storage process • Voice/video data combined in the data lake • Authenticated access mechanism for streaming video footage access – persisted to the data lake Collect Store Process/ analyze Consume Voice CCTV Video Get content from stream Data persistence (data lake) Serverless processing Data persistence (data lake) Serverless ingested footage access mechanism
  • 19. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Store—Target architecture • Voice/image stored in S3 • CCTV API to request video from steam and store in S3 with signed URL Collect Store Process/ analyze Consume Voice CCTV Image Get content from stream
  • 20. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Module activities • Test uploading files to S3 for • Video analysis • Voice analysis • Sample video frame for video object detection Collect Store Process/ analyze Consume
  • 21. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Module outcomes • Store locations understood and accessible with sample data Collect Store Process/ analyze Consume
  • 22. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Video and voice analysis Combined analysis for sentiment/anomaly detection (Industrial H&S across manufacturing, travel and transport safety, and customer service)
  • 23. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Video and voice data processing • Voice/image processing > analysis > indexing • CCTV run model for object detection Collect Store Process/ analyze Consume Voice CCTV Image Serverless processing Serverless processing workflow Video analysis Voice to text conversion Sentiment analysis Analysis indexing Machine learning image analysis Serverless processing
  • 24. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Process—Target architecture • Voice/image use AWS managed services for indexing • CCTV run model for object detection Collect Store Process/ analyze Consume Voice CCTV Image Deployed model and endpoint
  • 25. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Module activities • Test the ingestion of video or sample frame • Validate that the image is processed and view the output image with detected object and bounding box. Explanation of how the lambda parameters are managed • Explanation of model and further activities for building and enhancing the deployed model Collect Store Process/ analyze Consume
  • 26. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Module outcomes • Solution using Amazon SageMaker to detect objects • Solution that can analyze voice recordings Collect Store Process/ analyze Consume
  • 27. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Data visualizations Share CCTV footage with third parties Real-time dashboards and alerting for both internal and external parties
  • 28. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Output consumption • Authenticated access to Web based visualizations • Indexed views • Detected anomaly alerts and alarm notifications Collect Store Process/ analyze Consume Voice CCTV Video Get content from stream Get Content Notifications Serverless processing Serverless processing Serverless access endpoint Data lake Alerts and alarms Consumption auth. mechanism
  • 29. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Consume—Target architecture • HTML authenticated viewer with Amazon Cognito • View indexes • Kibana dashboard viewer • Amazon Simple Notification Service (Amazon SNS) notifications of detected spills Collect Store Process/ analyze Consume Voice CCTV Image Get content from stream Get content Stream processor
  • 30. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Module activities • View video from web page • Configure alerts receive alert when an object is detected • View analytics of voice streams • View sentiment analysis (voice/video) [Log in to local example or use AWS deployed solution] Collect Store Process/ analyze Consume
  • 31. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Module outcomes • Visualization of the solutions in real time/historic • Alerting to key events • Reference architecture to use cases for real time voice/video detection and alerting Collect Store Process/ analyze Consume
  • 32. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Voice and video analysis framework Collect Store Process/ analyze Consume Get content from stream Deployed model and endpoint Object/ image detection Video and customer interaction processing Contact center Streaming ingest engine Customer interaction data Data lake Streaming ingest engine Footage direct into streaming ingest engine ML inference at the edge (anomaly/face detection) Customer producer Proxied video stream (optional) Streaming ingest engine Serverless processing workflow Video analysis Voice to text conversion Sentiment analysis Analysis indexing Serverless processing Serverless access endpoint Data lake Consumption auth. mechanism Get content Serverless processing Serverless processing Machine learning image analysis Notifications Alerts and alarms
  • 33. Overall architecture Collect Store Process/ analyze Consume Get content from stream Get content Deployed model and endpoint Object/image detection Producer SDK writes directly to the video stream Model inference at the edge to detect faces RTSP stream read by proxy and send to stream Cropped images from video stream Contact records and call recordings Process media
  • 34. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Summary • Ingest data securely and at scale • Analyze new data sources to augment transactional data sources • Use data sources for ML training and inference • Integrate with existing applications and workflows • Provide visualization on demand • Real-time alerting
  • 35. Thank you! © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 36. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.