Data Lake Patterns for Voice, Vision, Advanced Analytics, & ML Using Serverless (ARC320-R1) - AWS re:Invent 2018

© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Data Lake Patterns for Voice, Vision, Advanced
Analytics, & ML Using Serverless
Paul Armstrong
Principal Solutions
Architect
Greg Share
Enterprise Solutions
Architect
Jagadeesh Pusapadi
Enterprise Solutions
Architect
A R C 3 2 0

Session abstract
Industry 4.0 demands greater insight into data to bring people, processes, and
equipment together. This workshop will illustrate how to gain business insights from
video and voice data sources, highlighting the data pipeline. We
will ingest source feeds, efficiently store the data, and perform advanced analytics
using Amazon Machine Learning (Amazon ML) services and analysis tools. Typical
applications include anomaly detection (detecting spills or hazardous objects
and predictive maintenance), voice sentiment analysis (customer service insights). By
the end of the session you will be able to quickly analyze data for uncommon
characteristics, using those detections to initiate a wide variety of actions.

Collect Store
Process/
analyze
Consume
Data Answers
Time to answer (latency)
Throughput
Cost
The data pipeline

Streaming
Collect Store ConsumeProcess/analyze
Apache Kafka
HotHotWarm
Fast
Stream
SQLNoSQLCacheFileStream
Mobile apps
Web apps
Devices
Sensors
IoT platforms
Data centers
Migration
Logging
RECORDS
FILES
STREAMS
Analysis&visualizationDataScience
DataTransport&LoggingIoTApplications
Presto
FastSlow
BatchInteractivePredictive
AmazonAI
Apps
Model
Train/
Eval
Models
Deploy
ETL

Workshop objectives
• Work through the data pipeline
• Focus on voice and video
• Demonstrate how to use the services to solve typical problems and use cases
• Leave with a working end to end solution

Target audience
• Architects
• Developers/engineers
• Level 300 – good working knowledge of AWS services, CLI
• All code will be provided
• Note ML services will be used and the models shared, not an in-depth ML
session

What do I need?
• AWS account with admin access
• AWS Command Line Interface (AWS CLI) command tools installed on laptop
• Understanding of AWS CloudFormation
• We will cover a broad spectrum of topics
• Voice/video
• Archive/storage/retrieval
• Analytics and ML
• Notification services

Workshop setup steps
• Login to AWS account
• Create an Amazon Simple Storage Service (Amazon S3) bucket
• Download the zip file from the link provided
• Un-compress and sync with the Amazon S3 bucket
• Open the workbook for detailed instructions

Video footage and voice ingestion
Industrial/Smart City/Smart Home video feeds
Customer services/operations voice inputs

Objectives
Collect video feed from camera
• CCTV footage
• Production line camera
Collect voice feed using hotline
• Automated hotline
• Analyze contact center recordings
Collect Store
Process/
analyze
Consume

Data collection process
• Call center hotline interaction
to data lake
• Data combined with footage
data
• CCTV footage streamed into
the data lake
• Optional customer producer
interaction with incoming
footage
Collect Store
Process/
analyze
Consume
ML inference
at the edge
(anomaly/face detection)
Proxied video
stream
(optional)
Footage
Customer interaction data
Voice
CCTV
Image
Contact
center
Streaming
ingest
engine
Data
persistence
(data lake)
Footage direct into
streaming ingest engine
Streaming
Ingest Engine
Customer
Producer
Streaming
Ingest Engine

Collect target architecture
• Phone hotline writing
contact records and voice
recordings to S3
• Shared bucket for images
• Cameras set up in workshop
and shared streams for cross
account access to Amazon
Kinesis Video Streams for
CCTV
• Optional direct access RTSP
stream and use own
producer to Kinesis Video
Streams for CCTV
Collect Store
Process/
analyze
Consume
Producer SDK writes directly to
the video stream
Model inference at the
edge to detect faces
Producer SDK
RTSP stream
read by proxy and
send to stream
Cropped images
from video stream
Contact records and
call recordings
Voice
CCTV
Image

Module activities
• Stream from local web camera or test setup of S3 sample
• Set up connect or use a sample audio file for analysis
Key activities
• Connect to Kinesis Video Streams streams to view content on the stream or
simulate sample frames for analysis
• Optionally configure connect to create voice recordings
• Ensure can access S3 of call recordings
Collect Store
Process/
analyze
Consume

Module outcomes
• Video stream enabled
• [Optional] Camera producer sending content to the stream
Collect Store
Process/
analyze
Consume

Video footage and voice persistence
Video and voice persistence options for analysis use cases

Data storage process
• Voice/video data combined in
the data lake
• Authenticated access
mechanism for streaming
video footage access –
persisted to the data lake
Collect Store
Process/
analyze
Consume
Voice
CCTV
Video
Get content
from stream
Data
persistence
(data lake)
Serverless
processing
Data
persistence
(data lake)
Serverless ingested
footage
access mechanism

Store—Target architecture
• Voice/image stored in S3
• CCTV API to request video
from steam and store in S3
with signed URL
Collect Store
Process/
analyze
Consume
Voice
CCTV
Image
Get content
from stream

Module activities
• Test uploading files to S3 for
• Video analysis
• Voice analysis
• Sample video frame for video object detection
Collect Store
Process/
analyze
Consume

Module outcomes
• Store locations understood and accessible with sample data
Collect Store
Process/
analyze
Consume

Video and voice analysis
Combined analysis for sentiment/anomaly detection
(Industrial H&S across manufacturing, travel and transport safety, and customer service)

Video and voice data processing
• Voice/image processing >
analysis > indexing
• CCTV run model for object
detection
Collect Store
Process/
analyze
Consume
Voice
CCTV
Image
Serverless
processing
Serverless
processing
workflow
Video
analysis
Voice to text
conversion
Sentiment
analysis
Analysis
indexing
Machine learning
image
analysis
Serverless
processing

Process—Target architecture
• Voice/image use AWS managed
services for indexing
• CCTV run model for object
detection
Collect Store
Process/
analyze
Consume
Voice
CCTV
Image
Deployed
model and
endpoint

Module activities
• Test the ingestion of video or sample frame
• Validate that the image is processed and view the output image with detected
object and bounding box. Explanation of how the lambda parameters are
managed
• Explanation of model and further activities for building and enhancing the
deployed model
Collect Store
Process/
analyze
Consume

Module outcomes
• Solution using Amazon SageMaker to detect objects
• Solution that can analyze voice recordings
Collect Store
Process/
analyze
Consume

Data visualizations
Share CCTV footage with third parties
Real-time dashboards and alerting for both internal and external parties

Output consumption
• Authenticated access to
Web based visualizations
• Indexed views
• Detected anomaly alerts
and alarm notifications
Collect Store
Process/
analyze
Consume
Voice
CCTV
Video
Get content from stream
Get Content
Notifications
Serverless
processing
Serverless
processing
Serverless
access
endpoint
Data
lake
Alerts and
alarms
Consumption
auth.
mechanism

Consume—Target architecture
• HTML authenticated viewer
with Amazon Cognito
• View indexes
• Kibana dashboard viewer
• Amazon Simple Notification
Service (Amazon SNS)
notifications of detected
spills
Collect Store
Process/
analyze
Consume
Voice
CCTV
Image
Get content
from stream
Get content
Stream
processor

Module activities
• View video from web page
• Configure alerts receive alert when an object is detected
• View analytics of voice streams
• View sentiment analysis (voice/video)
[Log in to local example or use AWS deployed solution]
Collect Store
Process/
analyze
Consume

Module outcomes
• Visualization of the solutions in real time/historic
• Alerting to key events
• Reference architecture to use cases for real time voice/video detection and
alerting
Collect Store
Process/
analyze
Consume

Voice and video analysis framework
Collect Store
Process/
analyze
Consume
Get content
from stream
Deployed model
and endpoint
Object/
image
detection
Video and customer
interaction processing
Contact
center
Streaming
ingest engine
Customer interaction data
Data
lake
Streaming
ingest engine
Footage direct into
streaming ingest engine
ML inference
at the edge
(anomaly/face detection)
Customer
producer
Proxied video
stream
(optional)
Streaming
ingest engine
Serverless processing
workflow
Video
analysis
Voice to text
conversion
Sentiment
analysis
Analysis
indexing
Serverless
processing
Serverless
access
endpoint
Data
lake
Consumption
auth. mechanism
Get
content
Serverless
processing
Serverless
processing
Machine learning
image
analysis
Notifications
Alerts and
alarms

Overall architecture
Collect Store
Process/
analyze
Consume
Get content
from stream
Get content
Deployed
model and
endpoint
Object/image detection
Producer SDK writes directly to
the video stream
Model inference at the
edge to detect faces
RTSP stream
read by proxy and
send to stream
Cropped images
from video stream
Contact records and
call recordings
Process
media

Summary
• Ingest data securely and at scale
• Analyze new data sources to augment transactional data sources
• Use data sources for ML training and inference
• Integrate with existing applications and workflows
• Provide visualization on demand
• Real-time alerting

Thank you!

Data Lake Patterns for Voice, Vision, Advanced Analytics, & ML Using Serverless (ARC320-R1) - AWS re:Invent 2018

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Similar a Data Lake Patterns for Voice, Vision, Advanced Analytics, & ML Using Serverless (ARC320-R1) - AWS re:Invent 2018

Similar a Data Lake Patterns for Voice, Vision, Advanced Analytics, & ML Using Serverless (ARC320-R1) - AWS re:Invent 2018 (20)

Más de Amazon Web Services

Más de Amazon Web Services (20)

Data Lake Patterns for Voice, Vision, Advanced Analytics, & ML Using Serverless (ARC320-R1) - AWS re:Invent 2018