Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Confluent and Kanchan Waikar, AWS

Applying ML on your Data in Motion
with AWS and Confluent
KanchanWaikar
Senior Specialist Solutions Architect at AWS
kanchanwaikar

© 2020, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Working with your data
Data in your data
lake
Continuously
generated click-
stream data
Third-party data
procured from vendor
Amazon Simple
Storage Service
(Amazon S3)
Amazon Kinesis Amazon Managed
Streaming for Apache
Kafka
AWS Data Exchange

Use AWS Data Exchange for procuring third-party
data on AWS
Data Providers
No longer need to maintain data
storage, delivery, billing, or
entitling technology
Automatically access
new data
Migrate existing subscriptions
at no additional cost
Easily analyze data as its
published
Distribute data in a secure
and compliant way
Quickly find diverse data
in one place
Migrate existing
subscriptions at no
additional cost
Reach to millions of
AWS customers
Data Subscribers

The AWS ML stack
Broadest and most complete set of machine learning capabilities
ML FRAMEWORKS
&
INFRASTRUCTURE
TensorFlow, PyTorch,
Apache MXNet
Deep learning
AMIs & containers
GPUs Inferentia Elastic inference FPGA
AI SERVICES
Vision
Rekognition
Speech
Polly
Transcribe
Chatbots
Lex
Contact centers
Contact Lens
Connect Voice ID
Code + DevOps
CodeGuru
DevOps Guru
Text
Comprehend
Translate
Textract
Business tools
Personalize,
Forecast
Fraud Detector
Lookout for Metrics
Search
Kendra
Industrial
Panorama Appliance and SDK,
Monitron, Lookout for
Equipment, Lookout for Vision
Healthcare
HealthLake
Comprehend Medical
Transcribe Medical
Label
data
Data
collection prep
Store
features
Detect bias
and explain
predictions
Visualize in
notebooks
Pick
algorithm
Manage
& monitor
Train
models faster
Deploy in
production
Tune
parameters
Manage edge
devices
SAGEMAKER STUDIO IDE
CI/CD
AMAZON
SAGEMAKER

AWS Marketplace
Flexible consumption
and contract models
Quick and
easy deployment
Helpful humans
to support you
8,000+
listings
1,600+
ISVs
24
regions
290,000+
customers
1.5M+
subscriptions
   

AWS Marketplace can help you get started
Find
A breadth
of tools:
Buy
Free trial
Pay-as-you-go
Hourly | Monthly | Annual
| Multi-Year
Bring Your Own License (BYOL)
Seller Private Offers
Channel Partner Private Offers
Through flexible
pricing options:
Deploy
AWS Control Tower
AWS Service Catalog
AWS CloudFormation
(Infrastructure as Code)
Software as a Service (SaaS)
Amazon Machine Image (AMI)
Amazon Elastic Container Service (ECS)
Amazon Elastic Kubernetes Service (EKS)
With multiple
deployment options:

Confluent on the AWS Marketplace (free trial)
https://aws.amazon.com/marketplace

Pre-trained
ML Models

?
Battle of
security
Buyer ’s data
Vs
Seller ’s IP
Where to
evaluate and
qualify
Which Model
to buy
Third-party AI Adoption challenges

Amazon
SageMaker
AWS Marketplace for Machine Learning
Computer
vision
NLP
Speech
recognition
Text
Image
Audio
Video
Structured
AWS Marketplace for
Machine Learning
Simplified provisioning
Consolidated into your AWS billing
Free | Free trial | Paid subscriptions
Curated and trusted catalog of hundreds of ML
model packages and algorithms

A catalog of products from
And more...

• Let you train a custom model.
• Are ready-to-use.
• Use models for:
• Batch inference
• Real-time inference
• Generating Synthetic features
• Use algorithms for:
• Training a model
• Hyperparameter optimization
Pre-trained models Algorithms
2
1
E.g. Vehicle Damage Inspection E.g. AutoGluon Tabular Algorithm
AWS Marketplace : Models and Algorithms Amazon
SageMaker

2
Amazon SageMaker
Model from AWS
Marketplace
Subscribe
3
Deploy models in
Amazon SageMaker with:
• Network isolation mode (container has no
internet access)
• Endpoint configured in yourVPC
• IAM policies and encryption
Deploying ML models from AWS Marketplace
1
Find and try a
model from
AWS Marketplace
Batch
transform job
Real-time
inference
4
Secure REST
APIAccess
SECURE YOUR DATA BY DEPLOYING MODELS IN YOUR VPC, IN
NETWORK ISOLATION MODE

Security
Private AWS
Marketplace
AWS Service Catalog
Scanning
IAM
policies
Network
isolation
Ensures data
protection
Vulnerability
scans
Encryption
Secure REST API
access

Model packages : Perform inference
Identify your customer
Identify Passport Detail Page, etc.
Generate better recommendations
Identify brands, understand customer propensity to buy, etc.
Improve workplace safety: Identify compliance
Identify presence of non-workers at worksite, workers are wearing
of PPE, masks, etc.
Insurance claim automation
Claim prediction, vehicle’s make/model/year identification, etc.
Process call-center data intelligently
Speech recognition, source separation, etc.
To address business needs
Want to solve
your problem
quickly?
Yes
Use pre-trained
ML model packages
Machine
Learning problem?

Model packages: Identify high-quality data
Identify and use good quality data
Is all data of
good quality?
No
Yes
Identify and use
good quality data
Do Feature Engineering
and train an ML model
Image
Image Quality Score — identify blurry or pixelated images
Audio
Background Noise Classifier (CPU) — identify audio files with background noise
Text
Text summarizer — convert wordy documents into short summaries
Phishing email classifier
Dynamic AI Text Similarity Model — identify text similarity
Language Scoring Inference Model — identify difficulty level

Model packages: Generate synthetic features
A custom ML model needs high-quality features
Can you
use an
out-of-the-box
solution?
No
Feature engineer data
Train a custom
ML model
Image
Fashion localization, interior localization—extract features for
recommendation engine
Audio
Source separation—separate lyrics from background sound and use lyrics for
your transcribe job
Text
Insult detection—emotion analysis model
Sentence classification—extract additional insights about characteristics associated with
the author

Demo

Demo: Deploy model and perform inference
1
Choose ML model
Subnet
Deploy model in form of
an endpoint
VPC
2
Subscribe
AWS
CloudFormation
Perform inference
AWS Command Line Interface
3

Amazon MSK
Fully managed, highly available,
and secure Apache Kafka
service
Highly secure
Protect your data with multiple levels of security, including VPC
network isolation, encryption at-rest and in-transit, and more
Highly available
Take advantage of multi-AZ replication within
an AWS region
Elastic stream processing
Run Apache Flink applications written in SQL, Java, or
Scala that elastically scale to process data streams
Fully managed
Focus on creating applications not managing
your Apache Kafka environment
Fully compatible
Run your existing Apache Kafka applications
on AWS without changes to source code

A real world streaming use-case
Construction
work-site
Use-case: A non-compliance identification and notification system
Goal: Reduce accidents by identifying and correcting non-compliance
Infrastructure-scale:
Tens of thousands of surveillance cameras, hundreds of construction sites, Thousands of
construction workers
Project Requirement:
Design an automated PPE non-compliance reporting system that reports incidences in
matter of minutes.
Technical requirements:
• Must scale up-and-down automatically based on loads (Cameras turned ON and OFF
based on shift hours)

Construction site non-compliance notification system
Courtesy - https://pixabay.com/videos/construction-road-excavator-worker-26239/

Summary Logs
(Start) HH:mm:SSS-(End) HH:mm:SSS : Alarm/No alarm : Status Details
00:00:000-00:00:015 : No Alarm : 1 truck(s), 1 excavator(s), 1 workers found.
00:00:060-00:00:075 : No Alarm : 1 truck(s), 1 excavator(s), no workers found.
00:00:075-00:00:090 : ALARM : 1 worker(s) wearing PPE but 0 wearing hard
hats, 1 truck(s), 1 excavator(s) found.
00:00:090-00:00:105 : No Alarm : 1 truck(s), 1 excavator(s), no workers found.
00:00:105-End : ALARM : 1 worker(s) wearing PPE but 0 wearing hard hats, 1
truck(s), 1 excavator(s) found.
3.0 4.5 6.0
7.5 9.0 10.5
0.0 1.5
Snapshots

Components: Building ML-driven Streaming applications
Amazon Simple
Notification
Service
Notification mechanism
Metadata storage
ML Model
Video/data storage
Amazon QuickSight Amazon DynamoDB
ML Model deployment
mechanism
Amazon SageMaker
Model
Data Visualization tool
Amazon Athena
Interactive Query service
Amazon Simple
Storage Service (S3)
KSQLDB
Event Streaming platform
AWS Lambda
Serverless compute
ksqlDB

Amazon
SageMaker
Pre-trained
ML Models
AWS Lambda
ksqlDB
Email notification
Safety
Administrator
Mobile
client
Amazon Simple
Notification
Service
AWS Lambda
S3 Sink S3 Bucket
Topic
Amazon Athena
Alarms
S3 Bucket
Construction
work-site
Amazon Kinesis
Video Streams

Demo!

ML Models
AKTE - Forklift
Detector
Pre-trained
Model
PPE Detector
for Laboratory
Safety
TensorIoT CV PPE
Mask Detection
Helmet & Vest
Detector for
Worker Safety
Social Distancing
Detector
Hard Hat Detector
for Worker Safety
Construction Worker
Detection
Construction
Machines Detector
GluonCV YoloV3
Object Detector
Pre-trained
Model
Pre-trained
Model
Pre-trained
Model
Pre-trained
Model
Pre-trained
Model
Pre-trained
Model
Pre-trained
Model
Pre-trained
Model
.. 400+ more!
aws-mp-bd-ml@amazon.com

Conclusion
Experiment, do POCs to evaluate solutions, and innovate on behalf of your
organization
Scale your applications using streaming platforms such as Confluent Cloud from
AWS Marketplace
Build powerful streaming applications powered by machine learning models

Questions?
Kanchan Waikar
Senior Specialist Solutions Architect at AWS
kanchanwaikar

Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Confluent and Kanchan Waikar, AWS

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Similar a Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Confluent and Kanchan Waikar, AWS

Similar a Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Confluent and Kanchan Waikar, AWS (20)

Más de HostedbyConfluent

Más de HostedbyConfluent (20)

Último

Último (20)

Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Confluent and Kanchan Waikar, AWS

Notas del editor