2. A Flywheel For Data
Machine Learning
Deep Learning
AI
More Users Better Products
More Data Better Analytics
Object Storage
Databases
Data warehouse
Streaming analytics
BI
Hadoop
Spark/Presto
Elasticsearch
Click stream
User activity
Generated content
Purchases
Clicks
Likes
Sensor data
3. Three pillars of IoT
Things
Sense
& Act
Cloud
Storage
& Compute
Intelligence
Insights &
Logic → Action
AWS Greengrass
4. AWS IoT
DEVICE SDK
Set of client libraries to
connect, authenticate and
exchange messages
DEVICE GATEWAY
Communicate with devices via
MQTT and HTTP
AUTHENTICATION
Secure with mutual
authentication and encryption
RULES ENGINE
Transform messages
based on rules and
route to AWS Services
AWS Services
- - - - -
3P Services
SHADOW
Persistent thing state during
intermittent connections
APPLICATIONS
AWS IoT API
REGISTRY
Identity and Management of
your things
5. Most machine data never reaches the cloud
Medical equipment Industrial machinery Extreme environments
6. Why this problem isn’t going away
Law of physics Law of economics Law of the land
22. STORE CONSUMEPROCESS / ANALYZE
Amazon QuickSight
Apps & Services
Analysis&visualizationNotebooksIDEAPI
Applications & API
Analysis and visualization
Notebooks
IDE
Business
users
Data scientist,
developers
COLLECT ETL
32. Thousands Of Employees Across The Company Focused on AI
Discovery &
Search
Fulfilment &
Logistics
Enhance
Existing Products
Define New
Product Categories
Bring Machine
Learning To All
Artificial Intelligence At Amazon
33. AI Applications on AWS
• Zillow
• Zestimate (using Apache Spark)
• Howard Hughes Corp
• Lead scoring for luxury real estate purchase
predictions
• FINRA
• Anomaly detection, sequence matching,
regression analysis, network/tribe analysis
• Netflix
• Recommendation engine
• Pinterest
• Image recognition search
• Fraud.net
• Detect online payment fraud
• DataXu
• Leverage automated & unattended ML at
large scale (Amazon EMR + Spark)
• Mapillary
• Computer vision for crowd sourced maps
• Hudl
• Predictive analytics on sports plays
• Upserve
• Restaurant table mgmt & POS for forecasting
customer traffic
• TuSimple
• Computer Vision for Autonomous Driving
• Clarifai
• Computer Vision APIs
35. One-Click GPU
Deep Learning
AWS Deep Learning AMI
Up to~40k CUDA cores
MXNet
TensorFlow
Theano
Caffe
Torch
Pre-configured CUDA drivers
Anaconda, Python3
+ CloudFormation template
+ Container Image
36. Can We Help Customers
Put Intelligence At The Heart Of
Every Application & Business?
38. Origin
Destination
Departure Date
Flight Booking
“Book a flight
to London”
Automatic
Speech Recognition
Natural Language
Understanding
Book Flight
London
Utterances
Flight booking
London Heathrow
Intent /
Slot model
London Heathrow
39. Origin
Destination
Departure Date
Flight Booking
“Book a flight
to London”
Automatic
Speech Recognition
Natural Language
Understanding
Book Flight
London
Utterances
Flight booking
London Heathrow
Intent /
Slot model
London Heathrow
LocationLocation
Seattle
40. Origin
Destination
Departure Date
Flight Booking
“Book a flight
to London”
Automatic
Speech Recognition
Natural Language
Understanding
Book Flight
London
Utterances
Flight booking
London Heathrow
Intent /
Slot model
London Heathrow
LocationLocation
Seattle
Prompt
“When would you like to fly?”
“When would you
like to fly?”
Polly
42. Origin
Destination
Departure Date
Flight Booking
“Next Friday”
Automatic
Speech Recognition
Next Friday
Utterances
Natural Language
Understanding
Flight booking
02 / 24 / 2017
Intent /
Slot model
London Heathrow
Seattle
02/24/2017
43. Origin
Destination
Departure Date
Flight Booking
“Next Friday”
Automatic
Speech Recognition
Next Friday
Utterances
Natural Language
Understanding
Flight booking
02 / 24 / 2017
Intent /
Slot model
London Heathrow
Seattle
02/24/2017
Confirmation
“Your flight is booked for next Friday”
“Your flight is booked
for next Friday”
Polly
44. Origin
Destination
Departure Date
Flight Booking
“Next Friday”
Automatic
Speech Recognition
Next Friday
Utterances
Natural Language
Understanding
Flight booking
02 / 24 / 2017
Intent /
Slot model
London Heathrow
Seattle
02/24/2017
Hotel Booking
46. The Philips HealthSuite digital platform
analyzes and stores 15 PB of patient data
gathered from 390 million imaging
studies, medical records, and patient
inputs
Running on AWS provides the reliability,
performance and scalability that Philips
needs to help protect patient data as its
global digital platform grows at the rate of
one petabyte per month.
AWS Customers making an impact with IoT
Video Testimonial
47. The BMW Group
Connected-car application collects sensor
data from BMW 7 Series cars to give drivers
dynamically updated map information.
Built its new car-as-a-sensor (CARASSO)
service in only six months
CARASSO can adapt to rapidly changing
load requirements; can scale up and down
by two orders of magnitude within 24 hours.
By 2018 CARASSO is expected to process
data collected by a fleet of 100,000 vehicles
traveling more than eight billion kilometers.
AWS Customers making an impact with IoT
Video Testimonial
48. “For our market surveillance
systems, we are looking at
about 40% [savings with
AWS], but the real benefits
are the business benefits:
We can do things that we
physically weren’t able to
do before, and that is
priceless.”
- Steve Randich, CIO
Analytics Case Study: Re-architecting Compliance
What FINRA needed
• Infrastructure for its market surveillance platform
• Support of analysis and storage of approximately 75 billion
market events every day
Why they chose AWS
• Fulfillment of FINRA’s security requirements
• Ability to create a flexible platform using dynamic clusters
(Hadoop, Hive, and HBase), Amazon EMR, and Amazon S3
Benefits realized
• Increased agility, speed, and cost savings
• Estimated savings of $10-20m annually by using AWS
49. • Began implementing an S3 data lake on AWS in 2014
• Has been running in production since early 2015
• Now able to integrate all data sets together in one analytics platform, i.e. sales data, marketing
data, manufacturing line data, patient population data, FDA public datasets, etc.
• Can easily share this aggregated data across all business units
• Rapid data experimentation
• Enables new use cases & data innovations not previously possible
• Leverages Amazon EMR and Amazon Redshift for their analytics layer around the lake
• Leverages R-Studio and SAS for data science layer on top of EMR and Redshift
• They use EMR for their ETL layer
• EMR is 50% faster & 30% cheaper than their legacy ETL solution
• Amgen’s AWS S3 data lake won Best Practice Award at Bio-IT World 2016 for ‘Real World Data
Platform & Analytics’
Best Practices
Awards
Bio IT World
2016
Winner