AWS re:Invent 2018 - Machine Learning recap (December 2018)

© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Julien Simon
Principal Technical Evangelist, AI & Machine Learning, AWS
@julsimon
* Based on a deck by Dan Mbanga, Global Lead
Business Dev. Manager, ML Services
AWS re:Invent 2018
New Machine Learning Services

© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
M L F R A M E W O R K S &
I N F R A S T R U C T U R E
A I S E R V I C E S
R E K O G N I T I O N
I M A G E
P O L L Y T R A N S C R I B E T R A N S L A T E C O M P R E H E N D L E XR E K O G N I T I O N
V I D E O
Vision Speech Language Chatbots
A M A Z O N
S A G E M A K E R
B U I L D T R A I N
F O R E C A S T
Forecasting
T E X T R A C T P E R S O N A L I Z E
Recommendations
D E P L O Y
Pre-built algorithms & notebooks
Data labeling (G R O U N D T R U T H )
One-click model training & tuning
Optimization (N E O )
One-click deployment & hosting
M L S E R V I C E S
F r a m e w o r k s I n t e r f a c e s I n f r a s t r u c t u r e
E C 2 P 3
& P 3 N
E C 2 C 5 F P G A s G R E E N G R A S S E L A S T I C
I N F E R E N C E
Reinforcement learningAlgorithms & models ( A W S M A R K E T P L A C E
F O R M A C H I N E L E A R N I N G )

Threeareas weareimproving for ML developers
CostData
Cost
We’re improving both training and
inference speed & cost
Data
Preparing data for ML is major expensive,
complex, and time consuming
Ease of use
We continue to want to reduce the barrier
of entry to ML for all developers

ImprovingTraining& InferenceCost

Amazon EC2P3dn instance
ThelargestP3instance,optimizedfordistributedtraining
https://aws.amazon.com/blogs/aws/new-ec2-p3dn-gpu-instances-with-100-gbps-networking-local-nvme-storage-for-faster-machine-learning-p3-price-reduction/
Reduce machine
learning training time
Better GPU
utilization
Support larger, more
complex models
K E Y F E AT U R E S
100Gbps of networking
bandwidth
8 NVIDIA Tesla
V100 GPUs
32GB of
memory per GPU
(2x more P3)
96 Intel
Skylake vCPUs
(50% more than P3)
with AVX-512

MostcostefficientplatformforTensorFlow
Stock
TensorFlow
65%
scaling efficiency
with 256 GPUs

Stock
TensorFlow
AWS-Optimized
TensorFlow
65% 90%
scaling efficiency
with 256 GPUs
scaling efficiency
with 256 GPUs
Available with
Amazon SageMaker
and the AWS Deep
Learning AMIs

https://aws.amazon.com/about-aws/whats-new/2018/11/tensorflow-scalability-to-256-gpus/
Fastest time for
TensorFlow
Stock
TensorFlow
AWS-Optimized
TensorFlow
65% 90%
scaling efficiency
with 256 GPUs
scaling efficiency
with 256 GPUs
30m 14m
training time training time
Available with
Amazon SageMaker
and the AWS Deep
Learning AMIs

Dynamic training withApache MXNetand RIs
https://aws.amazon.com/blogs/machine-learning/introducing-dynamic-training-for-deep-learning-with-amazon-ec2/
Use a variable number of
instances for distributed training
No loss of accuracy
Coming soon
spot instances,
additional frameworks

Training getsalotof attention,
but whatabout inference?

Prediction Training
Inference
(Prediction)
90%
Training
10%
Predictions drive
complexity and
cost inproduction

Thechallengesof prediction in production
One size does not fit all Elasticity is important

Amazon EC2C5ninstance
https://aws.amazon.com/blogs/aws/new-c5n-instances-with-100-gbps-networking/
Intel Xeon Platinum 8000
Up to 3.5GHz single core speed
Up to 100Gbit networking
Based on Nitro hypervisor for
bare metal-like performance

Amazon ElasticInference
https://aws.amazon.com/blogs/aws/amazon-elastic-inference-gpu-powered-deep-learning-inference-acceleration/
Match capacity
to demand
Available between 1 to 32
TFLOPS
Integrated with
Amazon EC2,
Amazon SageMaker, and
Amazon DL AMIs
Support for TensorFlow, Apache
MXNet, and ONNX
with PyTorch coming soon
Single and
mixed-precision
operations
Lower inference costs
up to 75%

Making iteasiertoobtain highquality
labeled data

Successfulmodels require high-qualitydata

AmazonSageMakerGroundTruth
https://aws.amazon.com/blogs/aws/amazon-sagemaker-ground-truth-build-highly-accurate-datasets-and-reduce-labeling-costs-by-up-to-70

Howitworks
Raw Data

Howitworks

Creatingtraining data

Driving EaseofUse

AmazonSageMaker: build,train,and deploy ML
1
2
3
1
2
3
Recommendation with Factorization Machines
Time-series with Deep AR
A lot of expertise is still required
Can we make it simpler?

Amazon Personalize
https://aws.amazon.com/blogs/aws/amazon-personalize-real-time-personalization-and-recommendation-for-everyone

Amazon Forecast
https://aws.amazon.com/blogs/aws/amazon-forecast-time-series-forecasting-made-easy/

AWS Marketplacefor MachineLearning
MLalgorithmsandmodelsavailableinstantly
S E L L E R S B U Y E R S

Over 150models and algorithmsavailable
Natural Language
Processing
Grammar & Parsing Text OCR Computer Vision
Named Entity
Recognition
Video Classification
Speech Recognition Text-to-Speech Speaker Identification Text Classification 3D Images Anomaly Detection
Text Generation Object Detection Regression Text Clustering
Handwriting
Recognition
Ranking
S O M E O F T H E A V A I L A B L E A L G O R I T H M S A N D M O D E L S
S E L E C T E D V E N D O R S

Model optimization isextremelycomplex

Trainonce,run anywhere

AmazonSageMaker Neo
https://aws.amazon.com/blogs/aws/amazon-sagemaker-neo-train-your-machine-learning-models-once-run-them-anywhere/

What’snext for
MachineLearning?

AmazonSageMakerRL
Reinforcementlearningforeverydeveloperanddatascientist
Broad support
for frameworks
Broad support for simulation
environments including
SimuLink and MatLab
TensorFlow, Apache
MXNet, Intel Coach,
and Ray RL support
2D & 3D physics
environments and
OpenAI Gym support
Supports Amazon Sumerian and
Amazon RoboMaker
Fully
managed
Example notebooks
and tutorials

IntroducingAWS DeepRacer
Fullyautonomous1/18thscaleracecar,drivenbyreinforcementlearning
HD video camera
Dual-core Intel
processorFour-wheel drive
Dual power for
compute and drive
AccelerometerGyroscope

Getting started

MachineLearningUniversity
Uses the same
materials used to train
Amazon developers
Foundational knowledge
with
real-world application
Structured
courses and
specialist certification
https://aws.training/machinelearning

Thank you!
Julien Simon
Principal Technical Evangelist, AI & Machine Learning, AWS
@julsimon

AWS re:Invent 2018 - Machine Learning recap (December 2018)

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to AWS re:Invent 2018 - Machine Learning recap (December 2018)

Similar to AWS re:Invent 2018 - Machine Learning recap (December 2018) (20)

More from Julien SIMON

More from Julien SIMON (20)

Recently uploaded

Recently uploaded (20)

AWS re:Invent 2018 - Machine Learning recap (December 2018)

Editor's Notes