SlideShare una empresa de Scribd logo
1 de 80
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AWS re:INVENT
Large Scale Deep Learning
with BigDL
N o v e m b e r 3 0 , 2 0 1 7
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Large Scale Deep Learning
with BigDL
T i m F o x | B i g D a t a a n d M a c h i n e L e a r n i n g C o n s u l t a n t | E l e p h a n t S c a l e
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
ABOUT ME
T i m F o x ,
P r i n c i p a l @ E l e p h a n t S c a l e
P r a c t i t i o n e r a n d T r a i n e r i n D a t a E n g i n e e r i n g
a n d D a t a S c i e n c e
Author of “Data Science in Python” on LinkedIn Learning
t i m @ e l e p h a n t s c a l e . c o m
L i n k e d i n : t i m - f o x - 0 0 6 3 5 4 1
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
ABOUT Elephant Scale
• T r a i n i n g i n B i g D a t a a n d A I t e c h n o l o g i e s
• B i g D a t a : S p a r k , H a d o o p , C l o u d , N o S Q L , S t r e a m i n g
• A I : M a c h i n e L e a r n i n g , D e e p L e a r n i n g , B i g D L ,
T e n s o r f l o w
• B i g D L t r a i n i n g a v a i l a b l e !
• P u b l i c a n d P r i v a t e t r a i n i n g s a v a i l a b l e
• B i g D L S a n d b o x : e l e p h a n t s c a l e . c o m / s a n d b o x
E l e p h a n t s c a l e . c o m
i n f o @ e l e p h a n t s c a l e . c o m
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Quick Roundup of AI / Machine
Learning / Deep Learning
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AI / MACHINE LEARNING / DEEP LEARNING
Artificial Intelligence (AI):
Broader concept of machines being able to carry
out 'smart' tasks
Machine Learning:
A type of AI that allows software to learn from
data without explicitly programmed
Deep Learning:
Using Neural Networks to solve some hard
problems
A r t i f i c i a l
I n t e l l i g e n c e
M a c h i n e
L e a r n i n g
D e e p
L e a r n i n g
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
DEEP LEARNING APPLICATIONS
S e l f D r i v i n g C a r s
• ML system using image recognition
• Where the edge of the road / road sign / car in front
F a c e r e c o g n i t i o n
• Facebook images
• System learns from images manually tagged and then automatically detects faces in
uploaded photos
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
DEEP LEARNING HISTORY
E a r l y a t t e m p t s a t D e e p L e a r n i n g d i d n o t s u c c e e d .
• Compute Power was insufficient for the time.
• Training Datasets were insufficiently sized for good results.
• We lacked the ability to parallelize our work.
I n t h e m o d e r n e r a , D e e p L e a r n i n g h a s b e e n s u c c e s s f u l .
• 'Big Data' – now we have so much data to train our models
• 'Big Data ecosystem' – excellent big data platforms (Hadoop, Spark, NoSQL)
are available as open source
• 'Big Compute' - cloud platforms significantly lowered the barrier to massive
compute power
• $1 buys you 16 core + 128 G + 10 Gigabit machine for 1 hr on AWS!
• So running a 100 node cluster for 5 hrs  $500
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AI SOFTWARE ECO SYSTEM
M a c h i n e L e a r n i n g D e e p L e a r n i n g
Java
- Weka
- Mahout
- DeepLearning4J
Python
- SciKit
- Tensorflow
- Theano
- Caffe
R
- Many libraries - Deepnet
- Darch
Distributed
- H20
- Spark
- H20
- Spark
- BigDL
Cloud - AWS - AWS
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
MACHINE LEARNING AND BIG DATA
Until recently most of the machine learning is done on “single computer” (with lots of
memory–100s of GBs)
Most R/Python/Java libraries are “single node based”
Now Big Data tools make it possible to run machine learning algorithms at massive scale –
distributed across a cluster
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
MODERN DEEP LEARNING FRAMEWORKS
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
TOOLS FOR SCALABLE MACHINE LEARNING
A p a c h e S p a r k M L
• Runs on top of popular Spark framework
• Massively scalable
• Can use memory (caching) effectively for
iterative algorithms
• Language support: Scala, Java, Python, R
B i g D L
• Built for Apache Spark and Optimized for Intel Xeon
• Language Support: Scala, Java, Python
T e n s o r F l o w
• Based on “data flow graphs”
• Language support: Python, C++
• https://www.tensorflow.org/
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
TOOLS FOR SCALABLE CLOUD
MACHINE LEARNING
A m a z o n M a c h i n e L e a r n i n g
• Ready to go algorithms
• Visualization tools
• Wizards to guide
• Scalable on AWS Cloud
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
BigDL
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
WHAT IS BIGDL
A d i s t r i b u t e d d e e p l e a r n i n g l i b r a r y f o r A p a c h e S p a r k
F e a t u r e p a r i t y w i t h p o p u l a r d e e p l e a r n i n g f r a m e w o r k s
• Caffe, Torch, Tensorflow
H i g h P e r f o r m a n c e
• Powered by Intel Math Kernel Library (MKL) and multi threaded programming
C a n s c a l e t o h u g e d a t a s e t s
• Using Apache Spark for scale
O p e n s o u r c e ! ( D e c 2 0 1 6 )
A c t i v e D e v e l o p m e n t
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
PRODUCTION ML/DL SYSTEMS ARE COMPLEX!
A c t u a l M L / D L i s o n l y s m a l l p o r t i o n o f m a s s i v e p r o d u c t i o n
s y s t e m
B i g D L r u n n i n g o n a s c a l a b l e p l a t f o r m l i k e S p a r k h e l p s
s i m p l i f y t h e c o m p l e x i t y
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
BIGDL FILLS THE 'GAP' IN BIG DATA +
DEEP LEARNING
F o l l o w s p r o v e n d e s i g n p a t t e r n s f o r d e a l i n g w i t h B i g D a t a
S e n d s ' c o m p u t e t o d a t a ' r a t h e r t h a n r e a d i n g m a s s i v e d a t a
o v e r n e t w o r k .
U s e s ' d a t a l o c a l i t y ' o f H D F S ( H a d o o p F i l e S y s t e m )
U t i l i z e s ' c l u s t e r m a n a g e r s ' l i k e Y A R N / M E S O S
• Automatically handles hardware/software failures
• Elasticity and resource sharing in a cluster
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
BIGDL & SPARK
R u n B i g D L a p p l i c a t i o n s a s S p a r k a p p l i c a t i o n s
S c a l a , J a v a , a n d P y t h o n s u p p o r t
U s e o t h e r S p a r k ' s f e a t u r e s
• In memory compute
• Integrate with Spark ML and Streaming
E a s y d e v e l o p m e n t w i t h J u p y t e r N o t e b o o k
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
BIGDL VS TENSORFLOW
B i g D L T e n s o r f l o w
Runtime Scala Engine with Python
front-end
C++ Engine with Python
front-end
Hadoop compatibility Can run natively on Spark &
Hadoop
Accesses Hadoop data as a
client only
Distributed Operation Scalable with Apache Spark
for massive scale out of the
box
Does not support massive
distribution out of the box
Runs Tensorflow Models Yes Yes
Acceleration CPU w/MKL CPU/GPU
Summary Excellent for distributing
deep-learning models to
massive scale on big-data.
Great TCO value.
Excellent library for small-
medium scale data, although
GPU hardware costs can be
significant.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
BIGDL: BIG COMPUTE PLUS BIG DATA
B i g D L h e l p s u s i n b a l a n c i n g o u r n e e d s
• Big Compute: Fast Linear Algebra, Intel MKL library
• Optimized for Intel Xeon
• Big Data: I/O parallelized to run on many CPUs
B i g D L A l l o w s M a s s i v e S c a l a b i l i t y
• Natively Designed to run on Spark
• Works with Hadoop eco system (via Spark)
Hadoop is THE Big Data platform for on-premise deployments
P l a y s n i c e l y w i t h o t h e r B i g D L f r a m e w o r k s
• Use existing Tensorflow or Caffe at scale in BigDL
• Train new models based on existing TF / Caffe models
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
BIGDL USE CASES
F r a u d d e t e c t i o n
S e n t i m e n t a n a l y s i s
I m a g e r e c o g n i t i o n
Find more at: https://github.com/intel-analytics/analytics-zoo/
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
GPUs and CPUs
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
GPUS (GRAPHICS PROCESSING UNITS)
G P U s h a v e a d d r e s s e d p a s t i s s u e s i n t r a i n i n g p e r f o r m a n c e
• Example: Tensorflow - optimized to run well on GPUs.
C P U i n p a s t n o t v e c t o r i z e d f o r p a r a l l e l c o m p u t e
• Meant that GPUs were much faster for deep learning
M o d e r n I n t e l X e o n C P U s h a v e v e c t o r i z e d l i n e a r a l g e b r a
• Properly optimized, approaches speed of GPUs
• CPUs are now a credible alternative to running on GPUs
• Cost Advantage and Scalability
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
INTEL MATH KERNEL LIBRARY (MKL)
F e a t u r e s h i g h l y o p t i m i z e d , t h r e a d e d , a n d v e c t o r i z e d m a t h
f u n c t i o n s t h a t m a x i m i z e p e r f o r m a n c e o n e a c h p r o c e s s o r f a m i l y .
U t i l i z e s i n d u s t r y - s t a n d a r d C a n d F o r t r a n A P I s f o r c o m p a t i b i l i t y
w i t h p o p u l a r B L A S , L A P A C K , a n d F F T W f u n c t i o n s — n o c o d e
c h a n g e s r e q u i r e d .
D i s p a t c h e s o p t i m i z e d c o d e f o r e a c h p r o c e s s o r a u t o m a t i c a l l y
w i t h o u t t h e n e e d t o b r a n c h c o d e .
P r o v i d e s p r i o r i t y s u p p o r t , c o n n e c t i n g y o u d i r e c t l y t o I n t e l
e n g i n e e r s f o r c o n f i d e n t i a l a n s w e r s t o t e c h n i c a l q u e s t i o n s
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
INTEL MKL PERFORMANCE
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
CPU VERSUS GPU FOR BIG DATA
C P U o f f e r s h i g h e r s c a l a b i l i t y a t l o w e r c o s t v e r s u s G P U
O p t i m i z e d S o f t w a r e a n d l i b r a r i e s o n C P U a l l o w s i n g l e - n o d e
p e r f o r m a n c e t o a p p r o a c h G P U p e r f o r m a n c e .
G P U p l u s C P U a r c h i t e c t u r e s c a n b e e f f e c t i v e f o r s m a l l e r n u m b e r
o f n o d e s , w h e n c o s t i s n o t a c o n c e r n .
” B i g C o m p u t e ” v e r s u s “ B i g D a t a ”
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Running BigDL
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
RUNNING BIGDL
D e v e l o p i n g :
U s e t h e f o l l o w i n g t o d e v e l o p
y o u r B i g D L a p p s e f f o r t l e s s l y
• Docker
• VM Sandbox
D e p l o y i n g :
C l o u d r e a d y d e p l o y m e n t
• Amazon AMI
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
DEMO: GETTING STARTED WITH BIGDL
W e w i l l p r o v i d e :
• Docker
• Sandbox VM
• AWS Marketplace AMI
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
BigDL Summary
B i g D L o f f e r s o u t s t a n d i n g s c a l a b i l i t y a n d p e r f o r m a n c e
B i g D L o p t i m i z e s T C O b y r u n n i n g b e i n g t u n e d a n d o p t i m i z e d f o r
I n t e l X e o n P r o c e s s o r s
B i g D L b r i n g s d e e p l e a r n i n g t o S p a r k C l u s t e r s a n d H a d o o p
D a t a s e t s
B i g D L c a n b e u s e d t o d e p l o y T e n s o r f l o w a n d C a f f e m o d e l s t o b i g
d a t a .
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
IMAGE RECOGNITION WITH APACHE
SPARK AND BIGDL
A l e x K a l i n i n | V P , A I / M a c h i n e L e a r n i n g | S i z m e k
ABOUT ME
A l e x K a l i n i n
V P , A I / M a c h i n e L e a r n i n g | S i z m e k
a l e x . k a l i n i n @ s i z m e k . c o m
L i n k e d i n : l i n k e d i n . c o m / i n / a l e x k a l i n i n /
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
https://www.linkedin.com/in/alexkalinin/
alex.kalinin@sizmek.com
1 0 0 , 0 0 0 , 0 0 0 , 0 0 0 r e q u e s t s p e r d a y
P B s o f t r a i n i n g d a t a
AI-POWERED MARKETING AND OPTIMIZATION
7 0 , 0 0 0 , 0 0 0 / m i n u t e
1 , 2 0 0 , 0 0 0 / s e c
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
FEED-FORWARD NETWORK
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
43
37
45
40
𝑦 = ෍(𝑤𝑖 ∗ 𝑥𝑖)
?
?
?
?
?
?
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
-0.53
0.01
-0.17
0.70
0.51
?
?
𝑦 = ෍(𝑤𝑖 ∗ 𝑥𝑖)
43
37
45
40
?
?
?
?
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
-0.53
0.01
-0.17
0.70
0.51
?
?
𝑦 = ෍(𝑤𝑖 ∗ 𝑥𝑖)
43
37
45
40
-1.56
?
?
?
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
-0.53
0.01
-0.17
0.70
0.51
?
?
𝑦 = 𝑅𝑒𝐿𝑈(෍ 𝑤𝑖 ∗ 𝑥𝑖 )
43
37
45
40
-1.56
?
?
?
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
-0.53
0.01
-0.17
0.70
0.51
?
?
𝑦 = 𝑅𝑒𝐿𝑈(෍ 𝑤𝑖 ∗ 𝑥𝑖 )
43
37
45
40
?
?
?
0
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
43
37
45
40
-0.12
0.13
0.21
-0.07
-0.05
?
?
𝑦 = 𝑅𝑒𝐿𝑈(෍ 𝑤𝑖 ∗ 𝑥𝑖 )
?
?
0
11.9
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
?
?
𝑦 = 𝑅𝑒𝐿𝑈(෍ 𝑤𝑖 ∗ 𝑥𝑖 )
43
37
45
40
-.11
?
0
11.9
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
?
?
𝑦 = 𝑅𝑒𝐿𝑈(෍ 𝑤𝑖 ∗ 𝑥𝑖 )
43
37
45
40 ?
0
11.9
0
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
?
?
𝑦 = 𝑅𝑒𝐿𝑈(෍ 𝑤𝑖 ∗ 𝑥𝑖 )
43
37
45
40 0.15
0
11.9
0
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
-0.67
?
𝑦 = 𝑅𝑒𝐿𝑈(෍ 𝑤𝑖 ∗ 𝑥𝑖 )
43
37
45
40 0.15
0
11.9
0
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
0
?
𝑦 = 𝑅𝑒𝐿𝑈(෍ 𝑤𝑖 ∗ 𝑥𝑖 )
43
37
45
40 0.15
0
11.9
0
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
0
0.52
𝑦 = 𝑅𝑒𝐿𝑈(෍ 𝑤𝑖 ∗ 𝑥𝑖 )
43
37
45
40 0.15
0
11.9
0
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
FEED-FORWARD NETWORK
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
FULLY CONNECTED
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
FULLY CONNECTED
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
FULLY CONNECTED
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
FULLY CONNECTED
Input Size:
Connections:
40,000
1,600,000,000
200
200
200
200
10 layers: 16 billion
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
INVENTION OF CONVOLUTIONAL
NEURAL NETWORKS
• L e N e t - 5 n e t w o r k d e v e l o p e d i n 1 9 9 8 b y Y a n n L e C u n
• T o r s t e n H u b e l a n d D a v i d W i e s e l
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
HUBEL & WIESEL
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1363130/
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
HIERARCHICAL & LOCAL VISUAL CORTEX
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
HIERARCHICAL & LOCAL VISUAL CORTEX
Lines,
Dots
Orientation,
Movement
High-Level
Shapes
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
KEY FEATURES OF CONVOLUTIONAL NETWORK
• C o n v o l u t i o n
• P o o l i n g
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
FULLY CONNECTED
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
CONVOLUTION
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
CONVOLUTION
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
CONVOLUTION
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
CONVOLUTION
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
CONVOLUTION
O n l y f o u r we i g h t s
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
CONVOLUTION
0.10 -0.06
0.24 0.17
Filter
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
POOLING
Source: https://cs231n.github.io/convolutional -networks/
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
CONVOLUTIONAL NETWORK
0
0
0
0
1
0
0
0
0
0
0
1
2
3
4
5
6
7
8
9
Convolution Pooling Convolution PoolingInput FC FC
Source: https://www.clarifai.com/technology
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
http://scs.ryerson.ca/~aharley/vis/conv/flat.html
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
DEMO – Image Recognition
G i t H u b : h t t p s : / / g i t h u b . c o m / a l e x - k a l i n i n / l e n e t - b i g d l
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Getting Started With BigDL
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
ABOUT ME
S u j e e M a n i y a m
F o u n d e r / P r i n c i p a l @ E l e p h a n t S c a l e
P r a c t i t i o n e r a n d T r a i n e r i n D a t a E n g i n e e r i n g
a n d D a t a S c i e n c e
Author
- "Hadoop and Spark" video training on O'Reilly Media
- "HBase Design Patterns"
- "Hadoop illuminated"
s u j e e @ e l e p h a n t s c a l e . c o m
L i n k e d i n : l i n k e d i n . c o m / i n / s u j e e m a n i y a m
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
RUNNING BIGDL
D e v e l o p i n g :
U s e t h e f o l l o w i n g t o d e v e l o p
y o u r B i g D L a p p s e f f o r t l e s s l y
• Docker
• VM Sandbox
• Amazon AMI
D e p l o y i n g :
C l o u d r e a d y d e p l o y m e n t
• Amazon AMI
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
GETTING STARTED WITH BIGDL
W e w i l l d e m o n s t r a t e
• Docker
• Sandbox VM
• AWS Marketplace AMI
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Docker
Step 1 : Install Docker on your laptop
Step 2 : get docker image
docker pull elephantscale/bigdl-sandbox
Step 3 : download tutorials
git clone https://github.com/elephantscale/bigdl-tutorials
Step 4 : Launch docker
cd bigdl-tutorials
./run-bigdl-docker.sh elephantscale/bigdl-sandbox
Step 5 : Go to Jupyter notebook
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
VM-Sandbox
Step 1 : Install VMware Player or VirtualBox on your laptop
Step 2 : Download BigDL-Sandbox image
http://elephantscale.com/sandbox/
Step 3 : (In Sandbox) download tutorials
git clone https://github.com/elephantscale/bigdl-tutorials
Step 4 : (In Sandbox) Run BigDL natively
cd bigdl-tutorials
./run-bigdl-native.sh
Step 5 : (In Sandbox) Go to Jupyter notebook
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Docker on AWS
Step 1 : Spin up an AMI (Ubuntu recommended)
Step 2 : Install Docker on the instance
Step 3 : get docker image
docker pull elephantscale/bigdl-sandbox
Step 4 : download tutorials
git clone https://github.com/elephantscale/bigdl-tutorials
Step 5 : Launch docker
cd bigdl-tutorials
./run-bigdl-docker.sh elephantscale/bigdl-sandbox
Step 6 : Go to Jupyter notebook
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AMI on AWS
Step 1 : Spin up BigDL AMI
Step 2 : download tutorials
git clone https://github.com/elephantscale/bigdl-
tutorials
Step 3 : Run BigDL
cd bigdl-tutorials
./run-bigdl-native.sh
Step 4 : Go to Jupyter notebook
Everyone Gets Free Credits on AWS
Ask Dave Nielsen
dave.nielsen@intel.com
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Notebooks and Slides
Tutorials: github.com/dnielsen/bigdl-resources
Slides: bit.ly/bigdlreinvent2017 (as in BigDL re:invent)
Tim Fox Sujee Maniyam Alex Kalinin Dave Nielsen
Elephant Scale Elephant Scale Sizmek Intel Software
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
THANK YOU!

Más contenido relacionado

La actualidad más candente

Optimizing Apache Spark SQL Joins
Optimizing Apache Spark SQL JoinsOptimizing Apache Spark SQL Joins
Optimizing Apache Spark SQL JoinsDatabricks
 
How to Actually Tune Your Spark Jobs So They Work
How to Actually Tune Your Spark Jobs So They WorkHow to Actually Tune Your Spark Jobs So They Work
How to Actually Tune Your Spark Jobs So They WorkIlya Ganelin
 
Image Processing on Delta Lake
Image Processing on Delta LakeImage Processing on Delta Lake
Image Processing on Delta LakeDatabricks
 
Productizing Structured Streaming Jobs
Productizing Structured Streaming JobsProductizing Structured Streaming Jobs
Productizing Structured Streaming JobsDatabricks
 
The Future of Column-Oriented Data Processing With Apache Arrow and Apache Pa...
The Future of Column-Oriented Data Processing With Apache Arrow and Apache Pa...The Future of Column-Oriented Data Processing With Apache Arrow and Apache Pa...
The Future of Column-Oriented Data Processing With Apache Arrow and Apache Pa...Dremio Corporation
 
Spark Autotuning: Spark Summit East talk by Lawrence Spracklen
Spark Autotuning: Spark Summit East talk by Lawrence SpracklenSpark Autotuning: Spark Summit East talk by Lawrence Spracklen
Spark Autotuning: Spark Summit East talk by Lawrence SpracklenSpark Summit
 
AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...
AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...
AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...James Serra
 
SHACL in Apache jena - ApacheCon2020
SHACL in Apache jena - ApacheCon2020SHACL in Apache jena - ApacheCon2020
SHACL in Apache jena - ApacheCon2020andyseaborne
 
Apache Tez: Accelerating Hadoop Query Processing
Apache Tez: Accelerating Hadoop Query ProcessingApache Tez: Accelerating Hadoop Query Processing
Apache Tez: Accelerating Hadoop Query ProcessingHortonworks
 
Introduction to Presto at Treasure Data
Introduction to Presto at Treasure DataIntroduction to Presto at Treasure Data
Introduction to Presto at Treasure DataTaro L. Saito
 
The Apache Spark File Format Ecosystem
The Apache Spark File Format EcosystemThe Apache Spark File Format Ecosystem
The Apache Spark File Format EcosystemDatabricks
 
Using Apache Spark as ETL engine. Pros and Cons
Using Apache Spark as ETL engine. Pros and Cons          Using Apache Spark as ETL engine. Pros and Cons
Using Apache Spark as ETL engine. Pros and Cons Provectus
 
DASK and Apache Spark
DASK and Apache SparkDASK and Apache Spark
DASK and Apache SparkDatabricks
 
Amazon S3 Best Practice and Tuning for Hadoop/Spark in the Cloud
Amazon S3 Best Practice and Tuning for Hadoop/Spark in the CloudAmazon S3 Best Practice and Tuning for Hadoop/Spark in the Cloud
Amazon S3 Best Practice and Tuning for Hadoop/Spark in the CloudNoritaka Sekiyama
 
Web ARChive (WARC) File Format
Web ARChive (WARC) File FormatWeb ARChive (WARC) File Format
Web ARChive (WARC) File FormatSawood Alam
 
How to Automate Performance Tuning for Apache Spark
How to Automate Performance Tuning for Apache SparkHow to Automate Performance Tuning for Apache Spark
How to Automate Performance Tuning for Apache SparkDatabricks
 
Interactive real time dashboards on data streams using Kafka, Druid, and Supe...
Interactive real time dashboards on data streams using Kafka, Druid, and Supe...Interactive real time dashboards on data streams using Kafka, Druid, and Supe...
Interactive real time dashboards on data streams using Kafka, Druid, and Supe...DataWorks Summit
 
MLflow at Company Scale
MLflow at Company ScaleMLflow at Company Scale
MLflow at Company ScaleDatabricks
 

La actualidad más candente (20)

Optimizing Apache Spark SQL Joins
Optimizing Apache Spark SQL JoinsOptimizing Apache Spark SQL Joins
Optimizing Apache Spark SQL Joins
 
How to Actually Tune Your Spark Jobs So They Work
How to Actually Tune Your Spark Jobs So They WorkHow to Actually Tune Your Spark Jobs So They Work
How to Actually Tune Your Spark Jobs So They Work
 
Image Processing on Delta Lake
Image Processing on Delta LakeImage Processing on Delta Lake
Image Processing on Delta Lake
 
Productizing Structured Streaming Jobs
Productizing Structured Streaming JobsProductizing Structured Streaming Jobs
Productizing Structured Streaming Jobs
 
The Future of Column-Oriented Data Processing With Apache Arrow and Apache Pa...
The Future of Column-Oriented Data Processing With Apache Arrow and Apache Pa...The Future of Column-Oriented Data Processing With Apache Arrow and Apache Pa...
The Future of Column-Oriented Data Processing With Apache Arrow and Apache Pa...
 
Spark Autotuning: Spark Summit East talk by Lawrence Spracklen
Spark Autotuning: Spark Summit East talk by Lawrence SpracklenSpark Autotuning: Spark Summit East talk by Lawrence Spracklen
Spark Autotuning: Spark Summit East talk by Lawrence Spracklen
 
AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...
AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...
AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...
 
SHACL in Apache jena - ApacheCon2020
SHACL in Apache jena - ApacheCon2020SHACL in Apache jena - ApacheCon2020
SHACL in Apache jena - ApacheCon2020
 
Apache Tez: Accelerating Hadoop Query Processing
Apache Tez: Accelerating Hadoop Query ProcessingApache Tez: Accelerating Hadoop Query Processing
Apache Tez: Accelerating Hadoop Query Processing
 
Introduction to Presto at Treasure Data
Introduction to Presto at Treasure DataIntroduction to Presto at Treasure Data
Introduction to Presto at Treasure Data
 
The Apache Spark File Format Ecosystem
The Apache Spark File Format EcosystemThe Apache Spark File Format Ecosystem
The Apache Spark File Format Ecosystem
 
Using Apache Spark as ETL engine. Pros and Cons
Using Apache Spark as ETL engine. Pros and Cons          Using Apache Spark as ETL engine. Pros and Cons
Using Apache Spark as ETL engine. Pros and Cons
 
DASK and Apache Spark
DASK and Apache SparkDASK and Apache Spark
DASK and Apache Spark
 
Amazon S3 Best Practice and Tuning for Hadoop/Spark in the Cloud
Amazon S3 Best Practice and Tuning for Hadoop/Spark in the CloudAmazon S3 Best Practice and Tuning for Hadoop/Spark in the Cloud
Amazon S3 Best Practice and Tuning for Hadoop/Spark in the Cloud
 
Introduction to Apache Spark
Introduction to Apache SparkIntroduction to Apache Spark
Introduction to Apache Spark
 
Web ARChive (WARC) File Format
Web ARChive (WARC) File FormatWeb ARChive (WARC) File Format
Web ARChive (WARC) File Format
 
Machine Learning in the IoT with Apache NiFi
Machine Learning in the IoT with Apache NiFiMachine Learning in the IoT with Apache NiFi
Machine Learning in the IoT with Apache NiFi
 
How to Automate Performance Tuning for Apache Spark
How to Automate Performance Tuning for Apache SparkHow to Automate Performance Tuning for Apache Spark
How to Automate Performance Tuning for Apache Spark
 
Interactive real time dashboards on data streams using Kafka, Druid, and Supe...
Interactive real time dashboards on data streams using Kafka, Druid, and Supe...Interactive real time dashboards on data streams using Kafka, Druid, and Supe...
Interactive real time dashboards on data streams using Kafka, Druid, and Supe...
 
MLflow at Company Scale
MLflow at Company ScaleMLflow at Company Scale
MLflow at Company Scale
 

Similar a BigDL: Image Recognition Using Apache Spark with BigDL - MCL358 - re:Invent 2017

BigDL Deep Learning in Apache Spark - AWS re:invent 2017
BigDL Deep Learning in Apache Spark - AWS re:invent 2017BigDL Deep Learning in Apache Spark - AWS re:invent 2017
BigDL Deep Learning in Apache Spark - AWS re:invent 2017Dave Nielsen
 
Data Modelling at Scale
Data Modelling at ScaleData Modelling at Scale
Data Modelling at ScaleDavid Simons
 
Create an IoT Gateway and Establish a Data Pipeline to AWS IoT with Intel - I...
Create an IoT Gateway and Establish a Data Pipeline to AWS IoT with Intel - I...Create an IoT Gateway and Establish a Data Pipeline to AWS IoT with Intel - I...
Create an IoT Gateway and Establish a Data Pipeline to AWS IoT with Intel - I...Amazon Web Services
 
Building Text Analytics Applications on AWS using Amazon Comprehend - AWS Onl...
Building Text Analytics Applications on AWS using Amazon Comprehend - AWS Onl...Building Text Analytics Applications on AWS using Amazon Comprehend - AWS Onl...
Building Text Analytics Applications on AWS using Amazon Comprehend - AWS Onl...Amazon Web Services
 
Open Source at AWS: Code, Contributions, Collaboration, and Communication
Open Source at AWS: Code, Contributions, Collaboration, and CommunicationOpen Source at AWS: Code, Contributions, Collaboration, and Communication
Open Source at AWS: Code, Contributions, Collaboration, and CommunicationAmazon Web Services
 
AWS re:Invent Recap 2016 Taiwan part 2
AWS re:Invent Recap 2016 Taiwan part 2AWS re:Invent Recap 2016 Taiwan part 2
AWS re:Invent Recap 2016 Taiwan part 2Amazon Web Services
 
Accelerate machine-learning workloads using Amazon EC2 P3 instances - CMP204 ...
Accelerate machine-learning workloads using Amazon EC2 P3 instances - CMP204 ...Accelerate machine-learning workloads using Amazon EC2 P3 instances - CMP204 ...
Accelerate machine-learning workloads using Amazon EC2 P3 instances - CMP204 ...Amazon Web Services
 
Slides: How Automating Data Lineage Improves BI Performance
Slides: How Automating Data Lineage Improves BI PerformanceSlides: How Automating Data Lineage Improves BI Performance
Slides: How Automating Data Lineage Improves BI PerformanceDATAVERSITY
 
Serverless Text Analytics with Amazon Comprehend
Serverless Text Analytics with Amazon ComprehendServerless Text Analytics with Amazon Comprehend
Serverless Text Analytics with Amazon ComprehendDonnie Prakoso
 
What it Means to be a Next-Generation Managed Service Provider
What it Means to be a Next-Generation Managed Service ProviderWhat it Means to be a Next-Generation Managed Service Provider
What it Means to be a Next-Generation Managed Service ProviderDatadog
 
Building AI-powered Apps on AWS
Building AI-powered Apps on AWSBuilding AI-powered Apps on AWS
Building AI-powered Apps on AWSAdrian Hornsby
 
Initiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AIInitiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AIAmazon Web Services
 
NEW LAUNCH! Natural Language Processing for Data Analytics - MCL343 - re:Inve...
NEW LAUNCH! Natural Language Processing for Data Analytics - MCL343 - re:Inve...NEW LAUNCH! Natural Language Processing for Data Analytics - MCL343 - re:Inve...
NEW LAUNCH! Natural Language Processing for Data Analytics - MCL343 - re:Inve...Amazon Web Services
 
GPSBUS216-GPS Applying AI-ML to Find Security Needles in the Haystack
GPSBUS216-GPS Applying AI-ML to Find Security Needles in the HaystackGPSBUS216-GPS Applying AI-ML to Find Security Needles in the Haystack
GPSBUS216-GPS Applying AI-ML to Find Security Needles in the HaystackAmazon Web Services
 
GAM311-How Linden Lab Built a Virtual World on the AWS Cloud.pdf
GAM311-How Linden Lab Built a Virtual World on the AWS Cloud.pdfGAM311-How Linden Lab Built a Virtual World on the AWS Cloud.pdf
GAM311-How Linden Lab Built a Virtual World on the AWS Cloud.pdfAmazon Web Services
 

Similar a BigDL: Image Recognition Using Apache Spark with BigDL - MCL358 - re:Invent 2017 (20)

BigDL Deep Learning in Apache Spark - AWS re:invent 2017
BigDL Deep Learning in Apache Spark - AWS re:invent 2017BigDL Deep Learning in Apache Spark - AWS re:invent 2017
BigDL Deep Learning in Apache Spark - AWS re:invent 2017
 
Database Backup
Database BackupDatabase Backup
Database Backup
 
Data Modelling at Scale
Data Modelling at ScaleData Modelling at Scale
Data Modelling at Scale
 
Create an IoT Gateway and Establish a Data Pipeline to AWS IoT with Intel - I...
Create an IoT Gateway and Establish a Data Pipeline to AWS IoT with Intel - I...Create an IoT Gateway and Establish a Data Pipeline to AWS IoT with Intel - I...
Create an IoT Gateway and Establish a Data Pipeline to AWS IoT with Intel - I...
 
AWS Storage State of the Union
AWS Storage State of the UnionAWS Storage State of the Union
AWS Storage State of the Union
 
Data Lake na área da saúde- AWS
Data Lake na área da saúde- AWSData Lake na área da saúde- AWS
Data Lake na área da saúde- AWS
 
Building Text Analytics Applications on AWS using Amazon Comprehend - AWS Onl...
Building Text Analytics Applications on AWS using Amazon Comprehend - AWS Onl...Building Text Analytics Applications on AWS using Amazon Comprehend - AWS Onl...
Building Text Analytics Applications on AWS using Amazon Comprehend - AWS Onl...
 
AWS AI Services - What's new
AWS AI Services - What's newAWS AI Services - What's new
AWS AI Services - What's new
 
Open Source at AWS: Code, Contributions, Collaboration, and Communication
Open Source at AWS: Code, Contributions, Collaboration, and CommunicationOpen Source at AWS: Code, Contributions, Collaboration, and Communication
Open Source at AWS: Code, Contributions, Collaboration, and Communication
 
AWS re:Invent Recap 2016 Taiwan part 2
AWS re:Invent Recap 2016 Taiwan part 2AWS re:Invent Recap 2016 Taiwan part 2
AWS re:Invent Recap 2016 Taiwan part 2
 
Accelerate machine-learning workloads using Amazon EC2 P3 instances - CMP204 ...
Accelerate machine-learning workloads using Amazon EC2 P3 instances - CMP204 ...Accelerate machine-learning workloads using Amazon EC2 P3 instances - CMP204 ...
Accelerate machine-learning workloads using Amazon EC2 P3 instances - CMP204 ...
 
Slides: How Automating Data Lineage Improves BI Performance
Slides: How Automating Data Lineage Improves BI PerformanceSlides: How Automating Data Lineage Improves BI Performance
Slides: How Automating Data Lineage Improves BI Performance
 
Serverless Text Analytics with Amazon Comprehend
Serverless Text Analytics with Amazon ComprehendServerless Text Analytics with Amazon Comprehend
Serverless Text Analytics with Amazon Comprehend
 
What it Means to be a Next-Generation Managed Service Provider
What it Means to be a Next-Generation Managed Service ProviderWhat it Means to be a Next-Generation Managed Service Provider
What it Means to be a Next-Generation Managed Service Provider
 
Building AI-powered Apps on AWS
Building AI-powered Apps on AWSBuilding AI-powered Apps on AWS
Building AI-powered Apps on AWS
 
Monomobile
MonomobileMonomobile
Monomobile
 
Initiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AIInitiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AI
 
NEW LAUNCH! Natural Language Processing for Data Analytics - MCL343 - re:Inve...
NEW LAUNCH! Natural Language Processing for Data Analytics - MCL343 - re:Inve...NEW LAUNCH! Natural Language Processing for Data Analytics - MCL343 - re:Inve...
NEW LAUNCH! Natural Language Processing for Data Analytics - MCL343 - re:Inve...
 
GPSBUS216-GPS Applying AI-ML to Find Security Needles in the Haystack
GPSBUS216-GPS Applying AI-ML to Find Security Needles in the HaystackGPSBUS216-GPS Applying AI-ML to Find Security Needles in the Haystack
GPSBUS216-GPS Applying AI-ML to Find Security Needles in the Haystack
 
GAM311-How Linden Lab Built a Virtual World on the AWS Cloud.pdf
GAM311-How Linden Lab Built a Virtual World on the AWS Cloud.pdfGAM311-How Linden Lab Built a Virtual World on the AWS Cloud.pdf
GAM311-How Linden Lab Built a Virtual World on the AWS Cloud.pdf
 

Más de Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

Más de Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

BigDL: Image Recognition Using Apache Spark with BigDL - MCL358 - re:Invent 2017

  • 1. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AWS re:INVENT Large Scale Deep Learning with BigDL N o v e m b e r 3 0 , 2 0 1 7
  • 2. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Large Scale Deep Learning with BigDL T i m F o x | B i g D a t a a n d M a c h i n e L e a r n i n g C o n s u l t a n t | E l e p h a n t S c a l e
  • 3. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. ABOUT ME T i m F o x , P r i n c i p a l @ E l e p h a n t S c a l e P r a c t i t i o n e r a n d T r a i n e r i n D a t a E n g i n e e r i n g a n d D a t a S c i e n c e Author of “Data Science in Python” on LinkedIn Learning t i m @ e l e p h a n t s c a l e . c o m L i n k e d i n : t i m - f o x - 0 0 6 3 5 4 1
  • 4. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. ABOUT Elephant Scale • T r a i n i n g i n B i g D a t a a n d A I t e c h n o l o g i e s • B i g D a t a : S p a r k , H a d o o p , C l o u d , N o S Q L , S t r e a m i n g • A I : M a c h i n e L e a r n i n g , D e e p L e a r n i n g , B i g D L , T e n s o r f l o w • B i g D L t r a i n i n g a v a i l a b l e ! • P u b l i c a n d P r i v a t e t r a i n i n g s a v a i l a b l e • B i g D L S a n d b o x : e l e p h a n t s c a l e . c o m / s a n d b o x E l e p h a n t s c a l e . c o m i n f o @ e l e p h a n t s c a l e . c o m
  • 5. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Quick Roundup of AI / Machine Learning / Deep Learning
  • 6. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AI / MACHINE LEARNING / DEEP LEARNING Artificial Intelligence (AI): Broader concept of machines being able to carry out 'smart' tasks Machine Learning: A type of AI that allows software to learn from data without explicitly programmed Deep Learning: Using Neural Networks to solve some hard problems A r t i f i c i a l I n t e l l i g e n c e M a c h i n e L e a r n i n g D e e p L e a r n i n g
  • 7. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. DEEP LEARNING APPLICATIONS S e l f D r i v i n g C a r s • ML system using image recognition • Where the edge of the road / road sign / car in front F a c e r e c o g n i t i o n • Facebook images • System learns from images manually tagged and then automatically detects faces in uploaded photos
  • 8. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. DEEP LEARNING HISTORY E a r l y a t t e m p t s a t D e e p L e a r n i n g d i d n o t s u c c e e d . • Compute Power was insufficient for the time. • Training Datasets were insufficiently sized for good results. • We lacked the ability to parallelize our work. I n t h e m o d e r n e r a , D e e p L e a r n i n g h a s b e e n s u c c e s s f u l . • 'Big Data' – now we have so much data to train our models • 'Big Data ecosystem' – excellent big data platforms (Hadoop, Spark, NoSQL) are available as open source • 'Big Compute' - cloud platforms significantly lowered the barrier to massive compute power • $1 buys you 16 core + 128 G + 10 Gigabit machine for 1 hr on AWS! • So running a 100 node cluster for 5 hrs  $500
  • 9. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AI SOFTWARE ECO SYSTEM M a c h i n e L e a r n i n g D e e p L e a r n i n g Java - Weka - Mahout - DeepLearning4J Python - SciKit - Tensorflow - Theano - Caffe R - Many libraries - Deepnet - Darch Distributed - H20 - Spark - H20 - Spark - BigDL Cloud - AWS - AWS
  • 10. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. MACHINE LEARNING AND BIG DATA Until recently most of the machine learning is done on “single computer” (with lots of memory–100s of GBs) Most R/Python/Java libraries are “single node based” Now Big Data tools make it possible to run machine learning algorithms at massive scale – distributed across a cluster
  • 11. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. MODERN DEEP LEARNING FRAMEWORKS
  • 12. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. TOOLS FOR SCALABLE MACHINE LEARNING A p a c h e S p a r k M L • Runs on top of popular Spark framework • Massively scalable • Can use memory (caching) effectively for iterative algorithms • Language support: Scala, Java, Python, R B i g D L • Built for Apache Spark and Optimized for Intel Xeon • Language Support: Scala, Java, Python T e n s o r F l o w • Based on “data flow graphs” • Language support: Python, C++ • https://www.tensorflow.org/
  • 13. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. TOOLS FOR SCALABLE CLOUD MACHINE LEARNING A m a z o n M a c h i n e L e a r n i n g • Ready to go algorithms • Visualization tools • Wizards to guide • Scalable on AWS Cloud
  • 14. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. BigDL
  • 15. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. WHAT IS BIGDL A d i s t r i b u t e d d e e p l e a r n i n g l i b r a r y f o r A p a c h e S p a r k F e a t u r e p a r i t y w i t h p o p u l a r d e e p l e a r n i n g f r a m e w o r k s • Caffe, Torch, Tensorflow H i g h P e r f o r m a n c e • Powered by Intel Math Kernel Library (MKL) and multi threaded programming C a n s c a l e t o h u g e d a t a s e t s • Using Apache Spark for scale O p e n s o u r c e ! ( D e c 2 0 1 6 ) A c t i v e D e v e l o p m e n t
  • 16. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. PRODUCTION ML/DL SYSTEMS ARE COMPLEX! A c t u a l M L / D L i s o n l y s m a l l p o r t i o n o f m a s s i v e p r o d u c t i o n s y s t e m B i g D L r u n n i n g o n a s c a l a b l e p l a t f o r m l i k e S p a r k h e l p s s i m p l i f y t h e c o m p l e x i t y
  • 17. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. BIGDL FILLS THE 'GAP' IN BIG DATA + DEEP LEARNING F o l l o w s p r o v e n d e s i g n p a t t e r n s f o r d e a l i n g w i t h B i g D a t a S e n d s ' c o m p u t e t o d a t a ' r a t h e r t h a n r e a d i n g m a s s i v e d a t a o v e r n e t w o r k . U s e s ' d a t a l o c a l i t y ' o f H D F S ( H a d o o p F i l e S y s t e m ) U t i l i z e s ' c l u s t e r m a n a g e r s ' l i k e Y A R N / M E S O S • Automatically handles hardware/software failures • Elasticity and resource sharing in a cluster
  • 18. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. BIGDL & SPARK R u n B i g D L a p p l i c a t i o n s a s S p a r k a p p l i c a t i o n s S c a l a , J a v a , a n d P y t h o n s u p p o r t U s e o t h e r S p a r k ' s f e a t u r e s • In memory compute • Integrate with Spark ML and Streaming E a s y d e v e l o p m e n t w i t h J u p y t e r N o t e b o o k
  • 19. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. BIGDL VS TENSORFLOW B i g D L T e n s o r f l o w Runtime Scala Engine with Python front-end C++ Engine with Python front-end Hadoop compatibility Can run natively on Spark & Hadoop Accesses Hadoop data as a client only Distributed Operation Scalable with Apache Spark for massive scale out of the box Does not support massive distribution out of the box Runs Tensorflow Models Yes Yes Acceleration CPU w/MKL CPU/GPU Summary Excellent for distributing deep-learning models to massive scale on big-data. Great TCO value. Excellent library for small- medium scale data, although GPU hardware costs can be significant.
  • 20. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. BIGDL: BIG COMPUTE PLUS BIG DATA B i g D L h e l p s u s i n b a l a n c i n g o u r n e e d s • Big Compute: Fast Linear Algebra, Intel MKL library • Optimized for Intel Xeon • Big Data: I/O parallelized to run on many CPUs B i g D L A l l o w s M a s s i v e S c a l a b i l i t y • Natively Designed to run on Spark • Works with Hadoop eco system (via Spark) Hadoop is THE Big Data platform for on-premise deployments P l a y s n i c e l y w i t h o t h e r B i g D L f r a m e w o r k s • Use existing Tensorflow or Caffe at scale in BigDL • Train new models based on existing TF / Caffe models
  • 21. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. BIGDL USE CASES F r a u d d e t e c t i o n S e n t i m e n t a n a l y s i s I m a g e r e c o g n i t i o n Find more at: https://github.com/intel-analytics/analytics-zoo/
  • 22. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. GPUs and CPUs
  • 23. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. GPUS (GRAPHICS PROCESSING UNITS) G P U s h a v e a d d r e s s e d p a s t i s s u e s i n t r a i n i n g p e r f o r m a n c e • Example: Tensorflow - optimized to run well on GPUs. C P U i n p a s t n o t v e c t o r i z e d f o r p a r a l l e l c o m p u t e • Meant that GPUs were much faster for deep learning M o d e r n I n t e l X e o n C P U s h a v e v e c t o r i z e d l i n e a r a l g e b r a • Properly optimized, approaches speed of GPUs • CPUs are now a credible alternative to running on GPUs • Cost Advantage and Scalability
  • 24. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. INTEL MATH KERNEL LIBRARY (MKL) F e a t u r e s h i g h l y o p t i m i z e d , t h r e a d e d , a n d v e c t o r i z e d m a t h f u n c t i o n s t h a t m a x i m i z e p e r f o r m a n c e o n e a c h p r o c e s s o r f a m i l y . U t i l i z e s i n d u s t r y - s t a n d a r d C a n d F o r t r a n A P I s f o r c o m p a t i b i l i t y w i t h p o p u l a r B L A S , L A P A C K , a n d F F T W f u n c t i o n s — n o c o d e c h a n g e s r e q u i r e d . D i s p a t c h e s o p t i m i z e d c o d e f o r e a c h p r o c e s s o r a u t o m a t i c a l l y w i t h o u t t h e n e e d t o b r a n c h c o d e . P r o v i d e s p r i o r i t y s u p p o r t , c o n n e c t i n g y o u d i r e c t l y t o I n t e l e n g i n e e r s f o r c o n f i d e n t i a l a n s w e r s t o t e c h n i c a l q u e s t i o n s
  • 25. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. INTEL MKL PERFORMANCE
  • 26. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. CPU VERSUS GPU FOR BIG DATA C P U o f f e r s h i g h e r s c a l a b i l i t y a t l o w e r c o s t v e r s u s G P U O p t i m i z e d S o f t w a r e a n d l i b r a r i e s o n C P U a l l o w s i n g l e - n o d e p e r f o r m a n c e t o a p p r o a c h G P U p e r f o r m a n c e . G P U p l u s C P U a r c h i t e c t u r e s c a n b e e f f e c t i v e f o r s m a l l e r n u m b e r o f n o d e s , w h e n c o s t i s n o t a c o n c e r n . ” B i g C o m p u t e ” v e r s u s “ B i g D a t a ”
  • 27. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Running BigDL
  • 28. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. RUNNING BIGDL D e v e l o p i n g : U s e t h e f o l l o w i n g t o d e v e l o p y o u r B i g D L a p p s e f f o r t l e s s l y • Docker • VM Sandbox D e p l o y i n g : C l o u d r e a d y d e p l o y m e n t • Amazon AMI
  • 29. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. DEMO: GETTING STARTED WITH BIGDL W e w i l l p r o v i d e : • Docker • Sandbox VM • AWS Marketplace AMI
  • 30. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. BigDL Summary B i g D L o f f e r s o u t s t a n d i n g s c a l a b i l i t y a n d p e r f o r m a n c e B i g D L o p t i m i z e s T C O b y r u n n i n g b e i n g t u n e d a n d o p t i m i z e d f o r I n t e l X e o n P r o c e s s o r s B i g D L b r i n g s d e e p l e a r n i n g t o S p a r k C l u s t e r s a n d H a d o o p D a t a s e t s B i g D L c a n b e u s e d t o d e p l o y T e n s o r f l o w a n d C a f f e m o d e l s t o b i g d a t a .
  • 31. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. IMAGE RECOGNITION WITH APACHE SPARK AND BIGDL A l e x K a l i n i n | V P , A I / M a c h i n e L e a r n i n g | S i z m e k
  • 32. ABOUT ME A l e x K a l i n i n V P , A I / M a c h i n e L e a r n i n g | S i z m e k a l e x . k a l i n i n @ s i z m e k . c o m L i n k e d i n : l i n k e d i n . c o m / i n / a l e x k a l i n i n /
  • 33. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. https://www.linkedin.com/in/alexkalinin/ alex.kalinin@sizmek.com 1 0 0 , 0 0 0 , 0 0 0 , 0 0 0 r e q u e s t s p e r d a y P B s o f t r a i n i n g d a t a AI-POWERED MARKETING AND OPTIMIZATION 7 0 , 0 0 0 , 0 0 0 / m i n u t e 1 , 2 0 0 , 0 0 0 / s e c
  • 34. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. FEED-FORWARD NETWORK
  • 35. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. 43 37 45 40 𝑦 = ෍(𝑤𝑖 ∗ 𝑥𝑖) ? ? ? ? ? ?
  • 36. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. -0.53 0.01 -0.17 0.70 0.51 ? ? 𝑦 = ෍(𝑤𝑖 ∗ 𝑥𝑖) 43 37 45 40 ? ? ? ?
  • 37. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. -0.53 0.01 -0.17 0.70 0.51 ? ? 𝑦 = ෍(𝑤𝑖 ∗ 𝑥𝑖) 43 37 45 40 -1.56 ? ? ?
  • 38. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. -0.53 0.01 -0.17 0.70 0.51 ? ? 𝑦 = 𝑅𝑒𝐿𝑈(෍ 𝑤𝑖 ∗ 𝑥𝑖 ) 43 37 45 40 -1.56 ? ? ?
  • 39. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. -0.53 0.01 -0.17 0.70 0.51 ? ? 𝑦 = 𝑅𝑒𝐿𝑈(෍ 𝑤𝑖 ∗ 𝑥𝑖 ) 43 37 45 40 ? ? ? 0
  • 40. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. 43 37 45 40 -0.12 0.13 0.21 -0.07 -0.05 ? ? 𝑦 = 𝑅𝑒𝐿𝑈(෍ 𝑤𝑖 ∗ 𝑥𝑖 ) ? ? 0 11.9
  • 41. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. ? ? 𝑦 = 𝑅𝑒𝐿𝑈(෍ 𝑤𝑖 ∗ 𝑥𝑖 ) 43 37 45 40 -.11 ? 0 11.9
  • 42. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. ? ? 𝑦 = 𝑅𝑒𝐿𝑈(෍ 𝑤𝑖 ∗ 𝑥𝑖 ) 43 37 45 40 ? 0 11.9 0
  • 43. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. ? ? 𝑦 = 𝑅𝑒𝐿𝑈(෍ 𝑤𝑖 ∗ 𝑥𝑖 ) 43 37 45 40 0.15 0 11.9 0
  • 44. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. -0.67 ? 𝑦 = 𝑅𝑒𝐿𝑈(෍ 𝑤𝑖 ∗ 𝑥𝑖 ) 43 37 45 40 0.15 0 11.9 0
  • 45. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. 0 ? 𝑦 = 𝑅𝑒𝐿𝑈(෍ 𝑤𝑖 ∗ 𝑥𝑖 ) 43 37 45 40 0.15 0 11.9 0
  • 46. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. 0 0.52 𝑦 = 𝑅𝑒𝐿𝑈(෍ 𝑤𝑖 ∗ 𝑥𝑖 ) 43 37 45 40 0.15 0 11.9 0
  • 47. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. FEED-FORWARD NETWORK
  • 48. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
  • 49. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
  • 50. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
  • 51. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. FULLY CONNECTED
  • 52. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. FULLY CONNECTED
  • 53. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. FULLY CONNECTED
  • 54. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. FULLY CONNECTED Input Size: Connections: 40,000 1,600,000,000 200 200 200 200 10 layers: 16 billion
  • 55. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. INVENTION OF CONVOLUTIONAL NEURAL NETWORKS • L e N e t - 5 n e t w o r k d e v e l o p e d i n 1 9 9 8 b y Y a n n L e C u n • T o r s t e n H u b e l a n d D a v i d W i e s e l
  • 56. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. HUBEL & WIESEL https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1363130/
  • 57. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. HIERARCHICAL & LOCAL VISUAL CORTEX
  • 58. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. HIERARCHICAL & LOCAL VISUAL CORTEX Lines, Dots Orientation, Movement High-Level Shapes
  • 59. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. KEY FEATURES OF CONVOLUTIONAL NETWORK • C o n v o l u t i o n • P o o l i n g
  • 60. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. FULLY CONNECTED
  • 61. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. CONVOLUTION
  • 62. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. CONVOLUTION
  • 63. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. CONVOLUTION
  • 64. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. CONVOLUTION
  • 65. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. CONVOLUTION O n l y f o u r we i g h t s
  • 66. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. CONVOLUTION 0.10 -0.06 0.24 0.17 Filter
  • 67. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. POOLING Source: https://cs231n.github.io/convolutional -networks/
  • 68. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. CONVOLUTIONAL NETWORK 0 0 0 0 1 0 0 0 0 0 0 1 2 3 4 5 6 7 8 9 Convolution Pooling Convolution PoolingInput FC FC Source: https://www.clarifai.com/technology
  • 69. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. http://scs.ryerson.ca/~aharley/vis/conv/flat.html
  • 70. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. DEMO – Image Recognition G i t H u b : h t t p s : / / g i t h u b . c o m / a l e x - k a l i n i n / l e n e t - b i g d l
  • 71. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Getting Started With BigDL
  • 72. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. ABOUT ME S u j e e M a n i y a m F o u n d e r / P r i n c i p a l @ E l e p h a n t S c a l e P r a c t i t i o n e r a n d T r a i n e r i n D a t a E n g i n e e r i n g a n d D a t a S c i e n c e Author - "Hadoop and Spark" video training on O'Reilly Media - "HBase Design Patterns" - "Hadoop illuminated" s u j e e @ e l e p h a n t s c a l e . c o m L i n k e d i n : l i n k e d i n . c o m / i n / s u j e e m a n i y a m
  • 73. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. RUNNING BIGDL D e v e l o p i n g : U s e t h e f o l l o w i n g t o d e v e l o p y o u r B i g D L a p p s e f f o r t l e s s l y • Docker • VM Sandbox • Amazon AMI D e p l o y i n g : C l o u d r e a d y d e p l o y m e n t • Amazon AMI
  • 74. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. GETTING STARTED WITH BIGDL W e w i l l d e m o n s t r a t e • Docker • Sandbox VM • AWS Marketplace AMI
  • 75. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Docker Step 1 : Install Docker on your laptop Step 2 : get docker image docker pull elephantscale/bigdl-sandbox Step 3 : download tutorials git clone https://github.com/elephantscale/bigdl-tutorials Step 4 : Launch docker cd bigdl-tutorials ./run-bigdl-docker.sh elephantscale/bigdl-sandbox Step 5 : Go to Jupyter notebook
  • 76. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. VM-Sandbox Step 1 : Install VMware Player or VirtualBox on your laptop Step 2 : Download BigDL-Sandbox image http://elephantscale.com/sandbox/ Step 3 : (In Sandbox) download tutorials git clone https://github.com/elephantscale/bigdl-tutorials Step 4 : (In Sandbox) Run BigDL natively cd bigdl-tutorials ./run-bigdl-native.sh Step 5 : (In Sandbox) Go to Jupyter notebook
  • 77. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Docker on AWS Step 1 : Spin up an AMI (Ubuntu recommended) Step 2 : Install Docker on the instance Step 3 : get docker image docker pull elephantscale/bigdl-sandbox Step 4 : download tutorials git clone https://github.com/elephantscale/bigdl-tutorials Step 5 : Launch docker cd bigdl-tutorials ./run-bigdl-docker.sh elephantscale/bigdl-sandbox Step 6 : Go to Jupyter notebook
  • 78. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AMI on AWS Step 1 : Spin up BigDL AMI Step 2 : download tutorials git clone https://github.com/elephantscale/bigdl- tutorials Step 3 : Run BigDL cd bigdl-tutorials ./run-bigdl-native.sh Step 4 : Go to Jupyter notebook Everyone Gets Free Credits on AWS Ask Dave Nielsen dave.nielsen@intel.com
  • 79. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Notebooks and Slides Tutorials: github.com/dnielsen/bigdl-resources Slides: bit.ly/bigdlreinvent2017 (as in BigDL re:invent) Tim Fox Sujee Maniyam Alex Kalinin Dave Nielsen Elephant Scale Elephant Scale Sizmek Intel Software
  • 80. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. THANK YOU!