SlideShare una empresa de Scribd logo
1 de 53
Descargar para leer sin conexión
Enabling Ubiquitous Visual Intelligence
Through Deep Learning	

Dr. Ren Wu	

Distinguished Scientist, Baidu	

wuren@baidu.com 	

@韧在百度
Dr. Ren Wu	

•  Distinguished Scientist, Baidu	

•  HSA Chief Software Architect, AMD	

•  PI, HP Labs CUDA Research Center	

•  World Computer Xiangqi Champion	

•  AI expert	

•  Heterogeneous Computing expert	

•  Computational scientist
Eight Years Ago - 05/11/1997
Deep Blue	

A classic example of application-specific system design comprised
of an IBM supercomputer with 480 custom-madeVLSI chess chips, running
massively parallel search algorithm with highly optimized implementation.
Computer Chess and Moore’s Law
Deep Learning Works	

“We deepened our investment in
advanced technologies like Deep
Learning, which is already yielding near
term enhancements in customer ROI
and is expected to drive
transformational change over the
longer term.” 	

	

– Robin Li, Baidu CEO
Amount of data	

Performance	

Deep learning	

Old algorithms	

Deep Learning
Deep Learning vs. Human Brain	

pixels	

edges	

object parts	

(combination 	

of edges)	

object models	

Deep Architecture in the Brain
Retina
Area V1
Area V2
Area V4
pixels
Edge detectors
Primitive shape
detectors
Higher level visual
abstractions
Slide credit: Andrew Ng
Voice	

Text	

 Image	

 User
Deep Convolutional Neural Networks	
* Efficient mapping of the training of Convolutional Neural Networks to a CUDA-based cluster	

courtesy of Jonatan Ward, Sergey Andreev, Francisco Heredia, Bogdan Lazar, Zlatka Manevska
Big Data	
•  >2000PBStorage
•  10-100PB/dayProcessing
•  100b-1000bWebpages
•  100b-1000bIndex
•  1b-10b/dayUpdate
•  100TB~1PB/dayLog
Heterogeneous Computing	

1993 world #1	

Think Machine CM5/1024	

131 GFlops	

2013	

Samsung Note 3 smartphone	

(Qualcomm SnapDragon 800)	

129 Gflops	

2000 world #1 	

ASCI White (IBM RS/6000SP)	

6MW power, 106 tons	

12.3 TFlops	

2013	

Two MacPro workstation	

(dual AMD GPUs each)	

14 TFlops
Deep Learning: Two Step Process	

Supercomputers used for
training	

	

And then deploy the trained
models everywhere!	

Datacenters	

 Tablets, smartphones	

 Wearable devices	

 IoTs
Deep Learning: Training 	

Big data + Deep learning + High performance computing =
Intelligence	

	

Big data + Deep learning + Heterogeneous computing =
Success
Image Recognition
human vs. machine	

http://7-themes.com/6977111-cute-little-girl-play-white-dog.html
ImageNet Classification Challenge
	

 •  ImageNet dataset	

•  More than 15 million images belonging to about 22,000 categories	

•  ILSVRC (ImageNet Large-Scale Visual Recognition Challenge)	

•  Classification task: 1.2 million images contains 1,000 categories	

•  One of the most challenging computer vision benchmarks	

•  Increasing attention both from industry and academic communities	

* Olga Russakovsky et al. ECCV 2014
ImageNet Classification Challenge
	

* courtesy of Feifei Li
ImageNet Classification Challenge
ImageNet Classification 2012-2014 	

Team	

 Year	

 Place	

 Error (top-5)	

 Uses external
data	

SuperVision	

 2012 	

 -	

 16.4%	

 no	

SuperVision	

 2012 	

 1st	

 15.3%	

 ImageNet 22k	

Clarifai	

 2013	

 -	

 11.7%	

 no	

Clarifai	

 2013	

 1st	

 11.2%	

 ImageNet 22k	

MSRA	

 2014 	

 3rd	

 7.35%	

 no	

VGG	

 2014	

 2nd	

 7.32%	

 no	

GoogLeNet	

 2014	

 1st	

 6.67%	

 no	

Slide credit: Yangqing Jia, Google	

 Invincible ?
Latest Results
Latest Results	

Team Date Top-5 test error
GoogLeNet 2014 6.66%
Deep Image 01/12/2015 5.98%
Deep Image 02/05/2015 5.33%
Microsoft 02/05/2015 4.94%
Google 03/02/2015 4.82%
Deep Image 05/10/2015 4.58%
Insights and Inspirations	

多算胜少算不胜	

	

孙⼦子 计篇 (Sun Tzu, 544-496 BC)	

	

More calculations win, few
calculation lose	

元元本本殚⻅见洽闻	

	

班固 ⻄西都赋(Gu Ban, 32-92 AD) 	

	

Meaning the more you see the
more you know	

明⾜足以察秋毫之末	

	

孟⼦子梁惠⺩王上 (Mencius, 372-289 BC)	

	

ability to see very fine details
Project Minwa (百度敏娲)	

•  Minerva + Athena + ⼥女娲	

•  Athena: Goddess of Wisdom,Warfare,
Divine Intelligence,Architecture, and Crafts	

•  Minerva: Goddess of wisdom, magic,
medicine, arts, commerce and defense	

•  ⼥女娲: 抟⼟土造⼈人, 炼⽯石补天, 婚姻, 乐器	

	

World’s Largest Artificial Neural Networks	

	

v Pushing the State-of-the-Art	

v ~ 100x bigger than previous ones	

v New kind of Intelligence?
Hardware/Software Co-design	
•  Stochastic gradient descent (SGD)	

•  High compute density	

•  Scale up, up to 100 nodes	

•  High bandwidth low latency	

•  36 nodes, 144 GPUs, 6.9TB Host, 1.7TB Device	

•  0.6 PFLOPS 	

•  Highly Optimized software stack	

•  RDMA/GPU Direct 	

•  New data partition and communication
strategies	

GPUs	
Infiniband
Minwa
Speedup (wall time for convergence) 	
Validation set accuracy for different numbers of GPUs	
0	
  
0.1	
  
0.2	
  
0.3	
  
0.4	
  
0.5	
  
0.6	
  
0.7	
  
0.8	
  
0.9	
  
0.25	
   0.5	
   1	
   2	
   4	
   8	
   16	
   32	
   64	
   128	
   256	
  
Accuracy
Time (hours)
32 GPU
16 GPU
1 GPU
Accuracy 80%
32 GPU: 8.6 hours
1 GPU: 212 hours
Speedup: 24.7x
Never have enough training
examples!	

	

Key observations 	

•  Invariant to illuminant of the scene	

•  Invariant to observers 	

Augmentation approaches	

•  Color casting	

•  Optical distortion	

•  Rotation and cropping etc	

Data Augmentation	

“⻅见多识⼲⼴广”
And the Color Constancy 	

	

Key observations 	

•  Invariant to illuminant of the scene	

•  Invariant to observers 	

Augmentation approaches	

•  Color casting	

•  Optical distortion	

•  Rotation and cropping etc	

The Color of the Dress	

“Inspired by the color constancy principal.
Essentially, this ‘forces’ our neural network to
develop its own color constancy ability.”
Data Augmentation	

Augmentation The number of possible changes
Color casting 68920
Vignetting 1960
Lens distortion 260
Rotation 20
Flipping 2
Cropping 82944(crop size is 224x224, input image
size is 512x512)
Possible variations
The Deep Image system learned from ~2 billion examples, out
of 90 billion possible candidates.
Data Augmentation vs. Overfitting
Examples	

Bathtub	
 Isopod	
Indian elephant	
 Ice bear	
Some hard cases addressed by adding our data augmentation.
Multi-scale Training	

•  Same crop size, different
resolution	

•  Fixed-size 224*224	

•  Downsized training images	

•  Reduces computational costs	

•  But not for state-of-the-art	

•  Different models trained by
different image sizes	

256*256	
512*512	
•  High-resolution model works	

•  256x256: top-5 7.96%	

•  512x512: top-5 7.42% 	

•  Multi-scale models are
complementary	

•  Fused model: 6.97%	

“明查秋毫”
Multi-scale Training	

Tricycle	
Washer	
Backpack	
Little blue heron
Tricycle
Single Model Performance	

•  One basic configuration has 16 layers	

•  The number of weights in our configuration is 212.7M	

•  About 40% bigger than VGG’s	

Team Top-5 val. error
VGG 8.0%
GoogLeNet 7.89%
BN-Inception 5.82%
MSRA, PReLU-net 5.71%
Deep Image 5.40%
Robustness
Major Differentiators 	

•  Customized built supercomputer dedicated for DL	

•  Simple, scalable algorithm + Fully optimized
software stack	

•  Larger models	

•  More Aggressive data augmentation	

•  Multi-scale, include high-resolution images	

Scalability + Insights 	

	

 	

 	

and push for extreme
Deep Learning: Deployment	

Big data + Deep learning + High performance computing =
Intelligence	

	

Big data + Deep learning + Heterogeneous computing =
Success
Owl of Minwa (百度敏鸮)	

Supercomputers	

 Datacenters	

 Tablets, smartphones	

Models trained by supercomputers	

Trained models will be deployed in many ways	

data centers (cloud), smartphones, and even wearables and IoTs	

d	

	

OpenCL based, light weight and high performance	

	

DNNs everywhere !	

knowledge, wisdom, perspicacity and erudition
DNNs Everywhere	

Supercomputers	

 Datacenters	

 Tablets, smartphones	

 Wearable devices	

IoTs	

1000s GPUs	

 100k-1m servers	

 2b (in China)	

 50b in 2020?	

Supercomputer used for training	

Trained DNNs then deployed to data centers (cloud),
smartphones, and even wearables and IoTs
Offline Mobile DNN App	

•  Image recognition on mobile device	

•  Real time and no connectivity
needed	

•  directly from video stream, what
you point is what you get	

•  Everything is done within the device	

	

•  OpenCL based, highly optimized	

•  Large deep neural network models	

•  Thousands of objects, flowers, dogs,
and bags etc	

•  Unleashed the full potential of the
device hardware	

	

•  Smart phones now, Wearables and
IoTs tomorrow
Cloud Computing: What’s Missing?	

Bandwidth?	

Latency?	

and	

Power consumption?	

*ArtemVasilyev: CNN optimizations for embedded systems and FFT	

Moving data around is expensive, very expensive!
Cloud Computing: What’s Missing?	

How about 	

privacy?
What’s Next?	

Dedicated Hardware + Heterogeneous Computing	

*MarkHorowitz
Heterogeneous Computing	

“Human mind and brain is not a single general-purpose processor
but a collection of highly specialized components, each solving a
different, specific problem and yet collectively making up who we
are as human beings and thinkers. “ - Prof. Nancy Kanwisher
© Copyright Khronos Group 2015 - Page 50
Vision Processing Power Efficiency
• Wearables will need ‘always-on’ vision
-  With smaller thermal limit / battery than phones!
• GPUs have x10 imaging power efficiency over CPU
-  GPUs architected for efficient pixel handling
• Dedicated Hardware/DSPs can be even more efficient
-  With some loss of generality
• Mobile SOCs have space for more transistors
-  But can’t turn on at same time = Dark Silicon
-  Can integrate more gates ‘for free’ if careful
how and when they are used
PowerEfficiency
Computation Flexibility
Dedicated
Hardware
GPU
Compute
Multi-core
CPU
X1
X10
X100
Potential for dedicated sensor/vision silicon to be
integrated into Mobile Processors
But how will they be programmed for
PORTABILITY and POWER EFFICIENCY?
© Copyright Khronos Group 2015 - Page 51
OpenCL Ecosystem
Implementers
Desktop/Mobile/FPGA
Working Group Members
Apps/Tools/Tests/Courseware
Single Source C++ Programming
Portable Kernel Intermediate Language
Core API and Language Specs
Everything
Connected	
Everything
Intelligent
Big data era AI era
I2
oT
Intelligent Internet of Things
Thank you!

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

GTC 2017: The AI Revolution
GTC 2017: The AI RevolutionGTC 2017: The AI Revolution
GTC 2017: The AI Revolution
 
Building New Realities in AEC with NVIDIA Quadro VR Webinar
Building New Realities in AEC with NVIDIA Quadro VR WebinarBuilding New Realities in AEC with NVIDIA Quadro VR Webinar
Building New Realities in AEC with NVIDIA Quadro VR Webinar
 
Big Data LDN 2017: Machine Learning: What Works And What They Won’t Tell You
Big Data LDN 2017: Machine Learning: What Works And What They Won’t Tell YouBig Data LDN 2017: Machine Learning: What Works And What They Won’t Tell You
Big Data LDN 2017: Machine Learning: What Works And What They Won’t Tell You
 
NVIDIA Testimony at Senate Commerce, Science, and Transportation Committee He...
NVIDIA Testimony at Senate Commerce, Science, and Transportation Committee He...NVIDIA Testimony at Senate Commerce, Science, and Transportation Committee He...
NVIDIA Testimony at Senate Commerce, Science, and Transportation Committee He...
 
GTC 2015 Highlights
GTC 2015 HighlightsGTC 2015 Highlights
GTC 2015 Highlights
 
AI Predictions 2017
AI Predictions 2017AI Predictions 2017
AI Predictions 2017
 
Deep Learning Workflows: Training and Inference
Deep Learning Workflows: Training and InferenceDeep Learning Workflows: Training and Inference
Deep Learning Workflows: Training and Inference
 
AI For Enterprise
AI For EnterpriseAI For Enterprise
AI For Enterprise
 
Driving Computer Vision Research Innovation In Artificial Intelligence
Driving Computer Vision Research Innovation In Artificial IntelligenceDriving Computer Vision Research Innovation In Artificial Intelligence
Driving Computer Vision Research Innovation In Artificial Intelligence
 
Top 5 AI and Deep Learning Stories 4/21
Top 5 AI and Deep Learning Stories 4/21Top 5 AI and Deep Learning Stories 4/21
Top 5 AI and Deep Learning Stories 4/21
 
Inception Awards: The Top Six AI Startups Changing The World
Inception Awards: The Top Six AI Startups Changing The WorldInception Awards: The Top Six AI Startups Changing The World
Inception Awards: The Top Six AI Startups Changing The World
 
The AI Era Ignited by GPU Deep Learning
The AI Era Ignited by GPU Deep Learning The AI Era Ignited by GPU Deep Learning
The AI Era Ignited by GPU Deep Learning
 
NVIDIA 2017 Overview
NVIDIA 2017 OverviewNVIDIA 2017 Overview
NVIDIA 2017 Overview
 
Harness the Power of AI and Deep Learning for Business
Harness the Power of AI and Deep Learning for BusinessHarness the Power of AI and Deep Learning for Business
Harness the Power of AI and Deep Learning for Business
 
9/23 Top 5 Deep Learning
9/23 Top 5 Deep Learning9/23 Top 5 Deep Learning
9/23 Top 5 Deep Learning
 
AdTech 2017 Sydney
AdTech 2017 SydneyAdTech 2017 Sydney
AdTech 2017 Sydney
 
Baidu World 2016 With NVIDIA CEO Jen-Hsun Huang
Baidu World 2016 With NVIDIA CEO Jen-Hsun HuangBaidu World 2016 With NVIDIA CEO Jen-Hsun Huang
Baidu World 2016 With NVIDIA CEO Jen-Hsun Huang
 
HPC Top 5 Stories: Dec. 12, 2016
HPC Top 5 Stories: Dec. 12, 2016HPC Top 5 Stories: Dec. 12, 2016
HPC Top 5 Stories: Dec. 12, 2016
 
HPC Top 5 Stories: Dec. 7, 2016
HPC Top 5 Stories: Dec. 7, 2016HPC Top 5 Stories: Dec. 7, 2016
HPC Top 5 Stories: Dec. 7, 2016
 
HPC Top 5 Stories: March 29, 2017
HPC Top 5 Stories: March 29, 2017HPC Top 5 Stories: March 29, 2017
HPC Top 5 Stories: March 29, 2017
 

Destacado

04 history of cv computer vision, neural networks and pattern recognition - ...
04  history of cv computer vision, neural networks and pattern recognition - ...04  history of cv computer vision, neural networks and pattern recognition - ...
04 history of cv computer vision, neural networks and pattern recognition - ...
zukun
 
P03 neural networks cvpr2012 deep learning methods for vision
P03 neural networks cvpr2012 deep learning methods for visionP03 neural networks cvpr2012 deep learning methods for vision
P03 neural networks cvpr2012 deep learning methods for vision
zukun
 

Destacado (20)

04 history of cv computer vision, neural networks and pattern recognition - ...
04  history of cv computer vision, neural networks and pattern recognition - ...04  history of cv computer vision, neural networks and pattern recognition - ...
04 history of cv computer vision, neural networks and pattern recognition - ...
 
P03 neural networks cvpr2012 deep learning methods for vision
P03 neural networks cvpr2012 deep learning methods for visionP03 neural networks cvpr2012 deep learning methods for vision
P03 neural networks cvpr2012 deep learning methods for vision
 
Scene classification using Convolutional Neural Networks - Jayani Withanawasam
Scene classification using Convolutional Neural Networks - Jayani WithanawasamScene classification using Convolutional Neural Networks - Jayani Withanawasam
Scene classification using Convolutional Neural Networks - Jayani Withanawasam
 
Deep Learning - A Literature survey
Deep Learning - A Literature surveyDeep Learning - A Literature survey
Deep Learning - A Literature survey
 
Multiple intelligence
Multiple intelligenceMultiple intelligence
Multiple intelligence
 
Word Embeddings - Introduction
Word Embeddings - IntroductionWord Embeddings - Introduction
Word Embeddings - Introduction
 
Predictive Analytics - Big Data & Artificial Intelligence
Predictive Analytics - Big Data & Artificial IntelligencePredictive Analytics - Big Data & Artificial Intelligence
Predictive Analytics - Big Data & Artificial Intelligence
 
An Introduction to Computer Vision
An Introduction to Computer VisionAn Introduction to Computer Vision
An Introduction to Computer Vision
 
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015
 
Deep Visual Understanding from Deep Learning by Prof. Jitendra Malik
Deep Visual Understanding from Deep Learning by Prof. Jitendra MalikDeep Visual Understanding from Deep Learning by Prof. Jitendra Malik
Deep Visual Understanding from Deep Learning by Prof. Jitendra Malik
 
Computer Vision Introduction
Computer Vision IntroductionComputer Vision Introduction
Computer Vision Introduction
 
Andrew Ng, Chief Scientist at Baidu
Andrew Ng, Chief Scientist at BaiduAndrew Ng, Chief Scientist at Baidu
Andrew Ng, Chief Scientist at Baidu
 
Convolutional Neural Networks (CNN)
Convolutional Neural Networks (CNN)Convolutional Neural Networks (CNN)
Convolutional Neural Networks (CNN)
 
用十分鐘理解 《神經網路發展史》
用十分鐘理解 《神經網路發展史》用十分鐘理解 《神經網路發展史》
用十分鐘理解 《神經網路發展史》
 
以深度學習加速語音及影像辨識應用發展
以深度學習加速語音及影像辨識應用發展以深度學習加速語音及影像辨識應用發展
以深度學習加速語音及影像辨識應用發展
 
Deep Learning - Convolutional Neural Networks - Architectural Zoo
Deep Learning - Convolutional Neural Networks - Architectural ZooDeep Learning - Convolutional Neural Networks - Architectural Zoo
Deep Learning - Convolutional Neural Networks - Architectural Zoo
 
Computer Vision Crash Course
Computer Vision Crash CourseComputer Vision Crash Course
Computer Vision Crash Course
 
Computer Vision
Computer VisionComputer Vision
Computer Vision
 
AWS re:Invent 2016: Deep Learning at Cloud Scale: Improving Video Discoverabi...
AWS re:Invent 2016: Deep Learning at Cloud Scale: Improving Video Discoverabi...AWS re:Invent 2016: Deep Learning at Cloud Scale: Improving Video Discoverabi...
AWS re:Invent 2016: Deep Learning at Cloud Scale: Improving Video Discoverabi...
 
Deep Learning Use Cases - Data Science Pop-up Seattle
Deep Learning Use Cases - Data Science Pop-up SeattleDeep Learning Use Cases - Data Science Pop-up Seattle
Deep Learning Use Cases - Data Science Pop-up Seattle
 

Similar a "Enabling Ubiquitous Visual Intelligence Through Deep Learning," a Keynote Presentation from Baidu

DeepImage_EmTech-public-small
DeepImage_EmTech-public-smallDeepImage_EmTech-public-small
DeepImage_EmTech-public-small
Ren Wu
 
DeepImage_GTC15_public
DeepImage_GTC15_publicDeepImage_GTC15_public
DeepImage_GTC15_public
Ren Wu
 

Similar a "Enabling Ubiquitous Visual Intelligence Through Deep Learning," a Keynote Presentation from Baidu (20)

AIDC India - AI Vision Slides
AIDC India - AI Vision SlidesAIDC India - AI Vision Slides
AIDC India - AI Vision Slides
 
Cognitive IoT using DeepLearning on data parallel frameworks like Spark & Flink
Cognitive IoT using DeepLearning on data parallel frameworks like Spark & FlinkCognitive IoT using DeepLearning on data parallel frameworks like Spark & Flink
Cognitive IoT using DeepLearning on data parallel frameworks like Spark & Flink
 
Correlation Analysis Modeling Use Case - IBM Power Systems
Correlation Analysis Modeling Use Case - IBM Power Systems Correlation Analysis Modeling Use Case - IBM Power Systems
Correlation Analysis Modeling Use Case - IBM Power Systems
 
Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine Learning
Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine LearningMakine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine Learning
Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine Learning
 
Machine Learning for Smarter Apps - Jacksonville Meetup
Machine Learning for Smarter Apps - Jacksonville MeetupMachine Learning for Smarter Apps - Jacksonville Meetup
Machine Learning for Smarter Apps - Jacksonville Meetup
 
AWS re:Invent 2016: Bringing Deep Learning to the Cloud with Amazon EC2 (CMP314)
AWS re:Invent 2016: Bringing Deep Learning to the Cloud with Amazon EC2 (CMP314)AWS re:Invent 2016: Bringing Deep Learning to the Cloud with Amazon EC2 (CMP314)
AWS re:Invent 2016: Bringing Deep Learning to the Cloud with Amazon EC2 (CMP314)
 
DeepImage_EmTech-public-small
DeepImage_EmTech-public-smallDeepImage_EmTech-public-small
DeepImage_EmTech-public-small
 
DeepImage_GTC15_public
DeepImage_GTC15_publicDeepImage_GTC15_public
DeepImage_GTC15_public
 
OpenPOWER/POWER9 AI webinar
OpenPOWER/POWER9 AI webinar OpenPOWER/POWER9 AI webinar
OpenPOWER/POWER9 AI webinar
 
Deep Learning on Qubole Data Platform
Deep Learning on Qubole Data PlatformDeep Learning on Qubole Data Platform
Deep Learning on Qubole Data Platform
 
Chris Nicholson, CEO Skymind at The AI Conference
Chris Nicholson, CEO Skymind at The AI Conference Chris Nicholson, CEO Skymind at The AI Conference
Chris Nicholson, CEO Skymind at The AI Conference
 
AI in the Enterprise: Past, Present & Future - StampedeCon AI Summit 2017
AI in the Enterprise: Past,  Present &  Future - StampedeCon AI Summit 2017AI in the Enterprise: Past,  Present &  Future - StampedeCon AI Summit 2017
AI in the Enterprise: Past, Present & Future - StampedeCon AI Summit 2017
 
Promises of Deep Learning
Promises of Deep LearningPromises of Deep Learning
Promises of Deep Learning
 
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)
 
Deep learning on mobile
Deep learning on mobileDeep learning on mobile
Deep learning on mobile
 
The Frontier of Deep Learning in 2020 and Beyond
The Frontier of Deep Learning in 2020 and BeyondThe Frontier of Deep Learning in 2020 and Beyond
The Frontier of Deep Learning in 2020 and Beyond
 
Using Crowdsourced Images to Create Image Recognition Models with Analytics Z...
Using Crowdsourced Images to Create Image Recognition Models with Analytics Z...Using Crowdsourced Images to Create Image Recognition Models with Analytics Z...
Using Crowdsourced Images to Create Image Recognition Models with Analytics Z...
 
"Deep Learning Beyond Cats and Cars: Developing a Real-life DNN-based Embedde...
"Deep Learning Beyond Cats and Cars: Developing a Real-life DNN-based Embedde..."Deep Learning Beyond Cats and Cars: Developing a Real-life DNN-based Embedde...
"Deep Learning Beyond Cats and Cars: Developing a Real-life DNN-based Embedde...
 
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
 
"The Vision AI Start-ups That Matter Most," a Presentation from Cognite Ventures
"The Vision AI Start-ups That Matter Most," a Presentation from Cognite Ventures"The Vision AI Start-ups That Matter Most," a Presentation from Cognite Ventures
"The Vision AI Start-ups That Matter Most," a Presentation from Cognite Ventures
 

Más de Edge AI and Vision Alliance

“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
Edge AI and Vision Alliance
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
Edge AI and Vision Alliance
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...
Edge AI and Vision Alliance
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
Edge AI and Vision Alliance
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
Edge AI and Vision Alliance
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
Edge AI and Vision Alliance
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara
Edge AI and Vision Alliance
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
Edge AI and Vision Alliance
 

Más de Edge AI and Vision Alliance (20)

“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
 
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...
 
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara
 
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
 
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
 
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Último (20)

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 

"Enabling Ubiquitous Visual Intelligence Through Deep Learning," a Keynote Presentation from Baidu

  • 1. Enabling Ubiquitous Visual Intelligence Through Deep Learning Dr. Ren Wu Distinguished Scientist, Baidu wuren@baidu.com @韧在百度
  • 2. Dr. Ren Wu •  Distinguished Scientist, Baidu •  HSA Chief Software Architect, AMD •  PI, HP Labs CUDA Research Center •  World Computer Xiangqi Champion •  AI expert •  Heterogeneous Computing expert •  Computational scientist
  • 3. Eight Years Ago - 05/11/1997
  • 4. Deep Blue A classic example of application-specific system design comprised of an IBM supercomputer with 480 custom-madeVLSI chess chips, running massively parallel search algorithm with highly optimized implementation.
  • 5. Computer Chess and Moore’s Law
  • 6. Deep Learning Works “We deepened our investment in advanced technologies like Deep Learning, which is already yielding near term enhancements in customer ROI and is expected to drive transformational change over the longer term.” – Robin Li, Baidu CEO
  • 7. Amount of data Performance Deep learning Old algorithms Deep Learning
  • 8. Deep Learning vs. Human Brain pixels edges object parts (combination of edges) object models Deep Architecture in the Brain Retina Area V1 Area V2 Area V4 pixels Edge detectors Primitive shape detectors Higher level visual abstractions Slide credit: Andrew Ng Voice Text Image User
  • 9. Deep Convolutional Neural Networks * Efficient mapping of the training of Convolutional Neural Networks to a CUDA-based cluster courtesy of Jonatan Ward, Sergey Andreev, Francisco Heredia, Bogdan Lazar, Zlatka Manevska
  • 10. Big Data •  >2000PBStorage •  10-100PB/dayProcessing •  100b-1000bWebpages •  100b-1000bIndex •  1b-10b/dayUpdate •  100TB~1PB/dayLog
  • 11. Heterogeneous Computing 1993 world #1 Think Machine CM5/1024 131 GFlops 2013 Samsung Note 3 smartphone (Qualcomm SnapDragon 800) 129 Gflops 2000 world #1 ASCI White (IBM RS/6000SP) 6MW power, 106 tons 12.3 TFlops 2013 Two MacPro workstation (dual AMD GPUs each) 14 TFlops
  • 12. Deep Learning: Two Step Process Supercomputers used for training And then deploy the trained models everywhere! Datacenters Tablets, smartphones Wearable devices IoTs
  • 13. Deep Learning: Training Big data + Deep learning + High performance computing = Intelligence Big data + Deep learning + Heterogeneous computing = Success
  • 14. Image Recognition human vs. machine http://7-themes.com/6977111-cute-little-girl-play-white-dog.html
  • 15. ImageNet Classification Challenge •  ImageNet dataset •  More than 15 million images belonging to about 22,000 categories •  ILSVRC (ImageNet Large-Scale Visual Recognition Challenge) •  Classification task: 1.2 million images contains 1,000 categories •  One of the most challenging computer vision benchmarks •  Increasing attention both from industry and academic communities * Olga Russakovsky et al. ECCV 2014
  • 16. ImageNet Classification Challenge * courtesy of Feifei Li
  • 18. ImageNet Classification 2012-2014 Team Year Place Error (top-5) Uses external data SuperVision 2012 - 16.4% no SuperVision 2012 1st 15.3% ImageNet 22k Clarifai 2013 - 11.7% no Clarifai 2013 1st 11.2% ImageNet 22k MSRA 2014 3rd 7.35% no VGG 2014 2nd 7.32% no GoogLeNet 2014 1st 6.67% no Slide credit: Yangqing Jia, Google Invincible ?
  • 20. Latest Results Team Date Top-5 test error GoogLeNet 2014 6.66% Deep Image 01/12/2015 5.98% Deep Image 02/05/2015 5.33% Microsoft 02/05/2015 4.94% Google 03/02/2015 4.82% Deep Image 05/10/2015 4.58%
  • 21. Insights and Inspirations 多算胜少算不胜 孙⼦子 计篇 (Sun Tzu, 544-496 BC) More calculations win, few calculation lose 元元本本殚⻅见洽闻 班固 ⻄西都赋(Gu Ban, 32-92 AD) Meaning the more you see the more you know 明⾜足以察秋毫之末 孟⼦子梁惠⺩王上 (Mencius, 372-289 BC) ability to see very fine details
  • 22. Project Minwa (百度敏娲) •  Minerva + Athena + ⼥女娲 •  Athena: Goddess of Wisdom,Warfare, Divine Intelligence,Architecture, and Crafts •  Minerva: Goddess of wisdom, magic, medicine, arts, commerce and defense •  ⼥女娲: 抟⼟土造⼈人, 炼⽯石补天, 婚姻, 乐器 World’s Largest Artificial Neural Networks v Pushing the State-of-the-Art v ~ 100x bigger than previous ones v New kind of Intelligence?
  • 23. Hardware/Software Co-design •  Stochastic gradient descent (SGD) •  High compute density •  Scale up, up to 100 nodes •  High bandwidth low latency •  36 nodes, 144 GPUs, 6.9TB Host, 1.7TB Device •  0.6 PFLOPS •  Highly Optimized software stack •  RDMA/GPU Direct •  New data partition and communication strategies GPUs Infiniband
  • 24. Minwa
  • 25. Speedup (wall time for convergence) Validation set accuracy for different numbers of GPUs 0   0.1   0.2   0.3   0.4   0.5   0.6   0.7   0.8   0.9   0.25   0.5   1   2   4   8   16   32   64   128   256   Accuracy Time (hours) 32 GPU 16 GPU 1 GPU Accuracy 80% 32 GPU: 8.6 hours 1 GPU: 212 hours Speedup: 24.7x
  • 26. Never have enough training examples! Key observations •  Invariant to illuminant of the scene •  Invariant to observers Augmentation approaches •  Color casting •  Optical distortion •  Rotation and cropping etc Data Augmentation “⻅见多识⼲⼴广”
  • 27. And the Color Constancy Key observations •  Invariant to illuminant of the scene •  Invariant to observers Augmentation approaches •  Color casting •  Optical distortion •  Rotation and cropping etc The Color of the Dress “Inspired by the color constancy principal. Essentially, this ‘forces’ our neural network to develop its own color constancy ability.”
  • 28. Data Augmentation Augmentation The number of possible changes Color casting 68920 Vignetting 1960 Lens distortion 260 Rotation 20 Flipping 2 Cropping 82944(crop size is 224x224, input image size is 512x512) Possible variations The Deep Image system learned from ~2 billion examples, out of 90 billion possible candidates.
  • 29. Data Augmentation vs. Overfitting
  • 30. Examples Bathtub Isopod Indian elephant Ice bear Some hard cases addressed by adding our data augmentation.
  • 31. Multi-scale Training •  Same crop size, different resolution •  Fixed-size 224*224 •  Downsized training images •  Reduces computational costs •  But not for state-of-the-art •  Different models trained by different image sizes 256*256 512*512 •  High-resolution model works •  256x256: top-5 7.96% •  512x512: top-5 7.42% •  Multi-scale models are complementary •  Fused model: 6.97% “明查秋毫”
  • 34. Single Model Performance •  One basic configuration has 16 layers •  The number of weights in our configuration is 212.7M •  About 40% bigger than VGG’s Team Top-5 val. error VGG 8.0% GoogLeNet 7.89% BN-Inception 5.82% MSRA, PReLU-net 5.71% Deep Image 5.40%
  • 36.
  • 37.
  • 38.
  • 39.
  • 40. Major Differentiators •  Customized built supercomputer dedicated for DL •  Simple, scalable algorithm + Fully optimized software stack •  Larger models •  More Aggressive data augmentation •  Multi-scale, include high-resolution images Scalability + Insights and push for extreme
  • 41. Deep Learning: Deployment Big data + Deep learning + High performance computing = Intelligence Big data + Deep learning + Heterogeneous computing = Success
  • 42. Owl of Minwa (百度敏鸮) Supercomputers Datacenters Tablets, smartphones Models trained by supercomputers Trained models will be deployed in many ways data centers (cloud), smartphones, and even wearables and IoTs d OpenCL based, light weight and high performance DNNs everywhere ! knowledge, wisdom, perspicacity and erudition
  • 43. DNNs Everywhere Supercomputers Datacenters Tablets, smartphones Wearable devices IoTs 1000s GPUs 100k-1m servers 2b (in China) 50b in 2020? Supercomputer used for training Trained DNNs then deployed to data centers (cloud), smartphones, and even wearables and IoTs
  • 44. Offline Mobile DNN App •  Image recognition on mobile device •  Real time and no connectivity needed •  directly from video stream, what you point is what you get •  Everything is done within the device •  OpenCL based, highly optimized •  Large deep neural network models •  Thousands of objects, flowers, dogs, and bags etc •  Unleashed the full potential of the device hardware •  Smart phones now, Wearables and IoTs tomorrow
  • 45.
  • 46. Cloud Computing: What’s Missing? Bandwidth? Latency? and Power consumption? *ArtemVasilyev: CNN optimizations for embedded systems and FFT Moving data around is expensive, very expensive!
  • 47. Cloud Computing: What’s Missing? How about privacy?
  • 48. What’s Next? Dedicated Hardware + Heterogeneous Computing *MarkHorowitz
  • 49. Heterogeneous Computing “Human mind and brain is not a single general-purpose processor but a collection of highly specialized components, each solving a different, specific problem and yet collectively making up who we are as human beings and thinkers. “ - Prof. Nancy Kanwisher
  • 50. © Copyright Khronos Group 2015 - Page 50 Vision Processing Power Efficiency • Wearables will need ‘always-on’ vision -  With smaller thermal limit / battery than phones! • GPUs have x10 imaging power efficiency over CPU -  GPUs architected for efficient pixel handling • Dedicated Hardware/DSPs can be even more efficient -  With some loss of generality • Mobile SOCs have space for more transistors -  But can’t turn on at same time = Dark Silicon -  Can integrate more gates ‘for free’ if careful how and when they are used PowerEfficiency Computation Flexibility Dedicated Hardware GPU Compute Multi-core CPU X1 X10 X100 Potential for dedicated sensor/vision silicon to be integrated into Mobile Processors But how will they be programmed for PORTABILITY and POWER EFFICIENCY?
  • 51. © Copyright Khronos Group 2015 - Page 51 OpenCL Ecosystem Implementers Desktop/Mobile/FPGA Working Group Members Apps/Tools/Tests/Courseware Single Source C++ Programming Portable Kernel Intermediate Language Core API and Language Specs
  • 52. Everything Connected Everything Intelligent Big data era AI era I2 oT Intelligent Internet of Things