Quoc Le, Stanford & Google - Tera Scale Deep Learning

•

2 recomendaciones•2,713 vistas

Kun Le

Tecnología

Tera-scale deep learning
Quoc
V.
Le

Stanford
University
and
Google

Joint
work
with

Kai
Chen
Greg
Corrado
Jeﬀ
Dean
MaAhieu
Devin

Rajat
Monga
Andrew
Ng
Marc Aurelio
Paul
Tucker
Ke
Yang

Ranzato

Machine
Learning
successes

Face
recogniLon
OCR
Autonomous
car

Email
classiﬁcaLon

RecommendaLon
systems
Web
page
ranking

Quoc
Le

The
role
of
Feature
ExtracLon

in
PaAern
RecogniLon

Classiﬁer

Feature
extracLon

(Mostly
hand-‐craWed
features)

Quoc
Le

Hand-‐CraWed
Features

Computer
vision:

…

SIFT/HOG
SURF

Speech
RecogniLon:

…

MFCC
Spectrogram
ZCR

Quoc
Le

New
feature-‐designing
paradigm

Unsupervised
Feature
Learning
/
Deep
Learning

Show
promises
for
small
datasets

Expensive
and
typically
applied
to
small
problems

Quoc
Le

Brain
SimulaLon

Autoencoder
Watching
10
million
YouTube
video
frames

Train
on
2000
machines
(16000
cores)
for
1
week

Autoencoder
1.15
billion
parameters

-‐  100x
larger
than
previously
reported

-‐  Small
compared
to
visual
cortex

Autoencoder

Image

Le,
et
al.,
Building
high-‐level
features
using
large-‐scale
unsupervised
learning.
ICML
2012

Key
results

Face
detector
Human
body
detector
Cat
detector

Totally
unsupervised!

~85%

correct
in

classifying

face
vs
no
face

Le,
et
al.,
Building
high-‐level
features
using
large-‐scale
unsupervised
learning.
ICML
2012

ImageNet
classiﬁcaLon

0.005%
9.5%
15.8%

Random
guess
State-‐of-‐the-‐art
Feature
learning

(Weston,
Bengio
‘11)
From
raw
pixels

ImageNet
2009
(10k
categories):
Best
published
result:
17%

(Sanchez
&
Perronnin
‘11
),

Our
method:
20%

Using
only
1000
categories,
our
method
>
50%

Quoc
Le

Scaling
up
Deep
Learning

Prior
art
Our
work

#
Examples
100,000
10,000,000

#
Dimensions
1,000
10,000

#
Parameters
10,000,000
1,000,000,000

Data
set
size
Gbytes
Tbytes

Edge
ﬁlters

High-‐level
features

Learned
features
from
Images
Face,
cat
detectors

Quoc
Le

Summary
of
Scaling
up

-‐  Local
connecLvity
(Model
Parallelism)

-‐  Asynchronous
SGDs
(Clever
opLmizaLon
/
Data
parallelism)

-‐  RPCs

-‐  Prefetching

-‐  Single

-‐  Removing
slow
machines

-‐  Lots
of
opLmizaLon

Quoc
Le

Locally
connected
networks

Machine
#1
Machine
#2
Machine
#3
Machine
#4

Features

Image

Quoc
Le

Asynchronous
Parallel
SGDs
(Alex
Smola’s
talk)

Parameter
server

Quoc
Le

Conclusions

•  Scale
deep
learning
100x
larger
using
distributed
training
on
1000

machines

•  Brain
simulaLon
-‐>
Cat
neuron

•  State-‐of-‐the-‐art
performances
on

–  Object
recogniLon
(ImageNet)

–  AcLon
RecogniLon

–  Cancer
image
classiﬁcaLon

•  Other
applicaLons

–  Speech
recogniLon

–  Machine
TranslaLon

ImageNet

0.005%
9.5%
15.8%

Best
published
result

Model

Random
guess
Our
method

Parallelism

Data
Parameter
server

Parallelism

Cat
neuron
Face
neuron

References

•  Q.V.
Le,
M.A.
Ranzato,
R.
Monga,
M.
Devin,
G.
Corrado,
K.
Chen,
J.
Dean,
A.Y.

Ng.
Building
high-‐level
features
using
large-‐scale
unsupervised
learning.

ICML,
2012.

•  Q.V.
Le,
J.
Ngiam,
Z.
Chen,
D.
Chia,
P.
Koh,
A.Y.
Ng.
Tiled
Convolu7onal
Neural

Networks.
NIPS,
2010.

•  Q.V.
Le,
W.Y.
Zou,
S.Y.
Yeung,
A.Y.
Ng.
Learning
hierarchical
spa7o-‐temporal

features
for
ac7on
recogni7on
with
independent
subspace
analysis.
CVPR,

2011.

•  Q.V.
Le,
J.
Ngiam,
A.
Coates,
A.
Lahiri,
B.
Prochnow,
A.Y.
Ng.

On
op7miza7on
methods
for
deep
learning.
ICML,
2011.

•  Q.V.
Le,
A.
Karpenko,
J.
Ngiam,
A.Y.
Ng.

ICA
with
Reconstruc7on
Cost
for

Eﬃcient
Overcomplete
Feature
Learning.
NIPS,
2011.

•  Q.V.
Le,
J.
Han,
J.
Gray,
P.
Spellman,
A.
Borowsky,
B.
Parvin.
Learning
Invariant

Features
for
Tumor
Signatures.
ISBI,
2012.

•  I.J.
Goodfellow,
Q.V.
Le,
A.M.
Saxe,
H.
Lee,
A.Y.
Ng,

Measuring
invariances
in

deep
networks.
NIPS,
2009.

hAp://ai.stanford.edu/~quocle

Más contenido relacionado

Similar a Quoc Le, Stanford & Google - Tera Scale Deep Learning

What's Wrong With Deep Learning?Philip Zheng

Strata London - Deep Learning 05-2015Turi, Inc.

2008 brokerage 04 smart vision system [compatibility mode]imec.archive

Yann le cunYandex

Deep Learning Hardware: Past, Present, & FutureRouyun Pan

Convolutional Neural Network (CNN)Muhammad Haroon

The Forces Driving JavaSteve Elliott

Framework Engineering_FinalYoungSu Son

introduction to deeplearningEyad Alshami

Anomaly Detection with Azure and .NETMarco Parenzan

Lecture24Albert Orriols-Puig

Evolving Web: Drupal 7 in Higher Education Case Study dergachev

An Introduction to Face DetectionLivares Technologies Pvt Ltd

Gesture Based Interactionlanesk8er

426 lecture2: AR TechnologyMark Billinghurst

Approximate Semantic Matching of Heterogeneous EventsEdward Curry

Anomaly Detection with Azure and .netMarco Parenzan

Microsoft HPC User Group sjwoodman

Dubbawala _ Ebay Virtual Courier AggregatorManish Kanojia

Similar a Quoc Le, Stanford & Google - Tera Scale Deep Learning (20)

What's Wrong With Deep Learning?

Strata London - Deep Learning 05-2015

2008 brokerage 04 smart vision system [compatibility mode]

Yann le cun

Deep Learning Hardware: Past, Present, & Future

Convolutional Neural Network (CNN)

The Forces Driving Java

Framework Engineering_Final

introduction to deeplearning

Anomaly Detection with Azure and .NET

Lecture24

Evolving Web: Drupal 7 in Higher Education Case Study

An Introduction to Face Detection

Gesture Based Interaction

426 lecture2: AR Technology

Approximate Semantic Matching of Heterogeneous Events

Anomaly Detection with Azure and .net

Microsoft HPC User Group

Dubbawala _ Ebay Virtual Courier Aggregator

Más de Kun Le

Ibm big data and analytics for a holistic customer journeyKun Le

Lessons and Challenges from Mining Retail E-Commerce DataKun Le

University of Washington Computer Science & Engineering CSE 403: Software Eng...Kun Le

Business Intelligence and RetailKun Le

The “Big Data” Ecosystem at LinkedInKun Le

Marketo - Definitive guide to marketing metrics marketing analyticsKun Le

Best practices for building and deploying predictive models over big data pre...Kun Le

Architecting a-big-data-platform-for-analytics 24606569Kun Le

Under the hood_of_mobile_marketingKun Le

IBM-Infoworld Big Data deep diveKun Le

Smarter analytics for retailers Delivering insight to enable business successKun Le

IBM - Using Predictive Analytics to Segment, Target and Optimize MarketingKun Le

IBM-Why Big Data?Kun Le

Big data that drives marketing roi across all channels & campaignsKun Le

Más de Kun Le (14)

Ibm big data and analytics for a holistic customer journey

Lessons and Challenges from Mining Retail E-Commerce Data

University of Washington Computer Science & Engineering CSE 403: Software Eng...

Business Intelligence and Retail

The “Big Data” Ecosystem at LinkedIn

Marketo - Definitive guide to marketing metrics marketing analytics

Best practices for building and deploying predictive models over big data pre...

Architecting a-big-data-platform-for-analytics 24606569

Under the hood_of_mobile_marketing

IBM-Infoworld Big Data deep dive

Smarter analytics for retailers Delivering insight to enable business success

IBM - Using Predictive Analytics to Segment, Target and Optimize Marketing

IBM-Why Big Data?

Big data that drives marketing roi across all channels & campaigns

Último

Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz

AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)Samir Dash

DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity

Why Teams call analytics are critical to your entire businesspanagenda

Spring Boot vs Quarkus the ultimate battle - DevoxxUKJago de Vreede

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays

MINDCTI Revenue Release Quarter One 2024MIND CTI

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software

Understanding the FAA Part 107 License ..Christopher Logan Kennedy

FWD Group - Insurer Innovation Award 2024The Digital Insurer

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard37

WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood

Corporate and higher education May webinar.pptxRustici Software

[BuildWithAI] Introduction to Gemini.pdfSandro Moreira

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays

Quoc Le, Stanford & Google - Tera Scale Deep Learning

1. Tera-scale deep learning Quoc V. Le Stanford University and Google Joint work with Kai Chen Greg Corrado Jeﬀ Dean MaAhieu Devin Rajat Monga Andrew Ng Marc Aurelio Paul Tucker Ke Yang Ranzato

2. Machine Learning successes Face recogniLon OCR Autonomous car Email classiﬁcaLon RecommendaLon systems Web page ranking Quoc Le

3. The role of Feature ExtracLon in PaAern RecogniLon Classiﬁer Feature extracLon (Mostly hand-‐craWed features) Quoc Le

4. Hand-‐CraWed Features Computer vision: … SIFT/HOG SURF Speech RecogniLon: … MFCC Spectrogram ZCR Quoc Le

5. New feature-‐designing paradigm Unsupervised Feature Learning / Deep Learning Show promises for small datasets Expensive and typically applied to small problems Quoc Le

6. The Trend of BigData Quoc Le

7. Brain SimulaLon Autoencoder Watching 10 million YouTube video frames Train on 2000 machines (16000 cores) for 1 week Autoencoder 1.15 billion parameters -‐  100x larger than previously reported -‐  Small compared to visual cortex Autoencoder Image Le, et al., Building high-‐level features using large-‐scale unsupervised learning. ICML 2012

8. Key results Face detector Human body detector Cat detector Totally unsupervised! ~85% correct in classifying face vs no face Le, et al., Building high-‐level features using large-‐scale unsupervised learning. ICML 2012

9. ImageNet classiﬁcaLon 0.005% 9.5% 15.8% Random guess State-‐of-‐the-‐art Feature learning (Weston, Bengio ‘11) From raw pixels ImageNet 2009 (10k categories): Best published result: 17% (Sanchez & Perronnin ‘11 ), Our method: 20% Using only 1000 categories, our method > 50% Quoc Le

10. Scaling up Deep Learning Prior art Our work # Examples 100,000 10,000,000 # Dimensions 1,000 10,000 # Parameters 10,000,000 1,000,000,000 Data set size Gbytes Tbytes Edge ﬁlters High-‐level features Learned features from Images Face, cat detectors Quoc Le

11. Summary of Scaling up -‐  Local connecLvity (Model Parallelism) -‐  Asynchronous SGDs (Clever opLmizaLon / Data parallelism) -‐  RPCs -‐  Prefetching -‐  Single -‐  Removing slow machines -‐  Lots of opLmizaLon Quoc Le

12. Locally connected networks Machine #1 Machine #2 Machine #3 Machine #4 Features Image Quoc Le

13. Asynchronous Parallel SGDs (Alex Smola’s talk) Parameter server Quoc Le

14. Conclusions •  Scale deep learning 100x larger using distributed training on 1000 machines •  Brain simulaLon -‐> Cat neuron •  State-‐of-‐the-‐art performances on –  Object recogniLon (ImageNet) –  AcLon RecogniLon –  Cancer image classiﬁcaLon •  Other applicaLons –  Speech recogniLon –  Machine TranslaLon ImageNet 0.005% 9.5% 15.8% Best published result Model Random guess Our method Parallelism Data Parameter server Parallelism Cat neuron Face neuron

15. References •  Q.V. Le, M.A. Ranzato, R. Monga, M. Devin, G. Corrado, K. Chen, J. Dean, A.Y. Ng. Building high-‐level features using large-‐scale unsupervised learning. ICML, 2012. •  Q.V. Le, J. Ngiam, Z. Chen, D. Chia, P. Koh, A.Y. Ng. Tiled Convolu7onal Neural Networks. NIPS, 2010. •  Q.V. Le, W.Y. Zou, S.Y. Yeung, A.Y. Ng. Learning hierarchical spa7o-‐temporal features for ac7on recogni7on with independent subspace analysis. CVPR, 2011. •  Q.V. Le, J. Ngiam, A. Coates, A. Lahiri, B. Prochnow, A.Y. Ng. On op7miza7on methods for deep learning. ICML, 2011. •  Q.V. Le, A. Karpenko, J. Ngiam, A.Y. Ng. ICA with Reconstruc7on Cost for Eﬃcient Overcomplete Feature Learning. NIPS, 2011. •  Q.V. Le, J. Han, J. Gray, P. Spellman, A. Borowsky, B. Parvin. Learning Invariant Features for Tumor Signatures. ISBI, 2012. •  I.J. Goodfellow, Q.V. Le, A.M. Saxe, H. Lee, A.Y. Ng, Measuring invariances in deep networks. NIPS, 2009. hAp://ai.stanford.edu/~quocle

Quoc Le, Stanford & Google - Tera Scale Deep Learning

Recomendados

Recomendados

Más contenido relacionado

Similar a Quoc Le, Stanford & Google - Tera Scale Deep Learning

Similar a Quoc Le, Stanford & Google - Tera Scale Deep Learning (20)

Más de Kun Le

Más de Kun Le (14)

Último

Último (20)

Quoc Le, Stanford & Google - Tera Scale Deep Learning