SlideShare una empresa de Scribd logo
1 de 76
Descargar para leer sin conexión
Animesh Prasad
Muthu Kumar Chandrasekaran 

Basics of Deep Learning
Workshop on Educational and Social Science Technologies (WESST 2017, 19—21 July)
Machine Learning Primer
2
Machine Learning Primer
3
4
Machine Learning Primer
5
http://playground.tensorflow.org
Machine Learning, AI
• Four key ingredients for ML towards AI
1. Lots & lots of data
2. Very flexible models
3.
 
Enough computing power
4.  Powerful priors that can defeat the curse of
dimensionality 

6
Bypassing the curse of dimensionality
We need to build compositionality into our ML models
Just as human languages exploit compositionality to give representations and
meanings to complex ideas
Exploiting compositionality gives an exponential gain in representational power
(1) Distributed representations / embeddings: feature learning
(2) Deep architecture: multiple levels of feature learning
Additional prior: compositionality is useful to describe the
world around us efficiently


7
Each feature can be discovered without the need for
seeing the exponentially large number of
configurations of the other features
• Consider a network whose hidden units discover the following features:
• Person wears glasses
• Person is female

• Person is a child

• Etc.
If each of n feature requires O(k) parameters, need O(nk) examples Non-
parametric methods would require O(nd
) examples


8


9
Hidden Units Discover Semantically Meaningful
Concepts 

• Network trained to recognize places, not objects


10
Why does it work?
•
 
It only works because we are making some assumptions about 

the data generating distribution 

•
 
Worse-case distributions still require exponential data 

•
 
But the world has structure and we can get an exponential gain by
exploiting some of it 



11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
Recurrent Neural Networks
(RNN)
Recurrent Neural Networks
• Selectively summarize an input sequence in a fixed-size state vector via a
recursive update
26
Recurrent Neural Networks
• Can produce an output at each time step: unfolding the graph tells us how to back-prop
through time.
27
Recurrent Neural Networks
• The RNN gradient is a product of Jacobian matrices, each associated with a step in the
forward computation.
28
29
RNN Tricks
30
Gated Recurrent Units & LSTM
Create a path where gradients can
flow for longer with self-loop
•  Corresponds to an eigenvalue of
Jacobian slightly less than 1
•  LSTM is heavily used
• GRU light-weight version
31
In Practice
32
33
34
35
36
37
Saddle Points During Training
38
Piecewise Linear Nonlinearity
Absolute value rectification works better than tanh
in lower layers of convnet.
Using a rectifier non- linearity (ReLU) instead of
tanh of softplus allows for the first time to train
very deep supervised networks without the need
for unsupervised pre-training; was biologically
motivated
Rectifiers one of the crucial ingredients in
ImageNet breakthrough
39
Dropout Regularizer: Super-
Efficient Bagging
•  Dropouts trick: during training
multiply neuron output by random bit
(p=0.5), during test by 0.5
•  Used in deep supervised networks
•  Similar to denoising auto-encoder,
but corrupting every layer
•  Works better with some non-
linearities (rectifiers, maxout)
40
Batch Normalization
•  Regulaizes & helps to train
41
Early Stopping
•  Beautiful FREE LUNCH (no need to launch many different
training runs for each value of hyper-parameter for #iterations)
•  Monitor validation error during training (after visiting # of
training examples = a multiple of validation set size)
•  Keep track of parameters with best validation error and report
them at the end
•  If error does not improve enough (with some patience), stop. 

42
Sampling Hyperparameters
•  Common approach: manual +
grid search
•  Grid search over
hyperparameters: simple &
wasteful
•  Random search: simple &
efficient 

• Independently sample each HP,

• More convenient: ok to early-
stop, continue further, etc. 

43
Speech
44
Natural Language
Representations
45
46
Analogical Representations for
Free Text
• Semantic relations appear
as linear relationships in the
space of learned
representations
• King – Queen ≈ Man –
Woman
• Paris – France + Italy ≈
Rome
47
•  Intermediate representation of meaning = ‘universal representation’
•  Encoder: from word sequence to sentence representation
•  Decoder: from representation to word sequence distribution
Encoder-Decoder Framework
48
Attention Mechanism for Deep
Learning
•  Consider an input (or intermediate) sequence or image
•  Consider an upper level representation, which can choose « where to
look », by assigning a weight or probability to each input position, as
produced by an MLP, applied at each position
49
Image-to-Text: Caption Generation
with Attention
50
Image-to-Text: Caption Generation
with Attention
51
52
53
54
Unsupervised Representation
Learning
55
Autoencoders
•  RBM 

•  (Denoising) Autoencoder
56
57
58
Insights
•  The big payoff of deep learning is to allow learning
higher levels of abstraction 

•  Higher-level abstractions disentangle the factors of
variation, which allows much easier generalization and
transfer 

59
Hands On
60
Hands On
• Login to NUS SoC VPN with your NUSNET Account
• ssh wesst@kpe.d1.comp.nus.edu.sg
Password: kerasuser
• copy folder demo to a new folder rename demo_<yourname>
61
CPU or GPU?
62
Frameworks
63
64
65
66
67
68
69
70
71
72
73
74
75
19 July 2017 76
References:
NIPS DL Tutorial 2015
CS231n: Convolutional Neural
Networks for Visual Recognition

Más contenido relacionado

La actualidad más candente

Introduction to deep learning
Introduction to deep learningIntroduction to deep learning
Introduction to deep learningAmr Rashed
 
Digit recognition
Digit recognitionDigit recognition
Digit recognitionbtandale
 
Artificial neural network
Artificial neural networkArtificial neural network
Artificial neural networknainabhatt2
 
Compegence: Dr. Rajaram Kudli - An Introduction to Artificial Neural Network ...
Compegence: Dr. Rajaram Kudli - An Introduction to Artificial Neural Network ...Compegence: Dr. Rajaram Kudli - An Introduction to Artificial Neural Network ...
Compegence: Dr. Rajaram Kudli - An Introduction to Artificial Neural Network ...COMPEGENCE
 
Neural networks and deep learning
Neural networks and deep learningNeural networks and deep learning
Neural networks and deep learningRADO7900
 
Deep Learning Tutorial
Deep Learning TutorialDeep Learning Tutorial
Deep Learning TutorialAmr Rashed
 
Neural networks of artificial intelligence
Neural networks of artificial  intelligenceNeural networks of artificial  intelligence
Neural networks of artificial intelligencealldesign
 
Soft Computing-173101
Soft Computing-173101Soft Computing-173101
Soft Computing-173101AMIT KUMAR
 
Neural networks.ppt
Neural networks.pptNeural networks.ppt
Neural networks.pptSrinivashR3
 
Soft computing (ANN and Fuzzy Logic) : Dr. Purnima Pandit
Soft computing (ANN and Fuzzy Logic)  : Dr. Purnima PanditSoft computing (ANN and Fuzzy Logic)  : Dr. Purnima Pandit
Soft computing (ANN and Fuzzy Logic) : Dr. Purnima PanditPurnima Pandit
 
Neural Networks Ver1
Neural  Networks  Ver1Neural  Networks  Ver1
Neural Networks Ver1ncct
 
Neural Netwrok
Neural NetwrokNeural Netwrok
Neural NetwrokRabin BK
 
Handwritten Digit Recognition
Handwritten Digit RecognitionHandwritten Digit Recognition
Handwritten Digit Recognitionijtsrd
 
neural networks
 neural networks neural networks
neural networksjoshiblog
 
Soft Computing
Soft ComputingSoft Computing
Soft ComputingMANISH T I
 
Emotion detection using cnn.pptx
Emotion detection using cnn.pptxEmotion detection using cnn.pptx
Emotion detection using cnn.pptxRADO7900
 
Artificial neural network for machine learning
Artificial neural network for machine learningArtificial neural network for machine learning
Artificial neural network for machine learninggrinu
 

La actualidad más candente (20)

Neural networks
Neural networksNeural networks
Neural networks
 
Introduction to deep learning
Introduction to deep learningIntroduction to deep learning
Introduction to deep learning
 
Digit recognition
Digit recognitionDigit recognition
Digit recognition
 
Artificial neural network
Artificial neural networkArtificial neural network
Artificial neural network
 
Compegence: Dr. Rajaram Kudli - An Introduction to Artificial Neural Network ...
Compegence: Dr. Rajaram Kudli - An Introduction to Artificial Neural Network ...Compegence: Dr. Rajaram Kudli - An Introduction to Artificial Neural Network ...
Compegence: Dr. Rajaram Kudli - An Introduction to Artificial Neural Network ...
 
Neural networks and deep learning
Neural networks and deep learningNeural networks and deep learning
Neural networks and deep learning
 
Neutral Network
Neutral NetworkNeutral Network
Neutral Network
 
Deep Learning Tutorial
Deep Learning TutorialDeep Learning Tutorial
Deep Learning Tutorial
 
Neural networks of artificial intelligence
Neural networks of artificial  intelligenceNeural networks of artificial  intelligence
Neural networks of artificial intelligence
 
Soft Computing-173101
Soft Computing-173101Soft Computing-173101
Soft Computing-173101
 
Neural networks.ppt
Neural networks.pptNeural networks.ppt
Neural networks.ppt
 
Soft computing (ANN and Fuzzy Logic) : Dr. Purnima Pandit
Soft computing (ANN and Fuzzy Logic)  : Dr. Purnima PanditSoft computing (ANN and Fuzzy Logic)  : Dr. Purnima Pandit
Soft computing (ANN and Fuzzy Logic) : Dr. Purnima Pandit
 
Neural Networks Ver1
Neural  Networks  Ver1Neural  Networks  Ver1
Neural Networks Ver1
 
Neural Netwrok
Neural NetwrokNeural Netwrok
Neural Netwrok
 
Handwritten Digit Recognition
Handwritten Digit RecognitionHandwritten Digit Recognition
Handwritten Digit Recognition
 
neural networks
 neural networks neural networks
neural networks
 
Soft Computing
Soft ComputingSoft Computing
Soft Computing
 
Emotion detection using cnn.pptx
Emotion detection using cnn.pptxEmotion detection using cnn.pptx
Emotion detection using cnn.pptx
 
test
testtest
test
 
Artificial neural network for machine learning
Artificial neural network for machine learningArtificial neural network for machine learning
Artificial neural network for machine learning
 

Destacado

Destacado (18)

Xiangen Hu - WESST Keynote - Conversational Tutors and the Experience API
Xiangen Hu - WESST Keynote - Conversational Tutors and the Experience APIXiangen Hu - WESST Keynote - Conversational Tutors and the Experience API
Xiangen Hu - WESST Keynote - Conversational Tutors and the Experience API
 
Jennifer Pei-Ling Tan - WESST - Fostering 21st Century Literacies through a ...
Jennifer Pei-Ling Tan - WESST - Fostering 21st Century Literacies through a  ...Jennifer Pei-Ling Tan - WESST - Fostering 21st Century Literacies through a  ...
Jennifer Pei-Ling Tan - WESST - Fostering 21st Century Literacies through a ...
 
NUS's Digital Learning Journey in Enhancing Teaching and Learning
NUS's Digital Learning Journey in Enhancing Teaching and LearningNUS's Digital Learning Journey in Enhancing Teaching and Learning
NUS's Digital Learning Journey in Enhancing Teaching and Learning
 
Introducing ALSET & SoC, Opportunities at NUS for Data and Intervention, Dat...
 Introducing ALSET & SoC, Opportunities at NUS for Data and Intervention, Dat... Introducing ALSET & SoC, Opportunities at NUS for Data and Intervention, Dat...
Introducing ALSET & SoC, Opportunities at NUS for Data and Intervention, Dat...
 
John Stamper - WESST Keynote - Continuous Improvement of Educational Technolo...
John Stamper - WESST Keynote - Continuous Improvement of Educational Technolo...John Stamper - WESST Keynote - Continuous Improvement of Educational Technolo...
John Stamper - WESST Keynote - Continuous Improvement of Educational Technolo...
 
Xiangen Hu - WESST - AutoTutor, an implementation of Conversation-Based Intel...
Xiangen Hu - WESST - AutoTutor, an implementation of Conversation-Based Intel...Xiangen Hu - WESST - AutoTutor, an implementation of Conversation-Based Intel...
Xiangen Hu - WESST - AutoTutor, an implementation of Conversation-Based Intel...
 
Ravi Chandran - WESST - NUS’ Technology Enhanced Learning Environment
Ravi Chandran - WESST - NUS’ Technology Enhanced Learning EnvironmentRavi Chandran - WESST - NUS’ Technology Enhanced Learning Environment
Ravi Chandran - WESST - NUS’ Technology Enhanced Learning Environment
 
Carolyn Rosé - WESST Keynote - Technology Support for Effective Team-Based Le...
Carolyn Rosé - WESST Keynote - Technology Support for Effective Team-Based Le...Carolyn Rosé - WESST Keynote - Technology Support for Effective Team-Based Le...
Carolyn Rosé - WESST Keynote - Technology Support for Effective Team-Based Le...
 
Tuan Q. Phan - WESST - Getting Started on the Computational Social Sciences
Tuan Q. Phan - WESST - Getting Started on the Computational Social SciencesTuan Q. Phan - WESST - Getting Started on the Computational Social Sciences
Tuan Q. Phan - WESST - Getting Started on the Computational Social Sciences
 
Dragan Gašević - WESST keynote - Critical Directions for Learning Analytics...
Dragan Gašević - WESST keynote - Critical Directions for Learning Analytics...Dragan Gašević - WESST keynote - Critical Directions for Learning Analytics...
Dragan Gašević - WESST keynote - Critical Directions for Learning Analytics...
 
Elizabeth Koh - WESST - Leveraging Learning Analytics to Formatively Assess T...
Elizabeth Koh - WESST - Leveraging Learning Analytics to Formatively Assess T...Elizabeth Koh - WESST - Leveraging Learning Analytics to Formatively Assess T...
Elizabeth Koh - WESST - Leveraging Learning Analytics to Formatively Assess T...
 
Joseph Jay Williams - WESST - Bridging Research and Practice via MOOClets & C...
Joseph Jay Williams - WESST - Bridging Research and Practice via MOOClets & C...Joseph Jay Williams - WESST - Bridging Research and Practice via MOOClets & C...
Joseph Jay Williams - WESST - Bridging Research and Practice via MOOClets & C...
 
Opening for WESST 2017
Opening for WESST 2017Opening for WESST 2017
Opening for WESST 2017
 
Bibhas Chakraborty - WESST - Dynamically Personalized Interventions
Bibhas Chakraborty - WESST - Dynamically Personalized InterventionsBibhas Chakraborty - WESST - Dynamically Personalized Interventions
Bibhas Chakraborty - WESST - Dynamically Personalized Interventions
 
Carolyn Rosé - WESST - From Data to Design of Dynamic Support for Collaborati...
Carolyn Rosé - WESST - From Data to Design of Dynamic Support for Collaborati...Carolyn Rosé - WESST - From Data to Design of Dynamic Support for Collaborati...
Carolyn Rosé - WESST - From Data to Design of Dynamic Support for Collaborati...
 
Prasanta Bhattacharya - WESST - Social Networks and Causal Inference
Prasanta Bhattacharya - WESST - Social Networks and Causal InferencePrasanta Bhattacharya - WESST - Social Networks and Causal Inference
Prasanta Bhattacharya - WESST - Social Networks and Causal Inference
 
Jo Fang - WESST - Applied Research in Workforce Development
Jo Fang - WESST - Applied Research in Workforce DevelopmentJo Fang - WESST - Applied Research in Workforce Development
Jo Fang - WESST - Applied Research in Workforce Development
 
Dragan Gašević - WESST - Learning Analytics Data Policy and Adoption Strate...
Dragan Gašević - WESST - Learning Analytics Data Policy and Adoption Strate...Dragan Gašević - WESST - Learning Analytics Data Policy and Adoption Strate...
Dragan Gašević - WESST - Learning Analytics Data Policy and Adoption Strate...
 

Similar a Animesh Prasad and Muthu Kumar Chandrasekaran - WESST - Basics of Deep Learning

Machine Duping 101: Pwning Deep Learning Systems
Machine Duping 101: Pwning Deep Learning SystemsMachine Duping 101: Pwning Deep Learning Systems
Machine Duping 101: Pwning Deep Learning SystemsClarence Chio
 
Convolutional Neural Networks (CNN)
Convolutional Neural Networks (CNN)Convolutional Neural Networks (CNN)
Convolutional Neural Networks (CNN)Gaurav Mittal
 
DEF CON 24 - Clarence Chio - machine duping 101
DEF CON 24 - Clarence Chio - machine duping 101DEF CON 24 - Clarence Chio - machine duping 101
DEF CON 24 - Clarence Chio - machine duping 101Felipe Prado
 
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...Impetus Technologies
 
Deep Learning in Robotics: Robot gains Social Intelligence through Multimodal...
Deep Learning in Robotics: Robot gains Social Intelligence through Multimodal...Deep Learning in Robotics: Robot gains Social Intelligence through Multimodal...
Deep Learning in Robotics: Robot gains Social Intelligence through Multimodal...gabrielesisinna
 
Deep learning from a novice perspective
Deep learning from a novice perspectiveDeep learning from a novice perspective
Deep learning from a novice perspectiveAnirban Santara
 
Natural Language Processing Advancements By Deep Learning: A Survey
Natural Language Processing Advancements By Deep Learning: A SurveyNatural Language Processing Advancements By Deep Learning: A Survey
Natural Language Processing Advancements By Deep Learning: A SurveyRimzim Thube
 
Deep learning for 3-D Scene Reconstruction and Modeling
Deep learning for 3-D Scene Reconstruction and Modeling Deep learning for 3-D Scene Reconstruction and Modeling
Deep learning for 3-D Scene Reconstruction and Modeling Yu Huang
 
Advancements in HPCC Systems Machine Learning
Advancements in HPCC Systems Machine LearningAdvancements in HPCC Systems Machine Learning
Advancements in HPCC Systems Machine LearningHPCC Systems
 
ML Module 3 Non Linear Learning.pptx
ML Module 3 Non Linear Learning.pptxML Module 3 Non Linear Learning.pptx
ML Module 3 Non Linear Learning.pptxDebabrataPain1
 
deepnet-lourentzou.ppt
deepnet-lourentzou.pptdeepnet-lourentzou.ppt
deepnet-lourentzou.pptyang947066
 
Model Evaluation in the land of Deep Learning
Model Evaluation in the land of Deep LearningModel Evaluation in the land of Deep Learning
Model Evaluation in the land of Deep LearningPramit Choudhary
 
DSRLab seminar Introduction to deep learning
DSRLab seminar   Introduction to deep learningDSRLab seminar   Introduction to deep learning
DSRLab seminar Introduction to deep learningPoo Kuan Hoong
 
Neural Networks and Deep Learning Basics
Neural Networks and Deep Learning BasicsNeural Networks and Deep Learning Basics
Neural Networks and Deep Learning BasicsJon Lederman
 
Recent progress on distributing deep learning
Recent progress on distributing deep learningRecent progress on distributing deep learning
Recent progress on distributing deep learningViet-Trung TRAN
 
Recurrent Neural Networks, LSTM and GRU
Recurrent Neural Networks, LSTM and GRURecurrent Neural Networks, LSTM and GRU
Recurrent Neural Networks, LSTM and GRUananth
 
Lecture on Deep Learning
Lecture on Deep LearningLecture on Deep Learning
Lecture on Deep LearningYasas Senarath
 

Similar a Animesh Prasad and Muthu Kumar Chandrasekaran - WESST - Basics of Deep Learning (20)

Machine Duping 101: Pwning Deep Learning Systems
Machine Duping 101: Pwning Deep Learning SystemsMachine Duping 101: Pwning Deep Learning Systems
Machine Duping 101: Pwning Deep Learning Systems
 
Convolutional Neural Networks (CNN)
Convolutional Neural Networks (CNN)Convolutional Neural Networks (CNN)
Convolutional Neural Networks (CNN)
 
DEF CON 24 - Clarence Chio - machine duping 101
DEF CON 24 - Clarence Chio - machine duping 101DEF CON 24 - Clarence Chio - machine duping 101
DEF CON 24 - Clarence Chio - machine duping 101
 
Introduction to deep learning
Introduction to deep learningIntroduction to deep learning
Introduction to deep learning
 
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
Deep Learning: Evolution of ML from Statistical to Brain-like Computing- Data...
 
Deep learning
Deep learningDeep learning
Deep learning
 
AINL 2016: Filchenkov
AINL 2016: FilchenkovAINL 2016: Filchenkov
AINL 2016: Filchenkov
 
Deep Learning in Robotics: Robot gains Social Intelligence through Multimodal...
Deep Learning in Robotics: Robot gains Social Intelligence through Multimodal...Deep Learning in Robotics: Robot gains Social Intelligence through Multimodal...
Deep Learning in Robotics: Robot gains Social Intelligence through Multimodal...
 
Deep learning from a novice perspective
Deep learning from a novice perspectiveDeep learning from a novice perspective
Deep learning from a novice perspective
 
Natural Language Processing Advancements By Deep Learning: A Survey
Natural Language Processing Advancements By Deep Learning: A SurveyNatural Language Processing Advancements By Deep Learning: A Survey
Natural Language Processing Advancements By Deep Learning: A Survey
 
Deep learning for 3-D Scene Reconstruction and Modeling
Deep learning for 3-D Scene Reconstruction and Modeling Deep learning for 3-D Scene Reconstruction and Modeling
Deep learning for 3-D Scene Reconstruction and Modeling
 
Advancements in HPCC Systems Machine Learning
Advancements in HPCC Systems Machine LearningAdvancements in HPCC Systems Machine Learning
Advancements in HPCC Systems Machine Learning
 
ML Module 3 Non Linear Learning.pptx
ML Module 3 Non Linear Learning.pptxML Module 3 Non Linear Learning.pptx
ML Module 3 Non Linear Learning.pptx
 
deepnet-lourentzou.ppt
deepnet-lourentzou.pptdeepnet-lourentzou.ppt
deepnet-lourentzou.ppt
 
Model Evaluation in the land of Deep Learning
Model Evaluation in the land of Deep LearningModel Evaluation in the land of Deep Learning
Model Evaluation in the land of Deep Learning
 
DSRLab seminar Introduction to deep learning
DSRLab seminar   Introduction to deep learningDSRLab seminar   Introduction to deep learning
DSRLab seminar Introduction to deep learning
 
Neural Networks and Deep Learning Basics
Neural Networks and Deep Learning BasicsNeural Networks and Deep Learning Basics
Neural Networks and Deep Learning Basics
 
Recent progress on distributing deep learning
Recent progress on distributing deep learningRecent progress on distributing deep learning
Recent progress on distributing deep learning
 
Recurrent Neural Networks, LSTM and GRU
Recurrent Neural Networks, LSTM and GRURecurrent Neural Networks, LSTM and GRU
Recurrent Neural Networks, LSTM and GRU
 
Lecture on Deep Learning
Lecture on Deep LearningLecture on Deep Learning
Lecture on Deep Learning
 

Último

General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...Poonam Aher Patil
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...Nguyen Thanh Tu Collection
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxmarlenawright1
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024Elizabeth Walsh
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.pptRamjanShidvankar
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxJisc
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfDr Vijay Vishwakarma
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxAmanpreet Kaur
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsKarakKing
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxJisc
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibitjbellavia9
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.MaryamAhmad92
 

Último (20)

General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 

Animesh Prasad and Muthu Kumar Chandrasekaran - WESST - Basics of Deep Learning

  • 1. Animesh Prasad Muthu Kumar Chandrasekaran 
 Basics of Deep Learning Workshop on Educational and Social Science Technologies (WESST 2017, 19—21 July)
  • 6. Machine Learning, AI • Four key ingredients for ML towards AI 1. Lots & lots of data 2. Very flexible models 3.   Enough computing power 4.  Powerful priors that can defeat the curse of dimensionality 
 6
  • 7. Bypassing the curse of dimensionality We need to build compositionality into our ML models Just as human languages exploit compositionality to give representations and meanings to complex ideas Exploiting compositionality gives an exponential gain in representational power (1) Distributed representations / embeddings: feature learning (2) Deep architecture: multiple levels of feature learning Additional prior: compositionality is useful to describe the world around us efficiently 
 7
  • 8. Each feature can be discovered without the need for seeing the exponentially large number of configurations of the other features • Consider a network whose hidden units discover the following features: • Person wears glasses • Person is female
 • Person is a child
 • Etc. If each of n feature requires O(k) parameters, need O(nk) examples Non- parametric methods would require O(nd ) examples 
 8
  • 10. Hidden Units Discover Semantically Meaningful Concepts 
 • Network trained to recognize places, not objects 
 10
  • 11. Why does it work? •   It only works because we are making some assumptions about 
 the data generating distribution 
 •   Worse-case distributions still require exponential data 
 •   But the world has structure and we can get an exponential gain by exploiting some of it 
 
 11
  • 12. 12
  • 13. 13
  • 14. 14
  • 15. 15
  • 16. 16
  • 17. 17
  • 18. 18
  • 19. 19
  • 20. 20
  • 21. 21
  • 22. 22
  • 23. 23
  • 24. 24
  • 26. Recurrent Neural Networks • Selectively summarize an input sequence in a fixed-size state vector via a recursive update 26
  • 27. Recurrent Neural Networks • Can produce an output at each time step: unfolding the graph tells us how to back-prop through time. 27
  • 28. Recurrent Neural Networks • The RNN gradient is a product of Jacobian matrices, each associated with a step in the forward computation. 28
  • 30. 30 Gated Recurrent Units & LSTM Create a path where gradients can flow for longer with self-loop •  Corresponds to an eigenvalue of Jacobian slightly less than 1 •  LSTM is heavily used • GRU light-weight version
  • 32. 32
  • 33. 33
  • 34. 34
  • 35. 35
  • 36. 36
  • 38. 38 Piecewise Linear Nonlinearity Absolute value rectification works better than tanh in lower layers of convnet. Using a rectifier non- linearity (ReLU) instead of tanh of softplus allows for the first time to train very deep supervised networks without the need for unsupervised pre-training; was biologically motivated Rectifiers one of the crucial ingredients in ImageNet breakthrough
  • 39. 39 Dropout Regularizer: Super- Efficient Bagging •  Dropouts trick: during training multiply neuron output by random bit (p=0.5), during test by 0.5 •  Used in deep supervised networks •  Similar to denoising auto-encoder, but corrupting every layer •  Works better with some non- linearities (rectifiers, maxout)
  • 41. 41 Early Stopping •  Beautiful FREE LUNCH (no need to launch many different training runs for each value of hyper-parameter for #iterations) •  Monitor validation error during training (after visiting # of training examples = a multiple of validation set size) •  Keep track of parameters with best validation error and report them at the end •  If error does not improve enough (with some patience), stop. 

  • 42. 42 Sampling Hyperparameters •  Common approach: manual + grid search •  Grid search over hyperparameters: simple & wasteful •  Random search: simple & efficient 
 • Independently sample each HP,
 • More convenient: ok to early- stop, continue further, etc. 

  • 45. 45
  • 46. 46 Analogical Representations for Free Text • Semantic relations appear as linear relationships in the space of learned representations • King – Queen ≈ Man – Woman • Paris – France + Italy ≈ Rome
  • 47. 47 •  Intermediate representation of meaning = ‘universal representation’ •  Encoder: from word sequence to sentence representation •  Decoder: from representation to word sequence distribution Encoder-Decoder Framework
  • 48. 48 Attention Mechanism for Deep Learning •  Consider an input (or intermediate) sequence or image •  Consider an upper level representation, which can choose « where to look », by assigning a weight or probability to each input position, as produced by an MLP, applied at each position
  • 51. 51
  • 52. 52
  • 53. 53
  • 55. 55 Autoencoders •  RBM 
 •  (Denoising) Autoencoder
  • 56. 56
  • 57. 57
  • 58. 58 Insights •  The big payoff of deep learning is to allow learning higher levels of abstraction 
 •  Higher-level abstractions disentangle the factors of variation, which allows much easier generalization and transfer 

  • 60. 60 Hands On • Login to NUS SoC VPN with your NUSNET Account • ssh wesst@kpe.d1.comp.nus.edu.sg Password: kerasuser • copy folder demo to a new folder rename demo_<yourname>
  • 63. 63
  • 64. 64
  • 65. 65
  • 66. 66
  • 67. 67
  • 68. 68
  • 69. 69
  • 70. 70
  • 71. 71
  • 72. 72
  • 73. 73
  • 74. 74
  • 75. 75
  • 76. 19 July 2017 76 References: NIPS DL Tutorial 2015 CS231n: Convolutional Neural Networks for Visual Recognition