2. Discriminative vs. Generative Learning
Discriminative Learning Generative Learning
Learn 𝑝(𝑦|𝑥) directly Model 𝑝 𝑦 , 𝑝 𝑥 𝑦 first,
Then derive the posterior distribution:
𝑝 𝑦 𝑥 =
𝑝 𝑥 𝑦 𝑝(𝑦)
𝑝(𝑥)
2
3. Undirected Graph vs. Directed Graph
Undirected Directed
• Boltzmann Machines
• Restricted Boltzmann Machines
• Deep Boltzmann Machines
• Sigmoid Belief Networks
• Variational Autoencoders (VAE)
• Generative Adversarial Networks
(GAN)
3
a b
c
a b
c
Deep Belief
Networks
4. Boltzmann Machines
• Stochastic Recurrent Neural Network and Markov Random
Field invented by Hinton and Sejnowski in 1985
• 𝑷 𝒙 =
𝐞𝐱𝐩(−𝑬 𝒙 )
𝒁
> E(x): Energy function
> Z: partition function where σ 𝑥 𝑃 𝑥 = 1
• Energy-based model: positive values all the time
• Single visible layer and single hidden layer
• Fully connected: not practical to implement
4
5. Restricted Boltzmann Machines
• Dimensionality reduction, classification, regression,
collaborative filtering, feature learning and topic modeling
• 𝑷 𝐯 = 𝒗, 𝐡 = 𝒉 =
𝟏
𝒁
𝐞𝐱𝐩(−𝑬 𝒗, 𝒉 )
• Two layers like BMs
• Building blocks of deep probabilistic models
• Gibbs sampling with Contrastive Divergence (CD) or Persistent
CD
5
9. Limitations of DBN (By Ruslan Salakhutdinov)
• Explaining away
• Greedy layer-wise pre-training
> no optimization over all layers
• Approximation inference is feed-forward
> no bottom-up and top-down
9
http://www.slideshare.net/zukun/p05-deep-boltzmann-machines-cvpr2012-deep-learning-methods-for-vision