Demystifying Differentiable Neural Computers and Their Brain Inspired Origin with Luis Leal

Luis Leal, Xoom a PayPal service
Demystifying
differentiable neural
computers
#DLSAIS14

DNC
•Differentiable Neural Computers(DeepMind,2016)
3#DLSAIS14

DNC basic idea
•Memory augmented neural network
•Neural network with I/O access to external memory
•I/O operations are learned instead of programmed
4#DLSAIS14

DNC basic idea
•Von Neumann computer architecture:
•CPU: in the DNC the CPU is a neural network
•Memory: separate external memory bank accessed by CPU via
read/write operations
5#DLSAIS14

Neuroscience meets AI and CS
•Basic architecture and memory allocation(release and assign) based in
computer science.
•Memory access(read) and retrieval based on neuroscience(hippocampus)
6#DLSAIS14

High level architecture
7#DLSAIS14
•A neural network called
controller (CPU) performs
computation on input data
•Read/Write heads perform
I/O from and to memory
•The controller interacts
with the read/write heads to
use “memories” for
computation.

DNC vs Neural Network
•Neural networks excel at pattern recognition ,perception tasks, sensory recognition and
reactive decision making(map inputs X to outputs Y) but they can’t be used for:
•Planning and reasoning tasks
•Use “memories” and facts from previous events
•Store useful information for future usage
•Generalize knowledge to new tasks(AGI)
•Work with complex data structures , like associative ones(graphs or trees)
8#DLSAIS14

DNC vs Neural Network
•The DNC tries to solve this by mixing the best of both worlds(memory based architecture
and machine learning):
•Perception and pattern recognition capabilities from machine learning
•Planning and reasoning based on previous memories and knowledge
•Usage of complex associative data structures
•Like a computer it can organize knowledge , data and facts as well as links between
them but like a neural network it needs no explicit programming because it can learn to
do so from examples(data).
9#DLSAIS14

Knowledge retrieval
•The DNC decides which “memories” to retrieve based on “attention mechanisms”
which can be described from both computational as well as neuro-science
perspectives,specially hippocampal synapses.
•Foundations of Human Memory , from Michael Kahana provides key human memory
concepts which the DNC has analogies with.
10#DLSAIS14
Neuro-science
Computational Neuro-science
Which external memory
locations to read and write
How does the brain
retrieves and relates stored
“memories” ?

Memory(attribute vectors)
•The external memory it’s a real number matrix(NxW).
•Attribute theory: every human memory is represented by a list of attributes which
describe the memory itself,and the context.
11#DLSAIS14
RAM with N positions and
word-size W
Human memories are
represented as a list of W
attributes.

Memory(attribute vectors)
12#DLSAIS14

Content based(similarity) access
•The controller(CPU) can emit a key vector and retrieve from the memory(or write to)
locations that best matches the key.
•Neuro-science proposes a model were :we can remember events when exposed to a
similar experience
13#DLSAIS14
Retrieve a weigthed sum of
memory values,weighted by
similarity to some specific value.
Similarity can be cosine similarity
We recall(or reinforce) past
experiences when exposed to
similar ones.

Content based(similarity) access
14#DLSAIS14

Time ordered access(temporal links)
•The system records the order in which memory locations are written.
•Temporal Context Model: its easier for us to remember and recall events in the order
they occurred(try to say all alphabet characters in random order vs ordered)
15#DLSAIS14
Linked list of memory position
written ,ordered by time.
Recall/retrieve memories in the
order they occurred.

Time ordered access(temporal links)
16#DLSAIS14

Short term and Long Term Memory
•Although not mandatory, the controller can be a LSTM(long short term memory) neural
network which provides short-term memory.
•Search of associative memory(SAM): SAM model proposes that our memory is a dual
store, a shor-term store and a long-term store.
17#DLSAIS14
Short-term memory provided by
LSTM neural network controller.
SAM model of dual memories
storage.

Short term and Long Term Memory
18#DLSAIS14

Dynamic Memory Allocation
•Additionally to writing by content, the DNC can assign and release memory as a
computer does, based on memory usage percentage and read orderings.
•The DNC can choose to write on new locations , update existing ones(reinforce
memories) or not write at all.
19#DLSAIS14
Dynamic memory administration. Add new memories or reinforce
existing ones.

Dynamic Memory Allocation
20#DLSAIS14

Complete architecture
21#DLSAIS14

22#DLSAIS14

23#DLSAIS14
At each time-step(clock cycle) the DNC:
• Gets an input(data) and calculates an output
that it’s a weighted sum of its inputs and the
“memories” retrieved from memory.
•The DNC decides how to interact with the
memory (where and what to read and write) via
an “interface vector”.
•The DNC sends the “memories” read to the next
time-step.

24#DLSAIS14
Thus, the output of the DNC its a function of it’s
input history,and what it decided to read from
memory.
Y = f(X,memory)

How the DNC decides I/O
25#DLSAIS14
How the DNC learns and decides how to interact with memory?
•The differentiable part of the DNC.
•Every component of the system uses weights similar to those of a
neural network
•Thus it can be trained via gradient descent and multi-variate calculus
optimization.
•Using samples(data) the system learns how to behave optimally.

Potential applications
26#DLSAIS14
•Problems that require reasoning and knowledge usage.
•Data structure based problems(graphs)

27#DLSAIS14
•Reasoning in Natural language processing instead of probabilistic models
•Chatbots that analyze and do reasoning?
•Successful test in bAbi dataset

28#DLSAIS14
•Graph reasoning problems.
•DeepMind trained the DNC on many random graphs:
•It learned to use it’s memory to navigate through the graph.
•Then 2 specific graphs were fed:
- The London underground graph
- A family tree
-Surprisingly it was able to generalize without re-training( AGI ?)

29#DLSAIS14
•Reinforcement learning
•It was tested on a grid game where:
•The player(agent) is given a set of goals and constraints per goal.
•It is then requested to satisfy a single goal
•It has to plan and reason how to achieve the goal.
•It stored the goals and constraints in memory

Thanks for your attention
30#DLSAIS14
•My contact:
-Linkedin: https://www.linkedin.com/in/luis-fernando-leal-hernandez-9a736276/
-Email: wichofer89@gmail.com
-Github: https://github.com/llealgt/DNC/
•References and illustrations thanks to:
•“Hybrid computing using a neural network with dynamic external memory", Nature 538, 471–476
(October 2016) doi:10.1038/nature20101.
•“Implementation and Optimization of Differentiable Neural Computers” ,Carol Hsin,Stanford
University
•“Differentiable memory and the brain”,Sam
Greydanus,https://greydanus.github.io/2017/02/27/differentiable-memory-and-the-brain/

Demystifying Differentiable Neural Computers and Their Brain Inspired Origin with Luis Leal

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Similar a Demystifying Differentiable Neural Computers and Their Brain Inspired Origin with Luis Leal

Similar a Demystifying Differentiable Neural Computers and Their Brain Inspired Origin with Luis Leal (20)

Más de Databricks

Más de Databricks (20)

Último

Último (20)

Demystifying Differentiable Neural Computers and Their Brain Inspired Origin with Luis Leal