Deep Learning Notes

Deep Learning en acción
Jose María Alvarez | Assoc. Prof. UC3M | josemaria.alvarez@uc3m.es

2Cátedra RTVE-UC3M
Agenda
03
02
01 Resumen de arquitectura y configuración
Visión general de Deep Learning
Keras
Entorno tecnológico
Ejemplos y casos de uso
Resolución de ejemplos

3Cátedra RTVE-UC3M


4Cátedra RTVE-UC3M
En vision por computador…
¿Qué se ve?

5Cátedra RTVE-UC3M
En vision por computador…
¿Qué se ve?

6Cátedra RTVE-UC3M
Visión general
Preguntas iniciales
¿Qué es un sistema de
Deep Learning?
¿Tipología de problemas?
¿Qué son las capas?
¿Criterios para selección del
número de capas?
¿Cuántos nodos por capas?
¿Cómo funciona un sistema de
Deep Learning de forma
general?
¿Qué es una función de activación?
¿Ejemplos?
¿Criterios de selección?
¿Qué es una función de calculo
de pérdida?
¿Ejemplos?
¿Criterios de selección?
¿Cómo se mide el
rendimiento de un Sistema de
Deep Learning?
¿Medidas?

7Cátedra RTVE-UC3M
Visión general
Contexto
Deep Learning by Adam Gibson; Josh PattersonPublished by O'Reilly Media, Inc., 2017

8Cátedra RTVE-UC3M
Visión general
Multilayer neural network topology

9Cátedra RTVE-UC3M
Visión general
Perceptron-One single layer perceptron

10Cátedra RTVE-UC3M
Visión general
Neurona artificial

Visión general
Neurona artificial

Visión general
Elementos básicos
Connection weights
“Weights on connections in a
neural network are coefficients
that scale (amplify or
minimize) the input signal to a
given neuron in the network. In
common representations of
neural networks, these are the
lines/arrows going from one
point to”
25%
Activation functions
“The functions that govern the artificial
neuron’s behavior are called activation
functions. The transmission of that input is
known as forward propagation. Activation
functions transform the combination of
inputs, weights, and biases..”
25%
Biases “Biases are scalar values added to the
input to ensure that at least a few
nodes per layer are activated
regardless of signal strength. Biases
allow learning to happen by giving the
network action in the event of low
signal. They allow the network to try
new interpretations or behaviors.
Biases are generally notated b, and,
like weights, biases are modified
throughout the learning process.”
25%
Loss functions
“Loss functions quantify how close
a given neural network is to the
ideal toward which it is training.
The idea is simple. We calculate a
metric based on the error we observe
in the network’s predictions”
25%

• Text-to-speech synthesis (Fan et al., Microsoft,
Interspeech 2014)
• Language identification (Gonzalez-Dominguez et al.,
Google, Interspeech 2014)
• Large vocabulary speech recognition (Sak et al., Google,
Interspeech 2014)
• Prosody contour prediction (Fernandez et al., IBM,
Interspeech 2014)
• Medium vocabulary speech recognition (Geiger et al.,
Interspeech 2014)
• English-to-French translation (Sutskever et al., Google,
NIPS 2014)
• Audio onset detection (Marchi et al., ICASSP 2014)
• Social signal classification (Brueckner & Schulter,
ICASSP 2014)
• Arabic handwriting recognition (Bluche et al., DAS 2014)
• TIMIT phoneme recognition (Graves et al., ICASSP
2013)
• Optical character recognition (Breuel et al., ICDAR 2013)
• Image caption generation (Vinyals et al., Google, 2014)
• Video-to-textual description (Donahue et al., 2014)
• Syntactic parsing for natural language processing
(Vinyals et al., Google, 2014)
• Photo-real talking heads (Soong and Wang, Microsoft,
2014)
• Automated image sharpening
• Automating image upscaling
• WaveNet: generating human
speech that can imitate
anyone’s voice
• WaveNet: generating
believable classical music
• Speech reconstruction from
silent video
• Generating fonts
• Image autofill for missing
regions
• Automated image
captioning (see
also: https://github.com/karpath
y/neuraltalk2)
• Turning hand-drawn doodles
into stylized artwork
Visión general
Algunos casos de uso…

Arquitectura y
configuración
Visión general de Deep Learning

Arquitectura y configuración
Tipología de redes profundas
Unsupervised
Pretrained
Networks
Convolutional
Neural Networks
Recurrent Neural
Networks
Recursive Neural
Networks

Arquitecturas de redes neuronales

Avances
DBNs, CNN, RBMs, etc.
Tipos de capas
Datos e imágenes
Arquitecturas híbridas
RNN, LSTM, GRU, etc.
Tipos de neuronas
IA como servicio
Tecnología

Arquitectura y
configuración
Funciones de
activación

Funciones de pérdida
Clasificación
Regresión
logística

Funciones de coste/pérdida
Mean absolute error loss (L1)
Mean squared log error loss
Mean Squared Error Loss (L2)
Mean absolute percentage error
Hinge Loss
Logistic Loss
Referencia: https://heartbeat.fritz.ai/5-regression-loss-functions-all-machine-learners-should-know-4fb140e9d4b0

Arquitectura de un sistema de Deep Learning
• Nombre: AlexNet
• Autor:
• Geoffrey Hinton
• 1980
• Aplicación:
• Tareas particulares (visión por
computador)
• Características:
• Convolutional y pooling layers
• Capas totalmente conectadas
• Referencia:
• https://papers.nips.cc/paper/4
824-imagenet-classification-
with-deep-convolutional-
neural-networks.pdf
• Código en Keras:
• https://gist.github.com/JBed/c
2fb3ce8ed299f197eff

• Nombre: VGG Net
• Autor:
• Visual graphics group (Oxford)
• 2014
• Aplicación:
computador)
• Convolutional y pooling layers
(19)
• Entrenamiento lento
• Referencia:
• https://arxiv.org/abs/1409.1556
• https://github.com/keras-
team/keras/blob/master/keras/a
pplications/vgg16.py

• Nombre: GoogleNet (Inception
Network)
• Autor:
• Google
• 2014
• Aplicación:
computador)
• Convolutional y pooling layers (22)
• No secuencialrendimiento
• Entrenamiento más rápido que
VGG
• Referencia:
team/keras/blob/master/keras/ap
plications/inception_v3.py

• Nombre: ResNet
• Residual Networks
• Autor:
• FIXME
• Aplicación:
• Tareas generales (visión por
computador)
• Procesamiento en lotes de la
entrada
• Referencia:
5
team/keras/blob/master/keras/a
pplications/resnet50.py

• Nombre: ResNetX
• Autor:
• Google
• 2014
• Aplicación:
computador)
• Procesamiento en lotes de la
entrada
• Referencia:
• https://arxiv.org/pdf/1611.05431.pdf
• https://github.com/titu1994/Ker
as-ResNeXt

YOLO GAN
(Generative Adversarial Network)

Otras mejoras
Fuente: https://www.datasciencecentral.com/profiles/blogs/24-neural-network-adjustements

Entorno
tecnológico
Tecnología

29Deep Learning en acción
Tecnología disponible
Fuente: https://towardsdatascience.com/deep-learning-framework-power-scores-2018-23607ddf297a
Otra comparación: https://www.kdnuggets.com/2018/03/deep-learning-frameworks.html

Deep Learning as a Service
Fuente: https://www.ibm.com/blogs/research/2018/03/deep-learning-advances/

Infraestructura
Fuente: https://medium.com/work-bench/todays-ai-software-infrastructure-landscape-and-trends-shaping-the-market-
460d0c1c26d2

Ej: Google Cloud Platform

Ej: Amazon
•
Fuente: https://www.morpheusdata.com/blog/2017-09-14-cloud-and-ai-a-work-in-progress-with-unlimited-upside

Ej: AzureML

Ej: Algorithmia
• https://algorithmia.com/

El framework KERAS

Principios de diseño
-API para humanos
Usabilidad
-Fácil de crear nuevas funciones
Extensible
-Tensorflow
-Theano
-CNTKN
Diferentes motores
-Secuencia o grafo.
-Elementos ortogonales: capas, funciones, etc.
Modularidad
-Reutilización de bibliotecas existentes e
integración: pandas, scikit-learn, matplotlib,
numpy, etc.
Trabajo con Python
01
02
03
04
05

Elementos del API
Models
• Secuencia
• API functional
Layers Preprocessing Metrics
Activation
functions
Loss functions Optimizers Callbacks
Utils Datasets Visualization

30 segundos para empezar

Configuración de entorno en Anaconda (Windows)
conda install python=3.6
conda create --name PythonCPU
activate PythonCPU
conda deactivate
conda install -c anaconda keras
conda install spyder
conda install -c anaconda pandas
conda install -c anaconda seaborn
conda install -c anaconda scikit-learn
…

Ejemplos paso a pso

Metodología de trabajo
Métodolo
gía
Tareas y referencias
Descubrimiento de
funcionalidades
Paso 6: Persistencia
Guardar el modelo
Otras operaciones
Paso 5: Test y predicción
Validación del modelo
Predicción …
Paso 4: Entrenamiento
Aprendizaje
Python notebooks
Online: Google colab
Off-line: Jupyter
Definición del problema
Entender el problema y los
datos disponibles.
Paso 1: Gestión de datos
Preparación de los datos para
su entrenamiento y prueba
Paso 2: Arquitectura e implementación
Configuración de la red: capas,
funciones de activación,
pérdida, etc.

Listado de ejemplos y casos de uso
1. Ejemplo 1: Aproximación de una función con regresión lineal
2. Ejemplo 2: El dataset MINST, reconocimiento de caracteres
3. Ejemplo 3: El dataset IRIS, clasificación de flores
4. Ejemplo 4: Predicción de la inversión en un coche
5. Ejemplo 5: Predicción de cáncer de pecho
6. Ejemplo 6: Predicción de spam
7. Ejemplo 7: Predicción de precios de las casas
8. Ejemplo 8: Introducción a RNN
9. Ejemplo 9: Series temporales con Keras
10. Ejemplo 10: Predicción de consumo de energía en casas
11. Ejemplo 11: Mantenimiento predictivo: clasificación
12. Ejemplo 12: Mantenimiento predictivo: regresión

1. http://d2l.ai/ Dive into Deep Learning (Libro)
2. https://blog.slavv.com/37-reasons-why-your-neural-network-is-not-working-
4020854bd607
3. https://github.com/fchollet/keras-resources
Enlaces relevantes

Deep Learning Notes

Recomendados

Recomendados

Más contenido relacionado

Similar a Deep Learning Notes

Similar a Deep Learning Notes (20)

Más de CARLOS III UNIVERSITY OF MADRID

Más de CARLOS III UNIVERSITY OF MADRID (20)

Último

Último (20)

Deep Learning Notes

Notas del editor