7 steps for highly effective deep neural networks

•

0 recomendaciones•501 vistas

Natalino will provide a fast-paced lecture on recognizing handwritten numbers with ANNs. In the past years much progress has been booked on the recognition of images: multi-layer perceptrons, inception networks, skip and residual nets, lstm, attention networks. In this talk, Natalino will go through 7 different architectures on how to classify handwritten numbers using keras and tensorflow. For each neural network architecture, he will spend some time on the theory, the coding, and the quality of the results. Code and Fun included!. original file: https://drive.google.com/file/d/0BwNrPuGaMi8PbVhUYUVKWUhGRjQ

Datos y análisis

7 steps for highly effective
deep neural networks
Natalino Busa - Head of Data Science

2 Natalino Busa - @natbusa
O’Reilly Author and Speaker
Teradata EMEA Practice Lead on Open Source Technologies
Teradata Principal Data Scientist
ING Group Enterprise Architect: Cybersecurity, Marketing, Fintech
Cognitive Finance Group Advisory Board Member
Philips Senior Researcher, Data Architect
Linkedin and Twitter:
@natbusa

3
Natalino Busa - @natbusa
X: independent variable
Y: dependent variable
Linear Regression:
How to best fit a line to some data

4
Natalino Busa - @natbusa
input : 784 numbers
output: 10 classes
28x28 pixels
Classification: Handwritten digits

5
Natalino Busa - @natbusa
Sharing the (Not so) Secret Lore:
Keras & Tensorflow
Some smaller projects:
Tflearn, Tensorlayers
http://keras.io/

6
Natalino Busa - @natbusa
1: Single Layer Perceptron
“dendrites”
Axon’s
response
Activation function

7
Natalino Busa - @natbusa
More activation functions:
https://en.wikipedia.org/wiki/Activation_function

8
Natalino Busa - @natbusa
Single Layer Neural Network
Takes: n-input features: Map them to a soft “binary” space
∑
x1
x2
xn
f
1: Single Layer Perceptron (binary classifier)

9
Natalino Busa - @natbusa
1: Single Layer Perceptron (multi-class classifier)
Values between 0 and 1
Sum of all outcomes = 1
It produces an estimate of a probability!
From soft binary space to predicting probabilities:
Take n inputs, Divide by the sum of the predicted values
∑
x1
x2
xn
f
∑ f
softmax ‘1’: 95%
‘8’: 5%

10
Natalino Busa - @natbusa
1: Single Layer Perceptron
Minimize costs:
The cost function depends on:
- Parameters of the model
- How the model “composes”
Goal :
Reduce the mean probability error
modify the parameters to reduce the error! Vintage math from last century

11
Natalino Busa - @natbusa
Supervised Learning
Stack layers of perceptrons
- Feed Forward Network
- Scoring goes
from input to output
- Back propagate the error
from output to input
SOFTMAX
Input parameters
classes (estimated probabilities)
Feed-forwardfunctions
Cost function
supervised : actual output
BackPropagateErrors

13
Natalino Busa - @natbusa
1. Single Layer Perceptron

14
Natalino Busa - @natbusa
1. Single Layer Perceptron

15
Natalino Busa - @natbusa
1. Single Layer Perceptron

16
Natalino Busa - @natbusa
1. Single Layer Perceptron

20
Natalino Busa - @natbusa
2. Multi Layer Perceptron

21
Natalino Busa - @natbusa
2. Multi Layer Perceptron

22
Natalino Busa - @natbusa
2. Multi Layer Perceptron

23
Natalino Busa - @natbusa
2. Multi Layer Perceptron

24
Natalino Busa - @natbusa
3. Convolution
From Krizehvsky et al. (2012)

25
Natalino Busa - @natbusa
3. Convolution
diagrams:
By Aphex34 - Own work, CC BY-SA 4.0, https://commons.wikimedia.org/w/index.php?curid=45659236
By Aphex34 - Own work, CC BY-SA 4.0, https://commons.wikimedia.org/w/index.php?curid=45673581
CC0, https://en.wikipedia.org/w/index.php?curid=48817276
convolution Max pooling RELU / ELU

26
Natalino Busa - @natbusa
3. Convolution

27
Natalino Busa - @natbusa
3. Convolution

28
Natalino Busa - @natbusa
3. Batch Normalization

29
Natalino Busa - @natbusa
3. Batch Normalization ( example for MLP)

30
Natalino Busa - @natbusa
3. Batch Normalization ( example for MLP)

31
Natalino Busa - @natbusa
3. Batch Normalization ( example for MLP)
Activation function:
SIGMOID

32
Natalino Busa - @natbusa
3. Batch Normalization ( example for MLP)
Activation function:
RELU

33
Natalino Busa - @natbusa
4. Regularization: Prevent overfitting in ANNs
- Batch Normalization
- RELU/ELU
- RESIDUAL / SKIP Networks
- DROP LAYER
- REDUCE PRECISION (HUFFMAN ENCODING)
In general ANN are parameters rich, constraining the
parameter space usually produces better results and speed
up the learning

34
Natalino Busa - @natbusa
5. Inception architectures
Cannot be stacked!

35
Natalino Busa - @natbusa
5. Inception architectures

36
Natalino Busa - @natbusa
5. Inception architectures
compression
Avoid
dimensions
explosion

37
Natalino Busa - @natbusa
5. Inception architectures

38
Natalino Busa - @natbusa
5. Inception architectures (top architecture)

39
Natalino Busa - @natbusa
6. Residual Networks
https://culurciello.github.io/tech/2016/06/04/nets.html

40
Natalino Busa - @natbusa
6. Residual Networks

41
Natalino Busa - @natbusa
6. Residual + Inception Networks

42
Natalino Busa - @natbusa
7. LSTM on Images

43
Natalino Busa - @natbusa
7. LSTM on Images

44
Natalino Busa - @natbusa
7. LSTM on Images
http://colah.github.io/posts/2015-08-Understanding-LSTMs/

45
Natalino Busa - @natbusa
7. LSTM on Images

46
Natalino Busa - @natbusa
7. LSTM on Images

47
Natalino Busa - @natbusa
7. LSTM on ConvNets (bonus slide)

48
Natalino Busa - @natbusa
All the codez :)
https://github.com/natbusa/deepnumbers

49
Natalino Busa - @natbusa
Meta- References
… just a few articles, but extremely dense in content.
A must read!
https://keras.io/
http://karpathy.github.io/neuralnets/
https://culurciello.github.io/tech/2016/06/04/nets.html
http://colah.github.io/posts/2015-08-Understanding-LSTMs/
https://gab41.lab41.org/batch-normalization-what-the-hey-d480039a9e3b

Más contenido relacionado

Similar a 7 steps for highly effective deep neural networks

Scaling PyData Up and OutTravis Oliphant

Fast and Scalable PythonTravis Oliphant

Scientific PythonEueung Mulyana

Puppet 3: Present and Future TenseEric Sorenson

Puppet 3: Present and Future TensePuppet

Understanding Jupyter notebooks using bioinformatics examplesLynn Langit

Deep Learning and Tensorflow Implementation(딥러닝, 텐서플로우, 파이썬, CNN)_Myungyon Ki...Myungyon Kim

Real-Time Anomoly Detection with Spark MLib, Akka and Cassandra by Natalino BusaSpark Summit

2951085 dzone-2016guidetobigdatabalu kvm

Introduction to Neural Networks in TensorflowNicholas McClure

Python coroutine경섭 심

Espresso: LinkedIn's Distributed Data Serving Platform (Talk)Amy W. Tang

Using Deep Learning in Production Pipelines to Predict Consumers’ Interest wi...Databricks

Large scale social networks analysis joclad 2013Rui Sarmento

Machine Learning with Sparkelephantscale

Startup.Ml: Using neon for NLP and Localization Applications Intel Nervana

A Platform for Accelerating Machine Learning ApplicationsNVIDIA Taiwan

ドキュメントを作りたくなってしまう魔法のツールSphinxTakayuki Shimizukawa

11_Saloni Malhotra_SummerTraining_PPT.pptxSaloniMalhotra23

Basic of python for data analysisPramod Toraskar

Similar a 7 steps for highly effective deep neural networks (20)

Scaling PyData Up and Out

Fast and Scalable Python

Scientific Python

Puppet 3: Present and Future Tense

Understanding Jupyter notebooks using bioinformatics examples

Deep Learning and Tensorflow Implementation(딥러닝, 텐서플로우, 파이썬, CNN)_Myungyon Ki...

Real-Time Anomoly Detection with Spark MLib, Akka and Cassandra by Natalino Busa

2951085 dzone-2016guidetobigdata

Introduction to Neural Networks in Tensorflow

Python coroutine

Espresso: LinkedIn's Distributed Data Serving Platform (Talk)

Using Deep Learning in Production Pipelines to Predict Consumers’ Interest wi...

Large scale social networks analysis joclad 2013

Machine Learning with Spark

Startup.Ml: Using neon for NLP and Localization Applications

A Platform for Accelerating Machine Learning Applications

ドキュメントを作りたくなってしまう魔法のツールSphinx

11_Saloni Malhotra_SummerTraining_PPT.pptx

Basic of python for data analysis

Más de Natalino Busa

Data Production Pipelines: Legacy, practices, and innovationNatalino Busa

[Ai in finance] AI in regulatory compliance, risk management, and auditingNatalino Busa

Strata London 16: sightseeing, venues, and friendsNatalino Busa

Data in ActionNatalino Busa

Real-Time Anomaly Detection with Spark MLlib, Akka and CassandraNatalino Busa

The evolution of data analyticsNatalino Busa

Towards Real-Time banking API's: Introducing Coral, a web api for realtime st...Natalino Busa

Streaming Api Design with Akka, Scala and SprayNatalino Busa

Hadoop + Cassandra: Fast queries on data lakes, and wikipedia search tutorial.Natalino Busa

Big data solutions for advanced marketing analyticsNatalino Busa

Awesome Banking API'sNatalino Busa

Yo. big data. understanding data science in the era of big data.Natalino Busa

Big and fast a quest for relevant and real-time analyticsNatalino Busa

Big Data and APIs - a recon tour on how to successfully do Big Data analyticsNatalino Busa

Strata 2014: Data science and big data trending topicsNatalino Busa

Streaming computing: architectures, and tchnologiesNatalino Busa

Big data landscapeNatalino Busa

Más de Natalino Busa (17)

Data Production Pipelines: Legacy, practices, and innovation

[Ai in finance] AI in regulatory compliance, risk management, and auditing

Strata London 16: sightseeing, venues, and friends

Data in Action

Real-Time Anomaly Detection with Spark MLlib, Akka and Cassandra

The evolution of data analytics

Towards Real-Time banking API's: Introducing Coral, a web api for realtime st...

Streaming Api Design with Akka, Scala and Spray

Hadoop + Cassandra: Fast queries on data lakes, and wikipedia search tutorial.

Big data solutions for advanced marketing analytics

Awesome Banking API's

Yo. big data. understanding data science in the era of big data.

Big and fast a quest for relevant and real-time analytics

Big Data and APIs - a recon tour on how to successfully do Big Data analytics

Strata 2014: Data science and big data trending topics

Streaming computing: architectures, and tchnologies

Big data landscape

Último

BigBuy dropshipping via API with DroFx.pptxolyaivanovalion

Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Pooja Nehwal

Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE9953056974 Low Rate Call Girls In Saket, Delhi NCR

Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...amitlee9823

Capstone Project on IBM Data Analytics ProgramMoniSankarHazra

Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal

5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795

BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls

Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls

Week-01-2.ppt BBB human Computer interactionfulawalesam

Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop

Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823

Midocean dropshipping via API with DroFxolyaivanovalion

VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY

Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...only4webmaster01

Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums

Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Standamitlee9823

Sampling (random) method and Non random.pptDr. Soumendra Kumar Patra

Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann

7 steps for highly effective deep neural networks

1. 7 steps for highly effective deep neural networks Natalino Busa - Head of Data Science

2. 2 Natalino Busa - @natbusa O’Reilly Author and Speaker Teradata EMEA Practice Lead on Open Source Technologies Teradata Principal Data Scientist ING Group Enterprise Architect: Cybersecurity, Marketing, Fintech Cognitive Finance Group Advisory Board Member Philips Senior Researcher, Data Architect Linkedin and Twitter: @natbusa

3. 3 Natalino Busa - @natbusa X: independent variable Y: dependent variable Linear Regression: How to best fit a line to some data

4. 4 Natalino Busa - @natbusa input : 784 numbers output: 10 classes 28x28 pixels Classification: Handwritten digits

5. 5 Natalino Busa - @natbusa Sharing the (Not so) Secret Lore: Keras & Tensorflow Some smaller projects: Tflearn, Tensorlayers http://keras.io/

6. 6 Natalino Busa - @natbusa 1: Single Layer Perceptron “dendrites” Axon’s response Activation function

7. 7 Natalino Busa - @natbusa More activation functions: https://en.wikipedia.org/wiki/Activation_function

8. 8 Natalino Busa - @natbusa Single Layer Neural Network Takes: n-input features: Map them to a soft “binary” space ∑ x1 x2 xn f 1: Single Layer Perceptron (binary classifier)

9. 9 Natalino Busa - @natbusa 1: Single Layer Perceptron (multi-class classifier) Values between 0 and 1 Sum of all outcomes = 1 It produces an estimate of a probability! From soft binary space to predicting probabilities: Take n inputs, Divide by the sum of the predicted values ∑ x1 x2 xn f ∑ f softmax ‘1’: 95% ‘8’: 5%

10. 10 Natalino Busa - @natbusa 1: Single Layer Perceptron Minimize costs: The cost function depends on: - Parameters of the model - How the model “composes” Goal : Reduce the mean probability error modify the parameters to reduce the error! Vintage math from last century

11. 11 Natalino Busa - @natbusa Supervised Learning Stack layers of perceptrons - Feed Forward Network - Scoring goes from input to output - Back propagate the error from output to input SOFTMAX Input parameters classes (estimated probabilities) Feed-forwardfunctions Cost function supervised : actual output BackPropagateErrors

12. 12 Natalino Busa - @natbusa Let’s go!

13. 13 Natalino Busa - @natbusa 1. Single Layer Perceptron

14. 14 Natalino Busa - @natbusa 1. Single Layer Perceptron

15. 15 Natalino Busa - @natbusa 1. Single Layer Perceptron

16. 16 Natalino Busa - @natbusa 1. Single Layer Perceptron

17. 17 Natalino Busa - @natbusa Tensorboard

18. 18 Natalino Busa - @natbusa Tensorboard

19. 19 Natalino Busa - @natbusa Tensorboard

20. 20 Natalino Busa - @natbusa 2. Multi Layer Perceptron

21. 21 Natalino Busa - @natbusa 2. Multi Layer Perceptron

22. 22 Natalino Busa - @natbusa 2. Multi Layer Perceptron

23. 23 Natalino Busa - @natbusa 2. Multi Layer Perceptron

24. 24 Natalino Busa - @natbusa 3. Convolution From Krizehvsky et al. (2012)

25. 25 Natalino Busa - @natbusa 3. Convolution diagrams: By Aphex34 - Own work, CC BY-SA 4.0, https://commons.wikimedia.org/w/index.php?curid=45659236 By Aphex34 - Own work, CC BY-SA 4.0, https://commons.wikimedia.org/w/index.php?curid=45673581 CC0, https://en.wikipedia.org/w/index.php?curid=48817276 convolution Max pooling RELU / ELU

26. 26 Natalino Busa - @natbusa 3. Convolution

27. 27 Natalino Busa - @natbusa 3. Convolution

28. 28 Natalino Busa - @natbusa 3. Batch Normalization

29. 29 Natalino Busa - @natbusa 3. Batch Normalization ( example for MLP)

30. 30 Natalino Busa - @natbusa 3. Batch Normalization ( example for MLP)

31. 31 Natalino Busa - @natbusa 3. Batch Normalization ( example for MLP) Activation function: SIGMOID

32. 32 Natalino Busa - @natbusa 3. Batch Normalization ( example for MLP) Activation function: RELU

33. 33 Natalino Busa - @natbusa 4. Regularization: Prevent overfitting in ANNs - Batch Normalization - RELU/ELU - RESIDUAL / SKIP Networks - DROP LAYER - REDUCE PRECISION (HUFFMAN ENCODING) In general ANN are parameters rich, constraining the parameter space usually produces better results and speed up the learning

34. 34 Natalino Busa - @natbusa 5. Inception architectures Cannot be stacked!

35. 35 Natalino Busa - @natbusa 5. Inception architectures

36. 36 Natalino Busa - @natbusa 5. Inception architectures compression Avoid dimensions explosion

37. 37 Natalino Busa - @natbusa 5. Inception architectures

38. 38 Natalino Busa - @natbusa 5. Inception architectures (top architecture)

39. 39 Natalino Busa - @natbusa 6. Residual Networks https://culurciello.github.io/tech/2016/06/04/nets.html

40. 40 Natalino Busa - @natbusa 6. Residual Networks

41. 41 Natalino Busa - @natbusa 6. Residual + Inception Networks

42. 42 Natalino Busa - @natbusa 7. LSTM on Images

43. 43 Natalino Busa - @natbusa 7. LSTM on Images

44. 44 Natalino Busa - @natbusa 7. LSTM on Images http://colah.github.io/posts/2015-08-Understanding-LSTMs/

45. 45 Natalino Busa - @natbusa 7. LSTM on Images

46. 46 Natalino Busa - @natbusa 7. LSTM on Images

47. 47 Natalino Busa - @natbusa 7. LSTM on ConvNets (bonus slide)

48. 48 Natalino Busa - @natbusa All the codez :) https://github.com/natbusa/deepnumbers

49. 49 Natalino Busa - @natbusa Meta- References … just a few articles, but extremely dense in content. A must read! https://keras.io/ http://karpathy.github.io/neuralnets/ https://culurciello.github.io/tech/2016/06/04/nets.html http://colah.github.io/posts/2015-08-Understanding-LSTMs/ https://gab41.lab41.org/batch-normalization-what-the-hey-d480039a9e3b

7 steps for highly effective deep neural networks

Recomendados

Recomendados

Más contenido relacionado

Similar a 7 steps for highly effective deep neural networks

Similar a 7 steps for highly effective deep neural networks (20)

Más de Natalino Busa

Más de Natalino Busa (17)

Último

Último (20)

7 steps for highly effective deep neural networks