SlideShare una empresa de Scribd logo
1 de 50
Descargar para leer sin conexión
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
@DocXavi
Deep Learning for Computer Vision
Beyond vision
5 May 2016
Xavier Giró-i-Nieto
Master en
Creació Multimedia
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
2
When robots open their eyes...
...is because they have learned to see.
Learning only to see ?
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Learning only to see ?
Nexi, del MIT Media Lab (Foto: Spencer Lowel)
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Big data
Internet of things - IoT
Learning only to see ?
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Personal data
Big data
Learning only to see ?
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Atlas, de Boston Dynamics
Robust manipulation and motion
Learning only to see ?
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Playing with other computers
Learning only to see ?
Mnih, Volodymyr, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin
Riedmiller. "Playing atari with deep reinforcement learning." arXiv preprint arXiv:1312.5602 (2013).
DeepMind
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Playing with humans
AlphaGo (Google DeepMind)
Learning only to see ?
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Autonomous Driving
Google Self-driving car
Learning only to see ?
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Visual arts
Google Research, “Going deeper into neural networks” - DeepDream (2015)
Learning only to see ?
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Google Research, “Going deeper into neural networks” - DeepDream (2015)
Visual arts
Only open their eyes ?
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
http://turing.deepart.io/
Visual arts
Learning only to see ?
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Music composition
Manuel Araoz, “Training a Recurrent Neural Network to Compose Music” (2016).
Learning only to see ?
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Poetry
Ross Goodwin, Neuralsnap (2016).
Learning only to see ?
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
“Scripts” (!?)
Darknet
JON
He leaned close and onions, barefoot from his shoulder. "I am not a purple
girl," he said as he stood over him. "The sight of you sell your father with you a
little choice."
"I say to swear up his sea or a boy of stone and heart, down," Lord Tywin
said. "I love your word or her to me."
Learning only to see ?
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Public Health
Announcement of Google DeepMind Health (24/02/2016)
Learning only to see ?
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Nacho Hernandez, “Why artificial intelligence will democratize
healthcare” (TEDx Talk, 2014)
Public health
Learning only to see ?
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Nancy Lublin, “The heartbreaking text that inspired a crisis
helpline” (TED Talk 2015)
Mental health
Learning only to see ?
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Psychological support and counseling ?
Learning only to see ?
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
20
Affective computing
Rana el Kalioubi, “This app know how you feel, from the look on your
face”, TEDTalks 2015.
Learning only to see ?
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
21
Nexi Project,
from MIT Media Lab
(Photos: Spencer
Lowel)
[video]
Affective computing
Learning only to see ?
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
“Google’s chairman (Eric Schmidth) thinks artificial intelligence will
let scientists solve some of the world’s "hard problems," like
population growth, climate change, human development, and
education.” (Bloomberg Business, 11/01/2016)
[+info @ MIT Technology Review]
Artificial intelligence
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Google’s CEO Sundar Pichai: “Era Of Computers Will End Very Soon,
AI Will Rule” (Fossbytes, 03/05/2016)
Artificial intelligence
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Jeremy Howard, “The wonderful and terrifying implications of
computers that can learn”, TEDTalks 2014.
Artificial intelligence
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Artificial intelligence
Stephen Hawking, “Artificial intelligence could spell out the
human race.” (2014)
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Artificial intelligence
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Elon Musk (Tesla), one of OpenAI promoters
Artificial intelligence
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Neil Lawrence, OpenAI won’t benefit humanity
without open data sharing (The Guardian,
14/12/2015)
Phd Comics: Who owns your data ? (Hint: it is not you)
Artificial intelligence
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
29
Xavier Sala-i-Martin (Columbia University),
“Les conclusions del Fòrum de Davos”
(TV3, 03/02/2016) - in Catalan
Carles Boix (Princeton University),
“La quarta revolució industrial”
(Diari Ara, 08/02/2016) - in Catalan
Artificial intelligence
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Artificial intelligence
Open question: in which jobs will robots replace humans first ?
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
31
Source: 25 Best jobs in America (Glassdoor)
Data scientist.
The best job in the world ?
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
32
The best job in the world ?
Summer internships
for Phd students
related to Data
Analytics.
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
33
The Economist, “Million-dollar babies” (02/04/2016)
The best job in the world ?
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
34
Nature, “AI talent grab sparks excitement and concern” (26/04/2016)
The best job in the world ?
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Learn more with Nat & Lo 20% Google Project:
Learn more
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Learn more
● Friendly slides for dissemination (family & friends).
[Available on slideshare.net]
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Learn more
● Presentació amigable per a familiar i amics.
[Disponible a slideshare.net]
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Learn more
Keras http://keras.io/
Tensor Flow https://www.tensorflow.org/
Caffe http://caffe.berkeleyvision.org/
Torch (Overfeat) http://torch.ch/
Theano http://deeplearning.net/software/theano/
MatconvNet (VLFeat) http://www.vlfeat.org/matconvnet/
CNTK (Mcrosoft) http://www.cntk.ai/
MxNet: https://github.com/dmlc/mxnet
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Learn more
Source: @fchollet
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
40
?
Learn more
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
41
Learn more
http://cs231n.stanford.edu/
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
42
Learn more
“Machine learning” sub-Reddit.
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
43
Learn more
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
44
Learn more
Grup d’estudi de machine learning Barcelona
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
45
Learn more
● Reading Group with public listing of videos, slides and papers.
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
More details on the
website by
Professor Jordi Torres.
Learn more
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Learn more
[Website]
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Learn more
[Website]
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Conclusions
49
Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016)
Thanks a lot !
50
Slides available at:
https://imatge.upc.edu/web/people/xavier-giro
@DocXavi
/ProfessorXavi

Más contenido relacionado

Destacado

Quaternion Based Omnidirectional Machine Condition Monitoring System
Quaternion Based Omnidirectional Machine Condition Monitoring SystemQuaternion Based Omnidirectional Machine Condition Monitoring System
Quaternion Based Omnidirectional Machine Condition Monitoring System
CSCJournals
 

Destacado (16)

Quaternion Based Omnidirectional Machine Condition Monitoring System
Quaternion Based Omnidirectional Machine Condition Monitoring SystemQuaternion Based Omnidirectional Machine Condition Monitoring System
Quaternion Based Omnidirectional Machine Condition Monitoring System
 
Deep Learning for Computer Vision (1/4): Image Analytics @ laSalle 2016
Deep Learning for Computer Vision (1/4): Image Analytics @ laSalle 2016Deep Learning for Computer Vision (1/4): Image Analytics @ laSalle 2016
Deep Learning for Computer Vision (1/4): Image Analytics @ laSalle 2016
 
Image net classification with Deep Convolutional Neural Networks
Image net classification with Deep Convolutional Neural NetworksImage net classification with Deep Convolutional Neural Networks
Image net classification with Deep Convolutional Neural Networks
 
Single Shot MultiBox Detector와 Recurrent Instance Segmentation
Single Shot MultiBox Detector와 Recurrent Instance SegmentationSingle Shot MultiBox Detector와 Recurrent Instance Segmentation
Single Shot MultiBox Detector와 Recurrent Instance Segmentation
 
Q Learning과 CNN을 이용한 Object Localization
Q Learning과 CNN을 이용한 Object LocalizationQ Learning과 CNN을 이용한 Object Localization
Q Learning과 CNN을 이용한 Object Localization
 
Deepcheck, 딥러닝 기반의 얼굴인식 출석체크
Deepcheck, 딥러닝 기반의 얼굴인식 출석체크Deepcheck, 딥러닝 기반의 얼굴인식 출석체크
Deepcheck, 딥러닝 기반의 얼굴인식 출석체크
 
Faster R-CNN: Towards real-time object detection with region proposal network...
Faster R-CNN: Towards real-time object detection with region proposal network...Faster R-CNN: Towards real-time object detection with region proposal network...
Faster R-CNN: Towards real-time object detection with region proposal network...
 
Deep Learning for Computer Vision: Visualization (UPC 2016)
Deep Learning for Computer Vision: Visualization (UPC 2016)Deep Learning for Computer Vision: Visualization (UPC 2016)
Deep Learning for Computer Vision: Visualization (UPC 2016)
 
Esmeralda Díaz-Aroca: Cómo tener una web personal a coste cero
Esmeralda Díaz-Aroca: Cómo tener una web personal a coste ceroEsmeralda Díaz-Aroca: Cómo tener una web personal a coste cero
Esmeralda Díaz-Aroca: Cómo tener una web personal a coste cero
 
Deep Learning for Computer Vision: Medical Imaging (UPC 2016)
Deep Learning for Computer Vision: Medical Imaging (UPC 2016)Deep Learning for Computer Vision: Medical Imaging (UPC 2016)
Deep Learning for Computer Vision: Medical Imaging (UPC 2016)
 
마인즈랩 발표자료 V1.9_for public
마인즈랩 발표자료 V1.9_for public마인즈랩 발표자료 V1.9_for public
마인즈랩 발표자료 V1.9_for public
 
NVIDIA Seminar ディープラーニングによる画像認識と応用事例
NVIDIA Seminar ディープラーニングによる画像認識と応用事例NVIDIA Seminar ディープラーニングによる画像認識と応用事例
NVIDIA Seminar ディープラーニングによる画像認識と応用事例
 
20150930
2015093020150930
20150930
 
論文紹介: Fast R-CNN&Faster R-CNN
論文紹介: Fast R-CNN&Faster R-CNN論文紹介: Fast R-CNN&Faster R-CNN
論文紹介: Fast R-CNN&Faster R-CNN
 
SSD: Single Shot MultiBox Detector (ECCV2016)
SSD: Single Shot MultiBox Detector (ECCV2016)SSD: Single Shot MultiBox Detector (ECCV2016)
SSD: Single Shot MultiBox Detector (ECCV2016)
 
Deep Learningと画像認識   ~歴史・理論・実践~
Deep Learningと画像認識 ~歴史・理論・実践~Deep Learningと画像認識 ~歴史・理論・実践~
Deep Learningと画像認識   ~歴史・理論・実践~
 

Similar a Deep Learning for Computer Vision (4/4): Beyond vision @ laSalle 2016

Similar a Deep Learning for Computer Vision (4/4): Beyond vision @ laSalle 2016 (20)

Multimedia retrieval (DCU 2016)
Multimedia retrieval (DCU 2016)Multimedia retrieval (DCU 2016)
Multimedia retrieval (DCU 2016)
 
What is social media research (Netnography + insight communities)
What is social media research (Netnography + insight communities)What is social media research (Netnography + insight communities)
What is social media research (Netnography + insight communities)
 
Designing A.I. - Week 1 - Intro Lecture
Designing A.I. - Week 1 - Intro LectureDesigning A.I. - Week 1 - Intro Lecture
Designing A.I. - Week 1 - Intro Lecture
 
Dave McCaughan AAA 2016
Dave McCaughan AAA 2016Dave McCaughan AAA 2016
Dave McCaughan AAA 2016
 
VU University Amsterdam - The Social Web 2016 - Lecture 4
VU University Amsterdam - The Social Web 2016 - Lecture 4VU University Amsterdam - The Social Web 2016 - Lecture 4
VU University Amsterdam - The Social Web 2016 - Lecture 4
 
Science and Social Media: The Importance of Being Online
Science and Social Media: The Importance of Being OnlineScience and Social Media: The Importance of Being Online
Science and Social Media: The Importance of Being Online
 
Let’s hunt the target using OSINT
Let’s hunt the target using OSINTLet’s hunt the target using OSINT
Let’s hunt the target using OSINT
 
Picturing the Social: Talk for Transforming Digital Methods Winter School
Picturing the Social: Talk for Transforming Digital Methods Winter SchoolPicturing the Social: Talk for Transforming Digital Methods Winter School
Picturing the Social: Talk for Transforming Digital Methods Winter School
 
The (R)evolution of Social Media in Software Engineering
The (R)evolution of Social Media in Software EngineeringThe (R)evolution of Social Media in Software Engineering
The (R)evolution of Social Media in Software Engineering
 
Nfais social discovery-v5
Nfais social discovery-v5Nfais social discovery-v5
Nfais social discovery-v5
 
Unpacking Digital Methods
Unpacking Digital MethodsUnpacking Digital Methods
Unpacking Digital Methods
 
VU University Amsterdam - The Social Web 2016 - Lecture 1
VU University Amsterdam - The Social Web 2016 - Lecture 1 VU University Amsterdam - The Social Web 2016 - Lecture 1
VU University Amsterdam - The Social Web 2016 - Lecture 1
 
5 happy things working in digital
5 happy things working in digital5 happy things working in digital
5 happy things working in digital
 
SCONUL Summer Conference 2018 - Nicole coleman
SCONUL Summer Conference 2018 - Nicole colemanSCONUL Summer Conference 2018 - Nicole coleman
SCONUL Summer Conference 2018 - Nicole coleman
 
Strategicplan2011
Strategicplan2011Strategicplan2011
Strategicplan2011
 
Leveraging social networks and social media
Leveraging social networks and social mediaLeveraging social networks and social media
Leveraging social networks and social media
 
Web Science Session 2: Social Media
Web Science Session 2: Social MediaWeb Science Session 2: Social Media
Web Science Session 2: Social Media
 
Road To Innovation
Road To Innovation Road To Innovation
Road To Innovation
 
PSH Mobile Voice 2016 Personal Virtual Assistants 
Are Not Enough?
PSH Mobile Voice 2016 Personal Virtual Assistants 
Are Not Enough?PSH Mobile Voice 2016 Personal Virtual Assistants 
Are Not Enough?
PSH Mobile Voice 2016 Personal Virtual Assistants 
Are Not Enough?
 
Yossarian 2018 intro
Yossarian 2018 introYossarian 2018 intro
Yossarian 2018 intro
 

Más de Universitat Politècnica de Catalunya

Generation of Synthetic Referring Expressions for Object Segmentation in Videos
Generation of Synthetic Referring Expressions for Object Segmentation in VideosGeneration of Synthetic Referring Expressions for Object Segmentation in Videos
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
Universitat Politècnica de Catalunya
 

Más de Universitat Politècnica de Catalunya (20)

Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Deep Generative Learning for All
Deep Generative Learning for AllDeep Generative Learning for All
Deep Generative Learning for All
 
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
 
Towards Sign Language Translation & Production | Xavier Giro-i-Nieto
Towards Sign Language Translation & Production | Xavier Giro-i-NietoTowards Sign Language Translation & Production | Xavier Giro-i-Nieto
Towards Sign Language Translation & Production | Xavier Giro-i-Nieto
 
The Transformer - Xavier Giró - UPC Barcelona 2021
The Transformer - Xavier Giró - UPC Barcelona 2021The Transformer - Xavier Giró - UPC Barcelona 2021
The Transformer - Xavier Giró - UPC Barcelona 2021
 
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
 
Open challenges in sign language translation and production
Open challenges in sign language translation and productionOpen challenges in sign language translation and production
Open challenges in sign language translation and production
 
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
Generation of Synthetic Referring Expressions for Object Segmentation in VideosGeneration of Synthetic Referring Expressions for Object Segmentation in Videos
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
 
Discovery and Learning of Navigation Goals from Pixels in Minecraft
Discovery and Learning of Navigation Goals from Pixels in MinecraftDiscovery and Learning of Navigation Goals from Pixels in Minecraft
Discovery and Learning of Navigation Goals from Pixels in Minecraft
 
Learn2Sign : Sign language recognition and translation using human keypoint e...
Learn2Sign : Sign language recognition and translation using human keypoint e...Learn2Sign : Sign language recognition and translation using human keypoint e...
Learn2Sign : Sign language recognition and translation using human keypoint e...
 
Intepretability / Explainable AI for Deep Neural Networks
Intepretability / Explainable AI for Deep Neural NetworksIntepretability / Explainable AI for Deep Neural Networks
Intepretability / Explainable AI for Deep Neural Networks
 
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
 
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
 
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
 
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
 
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
 
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
 
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
 
Curriculum Learning for Recurrent Video Object Segmentation
Curriculum Learning for Recurrent Video Object SegmentationCurriculum Learning for Recurrent Video Object Segmentation
Curriculum Learning for Recurrent Video Object Segmentation
 
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
 

Último

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Último (20)

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 

Deep Learning for Computer Vision (4/4): Beyond vision @ laSalle 2016

  • 1. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) @DocXavi Deep Learning for Computer Vision Beyond vision 5 May 2016 Xavier Giró-i-Nieto Master en Creació Multimedia
  • 2. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) 2 When robots open their eyes... ...is because they have learned to see. Learning only to see ?
  • 3. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Learning only to see ? Nexi, del MIT Media Lab (Foto: Spencer Lowel)
  • 4. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Big data Internet of things - IoT Learning only to see ?
  • 5. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Personal data Big data Learning only to see ?
  • 6. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Atlas, de Boston Dynamics Robust manipulation and motion Learning only to see ?
  • 7. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Playing with other computers Learning only to see ? Mnih, Volodymyr, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin Riedmiller. "Playing atari with deep reinforcement learning." arXiv preprint arXiv:1312.5602 (2013). DeepMind
  • 8. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Playing with humans AlphaGo (Google DeepMind) Learning only to see ?
  • 9. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Autonomous Driving Google Self-driving car Learning only to see ?
  • 10. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Visual arts Google Research, “Going deeper into neural networks” - DeepDream (2015) Learning only to see ?
  • 11. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Google Research, “Going deeper into neural networks” - DeepDream (2015) Visual arts Only open their eyes ?
  • 12. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) http://turing.deepart.io/ Visual arts Learning only to see ?
  • 13. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Music composition Manuel Araoz, “Training a Recurrent Neural Network to Compose Music” (2016). Learning only to see ?
  • 14. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Poetry Ross Goodwin, Neuralsnap (2016). Learning only to see ?
  • 15. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) “Scripts” (!?) Darknet JON He leaned close and onions, barefoot from his shoulder. "I am not a purple girl," he said as he stood over him. "The sight of you sell your father with you a little choice." "I say to swear up his sea or a boy of stone and heart, down," Lord Tywin said. "I love your word or her to me." Learning only to see ?
  • 16. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Public Health Announcement of Google DeepMind Health (24/02/2016) Learning only to see ?
  • 17. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Nacho Hernandez, “Why artificial intelligence will democratize healthcare” (TEDx Talk, 2014) Public health Learning only to see ?
  • 18. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Nancy Lublin, “The heartbreaking text that inspired a crisis helpline” (TED Talk 2015) Mental health Learning only to see ?
  • 19. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Psychological support and counseling ? Learning only to see ?
  • 20. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) 20 Affective computing Rana el Kalioubi, “This app know how you feel, from the look on your face”, TEDTalks 2015. Learning only to see ?
  • 21. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) 21 Nexi Project, from MIT Media Lab (Photos: Spencer Lowel) [video] Affective computing Learning only to see ?
  • 22. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) “Google’s chairman (Eric Schmidth) thinks artificial intelligence will let scientists solve some of the world’s "hard problems," like population growth, climate change, human development, and education.” (Bloomberg Business, 11/01/2016) [+info @ MIT Technology Review] Artificial intelligence
  • 23. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Google’s CEO Sundar Pichai: “Era Of Computers Will End Very Soon, AI Will Rule” (Fossbytes, 03/05/2016) Artificial intelligence
  • 24. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Jeremy Howard, “The wonderful and terrifying implications of computers that can learn”, TEDTalks 2014. Artificial intelligence
  • 25. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Artificial intelligence Stephen Hawking, “Artificial intelligence could spell out the human race.” (2014)
  • 26. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Artificial intelligence
  • 27. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Elon Musk (Tesla), one of OpenAI promoters Artificial intelligence
  • 28. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Neil Lawrence, OpenAI won’t benefit humanity without open data sharing (The Guardian, 14/12/2015) Phd Comics: Who owns your data ? (Hint: it is not you) Artificial intelligence
  • 29. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) 29 Xavier Sala-i-Martin (Columbia University), “Les conclusions del Fòrum de Davos” (TV3, 03/02/2016) - in Catalan Carles Boix (Princeton University), “La quarta revolució industrial” (Diari Ara, 08/02/2016) - in Catalan Artificial intelligence
  • 30. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Artificial intelligence Open question: in which jobs will robots replace humans first ?
  • 31. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) 31 Source: 25 Best jobs in America (Glassdoor) Data scientist. The best job in the world ?
  • 32. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) 32 The best job in the world ? Summer internships for Phd students related to Data Analytics.
  • 33. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) 33 The Economist, “Million-dollar babies” (02/04/2016) The best job in the world ?
  • 34. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) 34 Nature, “AI talent grab sparks excitement and concern” (26/04/2016) The best job in the world ?
  • 35. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Learn more with Nat & Lo 20% Google Project: Learn more
  • 36. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Learn more ● Friendly slides for dissemination (family & friends). [Available on slideshare.net]
  • 37. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Learn more ● Presentació amigable per a familiar i amics. [Disponible a slideshare.net]
  • 38. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Learn more Keras http://keras.io/ Tensor Flow https://www.tensorflow.org/ Caffe http://caffe.berkeleyvision.org/ Torch (Overfeat) http://torch.ch/ Theano http://deeplearning.net/software/theano/ MatconvNet (VLFeat) http://www.vlfeat.org/matconvnet/ CNTK (Mcrosoft) http://www.cntk.ai/ MxNet: https://github.com/dmlc/mxnet
  • 39. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Learn more Source: @fchollet
  • 40. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) 40 ? Learn more
  • 41. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) 41 Learn more http://cs231n.stanford.edu/
  • 42. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) 42 Learn more “Machine learning” sub-Reddit.
  • 43. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) 43 Learn more
  • 44. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) 44 Learn more Grup d’estudi de machine learning Barcelona
  • 45. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) 45 Learn more ● Reading Group with public listing of videos, slides and papers.
  • 46. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) More details on the website by Professor Jordi Torres. Learn more
  • 47. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Learn more [Website]
  • 48. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Learn more [Website]
  • 49. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Conclusions 49
  • 50. Xavier Giró i Nieto, “Deep learning beyond vision”. Master in Multimedia, La Salle URL (May 2016) Thanks a lot ! 50 Slides available at: https://imatge.upc.edu/web/people/xavier-giro @DocXavi /ProfessorXavi