Learning with side information through modality hallucination (2016)

•Descargar como PPTX, PDF•

2 recomendaciones•44,582 vistas

Learning with side information through modality hallucination, J. Hoffman et al., CVPR2016 http://www.cv-foundation.org/openaccess/content_cvpr_2016/html/Hoffman_Learning_With_Side_CVPR_2016_paper.html

Ingeniería

Terry Taewoong Um (terry.t.um@gmail.com)
University of Waterloo
Department of Electrical & Computer Engineering
Terry Taewoong Um
LEARNING WITH SIDE INFOR-
MATION THROUGH MODALITY
HALLUCINATION (2016)
1

Terry Taewoong Um (terry.t.um@gmail.com)
BEYOND SUPERVISED / UNSUPERVISED
2
supervised learning semi-supervised learning weakly-supervised learning
“Is object localization for free? Weakly-supervised
learning with convolutional neural networks (2015)”, M.
Oquab et al.
“Bayesian Semisupervised Learning with Deep Generative Models (2017)”, J. Gordon
et al.
• Various learning scenarios
• Learning with side information (modality)
(training) (test)

Terry Taewoong Um (terry.t.um@gmail.com)
MISSING INPUT DURING TEST
3
(training) (test)
Couch
zero-
padding…?
???

Terry Taewoong Um (terry.t.um@gmail.com)
MISSING INPUT DURING TEST
4
(training) (test)
Couch ???
generate

Terry Taewoong Um (terry.t.um@gmail.com)
MISSING INPUT DURING TEST
5
(training)
???
(test)
generate
Couch

Terry Taewoong Um (terry.t.um@gmail.com)
HALLUCINATION
6
(training) (test)
The red & blue should make similar features :

Terry Taewoong Um (terry.t.um@gmail.com)
RELATED WORKS
7
• RGB-D detection : exploit depth images
• Transfer learning and domain adaptation
: transfer the knowledge from a depth image to a RGB image
• Learning using privileged information : Training with a teacher
x : X-ray
x* : Clinician’s interpretation
y : Cancer Y/N
• Distillation : the output from one network is used as the target for a new network.

LOSS FUNCTION
8
Hallucination
Classification
Localization

LOSS FUNCTION
9
Hallucination
Classification
Localization

LOSS FUNCTION
10
Hallucination
Classification
Localization

SEVERAL ISSUES
11
Terry Taewoong Um (terry.t.um@gmail.com)
• Training & Initialization
: First train the RGB & D-Net, and copy the D-Net to H-Net
• Which layer to hallucinate? Pool5

RESULTS
12
Terry Taewoong Um (terry.t.um@gmail.com)
• With new dataset (Pascal voc 2007)
• With trained dataset (NYUD2)

RESULTS
13
Terry Taewoong Um (terry.t.um@gmail.com)
RGB-D-H (O)
RGB (X)
RGB-D-H (X)
RGB (O)

SUMMARY
14
Terry Taewoong Um (terry.t.um@gmail.com)
• If you have a missing modality at test time,
(Or if you have additional modality at training time,)
hallucinate!
• Good idea, but not a in-depth understanding…
• How can a RGB image “imagine” its missing depth image?
(Can we visualize
• Is the learned H-net generalizable to new images?
• Is this method effective to other modalities as well?
• Can we propose a domain-specific hallucination architecture?
• We may exploit more information (modalities) at training time than run-time
• Beyond supervised / unsupervised settings….

Más contenido relacionado

La actualidad más candente

Joint contrastive learning with infinite possibilitiestaeseon ryu

Higher Order Fused Regularization for Supervised Learning with Grouped Parame...Koh Takeuchi

00 - 30 Dec - IntroductionNeeldhara Misra

Deep Learning in Recommender Systems - RecSys Summer School 2017Balázs Hidasi

Overview of TensorFlow For Natural Language Processingananth

GAN for Bayesian Inference objectivesNatan Katz

001 20151005 ranking_nodesingrowingnetworkHa Phuong

MaxEnt (Loglinear) Models - Overviewananth

Relational Transfer in Reinforcement Learningbutest

Review : Adaptive Consistency Regularization for Semi-Supervised Transfer Lea...Dongmin Choi

La actualidad más candente (10)

Joint contrastive learning with infinite possibilities

Higher Order Fused Regularization for Supervised Learning with Grouped Parame...

00 - 30 Dec - Introduction

Deep Learning in Recommender Systems - RecSys Summer School 2017

Overview of TensorFlow For Natural Language Processing

GAN for Bayesian Inference objectives

001 20151005 ranking_nodesingrowingnetwork

MaxEnt (Loglinear) Models - Overview

Relational Transfer in Reinforcement Learning

Review : Adaptive Consistency Regularization for Semi-Supervised Transfer Lea...

Destacado

Lie Group Formulation for Robot MechanicsTerry Taewoong Um

기계학습 / 딥러닝이란 무엇인가Yongha Kim

[모두의연구소] 쫄지말자딥러닝Modulabs

인공 신경망 구현에 관한 간단한 설명Woonghee Lee

R 프로그래밍 기본 문법Terry Cho

머신 러닝 입문 #1-머신러닝 소개와 kNN 소개Terry Cho

Introduction to Deep Learning with TensorFlowTerry Taewoong Um

쫄지말자딥러닝2 - CNN RNN 포함버전Modulabs

인공지능, 기계학습 그리고 딥러닝Jinwon Lee

Large Scale Deep Learning with TensorFlow Jen Aman

U2 product For Wiseeco호진 하

GStreamer-VAAPI: Hardware-accelerated encoding and decoding on Intel hardware...Igalia

Global mobile market reportChang Kim

알파고 (바둑 인공지능)의 작동 원리Shane (Seungwhan) Moon

Docker 로 Linux 없이 Linux 환경에서 개발하기iFunFactory Inc.

Pure Function and Honest DesignHyungho Ko

Pitfalls of Object Oriented Programming by SONYAnaya Medias Swiss

2017 k8s and OpenStack-HelmSK Telecom

1, 빅데이터 시대의 인공지능 문동선 v2Dongsun Moon

클라우드 네트워킹과 SDN 그리고 OpenStackChoe Cheng-Dae

Destacado (20)

Lie Group Formulation for Robot Mechanics

기계학습 / 딥러닝이란 무엇인가

[모두의연구소] 쫄지말자딥러닝

인공 신경망 구현에 관한 간단한 설명

R 프로그래밍 기본 문법

머신 러닝 입문 #1-머신러닝 소개와 kNN 소개

Introduction to Deep Learning with TensorFlow

쫄지말자딥러닝2 - CNN RNN 포함버전

인공지능, 기계학습 그리고 딥러닝

Large Scale Deep Learning with TensorFlow

U2 product For Wiseeco

GStreamer-VAAPI: Hardware-accelerated encoding and decoding on Intel hardware...

Global mobile market report

알파고 (바둑 인공지능)의 작동 원리

Docker 로 Linux 없이 Linux 환경에서 개발하기

Pure Function and Honest Design

Pitfalls of Object Oriented Programming by SONY

2017 k8s and OpenStack-Helm

1, 빅데이터 시대의 인공지능 문동선 v2

클라우드 네트워킹과 SDN 그리고 OpenStack

Similar a Learning with side information through modality hallucination (2016)

Deep learning (Machine learning) tutorial for beginnersTerry Taewoong Um

Peter Norvig - NYC Machine Learning 2013Michael Scovetta

What knowledge bases know (and what they don't)srazniewski

"Let us talk about output features! by Florence d’Alché-Buc, LTCI & Full Prof...Paris Women in Machine Learning and Data Science

Machine Learning, LIX004M5butest

Deep Learning & NLP: Graphs to the Rescue!Roelof Pieters

Deep Learning: a birds eye viewRoelof Pieters

Data Structures and Algorithm - Week 5 - AVL TreesFerdin Joe John Joseph PhD

Week 1 Lec 1-5 with watermarking.pdfmeghana092

Week_1_Lec_1-5_with_watermarking_(1).pdfPrabhaK22

Using binary classifiersbutest

transfer.pptxHaibinSu2

MLlecture1.pptbutest

CM20315_01_Intro_Machine_Learning_ap.pptxIgnajavier

Machine Learning Introduction.pptxJeeva Nantham

Probing the Efficacy of the Algebra Project: A Summary of FindingsEDD SFSU

deepnet-lourentzou.pptyang947066

Chapter 4 dsHanif Durad

Data Science University of Sindh

Similar a Learning with side information through modality hallucination (2016) (20)

Deep learning (Machine learning) tutorial for beginners

Peter Norvig - NYC Machine Learning 2013

What knowledge bases know (and what they don't)

"Let us talk about output features! by Florence d’Alché-Buc, LTCI & Full Prof...

Machine Learning, LIX004M5

Deep Learning & NLP: Graphs to the Rescue!

Deep Learning: a birds eye view

Data Structures and Algorithm - Week 5 - AVL Trees

Week 1 Lec 1-5 with watermarking.pdf

Week_1_Lec_1-5_with_watermarking_(1).pdf

Using binary classifiers

transfer.pptx

MLlecture1.ppt

CM20315_01_Intro_Machine_Learning_ap.pptx

Machine Learning Introduction.pptx

Probing the Efficacy of the Algebra Project: A Summary of Findings

deepnet-lourentzou.ppt

Chapter 4 ds

Data Science

Más de Terry Taewoong Um

#44. KAIST에서 "대학 유죄"를 외치다: ART Lab의 도전Terry Taewoong Um

A brief introduction to OCR (Optical character recognition)Terry Taewoong Um

인공지능의 사회정의의 편이 될 수 있을까? (인공지능과 법)Terry Taewoong Um

On Calibration of Modern Neural Networks (2017)Terry Taewoong Um

Deep Learning: A Critical Appraisal (2018)Terry Taewoong Um

About Two Motion Planning PapersTerry Taewoong Um

로봇과 인공지능, 그리고 미래의 노동Terry Taewoong Um

Más de Terry Taewoong Um (7)

#44. KAIST에서 "대학 유죄"를 외치다: ART Lab의 도전

A brief introduction to OCR (Optical character recognition)

인공지능의 사회정의의 편이 될 수 있을까? (인공지능과 법)

On Calibration of Modern Neural Networks (2017)

Deep Learning: A Critical Appraisal (2018)

About Two Motion Planning Papers

로봇과 인공지능, 그리고 미래의 노동

Último

(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat

Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR9953056974 Low Rate Call Girls In Saket, Delhi NCR

(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat

VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor

MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N

(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat

Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile

UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan

Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234

Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis

MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N

HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95

Roadmap to Membership of RICS - Pathways and RoutesM Maged Hegazy, LLM, MBA, CCP, P3O

Porous Ceramics seminar and technical writingrakeshbaidya232001

Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona

High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat

Introduction to Multiple Access Protocol.pptxupamatechverse

Introduction to IEEE STANDARDS and its different types.pptxupamatechverse

Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile

Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile

Learning with side information through modality hallucination (2016)

1. Terry Taewoong Um (terry.t.um@gmail.com) University of Waterloo Department of Electrical & Computer Engineering Terry Taewoong Um LEARNING WITH SIDE INFOR- MATION THROUGH MODALITY HALLUCINATION (2016) 1

2. Terry Taewoong Um (terry.t.um@gmail.com) BEYOND SUPERVISED / UNSUPERVISED 2 supervised learning semi-supervised learning weakly-supervised learning “Is object localization for free? Weakly-supervised learning with convolutional neural networks (2015)”, M. Oquab et al. “Bayesian Semisupervised Learning with Deep Generative Models (2017)”, J. Gordon et al. • Various learning scenarios • Learning with side information (modality) (training) (test)

3. Terry Taewoong Um (terry.t.um@gmail.com) MISSING INPUT DURING TEST 3 (training) (test) Couch zero- padding…? ???

4. Terry Taewoong Um (terry.t.um@gmail.com) MISSING INPUT DURING TEST 4 (training) (test) Couch ??? generate

5. Terry Taewoong Um (terry.t.um@gmail.com) MISSING INPUT DURING TEST 5 (training) ??? (test) generate Couch

6. Terry Taewoong Um (terry.t.um@gmail.com) HALLUCINATION 6 (training) (test) The red & blue should make similar features :

7. Terry Taewoong Um (terry.t.um@gmail.com) RELATED WORKS 7 • RGB-D detection : exploit depth images • Transfer learning and domain adaptation : transfer the knowledge from a depth image to a RGB image • Learning using privileged information : Training with a teacher x : X-ray x* : Clinician’s interpretation y : Cancer Y/N • Distillation : the output from one network is used as the target for a new network.

8. LOSS FUNCTION 8 Hallucination Classification Localization

9. LOSS FUNCTION 9 Hallucination Classification Localization

10. LOSS FUNCTION 10 Hallucination Classification Localization

11. SEVERAL ISSUES 11 Terry Taewoong Um (terry.t.um@gmail.com) • Training & Initialization : First train the RGB & D-Net, and copy the D-Net to H-Net • Which layer to hallucinate? Pool5

12. RESULTS 12 Terry Taewoong Um (terry.t.um@gmail.com) • With new dataset (Pascal voc 2007) • With trained dataset (NYUD2)

13. RESULTS 13 Terry Taewoong Um (terry.t.um@gmail.com) RGB-D-H (O) RGB (X) RGB-D-H (X) RGB (O)

14. SUMMARY 14 Terry Taewoong Um (terry.t.um@gmail.com) • If you have a missing modality at test time, (Or if you have additional modality at training time,) hallucinate! • Good idea, but not a in-depth understanding… • How can a RGB image “imagine” its missing depth image? (Can we visualize • Is the learned H-net generalizable to new images? • Is this method effective to other modalities as well? • Can we propose a domain-specific hallucination architecture? • We may exploit more information (modalities) at training time than run-time • Beyond supervised / unsupervised settings….

Learning with side information through modality hallucination (2016)

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (10)

Destacado

Destacado (20)

Similar a Learning with side information through modality hallucination (2016)

Similar a Learning with side information through modality hallucination (2016) (20)

Más de Terry Taewoong Um

Más de Terry Taewoong Um (7)

Último

Último (20)

Learning with side information through modality hallucination (2016)