[Pr12] dann jaejun yoo

•Descargar como PPTX, PDF•

2 recomendaciones•2,291 vistas

Introduction to domain adversarial training of neural network. (Kor) video : https://www.youtube.com/watch?v=n2J7giHrS-Y&t=1s Papers: A survey on transfer learning, SJ Pan 2009 / A theory of learning from different domains, S Ben-David et al. 2010 / Domain-Adversarial Training of Neural Networks, Y Ganin 2016 Slides I refered: http://www.di.ens.fr/~germain/talks/nips2014_dann_slides.pdf http://john.blitzer.com/talks/icmltutorial_2010.pdf (DA theory part) https://epat2014.sciencesconf.org/conference/epat2014/pages/slides_DA_epat_17.pdf (DA theory part) https://www.slideshare.net/butest/ppt-3860159 (DA theory part)

Tecnología

Domain Adversarial Training of
Neural Network
PR12와 함께 이해하는
* Domain Adversarial Training of Neural Network, Y. Ganin et al. 2016를 바탕으로 작성한 리뷰
Jaejun Yoo
Ph.D. Candidate @KAIST
PR12
4TH MAY, 2017

Usually we try to…
Test
(target)
Training
(source)

For simplicity, let’s consider the
binary classification problem

일반적인 supervised learning setting: Training
과 test의 domain이 같다고 가정.

전자기기 고객평가 (X) /
긍정 혹은 부정 라벨 (Y)
비디오 게임 고객평가 (X)

전자기기 고객평가 (X) /
긍정 혹은 부정 라벨 (Y)
비디오 게임 고객평가 (X)
NN으로 표현되는 H 함수 공간으로부터….

전자기기 고객평가 (X) /
긍정 혹은 부정 라벨 (Y)
비디오 게임 고객평가 (X)
Classifier h를 학습하는데,
target의 label을 모르지만
source(X,Y)와 target(X)
두 도메인 모두에서 잘 label
을 찾는 h를 찾고 싶다.
NN으로 표현되는 H 함수 공간으로부터….

DANN
TRY TO CLASSIFY WELL WITH
THE EXTRACTED FEATURE!
Ordinary classification
POSITIVE
NEGATIVE
고객 평가 댓글

DANN
Ordinary classification
Domain Classification
전자기기
비디오 게임
TRY TO CLASSIFY WELL WITH
THE EXTRACTED FEATURE!
POSITIVE
NEGATIVE
고객 평가 댓글

DANN
Ordinary classification
Domain Classification
전자기기
비디오 게임
TRY TO CLASSIFY WELL WITH
THE EXTRACTED FEATURE!
POSITIVE
NEGATIVE
고객 평가 댓글
TRY TO EXTRACT
DOMAIN INDEPENDENT FEATURE!

• Combining DA and feature learning within one training process
• Principled way to learn a good representation based on the
generalization guarantee
: minimize the H divergence directly (no heuristic)
“When or when not the DA algorithm works.”
“Why it works.”
DANN

기존 전략: 최대한 적은 parameter로 training
error가 최소인 model을 찾자

이제는 training domain (source)과 testing
domain (target)이 서로 다르다
기존의 전략 외에 다른 전략이 추가로 필요하다.

PREREQUISITE
Different distances
Slide courtesy of Sungbin Lim, DeepBio, 2017

A Bound on the Adaptation Error
1. Difference across all measurable subsets cannot be estimated from
finite samples
2. We’re only interested in differences related to classification error

Idea: Measure subsets where hypotheses in disagree
Subsets A are error sets of one hypothesis wrt another
1. Always lower than L1
2. computable from finite unlabeled samples. (Kifer et al. 2004)
3. train classifier to discriminate between source and target data

A Computable Adaptation Bound
Divergence estimation
complexity
Dependent on number
of unlabeled samples

The optimal joint hypothesis
is the hypothesis with minimal combined error
is that error

REFERENCE
PAPERS
1. A survey on transfer learning, SJ Pan 2009
2. A theory of learning from different domains, S Ben-David et al. 2010
3. Domain-Adversarial Training of Neural Networks, Y Ganin 2016
BLOG
1. http://jaejunyoo.blogspot.com/2017/01/domain-adversarial-training-of-neural.html
2. https://github.com/jaejun-yoo/tf-dann-py35
3. https://github.com/jaejun-yoo/shallow-DANN-two-moon-dataset
SLIDES
1. http://www.di.ens.fr/~germain/talks/nips2014_dann_slides.pdf
2. http://john.blitzer.com/talks/icmltutorial_2010.pdf (DA theory part)
3. https://epat2014.sciencesconf.org/conference/epat2014/pages/slides_DA_epat_17.pdf (DA theory part)
4. https://www.slideshare.net/butest/ppt-3860159 (DA theory part)
VIDEO
1. https://www.youtube.com/watch?v=h8tXDbywcdQ (Terry Um 딥러닝 토크)
2. https://www.youtube.com/watch?v=F2OJ0fAK46Q (DA theory part)
3. https://www.youtube.com/watch?v=uc6K6tRHMAA&index=13&list=WL&t=2570s (DA theory part)

Más contenido relacionado

La actualidad más candente

Score based Generative Modeling through Stochastic Differential EquationsSungchul Kim

[MIRU2018] Global Average Poolingの特性を用いたAttention Branch NetworkHiroshi Fukui

SSII2019TS: Shall We GANs? ～GANの基礎から最近の研究まで～SSII

Semantic segmentationTakuya Minagawa

PR-302: NeRF: Representing Scenes as Neural Radiance Fields for View SynthesisHyeongmin Lee

第1回NIPS読み会・関西発表資料Takato Horii

Visualizing Data Using t-SNEDavid Khosid

Masked Autoencoders Are Scalable Vision LearnersGuoqingLiu9

画像生成・生成モデルメタサーベイcvpaper. challenge

カーネル法:正定値カーネルの理論Daiki Tanaka

Contrastive learning 20200607ぱんいちすみもと

[DL輪読会]"CyCADA: Cycle-Consistent Adversarial Domain Adaptation"&"Learning Se...Deep Learning JP

SSD: Single Shot MultiBox Detector (ECCV2016)Takanori Ogata

論文紹介 "DARTS: Differentiable Architecture Search"Yuta Koreeda

XGBoost & LightGBMGabriel Cypriano Saca

PR-305: Exploring Simple Siamese Representation LearningSungchul Kim

人工知能概論 3Tadahiro Taniguchi

[기초개념] Graph Convolutional Network (GCN)Donghyeon Kim

Neural word embedding as implicit matrix factorization の論文紹介Masanao Ochi

科学と機械学習のあいだ：変量の設計・変換・選択・交互作用・線形性Ichigaku Takigawa

La actualidad más candente (20)

Score based Generative Modeling through Stochastic Differential Equations

[MIRU2018] Global Average Poolingの特性を用いたAttention Branch Network

SSII2019TS: Shall We GANs? ～GANの基礎から最近の研究まで～

Semantic segmentation

PR-302: NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis

第1回NIPS読み会・関西発表資料

Visualizing Data Using t-SNE

Masked Autoencoders Are Scalable Vision Learners

画像生成・生成モデルメタサーベイ

カーネル法:正定値カーネルの理論

Contrastive learning 20200607

[DL輪読会]"CyCADA: Cycle-Consistent Adversarial Domain Adaptation"&"Learning Se...

SSD: Single Shot MultiBox Detector (ECCV2016)

論文紹介 "DARTS: Differentiable Architecture Search"

XGBoost & LightGBM

PR-305: Exploring Simple Siamese Representation Learning

人工知能概論 3

[기초개념] Graph Convolutional Network (GCN)

Neural word embedding as implicit matrix factorization の論文紹介

科学と機械学習のあいだ：変量の設計・変換・選択・交互作用・線形性

Similar a [Pr12] dann jaejun yoo

Introduction to Machine Learning Aristotelis Tsirigos butest

ensemble learningbutest

MachineLearning.pptbutest

Learning when to give up: theory, practice and perspectivesGiuseppe (Pino) Di Fabbrizio

3_learning.pptbutest

Introductionbutest

pptbutest

artificial intelligence.pptxSabthamiS1

Analyse de sentiment et classification par approche neuronale en Python et WekaPatrice Bellot - Aix-Marseille Université / CNRS (LIS, INS2I)

Machine Learningbutest

.pptbutest

Mis End Term Exam Theory ConceptsVidya sagar Sharma

Methodological study of opinion mining and sentiment analysis techniquesijsc

Supervised learningJohnson Ubah

Lecture 7butest

Similar a [Pr12] dann jaejun yoo (20)

Introduction to Machine Learning Aristotelis Tsirigos

ensemble learning

MachineLearning.ppt

Learning when to give up: theory, practice and perspectives

3_learning.ppt

Introduction

ppt

artificial intelligence.pptx

Analyse de sentiment et classification par approche neuronale en Python et Weka

Machine Learning

.ppt

Mis End Term Exam Theory Concepts

Methodological study of opinion mining and sentiment analysis techniques

Supervised learning

Lecture 7

Más de JaeJun Yoo

[PR12] Generative Models as Distributions of FunctionsJaeJun Yoo

[CVPR2020] Simple but effective image enhancement techniquesJaeJun Yoo

Rethinking Data Augmentation for Image Super-resolution: A Comprehensive Anal...JaeJun Yoo

Super resolution in deep learning era - Jaejun YooJaeJun Yoo

A beginner's guide to Style Transfer and recent trendsJaeJun Yoo

[PR12] Spectral Normalization for Generative Adversarial NetworksJaeJun Yoo

Introduction to ambient GANJaeJun Yoo

[PR12] categorical reparameterization with gumbel softmaxJaeJun Yoo

[PR12] understanding deep learning requires rethinking generalizationJaeJun Yoo

[PR12] Capsule Networks - Jaejun YooJaeJun Yoo

[PR12] Inception and Xception - Jaejun YooJaeJun Yoo

[PR12] PixelRNN- Jaejun YooJaeJun Yoo

Variants of GANs - Jaejun YooJaeJun Yoo

[PR12] intro. to gans jaejun yooJaeJun Yoo

Más de JaeJun Yoo (14)

[PR12] Generative Models as Distributions of Functions

[CVPR2020] Simple but effective image enhancement techniques

Rethinking Data Augmentation for Image Super-resolution: A Comprehensive Anal...

Super resolution in deep learning era - Jaejun Yoo

A beginner's guide to Style Transfer and recent trends

[PR12] Spectral Normalization for Generative Adversarial Networks

Introduction to ambient GAN

[PR12] categorical reparameterization with gumbel softmax

[PR12] understanding deep learning requires rethinking generalization

[PR12] Capsule Networks - Jaejun Yoo

[PR12] Inception and Xception - Jaejun Yoo

[PR12] PixelRNN- Jaejun Yoo

Variants of GANs - Jaejun Yoo

[PR12] intro. to gans jaejun yoo

Último

presentation ICT roal in 21st century educationjfdjdjcjdnsjd

Manulife - Insurer Transformation Award 2024The Digital Insurer

Ransomware_Q4_2023. The report. [EN].pdfOverkill Security

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93

MINDCTI Revenue Release Quarter One 2024MIND CTI

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz

Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh

ICT role in 21st century education and its challengesrafiqahmad00786416

A Beginners Guide to Building a RAG App Using Open Source MilvusZilliz

Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbuapidays

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung

Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun

A Year of the Servo Reboot: Where Are We Now?Igalia

Corporate and higher education May webinar.pptxRustici Software

Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...apidays

Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

MS Copilot expands with MS Graph connectorsNanddeep Nachan

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous

[Pr12] dann jaejun yoo

1. Domain Adversarial Training of Neural Network PR12와 함께 이해하는 * Domain Adversarial Training of Neural Network, Y. Ganin et al. 2016를 바탕으로 작성한 리뷰 Jaejun Yoo Ph.D. Candidate @KAIST PR12 4TH MAY, 2017

2. Usually we try to… Test (target) Training (source)

3. For simplicity, let’s consider the binary classification problem

5. 일반적인 supervised learning setting: Training 과 test의 domain이 같다고 가정.

9. TAXONOMY OF TRANSFER LEARNING

10.

11.

12.

13. 전자기기 고객평가 (X) / 긍정 혹은 부정 라벨 (Y)

14. 전자기기 고객평가 (X) / 긍정 혹은 부정 라벨 (Y) 비디오 게임 고객평가 (X)

15. 전자기기 고객평가 (X) / 긍정 혹은 부정 라벨 (Y) 비디오 게임 고객평가 (X) NN으로 표현되는 H 함수 공간으로부터….

16. 전자기기 고객평가 (X) / 긍정 혹은 부정 라벨 (Y) 비디오 게임 고객평가 (X) Classifier h를 학습하는데, target의 label을 모르지만 source(X,Y)와 target(X) 두 도메인 모두에서 잘 label 을 찾는 h를 찾고 싶다. NN으로 표현되는 H 함수 공간으로부터….

17. DANN

18. DANN TRY TO CLASSIFY WELL WITH THE EXTRACTED FEATURE! Ordinary classification POSITIVE NEGATIVE 고객 평가 댓글

19. DANN Ordinary classification Domain Classification 전자기기 비디오 게임 TRY TO CLASSIFY WELL WITH THE EXTRACTED FEATURE! POSITIVE NEGATIVE 고객 평가 댓글

20. DANN Ordinary classification Domain Classification 전자기기 비디오 게임 TRY TO CLASSIFY WELL WITH THE EXTRACTED FEATURE! POSITIVE NEGATIVE 고객 평가 댓글 TRY TO EXTRACT DOMAIN INDEPENDENT FEATURE!

21. DANN Ordinary classification Domain Classification 전자기기 비디오 게임 TRY TO CLASSIFY WELL WITH THE EXTRACTED FEATURE! POSITIVE NEGATIVE 고객 평가 댓글 TRY TO EXTRACT DOMAIN INDEPENDENT FEATURE! e.g. f : compact, sharp, blurry → easy to discriminate the domain ⇓ f : good, excited, nice, never buy, …

22. • Combining DA and feature learning within one training process • Principled way to learn a good representation based on the generalization guarantee : minimize the H divergence directly (no heuristic) “When or when not the DA algorithm works.” “Why it works.” DANN

23. 기존 전략: 최대한 적은 parameter로 training error가 최소인 model을 찾자

24. 이제는 training domain (source)과 testing domain (target)이 서로 다르다 기존의 전략 외에 다른 전략이 추가로 필요하다.

25.

26. PREREQUISITE Different distances Slide courtesy of Sungbin Lim, DeepBio, 2017

27. = 0

28. A Bound on the Adaptation Error 1. Difference across all measurable subsets cannot be estimated from finite samples 2. We’re only interested in differences related to classification error

29. Idea: Measure subsets where hypotheses in disagree Subsets A are error sets of one hypothesis wrt another 1. Always lower than L1 2. computable from finite unlabeled samples. (Kifer et al. 2004) 3. train classifier to discriminate between source and target data

30. A Computable Adaptation Bound Divergence estimation complexity Dependent on number of unlabeled samples

31. The optimal joint hypothesis is the hypothesis with minimal combined error is that error

32. THANKS TO GENERALIZATION GUARANTEE

33. THEORETICAL RESULTS

34. THEORETICAL RESULTS 𝒉 ∈ 𝑯 ⟺ 𝟏 − 𝒉 ∈ 𝑯

35. THEORETICAL RESULTS

36. THEORETICAL RESULTS

37. DANN

38. DANN

39. DANN

40. DANN

41. DANN ↔

42. DANN ↔

43. DANN

44.

45. SHALLOW DANN

46. SHALLOW DANN

47. tSNE RESULTS

48. REFERENCE PAPERS 1. A survey on transfer learning, SJ Pan 2009 2. A theory of learning from different domains, S Ben-David et al. 2010 3. Domain-Adversarial Training of Neural Networks, Y Ganin 2016 BLOG 1. http://jaejunyoo.blogspot.com/2017/01/domain-adversarial-training-of-neural.html 2. https://github.com/jaejun-yoo/tf-dann-py35 3. https://github.com/jaejun-yoo/shallow-DANN-two-moon-dataset SLIDES 1. http://www.di.ens.fr/~germain/talks/nips2014_dann_slides.pdf 2. http://john.blitzer.com/talks/icmltutorial_2010.pdf (DA theory part) 3. https://epat2014.sciencesconf.org/conference/epat2014/pages/slides_DA_epat_17.pdf (DA theory part) 4. https://www.slideshare.net/butest/ppt-3860159 (DA theory part) VIDEO 1. https://www.youtube.com/watch?v=h8tXDbywcdQ (Terry Um 딥러닝 토크) 2. https://www.youtube.com/watch?v=F2OJ0fAK46Q (DA theory part) 3. https://www.youtube.com/watch?v=uc6K6tRHMAA&index=13&list=WL&t=2570s (DA theory part)

[Pr12] dann jaejun yoo

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Similar a [Pr12] dann jaejun yoo

Similar a [Pr12] dann jaejun yoo (20)

Más de JaeJun Yoo

Más de JaeJun Yoo (14)

Último

Último (20)

[Pr12] dann jaejun yoo