Semantic Image Inpainting

•

0 recomendaciones•485 vistas

민

Mostly paper review of Semantic Image Inpainting with Deep Generative Models, R Yeh et al. CVPR 2017. Prepared for Lab Seminar at SNU Datamining Center on 20180213.

Datos y análisis

Semantic Image Inpainting
Semantic Image Inpainting with Deep Generative Models,
R Yeh et al. CVPR 2017
LAB SEMINAR
1
2018.02.13
SNU DATAMINING CENTER
MINKI CHUNG

TABLE OF CONTENTS
▸ Motivation
▸ What is image inpainting
▸ Problem statement
▸ Baseline
▸ Semantic image inpainting with Deep Generative
Models
▸ My work
▸ Discussion
2

MOTIVATION 4
▸ What is image inpainting?
https://www.youtube.com/watch?v=1F-6iRrgh1s

MOTIVATION 5
▸ Objective: Make attentive inpainter
IF BACKGROUND OF TARGET REMOVING OBJECT IS SIMPLE, EXISTING METHOD WORKS FINE
HOWEVER, IF BACKGROUND OF TARGET REMOVING OBJECT IS COMPLEX, BETTER NEED ANOTHER METHOD

BASELINE
▸ Semantic Image Inpainting with Deep Generative
Models, R Yeh et al. CVPR 2017
6

SEMANTIC IMAGE INPAINTING WITH DEEP GENERATIVE MODELS 7
▸ DCGAN-Based
▸ Not end-to-end:
▸ 1. Train generator ﬁrst (uncorrupted data)
▸ 2. Find z_hat for inpainting
CONTEXTUAL LOSSPRIOR LOSS
https://arxiv.org/abs/1607.07539

SEMANTIC IMAGE INPAINTING WITH DEEP GENERATIVE MODELS 8
▸ Hypothesis: Trained G is efﬁcient- image not from pdata (e.g., corrupted data) should
not lie on the learned encoding manifold, z
▸ Objective: Find encoding z_hat: “closest” to the corrupted image while being
constrained to the manifold,
▸ y: corrupted image
M: binary mask(size equal to the image)
https://arxiv.org/abs/1607.07539
PRIOR LOSSCONTEXTUAL LOSS

SEMANTIC IMAGE INPAINTING WITH DEEP GENERATIVE MODELS 9
▸ Contextual Loss: Not simply l1 norm between G(z) and uncorrupted portion of
input image y, do consider corrupted area
▸ Weighting term W,
▸ So,
Wi: importance weight at pixel location i
N(i): set of neighbors of pixel i in a local window
BIGGER WEIGHT
y: corrupted image
M: binary mask(size equal to the image)
https://arxiv.org/abs/1607.07539

SEMANTIC IMAGE INPAINTING WITH DEEP GENERATIVE MODELS 10
▸ Prior Loss: how realistic the generated image is
▸ Identical to the GAN loss for training the discriminator D,
▸
▸ Without Lp, the mapping from y to z may converge to a perceptually implausible
result
https://arxiv.org/abs/1607.07539

SEMANTIC IMAGE INPAINTING WITH DEEP GENERATIVE MODELS 11
▸ Tackling points:
▸ Object-level occlusion: Narrowing down for object removal
▸ Contextual loss: A pixel that is very far away from any holes plays very little role
in the inpainting process.
▸ What if..?
▸ Interpretation: Want to see the pixel which plays key role in deciding z_hat
▸ → Attention
1
2
3

MY WORK 13
▸ Object-level occlusion: Narrowing down to object removal
▸ MS-COCO Dataset
▸ Train set: 118287
▸ COCO Api: Get annotations(instance)
▸ Use images which have person instance such that
smaller than 1/4 of the image bigger than 1/20 of the
image
▸ 30830, (rescale to 256x256)
1

MY WORK 14
▸ Limitation of contextual loss: less inﬂuence of farther part on inpainting
▸ Naive approach: for each grid of image, ﬁnd pixel inﬂuence(attention_ratio) on
ﬁnding optimal z_hat
▸ Do it subsequently, grid by grid
occlusio
0.1 0.1 0.1
0.4 0.6
0.4 0.3
0.1 0.7
0.5 0.3
0.8 0.7
0.6 0.7
0.2 0.40.1 0.3
0.1 0.1
0.2
0.1
0.7
0.1
occlusio
2

MY WORK 15
▸ After ﬁnding optimal attention_ratio for each grid
▸ Find noize z hat based on ‘Original * Attn_Ratio image’ to reconstruct image
▸ Visualization of pixel inﬂuence on inpainting
ORIGINAL ORIGINAL*ATTN_RATIOMASKED
3

MY WORK 16
▸ However… because of computation inefﬁciency, unable to learn
▸ (Current situation) Rethinking about the attention method..
WITHOUT ATTENTION, 1000 EPOCH WITH ATTENTION, 20 EPOCH

REFERENCE
▸ Semantic image inpainting with Deep Generative Models, Raymond A.
Yeh, Chen Chen, Teck Yian Lim, Alexander G. Schwing, Mark Hasegawa-
Johnson, Minh N. Do, CVPR 2017, https://arxiv.org/abs/1607.07539
▸ MS COCO dataset, http://cocodataset.org/#home
18

Más contenido relacionado

Último

ALSO dropshipping via API with DroFx.pptxolyaivanovalion

Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083

Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71

꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat

VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor

100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate

Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083

Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823

Mature dropshipping via API with DroFx.pptxolyaivanovalion

Carero dropshipping via API with DroFx.pptxolyaivanovalion

CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion

Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlkumarajju5765

FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg

Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE9953056974 Low Rate Call Girls In Saket, Delhi NCR

Data-Analysis for Chicago Crime Data 2023ymrp368

Zuja dropshipping via API with DroFx.pptxolyaivanovalion

CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Onlineanilsa9823

Destacado

AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork

Skeleton Culture CodeSkeleton Technologies

PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley

Content Methodology: A Best Practices Report (Webinar)contently

How to Prepare For a Successful Job Search for 2024Albert Qian

Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)

Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal

5 Public speaking tips from TED - Visualized summarySpeakerHub

ChatGPT and the Future of Work - Clark Boyd Clark Boyd

Getting into the tech field. what next Tessa Mero

Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray

How to have difficult conversations Rajiv Jayarajah, MAppComm, ACC

Introduction to Data ScienceChristy Abraham Joy

Time Management & Productivity - Best PracticesVit Horky

The six step guide to practical project managementMindGenius

Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36

Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools

12 Ways to Increase Your Influence at WorkGetSmarter

ChatGPT webinar slidesAlireza Esmikhani

More than Just Lines on a Map: Best Practices for U.S Bike RoutesProject for Public Spaces & National Center for Biking and Walking

Destacado (20)

AI Trends in Creative Operations 2024 by Artwork Flow.pdf

Skeleton Culture Code

PEPSICO Presentation to CAGNY Conference Feb 2024

Content Methodology: A Best Practices Report (Webinar)

How to Prepare For a Successful Job Search for 2024

Social Media Marketing Trends 2024 // The Global Indie Insights

Trends In Paid Search: Navigating The Digital Landscape In 2024

5 Public speaking tips from TED - Visualized summary

ChatGPT and the Future of Work - Clark Boyd

Getting into the tech field. what next

Google's Just Not That Into You: Understanding Core Updates & Search Intent

How to have difficult conversations

Introduction to Data Science

Time Management & Productivity - Best Practices

The six step guide to practical project management

Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...

Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...

12 Ways to Increase Your Influence at Work

ChatGPT webinar slides

More than Just Lines on a Map: Best Practices for U.S Bike Routes

Semantic Image Inpainting

1. Semantic Image Inpainting Semantic Image Inpainting with Deep Generative Models, R Yeh et al. CVPR 2017 LAB SEMINAR 1 2018.02.13 SNU DATAMINING CENTER MINKI CHUNG

2. TABLE OF CONTENTS ▸ Motivation ▸ What is image inpainting ▸ Problem statement ▸ Baseline ▸ Semantic image inpainting with Deep Generative Models ▸ My work ▸ Discussion 2

3. MOTIVATION 3

4. MOTIVATION 4 ▸ What is image inpainting? https://www.youtube.com/watch?v=1F-6iRrgh1s

5. MOTIVATION 5 ▸ Objective: Make attentive inpainter IF BACKGROUND OF TARGET REMOVING OBJECT IS SIMPLE, EXISTING METHOD WORKS FINE HOWEVER, IF BACKGROUND OF TARGET REMOVING OBJECT IS COMPLEX, BETTER NEED ANOTHER METHOD

6. BASELINE ▸ Semantic Image Inpainting with Deep Generative Models, R Yeh et al. CVPR 2017 6

7. SEMANTIC IMAGE INPAINTING WITH DEEP GENERATIVE MODELS 7 ▸ DCGAN-Based ▸ Not end-to-end: ▸ 1. Train generator ﬁrst (uncorrupted data) ▸ 2. Find z_hat for inpainting CONTEXTUAL LOSSPRIOR LOSS https://arxiv.org/abs/1607.07539

8. SEMANTIC IMAGE INPAINTING WITH DEEP GENERATIVE MODELS 8 ▸ Hypothesis: Trained G is efﬁcient- image not from pdata (e.g., corrupted data) should not lie on the learned encoding manifold, z ▸ Objective: Find encoding z_hat: “closest” to the corrupted image while being constrained to the manifold, ▸ y: corrupted image M: binary mask(size equal to the image) https://arxiv.org/abs/1607.07539 PRIOR LOSSCONTEXTUAL LOSS

9. SEMANTIC IMAGE INPAINTING WITH DEEP GENERATIVE MODELS 9 ▸ Contextual Loss: Not simply l1 norm between G(z) and uncorrupted portion of input image y, do consider corrupted area ▸ Weighting term W, ▸ So, Wi: importance weight at pixel location i N(i): set of neighbors of pixel i in a local window BIGGER WEIGHT y: corrupted image M: binary mask(size equal to the image) https://arxiv.org/abs/1607.07539

10. SEMANTIC IMAGE INPAINTING WITH DEEP GENERATIVE MODELS 10 ▸ Prior Loss: how realistic the generated image is ▸ Identical to the GAN loss for training the discriminator D, ▸ ▸ Without Lp, the mapping from y to z may converge to a perceptually implausible result https://arxiv.org/abs/1607.07539

11. SEMANTIC IMAGE INPAINTING WITH DEEP GENERATIVE MODELS 11 ▸ Tackling points: ▸ Object-level occlusion: Narrowing down for object removal ▸ Contextual loss: A pixel that is very far away from any holes plays very little role in the inpainting process. ▸ What if..? ▸ Interpretation: Want to see the pixel which plays key role in deciding z_hat ▸ → Attention 1 2 3

12. MY WORK 12

13. MY WORK 13 ▸ Object-level occlusion: Narrowing down to object removal ▸ MS-COCO Dataset ▸ Train set: 118287 ▸ COCO Api: Get annotations(instance) ▸ Use images which have person instance such that smaller than 1/4 of the image bigger than 1/20 of the image ▸ 30830, (rescale to 256x256) 1

14. MY WORK 14 ▸ Limitation of contextual loss: less influence of farther part on inpainting ▸ Naive approach: for each grid of image, find pixel influence(attention_ratio) on finding optimal z_hat ▸ Do it subsequently, grid by grid occlusio 0.1 0.1 0.1 0.4 0.6 0.4 0.3 0.1 0.7 0.5 0.3 0.8 0.7 0.6 0.7 0.2 0.40.1 0.3 0.1 0.1 0.2 0.1 0.7 0.1 occlusio 2

15. MY WORK 15 ▸ After ﬁnding optimal attention_ratio for each grid ▸ Find noize z hat based on ‘Original * Attn_Ratio image’ to reconstruct image ▸ Visualization of pixel inﬂuence on inpainting ORIGINAL ORIGINAL*ATTN_RATIOMASKED 3

16. MY WORK 16 ▸ However… because of computation inefﬁciency, unable to learn ▸ (Current situation) Rethinking about the attention method.. WITHOUT ATTENTION, 1000 EPOCH WITH ATTENTION, 20 EPOCH

17. ANY Q? 17

18. REFERENCE ▸ Semantic image inpainting with Deep Generative Models, Raymond A. Yeh, Chen Chen, Teck Yian Lim, Alexander G. Schwing, Mark Hasegawa- Johnson, Minh N. Do, CVPR 2017, https://arxiv.org/abs/1607.07539 ▸ MS COCO dataset, http://cocodataset.org/#home 18

19. END OF DOCUMENT 19

Semantic Image Inpainting

Recomendados

Recomendados

Más contenido relacionado

Último

Último (20)

Destacado

Destacado (20)

Semantic Image Inpainting