SlideShare una empresa de Scribd logo
1 de 48
Seam Carving for Content-Aware Image Resizing Shai  Aidan (Mitsubishi Electric Research Labs) Ariel Shamir (The Interdisciplinary Center & MERL) ACM SIGGRAPH 2007
Resize Seam carving & insertion
Abstract Seams are optimal 8-connected paths of pixels cross the image Carving out or inserting seams to achieve content-aware resizing
Outline Introduction Background Seam-carving operator Discrete image resizing Multi-size images Limitations Conclusions and future work
Introduction
Motivation HTML can support dynamic changes of page layout and text.  Why can not an image deform to fit different layout automatically ? iGoogle How about aspect ratio of an image , such as fitting photo into PDA or phone cells ? Solution ? Resize – content independent Crop – remove pixels from the image periphery only
Basic Idea of Seam-Carving Use energy function to define the importanceof pixels Define seam-carving image operator Image reduction Carving out seams - the connected low energy pixels crossing the image  Preserving the image structure Image enlarging Insert seams on low energy area The order of seam insertion ensures a balance between the  original image content and the artificially inserted pixels
Application Discrete image resizing Aspect Ration Change,  Image Retarget,  Image Enlarging,  Content Amplification,  Seam Carving in gradient domain, Object Removal Multi-size images An image can continuously change their size in a content-aware manner Storing the order of seam removal and insertion
background
Image Retarget Seek to change the size of the image while maintaining the important features  Face detector An automatic thumbnail creation [Suh03]   ROI Fisheye-View warp [Liu and Gleicher 05, 06] Visual saliency []  [Suh 03] [Selur 04,  decompose image to foreground obj and background origin
Saliency map [Itti IEEE99] Simulate neuroscience of human visual system Pyramid tech. to compute 3 feature maps, color, intensity and orientation [Suh 03],  an automatic thumbnail creation, based on either a saliency map or the output of a face detector [Chen 03],  adapting most important region of images to mobile devices.
[Liu 03],  suggesting to trade time for space. Given a collection of regions of interest, they construct an optimal path through these regions and display them serially.
[Santella et al. 06] use eye tracking, in addition to composition rules to crop images intelligently.
ROI (Region-Of-Interest) Such a method was proposed by [Liu and Gleicher 05, 06] for image and video retargeting. For image retargeting they find ROI and construct a novel Fisheye-View warp that essentially applies a piecewise linear scaling function in each dimension to the image. This way the ROI is maintained while the rest of the image is warped. The retargeting can be done in interactive rates, once the ROI is found, so the user can control the desired size of the image by moving a slider. In their video retargeting work they use a combination of image and saliency maps to find the ROI. Then they use a combination of cropping, virtual pan and shot cuts to retarget the video frames.
Feature-aware warping The first solution to the general problem of warping an image into an arbitrary shape while preserving user-specified features was recently proposed by [Gal et al. 06].  The feature-aware warping is achieved by a particular formulation of the Laplacian editing technique, suited to accommodate similarity constraints on parts of the domain.  Since local constraints are propagated by the global optimization process, not all the constraints can always be satisfied at once
Seam Perfect seams to combine parts of a set of photo into a single composite picture [Agarwala et al. 04]  Drag-and-Drop Pasting that extends the Poisson Image Editing to computer an optimal boundary (seam) between the source picture and target images [Jia et al. 06] AutoCollage,  a program that automatically creates a collage image from a collection of images.  [Rother et al. 06] Simultaneously solve matting and compositing. They allow the user to scale the size of the foreground object and paste it back on the original background. [Wang , Cohen 06] evaluated several cost functions for seamless image stitching and concluded that minimizing an L1error norm between the gradients of the stitched image and the gradients of the input images performed well in general [Zomet et al. 05]
Sear Optimal Seam Dijkstra’s shortest path algorithm [98] Dynamic programming [Efros 01] Graph cuts [Kwatra 03]
Seam-carving operator
Strategies of Image Reduction Original e1 energy Optimal  global remove the lowest energy pixels Pixel remove the least energy in each row
Strategies of Image Reduction Original e1 energy Column removing columns with minimal energy Crop find a sub-win with the highest energy
Vertical Seam
Horizontal Seam
Optimal Seam Search
Optimal Seam Search S Dynamic Programming G
e1 energy
Image Energy Preservation The average energy of all pixels during resizing
Energy Functions L1 and L2-norm of the gradient,  saliency measure [Itti 99]
Histogram of Gradient  (HoG) ,[object Object],Dividing the image window into cells For each cell accumulating a local 1-D histogram of gradient directions Normalize cells by the measure of local histogram energy over larger blocks The average gradient image Weighted R-HOG descriptor R-HOG descriptor
Energy Functions Histogram of Gradient  (HoG) [Dalal and Triggs 95] max(HoG(I(x,y)) makes sure the seams run parallel to the edge of objects and not cross them
Energy Functions Entropy  Compute the entropy over a 9 x 9 window and add it to e1 eEntropy(x,y) = +  e1 (x,y)
Energy Functions Segmentation and L1 Image segmentation [Christoudias 02]  Apply e1 on the results
No single e function performs well across all images Similar range for resizing e1 or eHoG works well
Discrete image resizing  Aspect Ratio Change, Retargeting with Optimal Seams-Order,  Image Enlarging, Content Amplification,
Aspect Ratio Change Carving-out /insert seams Original Original Original 1D aspect ratio changing
2D aspect ratio changing Optimal Seams-Order Search + DynamicProgramming =  + min
Retargeting with Optimal Seams-Order h-first alternate v-first Transport map Original optimal
Image Enlarging Find first k seams for removal Duplicate them in order to arrive at I(-k) origin I(t): smaller image after t seam-carving I(-k): enlarged image after k seam insertion t I(-1) I(-k) I(t) I(-k) enlarged image insert seams in order of removal
Image Enlarging (>50%) origin Break into several steps Each step does not enlarge the size of image more than a fraction
Content Amplification Amplified Original
Seam Carving in the Gradient Domain Seam + Poisson Reconstruction [Perez 03] Compute e function Work on the gradient domain Remove seams from the x and y derivatives of the original image Use Poisson Reconstruction original retarget retarget in  Gradient Domain
Object Removal Mark the removing target Remove seams until all the marked pixels are gone * Employ seam insertion to maintain the original size
Object Removal Origin
Multi-size images Store the pre-computed  representation that encodes, for each pixel in V/H map The index of the seam that removed it The negative index of the seam that inserted it Blue (first seam)  Red (last seam) origin V(i,j)=t : pixel (i,j) removed by t-th vertical seam H(i,j)=t : pixel (i,j) removed by t-th horizontal seam
Limitations Seam-Carving  does not work well on all images 	Ex:  face Origin Crop Scale Constraint the face Face the flower Bottom up feature detection
Limitations The amount of content Too density, no “less” important area The layout of the image content origin origin
Conclusions Present a content-aware resizing using the seam-carving image operator Seams are the optimal paths on a single image Carve-out seams Insert seams Application of seam-carving operator Aspect ratio change, image retargeting, content amplification, object removal Multi-size images that support continuous resizingin real-time
Future Work Video resizing Combination of scaling and seam-carving Define more robust multi-size image Better solution to combine horizontal and vertical seams in multi-size image
END

Más contenido relacionado

La actualidad más candente

Project Face Detection
Project Face Detection Project Face Detection
Project Face Detection
Abu Saleh Musa
 
Detection and recognition of face using neural network
Detection and recognition of face using neural networkDetection and recognition of face using neural network
Detection and recognition of face using neural network
Smriti Tikoo
 
Face recognition ppt
Face recognition pptFace recognition ppt
Face recognition ppt
Santosh Kumar
 

La actualidad más candente (20)

Project Face Detection
Project Face Detection Project Face Detection
Project Face Detection
 
Facial recognition
Facial recognitionFacial recognition
Facial recognition
 
face-recognition-technology-ppt[1].pptx
face-recognition-technology-ppt[1].pptxface-recognition-technology-ppt[1].pptx
face-recognition-technology-ppt[1].pptx
 
Face identification
Face  identificationFace  identification
Face identification
 
Top 10 Applications of Machine Learning | Machine Learning Application Exampl...
Top 10 Applications of Machine Learning | Machine Learning Application Exampl...Top 10 Applications of Machine Learning | Machine Learning Application Exampl...
Top 10 Applications of Machine Learning | Machine Learning Application Exampl...
 
face detection
face detectionface detection
face detection
 
FACE RECOGNITION TECHNOLOGY
FACE RECOGNITION TECHNOLOGYFACE RECOGNITION TECHNOLOGY
FACE RECOGNITION TECHNOLOGY
 
Computer vision ppt
Computer vision pptComputer vision ppt
Computer vision ppt
 
Detection and recognition of face using neural network
Detection and recognition of face using neural networkDetection and recognition of face using neural network
Detection and recognition of face using neural network
 
Face recognition ppt
Face recognition pptFace recognition ppt
Face recognition ppt
 
Face Detection
Face DetectionFace Detection
Face Detection
 
Towards Light-weight and Real-time Line Segment Detection
Towards Light-weight and Real-time Line Segment DetectionTowards Light-weight and Real-time Line Segment Detection
Towards Light-weight and Real-time Line Segment Detection
 
Viola-Jones Object Detection
Viola-Jones Object DetectionViola-Jones Object Detection
Viola-Jones Object Detection
 
Credit card fraud detection using python machine learning
Credit card fraud detection using python machine learningCredit card fraud detection using python machine learning
Credit card fraud detection using python machine learning
 
COVID-19-Preventions-Control-System and Unconstrained Face-mask and Face-hand...
COVID-19-Preventions-Control-System and Unconstrained Face-mask and Face-hand...COVID-19-Preventions-Control-System and Unconstrained Face-mask and Face-hand...
COVID-19-Preventions-Control-System and Unconstrained Face-mask and Face-hand...
 
Comparative studies on detecting abusive language on twitter
Comparative studies on detecting abusive language on twitterComparative studies on detecting abusive language on twitter
Comparative studies on detecting abusive language on twitter
 
Handwritten bangla-digit-recognition-using-deep-learning
Handwritten bangla-digit-recognition-using-deep-learningHandwritten bangla-digit-recognition-using-deep-learning
Handwritten bangla-digit-recognition-using-deep-learning
 
Textual & Sentiment Analysis of Movie Reviews
Textual & Sentiment Analysis of Movie ReviewsTextual & Sentiment Analysis of Movie Reviews
Textual & Sentiment Analysis of Movie Reviews
 
Facial emotion detection on babies' emotional face using Deep Learning.
Facial emotion detection on babies' emotional face using Deep Learning.Facial emotion detection on babies' emotional face using Deep Learning.
Facial emotion detection on babies' emotional face using Deep Learning.
 
Handwritten Character Recognition
Handwritten Character RecognitionHandwritten Character Recognition
Handwritten Character Recognition
 

Similar a study Seam Carving For Content Aware Image Resizing

F0255046056
F0255046056F0255046056
F0255046056
theijes
 
Research Paper v2.0
Research Paper v2.0Research Paper v2.0
Research Paper v2.0
Kapil Tiwari
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
ijceronline
 

Similar a study Seam Carving For Content Aware Image Resizing (20)

Medial Axis Transformation based Skeletonzation of Image Patterns using Image...
Medial Axis Transformation based Skeletonzation of Image Patterns using Image...Medial Axis Transformation based Skeletonzation of Image Patterns using Image...
Medial Axis Transformation based Skeletonzation of Image Patterns using Image...
 
Medial axis transformation based skeletonzation of image patterns using image...
Medial axis transformation based skeletonzation of image patterns using image...Medial axis transformation based skeletonzation of image patterns using image...
Medial axis transformation based skeletonzation of image patterns using image...
 
A Detailed Analysis on Feature Extraction Techniques of Panoramic Image Stitc...
A Detailed Analysis on Feature Extraction Techniques of Panoramic Image Stitc...A Detailed Analysis on Feature Extraction Techniques of Panoramic Image Stitc...
A Detailed Analysis on Feature Extraction Techniques of Panoramic Image Stitc...
 
Oc2423022305
Oc2423022305Oc2423022305
Oc2423022305
 
F0255046056
F0255046056F0255046056
F0255046056
 
Research Paper v2.0
Research Paper v2.0Research Paper v2.0
Research Paper v2.0
 
Design and Implementation of VLSI Architecture for Image Scaling Processor
Design and Implementation of VLSI Architecture for Image  Scaling ProcessorDesign and Implementation of VLSI Architecture for Image  Scaling Processor
Design and Implementation of VLSI Architecture for Image Scaling Processor
 
paper writing
paper writingpaper writing
paper writing
 
Performance analysis on color image mosaicing techniques on FPGA
Performance analysis on color image mosaicing techniques on FPGAPerformance analysis on color image mosaicing techniques on FPGA
Performance analysis on color image mosaicing techniques on FPGA
 
A10 avidan
A10 avidanA10 avidan
A10 avidan
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
Image inpainting
Image inpaintingImage inpainting
Image inpainting
 
Image Stitching Algorithm: An Optimization between Correlation-Based and Feat...
Image Stitching Algorithm: An Optimization between Correlation-Based and Feat...Image Stitching Algorithm: An Optimization between Correlation-Based and Feat...
Image Stitching Algorithm: An Optimization between Correlation-Based and Feat...
 
IRJET- Saliency based Image Co-Segmentation
IRJET- Saliency based Image Co-SegmentationIRJET- Saliency based Image Co-Segmentation
IRJET- Saliency based Image Co-Segmentation
 
Face Detection System on Ada boost Algorithm Using Haar Classifiers
Face Detection System on Ada boost Algorithm Using Haar ClassifiersFace Detection System on Ada boost Algorithm Using Haar Classifiers
Face Detection System on Ada boost Algorithm Using Haar Classifiers
 
IRJET- Real Time Implementation of Bi-Histogram Equalization Method on Androi...
IRJET- Real Time Implementation of Bi-Histogram Equalization Method on Androi...IRJET- Real Time Implementation of Bi-Histogram Equalization Method on Androi...
IRJET- Real Time Implementation of Bi-Histogram Equalization Method on Androi...
 
IRJET- Design and Implementation of ATM Security System using Vibration Senso...
IRJET- Design and Implementation of ATM Security System using Vibration Senso...IRJET- Design and Implementation of ATM Security System using Vibration Senso...
IRJET- Design and Implementation of ATM Security System using Vibration Senso...
 
Log polar coordinates
Log polar coordinatesLog polar coordinates
Log polar coordinates
 
IRJET- An Approach to FPGA based Implementation of Image Mosaicing using Neur...
IRJET- An Approach to FPGA based Implementation of Image Mosaicing using Neur...IRJET- An Approach to FPGA based Implementation of Image Mosaicing using Neur...
IRJET- An Approach to FPGA based Implementation of Image Mosaicing using Neur...
 
EXTENDED WAVELET TRANSFORM BASED IMAGE INPAINTING ALGORITHM FOR NATURAL SCENE...
EXTENDED WAVELET TRANSFORM BASED IMAGE INPAINTING ALGORITHM FOR NATURAL SCENE...EXTENDED WAVELET TRANSFORM BASED IMAGE INPAINTING ALGORITHM FOR NATURAL SCENE...
EXTENDED WAVELET TRANSFORM BASED IMAGE INPAINTING ALGORITHM FOR NATURAL SCENE...
 

Más de Chiamin Hsu

study Domain Transform for Edge-Aware Image and Video Processing
study Domain Transform for Edge-Aware Image and Video Processingstudy Domain Transform for Edge-Aware Image and Video Processing
study Domain Transform for Edge-Aware Image and Video Processing
Chiamin Hsu
 
study Image and video abstraction by multi scale anisotropic kuwahara
study  Image and video abstraction by multi scale anisotropic kuwaharastudy  Image and video abstraction by multi scale anisotropic kuwahara
study Image and video abstraction by multi scale anisotropic kuwahara
Chiamin Hsu
 
stduy Edge-Based Image Coarsening
stduy Edge-Based Image Coarseningstduy Edge-Based Image Coarsening
stduy Edge-Based Image Coarsening
Chiamin Hsu
 
study Streaming Multigrid For Gradient Domain Operations On Large Images
study Streaming Multigrid For Gradient Domain Operations On Large Imagesstudy Streaming Multigrid For Gradient Domain Operations On Large Images
study Streaming Multigrid For Gradient Domain Operations On Large Images
Chiamin Hsu
 
study Shading Based Surface Editing
study Shading Based Surface Editingstudy Shading Based Surface Editing
study Shading Based Surface Editing
Chiamin Hsu
 
study Diffusion Curves: A Vector Representation for Smooth-Shaded Images
study Diffusion Curves: A Vector Representation for Smooth-Shaded Imagesstudy Diffusion Curves: A Vector Representation for Smooth-Shaded Images
study Diffusion Curves: A Vector Representation for Smooth-Shaded Images
Chiamin Hsu
 
study Image Vectorization using Optimized Gradeint Meshes
study Image Vectorization using Optimized Gradeint Meshesstudy Image Vectorization using Optimized Gradeint Meshes
study Image Vectorization using Optimized Gradeint Meshes
Chiamin Hsu
 
study Active Refocusing Of Images And Videos
study Active Refocusing Of Images And Videosstudy Active Refocusing Of Images And Videos
study Active Refocusing Of Images And Videos
Chiamin Hsu
 

Más de Chiamin Hsu (12)

study Domain Transform for Edge-Aware Image and Video Processing
study Domain Transform for Edge-Aware Image and Video Processingstudy Domain Transform for Edge-Aware Image and Video Processing
study Domain Transform for Edge-Aware Image and Video Processing
 
study Image and video abstraction by multi scale anisotropic kuwahara
study  Image and video abstraction by multi scale anisotropic kuwaharastudy  Image and video abstraction by multi scale anisotropic kuwahara
study Image and video abstraction by multi scale anisotropic kuwahara
 
study Accelerating Spatially Varying Gaussian Filters
study Accelerating Spatially Varying Gaussian Filtersstudy Accelerating Spatially Varying Gaussian Filters
study Accelerating Spatially Varying Gaussian Filters
 
stduy Edge-Based Image Coarsening
stduy Edge-Based Image Coarseningstduy Edge-Based Image Coarsening
stduy Edge-Based Image Coarsening
 
study Streaming Multigrid For Gradient Domain Operations On Large Images
study Streaming Multigrid For Gradient Domain Operations On Large Imagesstudy Streaming Multigrid For Gradient Domain Operations On Large Images
study Streaming Multigrid For Gradient Domain Operations On Large Images
 
study Shading Based Surface Editing
study Shading Based Surface Editingstudy Shading Based Surface Editing
study Shading Based Surface Editing
 
study Diffusion Curves: A Vector Representation for Smooth-Shaded Images
study Diffusion Curves: A Vector Representation for Smooth-Shaded Imagesstudy Diffusion Curves: A Vector Representation for Smooth-Shaded Images
study Diffusion Curves: A Vector Representation for Smooth-Shaded Images
 
study Image Vectorization using Optimized Gradeint Meshes
study Image Vectorization using Optimized Gradeint Meshesstudy Image Vectorization using Optimized Gradeint Meshes
study Image Vectorization using Optimized Gradeint Meshes
 
study Latent Doodle Space
study Latent Doodle Spacestudy Latent Doodle Space
study Latent Doodle Space
 
study Coded Aperture
study Coded Aperturestudy Coded Aperture
study Coded Aperture
 
study Active Refocusing Of Images And Videos
study Active Refocusing Of Images And Videosstudy Active Refocusing Of Images And Videos
study Active Refocusing Of Images And Videos
 
study Dappled Photography
study Dappled Photographystudy Dappled Photography
study Dappled Photography
 

Último

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 

Último (20)

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 

study Seam Carving For Content Aware Image Resizing

  • 1. Seam Carving for Content-Aware Image Resizing Shai Aidan (Mitsubishi Electric Research Labs) Ariel Shamir (The Interdisciplinary Center & MERL) ACM SIGGRAPH 2007
  • 2. Resize Seam carving & insertion
  • 3. Abstract Seams are optimal 8-connected paths of pixels cross the image Carving out or inserting seams to achieve content-aware resizing
  • 4. Outline Introduction Background Seam-carving operator Discrete image resizing Multi-size images Limitations Conclusions and future work
  • 6. Motivation HTML can support dynamic changes of page layout and text. Why can not an image deform to fit different layout automatically ? iGoogle How about aspect ratio of an image , such as fitting photo into PDA or phone cells ? Solution ? Resize – content independent Crop – remove pixels from the image periphery only
  • 7. Basic Idea of Seam-Carving Use energy function to define the importanceof pixels Define seam-carving image operator Image reduction Carving out seams - the connected low energy pixels crossing the image Preserving the image structure Image enlarging Insert seams on low energy area The order of seam insertion ensures a balance between the original image content and the artificially inserted pixels
  • 8. Application Discrete image resizing Aspect Ration Change, Image Retarget, Image Enlarging, Content Amplification, Seam Carving in gradient domain, Object Removal Multi-size images An image can continuously change their size in a content-aware manner Storing the order of seam removal and insertion
  • 10. Image Retarget Seek to change the size of the image while maintaining the important features Face detector An automatic thumbnail creation [Suh03] ROI Fisheye-View warp [Liu and Gleicher 05, 06] Visual saliency [] [Suh 03] [Selur 04, decompose image to foreground obj and background origin
  • 11. Saliency map [Itti IEEE99] Simulate neuroscience of human visual system Pyramid tech. to compute 3 feature maps, color, intensity and orientation [Suh 03], an automatic thumbnail creation, based on either a saliency map or the output of a face detector [Chen 03], adapting most important region of images to mobile devices.
  • 12. [Liu 03], suggesting to trade time for space. Given a collection of regions of interest, they construct an optimal path through these regions and display them serially.
  • 13. [Santella et al. 06] use eye tracking, in addition to composition rules to crop images intelligently.
  • 14. ROI (Region-Of-Interest) Such a method was proposed by [Liu and Gleicher 05, 06] for image and video retargeting. For image retargeting they find ROI and construct a novel Fisheye-View warp that essentially applies a piecewise linear scaling function in each dimension to the image. This way the ROI is maintained while the rest of the image is warped. The retargeting can be done in interactive rates, once the ROI is found, so the user can control the desired size of the image by moving a slider. In their video retargeting work they use a combination of image and saliency maps to find the ROI. Then they use a combination of cropping, virtual pan and shot cuts to retarget the video frames.
  • 15. Feature-aware warping The first solution to the general problem of warping an image into an arbitrary shape while preserving user-specified features was recently proposed by [Gal et al. 06]. The feature-aware warping is achieved by a particular formulation of the Laplacian editing technique, suited to accommodate similarity constraints on parts of the domain. Since local constraints are propagated by the global optimization process, not all the constraints can always be satisfied at once
  • 16. Seam Perfect seams to combine parts of a set of photo into a single composite picture [Agarwala et al. 04] Drag-and-Drop Pasting that extends the Poisson Image Editing to computer an optimal boundary (seam) between the source picture and target images [Jia et al. 06] AutoCollage, a program that automatically creates a collage image from a collection of images. [Rother et al. 06] Simultaneously solve matting and compositing. They allow the user to scale the size of the foreground object and paste it back on the original background. [Wang , Cohen 06] evaluated several cost functions for seamless image stitching and concluded that minimizing an L1error norm between the gradients of the stitched image and the gradients of the input images performed well in general [Zomet et al. 05]
  • 17. Sear Optimal Seam Dijkstra’s shortest path algorithm [98] Dynamic programming [Efros 01] Graph cuts [Kwatra 03]
  • 19. Strategies of Image Reduction Original e1 energy Optimal global remove the lowest energy pixels Pixel remove the least energy in each row
  • 20. Strategies of Image Reduction Original e1 energy Column removing columns with minimal energy Crop find a sub-win with the highest energy
  • 24. Optimal Seam Search S Dynamic Programming G
  • 26. Image Energy Preservation The average energy of all pixels during resizing
  • 27. Energy Functions L1 and L2-norm of the gradient, saliency measure [Itti 99]
  • 28.
  • 29. Energy Functions Histogram of Gradient (HoG) [Dalal and Triggs 95] max(HoG(I(x,y)) makes sure the seams run parallel to the edge of objects and not cross them
  • 30. Energy Functions Entropy Compute the entropy over a 9 x 9 window and add it to e1 eEntropy(x,y) = + e1 (x,y)
  • 31. Energy Functions Segmentation and L1 Image segmentation [Christoudias 02] Apply e1 on the results
  • 32. No single e function performs well across all images Similar range for resizing e1 or eHoG works well
  • 33. Discrete image resizing Aspect Ratio Change, Retargeting with Optimal Seams-Order, Image Enlarging, Content Amplification,
  • 34. Aspect Ratio Change Carving-out /insert seams Original Original Original 1D aspect ratio changing
  • 35. 2D aspect ratio changing Optimal Seams-Order Search + DynamicProgramming = + min
  • 36. Retargeting with Optimal Seams-Order h-first alternate v-first Transport map Original optimal
  • 37. Image Enlarging Find first k seams for removal Duplicate them in order to arrive at I(-k) origin I(t): smaller image after t seam-carving I(-k): enlarged image after k seam insertion t I(-1) I(-k) I(t) I(-k) enlarged image insert seams in order of removal
  • 38. Image Enlarging (>50%) origin Break into several steps Each step does not enlarge the size of image more than a fraction
  • 40. Seam Carving in the Gradient Domain Seam + Poisson Reconstruction [Perez 03] Compute e function Work on the gradient domain Remove seams from the x and y derivatives of the original image Use Poisson Reconstruction original retarget retarget in Gradient Domain
  • 41. Object Removal Mark the removing target Remove seams until all the marked pixels are gone * Employ seam insertion to maintain the original size
  • 43. Multi-size images Store the pre-computed representation that encodes, for each pixel in V/H map The index of the seam that removed it The negative index of the seam that inserted it Blue (first seam)  Red (last seam) origin V(i,j)=t : pixel (i,j) removed by t-th vertical seam H(i,j)=t : pixel (i,j) removed by t-th horizontal seam
  • 44. Limitations Seam-Carving does not work well on all images Ex: face Origin Crop Scale Constraint the face Face the flower Bottom up feature detection
  • 45. Limitations The amount of content Too density, no “less” important area The layout of the image content origin origin
  • 46. Conclusions Present a content-aware resizing using the seam-carving image operator Seams are the optimal paths on a single image Carve-out seams Insert seams Application of seam-carving operator Aspect ratio change, image retargeting, content amplification, object removal Multi-size images that support continuous resizingin real-time
  • 47. Future Work Video resizing Combination of scaling and seam-carving Define more robust multi-size image Better solution to combine horizontal and vertical seams in multi-size image
  • 48. END

Notas del editor

  1. Shai Aidan 以色列人MERL(Mitsubishi Electric Research Labs)沒想到以色列人投效到日本研究中心