SlideShare una empresa de Scribd logo
1 de 40
Descargar para leer sin conexión
Saliency Detection via
 Divergence Analysis

Jia-Bin Huang        Narendra Ahuja

 Electrical and Computer Engineering
University of Illinois, Urbana-Champaign
Jia-Bin Huang         Narendra Ahuja
jbhuang1@illinois.edu
   n-ahuja@illinois.edu
What this talk is about?
1.  The saliency detection problem

2.  A unifying framework for bottom-up
    saliency detection algorithms

3.  Ways to improve the performance
Look for the unique item
 Clap when you find it
What is Saliency?
•  Visual salience (or visual saliency) is the
    distinct subjective perceptual quality
    which makes some items in the world
    stand out from their neighbors and
    immediately grab our attention

                Where to look?
Example of Stimulus



 Color       Orientation




Motion     Natural scene
Why We Care?
•    Adaptive image compression
•    Object-of-interest image segmentation
•    Automatic image thumbnail
•    (Class-independent) Object detection and recognition
•    Visual tracking
•    Automatic Image collage
•    Content-aware image resizing
•    Non-photorealistic rendering
•    Understanding mechanism of human visual attention
Problem Setting
•  Input: Image -> Output: Saliency map
Design Principles
•  Rarity
   –  [Bruce NIPS 05] [Zhang JOV 08] [Rahtu ECCV 10] [Klein ICCV 11] [Borji
       CVPR 12]
•  Local complexity
   –  [Kadir IJCV 01]
•  Contrast
   –  [Itti PAMI 98] [Harel NIPS 05] [Ma MM 03] [Achanta CVPR 09] [Achanta
       ICIP 10] [Cheng CVPR 11] [Goferman CVPR 10] [Perazzi CVPR 12]
•  Spectral
   –  [Hou CVPR 07] [Guo CVPR 08] [Hou PAMI 12] [Li PAMI 12]
•  Learning
   –  [Liu CVPR 07] [Judd ICCV 09] [Oliva ICIP 03] [Torralba Psycho.Rev 06]
Complexity!                         Contrast!
Surprise!                                Spectral!                Learning!
            Information!       Rarity!                     WTH?
Which one is better?
Main Result
•  Most of the bottom-up saliency detection
   algorithms can be rewritten in the form of
   divergence between probabilistic
   distributions learned from center and
   surround
Center-Surround Divergence
Kullback-Leibler Divergence
From Center to Surround
From Center to Surround
From Surround to Center
From Surround to Center
From Surround to Center
Symmetrised Divergence
Symmetrised Divergence
How to Choose Support?
Experimental Results
•  Datasets
  –  MSRA salient object detection dataset
  –  1000 groundtruth binary mask are available from
      [Achanta CVPR 09]
•  Evaluation metric
  –  Precision and recall curves
State-of-the-art
Saliency Detection Methods
–  Center to surround
   •  AIM [Bruce NIPS 05] SUN [Zhang JOV 08], SW [Rahtu ECCV
       10]
–  Surround to center
   •  CA [Goferman CVPR 10], AC [Achanta ICIP 10], FT [Achanta
       CVPR 09], LC [Zhai-ACMMM 06]
–  Symmetrised divergence
   •  HC, RC, [Cheng CVPR 11], IT [Itti PAMI 98], GB [Harel NIPS
      07]
–  Spectrum-based
   •  SR [Hou CVPR 07]
•  Rarity-based
   –  AIM: image specific
   –  SUN: image indep.
•  Rarity-based
   –  SW: Patch/window
       based surround +
       scale space analysis
•  Contrast-based
   –  GB: spatial weighting
      for adaptive surround
•  Contrast-based
   –  AC: adaptive surround
      support
•  Contrast-based
   –  RC: region-based
      center support
•  Contrast-based
   –  FT: color feature
   –  LC: intensity feature
Lessons Learned
•  Most of the bottom-up saliency detection
   algorithms are in fact closely related
  –  Not exhaustive, e.g., spectral-based methods
•  How to improve the performance?
  –  Richer features
  –  Less approximation
  –  Adaptive center/surround support
Future Work
Input image     Saliency Map      Error Map




              Method parameters
Questions?
•  Comments or questions:

Jia-Bin Huang
jbhuang1@illinois.edu
https://netfiles.uiuc.edu/jbhuang1/www/
References
•  Saliency Detection via Divergence Analysis: An Unified Perspective
   J.B. Huang and N. Ahuja. ICPR, 2012
•  Frequency-tuned salient region detection
   R. Achanta, S. Hemami, F. Estrada, and S. Susstrunk., CVPR 2009
•  Saliency detection using maximum symmetric surround.
   R. Achanta and S. Susstrunk, ICIP, 2010.
•  State-of-the-art in visual attention modeling
   A. Borji and L. Itti. PAMI 2012
•  Saliency based on information maximization.
   N. Bruce and J. Tsotsos. NIPS, 2005
•  Global contrast based salient region detection.
   M. M. Cheng, G. X. Zhang, N. J. Mitra, X. Huang, and S. M. Hu. CVPR, 2011
•  The discriminant center-surround hypothesis for bottom-up saliency.
•  D. Gao, V. Mahadevan, and N. Vasconcelos. NIPS 2007
References
•  Context-aware saliency detection.
   S. Goferman, L. Zelnik-Manor, and A. Tal. CVPR, 2010.
•  Graph-based visual saliency.
   J. Harel, C. Koch, and P. Perona. NIPS, 2006
•  Saliency detection: A spectral residual approach.
   X. Hou and L. Zhang. CVPR, 2007
•  A model of saliency based visual attention for rapid scene analysis.
   L. Itti, C. Koch, and E. Niebur. PAMI 1998
•  Learning to predict where humans look.
   T. Judd, K. Ehinger, F. Durand, and A. Torralba. ICCV, 2009
•  Center-surround divergence of feature statistics for salient object detection.
   D. A. Klein and S. Frintrop. ICCV, 2011
•  Saliency detection based on frequency and spatial domain analyses
   J. Li, M. Levine, X. An, and H. He. BMVC 2011
References
•  Learning to detect a salient object
   T. Liu, J. Sun, N. N. Zheng, X. Tang, and H. Y. Shum, CVPR 2007
•  Contrast-based image attention analysis by using fuzzy growing
   Y. Ma and H. Zhang, ACM MM 2003
•  Top-down control of visual attention in object detection
   A. Oliva, A. Torralba, M. Castelhano, and J. Henderson. ICIP 2003
•  Segmenting salient objects from images and videos
   E. Rahtu, J. Kannala, M. Salo, and J. Heikkil., ECCV 2010
•  Computational versus psychophysical image saliency: A comparative
    evaluation study, A. Toet. PAMI 2011
•  Visual attention detection in video sequences using spatiotemporal cues
   Y. Zhai and M. Shah. ACM MM 2006
•  Sun: A bayesian framework for saliency using natural statistics
   L. Zhang, M. Tong, T. Marks, H. Shan, and G. Cottrell. JOV 2008

Más contenido relacionado

La actualidad más candente

Image enhancement in the spatial domain1
Image enhancement in the spatial domain1Image enhancement in the spatial domain1
Image enhancement in the spatial domain1shabanam tamboli
 
Fundamental Steps of Digital Image Processing & Image Components
Fundamental Steps of Digital Image Processing & Image ComponentsFundamental Steps of Digital Image Processing & Image Components
Fundamental Steps of Digital Image Processing & Image ComponentsKalyan Acharjya
 
GTSRB Traffic Sign recognition using machine learning
GTSRB Traffic Sign recognition using machine learningGTSRB Traffic Sign recognition using machine learning
GTSRB Traffic Sign recognition using machine learningRupali Aher
 
Face Recognition: From Scratch To Hatch
Face Recognition: From Scratch To HatchFace Recognition: From Scratch To Hatch
Face Recognition: From Scratch To HatchEduard Tyantov
 
Processing of satellite_image_using_digi
Processing of satellite_image_using_digiProcessing of satellite_image_using_digi
Processing of satellite_image_using_digiShanmuga Sundaram
 
Automated Number Plate Recognition
Automated Number Plate RecognitionAutomated Number Plate Recognition
Automated Number Plate RecognitionShwetank Sharma
 
Image segmentation 2
Image segmentation 2 Image segmentation 2
Image segmentation 2 Rumah Belajar
 
DEM generation, Image Matching in Aerial Photogrammetry.pptx
DEM generation, Image Matching in  Aerial Photogrammetry.pptxDEM generation, Image Matching in  Aerial Photogrammetry.pptx
DEM generation, Image Matching in Aerial Photogrammetry.pptxIndraSubedi7
 
Real Time Object Tracking
Real Time Object TrackingReal Time Object Tracking
Real Time Object TrackingVanya Valindria
 
Computer Vision Structure from motion
Computer Vision Structure from motionComputer Vision Structure from motion
Computer Vision Structure from motionWael Badawy
 
You Only Look Once: Unified, Real-Time Object Detection
You Only Look Once: Unified, Real-Time Object DetectionYou Only Look Once: Unified, Real-Time Object Detection
You Only Look Once: Unified, Real-Time Object DetectionDADAJONJURAKUZIEV
 
Deep learning based object detection basics
Deep learning based object detection basicsDeep learning based object detection basics
Deep learning based object detection basicsBrodmann17
 
Image Processing: Spatial filters
Image Processing: Spatial filtersImage Processing: Spatial filters
Image Processing: Spatial filtersA B Shinde
 
Comparison of image segmentation
Comparison of image segmentationComparison of image segmentation
Comparison of image segmentationHaitham Ahmed
 
Transfer Learning and Fine-tuning Deep Neural Networks
 Transfer Learning and Fine-tuning Deep Neural Networks Transfer Learning and Fine-tuning Deep Neural Networks
Transfer Learning and Fine-tuning Deep Neural NetworksPyData
 
Image classification using cnn
Image classification using cnnImage classification using cnn
Image classification using cnnSumeraHangi
 
Face detection and recognition using surveillance camera2 edited
Face detection and recognition using surveillance camera2 editedFace detection and recognition using surveillance camera2 edited
Face detection and recognition using surveillance camera2 editedSantu Chall
 

La actualidad más candente (20)

Image enhancement in the spatial domain1
Image enhancement in the spatial domain1Image enhancement in the spatial domain1
Image enhancement in the spatial domain1
 
Object detection
Object detectionObject detection
Object detection
 
Fundamental Steps of Digital Image Processing & Image Components
Fundamental Steps of Digital Image Processing & Image ComponentsFundamental Steps of Digital Image Processing & Image Components
Fundamental Steps of Digital Image Processing & Image Components
 
SPATIAL FILTER
SPATIAL FILTERSPATIAL FILTER
SPATIAL FILTER
 
GTSRB Traffic Sign recognition using machine learning
GTSRB Traffic Sign recognition using machine learningGTSRB Traffic Sign recognition using machine learning
GTSRB Traffic Sign recognition using machine learning
 
Face Recognition: From Scratch To Hatch
Face Recognition: From Scratch To HatchFace Recognition: From Scratch To Hatch
Face Recognition: From Scratch To Hatch
 
Processing of satellite_image_using_digi
Processing of satellite_image_using_digiProcessing of satellite_image_using_digi
Processing of satellite_image_using_digi
 
Automated Number Plate Recognition
Automated Number Plate RecognitionAutomated Number Plate Recognition
Automated Number Plate Recognition
 
Image segmentation 2
Image segmentation 2 Image segmentation 2
Image segmentation 2
 
DEM generation, Image Matching in Aerial Photogrammetry.pptx
DEM generation, Image Matching in  Aerial Photogrammetry.pptxDEM generation, Image Matching in  Aerial Photogrammetry.pptx
DEM generation, Image Matching in Aerial Photogrammetry.pptx
 
Real Time Object Tracking
Real Time Object TrackingReal Time Object Tracking
Real Time Object Tracking
 
Computer Vision Structure from motion
Computer Vision Structure from motionComputer Vision Structure from motion
Computer Vision Structure from motion
 
You Only Look Once: Unified, Real-Time Object Detection
You Only Look Once: Unified, Real-Time Object DetectionYou Only Look Once: Unified, Real-Time Object Detection
You Only Look Once: Unified, Real-Time Object Detection
 
Deep learning based object detection basics
Deep learning based object detection basicsDeep learning based object detection basics
Deep learning based object detection basics
 
Image Processing: Spatial filters
Image Processing: Spatial filtersImage Processing: Spatial filters
Image Processing: Spatial filters
 
License Plate recognition
License Plate recognitionLicense Plate recognition
License Plate recognition
 
Comparison of image segmentation
Comparison of image segmentationComparison of image segmentation
Comparison of image segmentation
 
Transfer Learning and Fine-tuning Deep Neural Networks
 Transfer Learning and Fine-tuning Deep Neural Networks Transfer Learning and Fine-tuning Deep Neural Networks
Transfer Learning and Fine-tuning Deep Neural Networks
 
Image classification using cnn
Image classification using cnnImage classification using cnn
Image classification using cnn
 
Face detection and recognition using surveillance camera2 edited
Face detection and recognition using surveillance camera2 editedFace detection and recognition using surveillance camera2 edited
Face detection and recognition using surveillance camera2 edited
 

Destacado

Transformation Guided Image Completion ICCP 2013
Transformation Guided Image Completion ICCP 2013Transformation Guided Image Completion ICCP 2013
Transformation Guided Image Completion ICCP 2013Jia-Bin Huang
 
Toward Accurate and Robust Cross-Ratio based Gaze Trackers Through Learning F...
Toward Accurate and Robust Cross-Ratio based Gaze Trackers Through Learning F...Toward Accurate and Robust Cross-Ratio based Gaze Trackers Through Learning F...
Toward Accurate and Robust Cross-Ratio based Gaze Trackers Through Learning F...Jia-Bin Huang
 
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)Jia-Bin Huang
 
How to Read Academic Papers
How to Read Academic PapersHow to Read Academic Papers
How to Read Academic PapersJia-Bin Huang
 
Image Completion using Planar Structure Guidance (SIGGRAPH 2014)
Image Completion using Planar Structure Guidance (SIGGRAPH 2014)Image Completion using Planar Structure Guidance (SIGGRAPH 2014)
Image Completion using Planar Structure Guidance (SIGGRAPH 2014)Jia-Bin Huang
 
Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Jia-Bin Huang
 
Lecture 21 - Image Categorization - Computer Vision Spring2015
Lecture 21 - Image Categorization -  Computer Vision Spring2015Lecture 21 - Image Categorization -  Computer Vision Spring2015
Lecture 21 - Image Categorization - Computer Vision Spring2015Jia-Bin Huang
 
Linear Algebra and Matlab tutorial
Linear Algebra and Matlab tutorialLinear Algebra and Matlab tutorial
Linear Algebra and Matlab tutorialJia-Bin Huang
 
Writing Fast MATLAB Code
Writing Fast MATLAB CodeWriting Fast MATLAB Code
Writing Fast MATLAB CodeJia-Bin Huang
 
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015Jia-Bin Huang
 
Three Reasons to Join FVE at uiuc
Three Reasons to Join FVE at uiucThree Reasons to Join FVE at uiuc
Three Reasons to Join FVE at uiucJia-Bin Huang
 
Applying for Graduate School in S.T.E.M.
Applying for Graduate School in S.T.E.M.Applying for Graduate School in S.T.E.M.
Applying for Graduate School in S.T.E.M.Jia-Bin Huang
 
Jia-Bin Huang's Curriculum Vitae
Jia-Bin Huang's Curriculum VitaeJia-Bin Huang's Curriculum Vitae
Jia-Bin Huang's Curriculum VitaeJia-Bin Huang
 
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)Jia-Bin Huang
 
UIUC CS 498 - Computational Photography - Final project presentation
UIUC CS 498 - Computational Photography - Final project presentation UIUC CS 498 - Computational Photography - Final project presentation
UIUC CS 498 - Computational Photography - Final project presentation Jia-Bin Huang
 
美國研究所申請流程 (A Guide for Applying Graduate Schools in USA)
美國研究所申請流程 (A Guide for Applying Graduate Schools in USA)美國研究所申請流程 (A Guide for Applying Graduate Schools in USA)
美國研究所申請流程 (A Guide for Applying Graduate Schools in USA)Jia-Bin Huang
 
Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)
Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)
Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)Jia-Bin Huang
 
What Makes a Creative Photograph?
What Makes a Creative Photograph?What Makes a Creative Photograph?
What Makes a Creative Photograph?Jia-Bin Huang
 
Computer Vision Crash Course
Computer Vision Crash CourseComputer Vision Crash Course
Computer Vision Crash CourseJia-Bin Huang
 
How to come up with new research ideas
How to come up with new research ideasHow to come up with new research ideas
How to come up with new research ideasJia-Bin Huang
 

Destacado (20)

Transformation Guided Image Completion ICCP 2013
Transformation Guided Image Completion ICCP 2013Transformation Guided Image Completion ICCP 2013
Transformation Guided Image Completion ICCP 2013
 
Toward Accurate and Robust Cross-Ratio based Gaze Trackers Through Learning F...
Toward Accurate and Robust Cross-Ratio based Gaze Trackers Through Learning F...Toward Accurate and Robust Cross-Ratio based Gaze Trackers Through Learning F...
Toward Accurate and Robust Cross-Ratio based Gaze Trackers Through Learning F...
 
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
 
How to Read Academic Papers
How to Read Academic PapersHow to Read Academic Papers
How to Read Academic Papers
 
Image Completion using Planar Structure Guidance (SIGGRAPH 2014)
Image Completion using Planar Structure Guidance (SIGGRAPH 2014)Image Completion using Planar Structure Guidance (SIGGRAPH 2014)
Image Completion using Planar Structure Guidance (SIGGRAPH 2014)
 
Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)
 
Lecture 21 - Image Categorization - Computer Vision Spring2015
Lecture 21 - Image Categorization -  Computer Vision Spring2015Lecture 21 - Image Categorization -  Computer Vision Spring2015
Lecture 21 - Image Categorization - Computer Vision Spring2015
 
Linear Algebra and Matlab tutorial
Linear Algebra and Matlab tutorialLinear Algebra and Matlab tutorial
Linear Algebra and Matlab tutorial
 
Writing Fast MATLAB Code
Writing Fast MATLAB CodeWriting Fast MATLAB Code
Writing Fast MATLAB Code
 
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
 
Three Reasons to Join FVE at uiuc
Three Reasons to Join FVE at uiucThree Reasons to Join FVE at uiuc
Three Reasons to Join FVE at uiuc
 
Applying for Graduate School in S.T.E.M.
Applying for Graduate School in S.T.E.M.Applying for Graduate School in S.T.E.M.
Applying for Graduate School in S.T.E.M.
 
Jia-Bin Huang's Curriculum Vitae
Jia-Bin Huang's Curriculum VitaeJia-Bin Huang's Curriculum Vitae
Jia-Bin Huang's Curriculum Vitae
 
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
 
UIUC CS 498 - Computational Photography - Final project presentation
UIUC CS 498 - Computational Photography - Final project presentation UIUC CS 498 - Computational Photography - Final project presentation
UIUC CS 498 - Computational Photography - Final project presentation
 
美國研究所申請流程 (A Guide for Applying Graduate Schools in USA)
美國研究所申請流程 (A Guide for Applying Graduate Schools in USA)美國研究所申請流程 (A Guide for Applying Graduate Schools in USA)
美國研究所申請流程 (A Guide for Applying Graduate Schools in USA)
 
Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)
Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)
Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)
 
What Makes a Creative Photograph?
What Makes a Creative Photograph?What Makes a Creative Photograph?
What Makes a Creative Photograph?
 
Computer Vision Crash Course
Computer Vision Crash CourseComputer Vision Crash Course
Computer Vision Crash Course
 
How to come up with new research ideas
How to come up with new research ideasHow to come up with new research ideas
How to come up with new research ideas
 

Similar a Saliency Detection via Divergence Analysis: A Unified Perspective ICPR 2012

Lecture 10 ming yang - face recognition systems
Lecture 10   ming yang - face recognition systemsLecture 10   ming yang - face recognition systems
Lecture 10 ming yang - face recognition systemsmustafa sarac
 
Sample selection and measuring instruments basic consideration of.pptx
Sample selection and measuring instruments basic consideration of.pptxSample selection and measuring instruments basic consideration of.pptx
Sample selection and measuring instruments basic consideration of.pptxaamnaemir
 
"How Image Sensor and Video Compression Parameters Impact Vision Algorithms,"...
"How Image Sensor and Video Compression Parameters Impact Vision Algorithms,"..."How Image Sensor and Video Compression Parameters Impact Vision Algorithms,"...
"How Image Sensor and Video Compression Parameters Impact Vision Algorithms,"...Edge AI and Vision Alliance
 
Passive stereo vision with deep learning
Passive stereo vision with deep learningPassive stereo vision with deep learning
Passive stereo vision with deep learningYu Huang
 
Human Pose Estimation by Deep Learning
Human Pose Estimation by Deep LearningHuman Pose Estimation by Deep Learning
Human Pose Estimation by Deep LearningWei Yang
 
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...Tulipp. Eu
 
A Comprehensive Analysis on Co-Saliency Detection on Learning Approaches in 3...
A Comprehensive Analysis on Co-Saliency Detection on Learning Approaches in 3...A Comprehensive Analysis on Co-Saliency Detection on Learning Approaches in 3...
A Comprehensive Analysis on Co-Saliency Detection on Learning Approaches in 3...AnuragVijayAgrawal
 
Stereoscopic field of view guidelines
Stereoscopic field of view guidelinesStereoscopic field of view guidelines
Stereoscopic field of view guidelinesBoris Greenberg
 
When Remote Sensing Meets Artificial Intelligence
When Remote Sensing Meets Artificial IntelligenceWhen Remote Sensing Meets Artificial Intelligence
When Remote Sensing Meets Artificial IntelligenceWahyuRahmaniar2
 
Image registraiton in Radiation Therapy
Image registraiton in Radiation TherapyImage registraiton in Radiation Therapy
Image registraiton in Radiation TherapyTalluri Anil Kumar
 
“Designing Assistive Technology: A Personal Experience of Trial and Error “
“Designing Assistive Technology: A Personal Experience of Trial and Error ““Designing Assistive Technology: A Personal Experience of Trial and Error “
“Designing Assistive Technology: A Personal Experience of Trial and Error “diannepatricia
 
Automatic System for Detection and Classification of Brain Tumors
Automatic System for Detection and Classification of Brain TumorsAutomatic System for Detection and Classification of Brain Tumors
Automatic System for Detection and Classification of Brain TumorsFatma Sayed Ibrahim
 

Similar a Saliency Detection via Divergence Analysis: A Unified Perspective ICPR 2012 (20)

Lecture 10 ming yang - face recognition systems
Lecture 10   ming yang - face recognition systemsLecture 10   ming yang - face recognition systems
Lecture 10 ming yang - face recognition systems
 
Sample selection and measuring instruments basic consideration of.pptx
Sample selection and measuring instruments basic consideration of.pptxSample selection and measuring instruments basic consideration of.pptx
Sample selection and measuring instruments basic consideration of.pptx
 
CSU_comp
CSU_compCSU_comp
CSU_comp
 
Aberrometry
AberrometryAberrometry
Aberrometry
 
"How Image Sensor and Video Compression Parameters Impact Vision Algorithms,"...
"How Image Sensor and Video Compression Parameters Impact Vision Algorithms,"..."How Image Sensor and Video Compression Parameters Impact Vision Algorithms,"...
"How Image Sensor and Video Compression Parameters Impact Vision Algorithms,"...
 
Anits dip
Anits dipAnits dip
Anits dip
 
Passive stereo vision with deep learning
Passive stereo vision with deep learningPassive stereo vision with deep learning
Passive stereo vision with deep learning
 
Human Pose Estimation by Deep Learning
Human Pose Estimation by Deep LearningHuman Pose Estimation by Deep Learning
Human Pose Estimation by Deep Learning
 
Dip
DipDip
Dip
 
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
 
A Comprehensive Analysis on Co-Saliency Detection on Learning Approaches in 3...
A Comprehensive Analysis on Co-Saliency Detection on Learning Approaches in 3...A Comprehensive Analysis on Co-Saliency Detection on Learning Approaches in 3...
A Comprehensive Analysis on Co-Saliency Detection on Learning Approaches in 3...
 
Stereoscopic field of view guidelines
Stereoscopic field of view guidelinesStereoscopic field of view guidelines
Stereoscopic field of view guidelines
 
phase2 FINAL
phase2 FINALphase2 FINAL
phase2 FINAL
 
When Remote Sensing Meets Artificial Intelligence
When Remote Sensing Meets Artificial IntelligenceWhen Remote Sensing Meets Artificial Intelligence
When Remote Sensing Meets Artificial Intelligence
 
Image registraiton in Radiation Therapy
Image registraiton in Radiation TherapyImage registraiton in Radiation Therapy
Image registraiton in Radiation Therapy
 
“Designing Assistive Technology: A Personal Experience of Trial and Error “
“Designing Assistive Technology: A Personal Experience of Trial and Error ““Designing Assistive Technology: A Personal Experience of Trial and Error “
“Designing Assistive Technology: A Personal Experience of Trial and Error “
 
Automatic System for Detection and Classification of Brain Tumors
Automatic System for Detection and Classification of Brain TumorsAutomatic System for Detection and Classification of Brain Tumors
Automatic System for Detection and Classification of Brain Tumors
 
Part2
Part2Part2
Part2
 
Corneal topography
Corneal topographyCorneal topography
Corneal topography
 
Digital image processing 2
Digital image processing   2Digital image processing   2
Digital image processing 2
 

Más de Jia-Bin Huang

How to write a clear paper
How to write a clear paperHow to write a clear paper
How to write a clear paperJia-Bin Huang
 
Research 101 - Paper Writing with LaTeX
Research 101 - Paper Writing with LaTeXResearch 101 - Paper Writing with LaTeX
Research 101 - Paper Writing with LaTeXJia-Bin Huang
 
Real-time Face Detection and Recognition
Real-time Face Detection and RecognitionReal-time Face Detection and Recognition
Real-time Face Detection and RecognitionJia-Bin Huang
 
Pose aware online visual tracking
Pose aware online visual trackingPose aware online visual tracking
Pose aware online visual trackingJia-Bin Huang
 
Face Expression Enhancement
Face Expression EnhancementFace Expression Enhancement
Face Expression EnhancementJia-Bin Huang
 
Image Smoothing for Structure Extraction
Image Smoothing for Structure ExtractionImage Smoothing for Structure Extraction
Image Smoothing for Structure ExtractionJia-Bin Huang
 
Static and Dynamic Hand Gesture Recognition
Static and Dynamic Hand Gesture RecognitionStatic and Dynamic Hand Gesture Recognition
Static and Dynamic Hand Gesture RecognitionJia-Bin Huang
 
Real-Time Face Detection, Tracking, and Attributes Recognition
Real-Time Face Detection, Tracking, and Attributes RecognitionReal-Time Face Detection, Tracking, and Attributes Recognition
Real-Time Face Detection, Tracking, and Attributes RecognitionJia-Bin Huang
 
Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Jia-Bin Huang
 
Information Preserving Color Transformation for Protanopia and Deuteranopia (...
Information Preserving Color Transformation for Protanopia and Deuteranopia (...Information Preserving Color Transformation for Protanopia and Deuteranopia (...
Information Preserving Color Transformation for Protanopia and Deuteranopia (...Jia-Bin Huang
 
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)Jia-Bin Huang
 
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)Jia-Bin Huang
 
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)Jia-Bin Huang
 

Más de Jia-Bin Huang (13)

How to write a clear paper
How to write a clear paperHow to write a clear paper
How to write a clear paper
 
Research 101 - Paper Writing with LaTeX
Research 101 - Paper Writing with LaTeXResearch 101 - Paper Writing with LaTeX
Research 101 - Paper Writing with LaTeX
 
Real-time Face Detection and Recognition
Real-time Face Detection and RecognitionReal-time Face Detection and Recognition
Real-time Face Detection and Recognition
 
Pose aware online visual tracking
Pose aware online visual trackingPose aware online visual tracking
Pose aware online visual tracking
 
Face Expression Enhancement
Face Expression EnhancementFace Expression Enhancement
Face Expression Enhancement
 
Image Smoothing for Structure Extraction
Image Smoothing for Structure ExtractionImage Smoothing for Structure Extraction
Image Smoothing for Structure Extraction
 
Static and Dynamic Hand Gesture Recognition
Static and Dynamic Hand Gesture RecognitionStatic and Dynamic Hand Gesture Recognition
Static and Dynamic Hand Gesture Recognition
 
Real-Time Face Detection, Tracking, and Attributes Recognition
Real-Time Face Detection, Tracking, and Attributes RecognitionReal-Time Face Detection, Tracking, and Attributes Recognition
Real-Time Face Detection, Tracking, and Attributes Recognition
 
Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)
 
Information Preserving Color Transformation for Protanopia and Deuteranopia (...
Information Preserving Color Transformation for Protanopia and Deuteranopia (...Information Preserving Color Transformation for Protanopia and Deuteranopia (...
Information Preserving Color Transformation for Protanopia and Deuteranopia (...
 
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
 
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
 
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
 

Saliency Detection via Divergence Analysis: A Unified Perspective ICPR 2012

  • 1. Saliency Detection via Divergence Analysis Jia-Bin Huang Narendra Ahuja Electrical and Computer Engineering University of Illinois, Urbana-Champaign
  • 2. Jia-Bin Huang Narendra Ahuja jbhuang1@illinois.edu n-ahuja@illinois.edu
  • 3. What this talk is about? 1.  The saliency detection problem 2.  A unifying framework for bottom-up saliency detection algorithms 3.  Ways to improve the performance
  • 4. Look for the unique item Clap when you find it
  • 5.
  • 6.
  • 7.
  • 8. What is Saliency? •  Visual salience (or visual saliency) is the distinct subjective perceptual quality which makes some items in the world stand out from their neighbors and immediately grab our attention Where to look?
  • 9. Example of Stimulus Color Orientation Motion Natural scene
  • 10. Why We Care? •  Adaptive image compression •  Object-of-interest image segmentation •  Automatic image thumbnail •  (Class-independent) Object detection and recognition •  Visual tracking •  Automatic Image collage •  Content-aware image resizing •  Non-photorealistic rendering •  Understanding mechanism of human visual attention
  • 11. Problem Setting •  Input: Image -> Output: Saliency map
  • 12. Design Principles •  Rarity –  [Bruce NIPS 05] [Zhang JOV 08] [Rahtu ECCV 10] [Klein ICCV 11] [Borji CVPR 12] •  Local complexity –  [Kadir IJCV 01] •  Contrast –  [Itti PAMI 98] [Harel NIPS 05] [Ma MM 03] [Achanta CVPR 09] [Achanta ICIP 10] [Cheng CVPR 11] [Goferman CVPR 10] [Perazzi CVPR 12] •  Spectral –  [Hou CVPR 07] [Guo CVPR 08] [Hou PAMI 12] [Li PAMI 12] •  Learning –  [Liu CVPR 07] [Judd ICCV 09] [Oliva ICIP 03] [Torralba Psycho.Rev 06]
  • 13. Complexity! Contrast! Surprise! Spectral! Learning! Information! Rarity! WTH?
  • 14. Which one is better?
  • 15. Main Result •  Most of the bottom-up saliency detection algorithms can be rewritten in the form of divergence between probabilistic distributions learned from center and surround
  • 18. From Center to Surround
  • 19. From Center to Surround
  • 25. How to Choose Support?
  • 26. Experimental Results •  Datasets –  MSRA salient object detection dataset –  1000 groundtruth binary mask are available from [Achanta CVPR 09] •  Evaluation metric –  Precision and recall curves
  • 27. State-of-the-art Saliency Detection Methods –  Center to surround •  AIM [Bruce NIPS 05] SUN [Zhang JOV 08], SW [Rahtu ECCV 10] –  Surround to center •  CA [Goferman CVPR 10], AC [Achanta ICIP 10], FT [Achanta CVPR 09], LC [Zhai-ACMMM 06] –  Symmetrised divergence •  HC, RC, [Cheng CVPR 11], IT [Itti PAMI 98], GB [Harel NIPS 07] –  Spectrum-based •  SR [Hou CVPR 07]
  • 28. •  Rarity-based –  AIM: image specific –  SUN: image indep.
  • 29. •  Rarity-based –  SW: Patch/window based surround + scale space analysis
  • 30. •  Contrast-based –  GB: spatial weighting for adaptive surround
  • 31. •  Contrast-based –  AC: adaptive surround support
  • 32. •  Contrast-based –  RC: region-based center support
  • 33. •  Contrast-based –  FT: color feature –  LC: intensity feature
  • 34. Lessons Learned •  Most of the bottom-up saliency detection algorithms are in fact closely related –  Not exhaustive, e.g., spectral-based methods •  How to improve the performance? –  Richer features –  Less approximation –  Adaptive center/surround support
  • 35. Future Work Input image Saliency Map Error Map Method parameters
  • 36.
  • 37. Questions? •  Comments or questions: Jia-Bin Huang jbhuang1@illinois.edu https://netfiles.uiuc.edu/jbhuang1/www/
  • 38. References •  Saliency Detection via Divergence Analysis: An Unified Perspective J.B. Huang and N. Ahuja. ICPR, 2012 •  Frequency-tuned salient region detection R. Achanta, S. Hemami, F. Estrada, and S. Susstrunk., CVPR 2009 •  Saliency detection using maximum symmetric surround. R. Achanta and S. Susstrunk, ICIP, 2010. •  State-of-the-art in visual attention modeling A. Borji and L. Itti. PAMI 2012 •  Saliency based on information maximization. N. Bruce and J. Tsotsos. NIPS, 2005 •  Global contrast based salient region detection. M. M. Cheng, G. X. Zhang, N. J. Mitra, X. Huang, and S. M. Hu. CVPR, 2011 •  The discriminant center-surround hypothesis for bottom-up saliency. •  D. Gao, V. Mahadevan, and N. Vasconcelos. NIPS 2007
  • 39. References •  Context-aware saliency detection. S. Goferman, L. Zelnik-Manor, and A. Tal. CVPR, 2010. •  Graph-based visual saliency. J. Harel, C. Koch, and P. Perona. NIPS, 2006 •  Saliency detection: A spectral residual approach. X. Hou and L. Zhang. CVPR, 2007 •  A model of saliency based visual attention for rapid scene analysis. L. Itti, C. Koch, and E. Niebur. PAMI 1998 •  Learning to predict where humans look. T. Judd, K. Ehinger, F. Durand, and A. Torralba. ICCV, 2009 •  Center-surround divergence of feature statistics for salient object detection. D. A. Klein and S. Frintrop. ICCV, 2011 •  Saliency detection based on frequency and spatial domain analyses J. Li, M. Levine, X. An, and H. He. BMVC 2011
  • 40. References •  Learning to detect a salient object T. Liu, J. Sun, N. N. Zheng, X. Tang, and H. Y. Shum, CVPR 2007 •  Contrast-based image attention analysis by using fuzzy growing Y. Ma and H. Zhang, ACM MM 2003 •  Top-down control of visual attention in object detection A. Oliva, A. Torralba, M. Castelhano, and J. Henderson. ICIP 2003 •  Segmenting salient objects from images and videos E. Rahtu, J. Kannala, M. Salo, and J. Heikkil., ECCV 2010 •  Computational versus psychophysical image saliency: A comparative evaluation study, A. Toet. PAMI 2011 •  Visual attention detection in video sequences using spatiotemporal cues Y. Zhai and M. Shah. ACM MM 2006 •  Sun: A bayesian framework for saliency using natural statistics L. Zhang, M. Tong, T. Marks, H. Shan, and G. Cottrell. JOV 2008

Notas del editor

  1. Thanks for the introduction. I am Jia-Bin Huang. Today I am going to talk about some interesting facts about saliency detection.
  2. This is a joint work with Prof. Narendra Ahuja at the University of Illinois, Urbana-Champaign
  3. I will start with an overview of the problem. Then, introducing a unifying framework for bottom-up saliency detection algorithms. Later, we suggest ways to improve the performance.
  4. Let’s play a game first. Please clap when you see something different.
  5. Saliency is the distinct subjective quality which makes some items pop out from their neighbors and grab our attention. It guides us where to look.
  6. Here are some examples of stimulus.
  7. The reasons that this problem is important because it can facilitate numerous applications in computer vision and computer graphics.
  8. Here is the problem setting. Given an input image, our goal is to produce a saliency map locating the salient regions.
  9. There have been many design principles that guide the development of saliency detection algorithms. For example, rarity, complexity, contrast, spectral methods, and learning-based approaches.
  10. However, since some of the principles are inspired from psychological experiments, some are from intuitions, and others are purely computational, it’s often difficult to see their relations and more practically, which one is better?
  11. In this paper, we provide the bridges between these methods. Our claim is that most of the bottom-up saliency detection algorithms can be rewritten in the form of divergence between the probabilistic distributions learned from center and surround.
  12. Let’s start with KL-divergence. KL divergence is a contrast function between two probabilistic distributions. It has the following forms. It has many important operational meanings in detection, estimation, and information theory. In the following few slides, I will explicitly show the connections between various models.
  13. If we assume that the center support is a single pixel x_i, the KL-divergence from center to surround will become the so-called Shannon’s self-information. This class of saliency is called rarity-based saliency.
  14. If we take the difference between the self-information of center and surround, it yields the saliency in Rahtu’s paper at ECCV. If we make the assumption of feature channel independence, we can derive the saliency measure in Klein at ICCV.
  15. So how about switch the order? From the likelihood theory we know that the divergence can be rewritten as likelihood function. This forms the basics of contrast-based saliency methods.
  16. Assume that the distribution of the center follows Laplacian distributions. The saliency measure has this form.
  17. Similarly, assuming that the distribution is Gaussian. Using various approximation approaches gives rises to various measures.
  18. Now let’s look at some symmetrised divergence measure. The first one is the symmetric KL divergence. This measure has been widely used from feature selection. In the context of saliency detection, it corresponds to combine local and global rarity. Lambda divergence is another symmetric measure. It corresponds to the mutual information-based saliency.