SlideShare una empresa de Scribd logo
1 de 1
Descargar para leer sin conexión
Fast Sparse Representation with Prototypes
                                                               Jia-Bin Huang and Ming-Hsuan Yang
                                          Electrical Engineering and Computer Science, University of California, Merced
                                                              jbhuang@ieee.org, mhyang@ieee.org

 Overview                                                          Algorithm                                                                                                             Experimental Results
• Find the approximated sparse solution of the linear system      • Original    problem                                                                                                 • Single   Image Super-Resolution
  y = Ax
                                                                                                  min x         1    subject to           y − Fx    2≤ .                          (1)
• Original sparse coding problem: find a sparse solution from a                                        x
  dense matrix                                                    • Dictionary       learning from the K-SVD algorithm
                                                                                                                                K    ni
                                                                                              min         F − DW       2   =                fi ,j − Dwi ,j   2
                                                                                               D,W                     F                                     2
                                                                                                                               i =1 j =1
                                                                                                                        subject to          wi ,j   0≤   S0 ,                     (2)
                                                                  • Approximation           process
                                                                                                                    y ≈ Dwy ≈ DWx.                                                (3)
                                                                            Dwy + ey = DWx + eFx =⇒ D(wy − Wx) = eFx − ey.                                                        (4)
                                                                                         D(wy − Wx) 2 ≤ (s + 1) .                                                                 (5)
                                                                                                        (s + 1)
                                                                                     z 2 = wy − Wx 2 ≤            = .                                                             (6)
• Transformedsparse coding problem: find a sparse solution                                                 (1 − ρ)
 from a sparse matrix                                             • Reduced 1 − norm minimization problem

                                                                                                min x       1       subject to       wy − Wx        2≤ .                          (7)
                                                                                                  x


                                                                                                                                                                                                   Ground-truth         Bicubic          [Yang et al. 08]    Proposed
                                                                   Experimental Results                                                                                                 Table: Execution time and RMSE for sparse coding on four test images (scale factor = 3)
                                                                                                                                                                                                              Image         Original        Proposed
                                                                                                                                                                                                              Method    RMSE Time (s)    RMSE Time (s) Speedup
                                                                  • Face   Recognition
                                                                                                                                                                                                                Girl    5.6684 17.2333   6.2837 1.5564  11.07
                                                                                                                                                                                                              Flower    3.3649 14.9173   3.8710 1.3230  11.27
                                                                                                                                                                                                             Pathenon   12.247 35.1163   13.469 3.1485  11.15
                                                                                                                                                                                                             Raccoon    9.3584 27.9819   10.148 2.3284  12.02
 Applications                                                                                                                                                                           • Human     Pose Estimation
                                                                                     Figure: Sample images from the extended Yale database B
• Sparsity-based  classification/clustering
• Image restoration (e.g., denoising, inpainting, demosaicking,         Table: Recognition accuracy and speed using the Extended Yale database B
  super-resolution)                                                                   Feature Downsampled image            PCA
• Compressive sensing
                                                                                      Method Acc (%) Time (s) Acc (%) Time (s)
                                                                                      Original 93.78       20.08      95.01 13.17
                                                                                    Proposed 91.62          0.51      92.28     0.32                                                                        Figure: Sample images from the INRIA data set
                                                                                    Speed-up           39.4                41.2
 Contributions                                                    • Multi-view       Object Recognition
                                                                                                                                                                                         Table: Comparison on pose estimation accuracy and speed under different number of
                                                                                                                                                                                         prototypes using the INRIA data set
                                                                           Table: Comparison on recognition speed and accuracy on the COIL-100                                                   Number of coefficients        3      6      9     12     15 Original
• Exploitthe fact that signals can be well represented by a         Number of view            8                  16                       8                           16                         Mean error (in degrees) 9.1348 7.9970 7.4406 7.2965 7.1872 6.6513
  sparse linear combination of atom signals                          Feature used
                                                                                              Recognition accuracy                               Execution time
                                                                                     Orig. (%) Ours (%) Orig. (%) Ours (%) Orig. (s) Ours (ms) Speed-up Orig. (s) Ours (ms) Speed-up
                                                                                                                                                                                               Execution time (in seconds) 0.0082 0.0734 0.3663 1.1020 2.3336 24.69
• Reduce the original dense and large problem to a sparse and        Downsample       82.43     80.93      90.01    87.43   6.85        3.2     2140.6    52.73      3.9     13520.5                    Speed-up           3011.0 336.4 67.4     22.4   10.6
                                                                     Downsample       75.08     74.28      84.75    84.00   4.13        3.9     1059.0    48.02      5.2     9234.6
  small problem                                                       PCA: 256        84.56     82.08      91.03    89.22   3.71        3.3     1124.2    29.58      3.8     7784.2
                                                                      PCA: 100        81.23     79.23      90.58    87.72   2.54        3.6     705.6     21.00      5.6     3750.0




Jia-Bin Huang and Ming-Hsuan Yang (EECS, University of California, Merced)                                                                      IEEE Conference on Computer Vision and Pattern Recognition 2010 (CVPR 2010)

Más contenido relacionado

Destacado

Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)
Jia-Bin Huang
 
Lecture 21 - Image Categorization - Computer Vision Spring2015
Lecture 21 - Image Categorization -  Computer Vision Spring2015Lecture 21 - Image Categorization -  Computer Vision Spring2015
Lecture 21 - Image Categorization - Computer Vision Spring2015
Jia-Bin Huang
 

Destacado (10)

How to Read Academic Papers
How to Read Academic PapersHow to Read Academic Papers
How to Read Academic Papers
 
Image Completion using Planar Structure Guidance (SIGGRAPH 2014)
Image Completion using Planar Structure Guidance (SIGGRAPH 2014)Image Completion using Planar Structure Guidance (SIGGRAPH 2014)
Image Completion using Planar Structure Guidance (SIGGRAPH 2014)
 
Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)
Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)
Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)
 
Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)
 
Lecture 21 - Image Categorization - Computer Vision Spring2015
Lecture 21 - Image Categorization -  Computer Vision Spring2015Lecture 21 - Image Categorization -  Computer Vision Spring2015
Lecture 21 - Image Categorization - Computer Vision Spring2015
 
Computer Vision Crash Course
Computer Vision Crash CourseComputer Vision Crash Course
Computer Vision Crash Course
 
Linear Algebra and Matlab tutorial
Linear Algebra and Matlab tutorialLinear Algebra and Matlab tutorial
Linear Algebra and Matlab tutorial
 
Writing Fast MATLAB Code
Writing Fast MATLAB CodeWriting Fast MATLAB Code
Writing Fast MATLAB Code
 
Research 101 - Paper Writing with LaTeX
Research 101 - Paper Writing with LaTeXResearch 101 - Paper Writing with LaTeX
Research 101 - Paper Writing with LaTeX
 
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
 

Más de Jia-Bin Huang

Pose aware online visual tracking
Pose aware online visual trackingPose aware online visual tracking
Pose aware online visual tracking
Jia-Bin Huang
 
Face Expression Enhancement
Face Expression EnhancementFace Expression Enhancement
Face Expression Enhancement
Jia-Bin Huang
 
Image Smoothing for Structure Extraction
Image Smoothing for Structure ExtractionImage Smoothing for Structure Extraction
Image Smoothing for Structure Extraction
Jia-Bin Huang
 
Real-Time Face Detection, Tracking, and Attributes Recognition
Real-Time Face Detection, Tracking, and Attributes RecognitionReal-Time Face Detection, Tracking, and Attributes Recognition
Real-Time Face Detection, Tracking, and Attributes Recognition
Jia-Bin Huang
 
Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)
Jia-Bin Huang
 
Information Preserving Color Transformation for Protanopia and Deuteranopia (...
Information Preserving Color Transformation for Protanopia and Deuteranopia (...Information Preserving Color Transformation for Protanopia and Deuteranopia (...
Information Preserving Color Transformation for Protanopia and Deuteranopia (...
Jia-Bin Huang
 
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Jia-Bin Huang
 
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Jia-Bin Huang
 

Más de Jia-Bin Huang (12)

How to write a clear paper
How to write a clear paperHow to write a clear paper
How to write a clear paper
 
Real-time Face Detection and Recognition
Real-time Face Detection and RecognitionReal-time Face Detection and Recognition
Real-time Face Detection and Recognition
 
Jia-Bin Huang's Curriculum Vitae
Jia-Bin Huang's Curriculum VitaeJia-Bin Huang's Curriculum Vitae
Jia-Bin Huang's Curriculum Vitae
 
Pose aware online visual tracking
Pose aware online visual trackingPose aware online visual tracking
Pose aware online visual tracking
 
Face Expression Enhancement
Face Expression EnhancementFace Expression Enhancement
Face Expression Enhancement
 
Image Smoothing for Structure Extraction
Image Smoothing for Structure ExtractionImage Smoothing for Structure Extraction
Image Smoothing for Structure Extraction
 
Static and Dynamic Hand Gesture Recognition
Static and Dynamic Hand Gesture RecognitionStatic and Dynamic Hand Gesture Recognition
Static and Dynamic Hand Gesture Recognition
 
Real-Time Face Detection, Tracking, and Attributes Recognition
Real-Time Face Detection, Tracking, and Attributes RecognitionReal-Time Face Detection, Tracking, and Attributes Recognition
Real-Time Face Detection, Tracking, and Attributes Recognition
 
Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)
 
Information Preserving Color Transformation for Protanopia and Deuteranopia (...
Information Preserving Color Transformation for Protanopia and Deuteranopia (...Information Preserving Color Transformation for Protanopia and Deuteranopia (...
Information Preserving Color Transformation for Protanopia and Deuteranopia (...
 
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
 
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
 

Fast Sparse Representation with Prototypes (CVPR 2010)

  • 1. Fast Sparse Representation with Prototypes Jia-Bin Huang and Ming-Hsuan Yang Electrical Engineering and Computer Science, University of California, Merced jbhuang@ieee.org, mhyang@ieee.org Overview Algorithm Experimental Results • Find the approximated sparse solution of the linear system • Original problem • Single Image Super-Resolution y = Ax min x 1 subject to y − Fx 2≤ . (1) • Original sparse coding problem: find a sparse solution from a x dense matrix • Dictionary learning from the K-SVD algorithm K ni min F − DW 2 = fi ,j − Dwi ,j 2 D,W F 2 i =1 j =1 subject to wi ,j 0≤ S0 , (2) • Approximation process y ≈ Dwy ≈ DWx. (3) Dwy + ey = DWx + eFx =⇒ D(wy − Wx) = eFx − ey. (4) D(wy − Wx) 2 ≤ (s + 1) . (5) (s + 1) z 2 = wy − Wx 2 ≤ = . (6) • Transformedsparse coding problem: find a sparse solution (1 − ρ) from a sparse matrix • Reduced 1 − norm minimization problem min x 1 subject to wy − Wx 2≤ . (7) x Ground-truth Bicubic [Yang et al. 08] Proposed Experimental Results Table: Execution time and RMSE for sparse coding on four test images (scale factor = 3) Image Original Proposed Method RMSE Time (s) RMSE Time (s) Speedup • Face Recognition Girl 5.6684 17.2333 6.2837 1.5564 11.07 Flower 3.3649 14.9173 3.8710 1.3230 11.27 Pathenon 12.247 35.1163 13.469 3.1485 11.15 Raccoon 9.3584 27.9819 10.148 2.3284 12.02 Applications • Human Pose Estimation Figure: Sample images from the extended Yale database B • Sparsity-based classification/clustering • Image restoration (e.g., denoising, inpainting, demosaicking, Table: Recognition accuracy and speed using the Extended Yale database B super-resolution) Feature Downsampled image PCA • Compressive sensing Method Acc (%) Time (s) Acc (%) Time (s) Original 93.78 20.08 95.01 13.17 Proposed 91.62 0.51 92.28 0.32 Figure: Sample images from the INRIA data set Speed-up 39.4 41.2 Contributions • Multi-view Object Recognition Table: Comparison on pose estimation accuracy and speed under different number of prototypes using the INRIA data set Table: Comparison on recognition speed and accuracy on the COIL-100 Number of coefficients 3 6 9 12 15 Original • Exploitthe fact that signals can be well represented by a Number of view 8 16 8 16 Mean error (in degrees) 9.1348 7.9970 7.4406 7.2965 7.1872 6.6513 sparse linear combination of atom signals Feature used Recognition accuracy Execution time Orig. (%) Ours (%) Orig. (%) Ours (%) Orig. (s) Ours (ms) Speed-up Orig. (s) Ours (ms) Speed-up Execution time (in seconds) 0.0082 0.0734 0.3663 1.1020 2.3336 24.69 • Reduce the original dense and large problem to a sparse and Downsample 82.43 80.93 90.01 87.43 6.85 3.2 2140.6 52.73 3.9 13520.5 Speed-up 3011.0 336.4 67.4 22.4 10.6 Downsample 75.08 74.28 84.75 84.00 4.13 3.9 1059.0 48.02 5.2 9234.6 small problem PCA: 256 84.56 82.08 91.03 89.22 3.71 3.3 1124.2 29.58 3.8 7784.2 PCA: 100 81.23 79.23 90.58 87.72 2.54 3.6 705.6 21.00 5.6 3750.0 Jia-Bin Huang and Ming-Hsuan Yang (EECS, University of California, Merced) IEEE Conference on Computer Vision and Pattern Recognition 2010 (CVPR 2010)