CVPR2009: Object Detection Using a Max-Margin Hough Transform

•Download as PPT, PDF•

3 likes•1,589 views

zukun

Education

CVPR 2009, Miami, Florida Subhransu Maji and Jitendra Malik University of California at Berkeley, Berkeley, CA-94720 Object Detection Using a Max-Margin Hough Transform

Overview ,[object Object],[object Object],[object Object],[object Object]

Our Approach: Hough Transform ,[object Object],[object Object],[object Object],[object Object],[object Object]

Generalized to object detection Learning ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Spatial occurrence distributions x y s x y s x y s x y s

Detection Pipeline B. Leibe, A. Leonardis, and B. Schiele. Combined object categorization and segmentation with an implicit shape model ‘ 2004 Probabilistic Voting Interest Points eg. SIFT,GB, Local Patches Matched Codebook Entries KD Tree

Probabilistic Hough Transform ,[object Object],[object Object],Position Posterior Codeword Match Codeword likelihood Detection Score Codeword likelihood

Learning Feature Weights ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

[object Object],[object Object],[object Object],[object Object],Learning Feature Weights : First Try

[object Object],[object Object],Learning Feature Weights : Second Try Position Posterior Codeword Match Codeword likelihood Activations Feature weights

Max-Margin Training ,[object Object],[object Object],[object Object],[object Object],[object Object],Standard ISM model (Leibe et.al.’04) Our Contribution class label {+1,-1} activations non negative

Experimental Results ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Learned Weights (ETHZ shape) Max-Margin Important Parts Naïve Bayes blue (low) , dark red (high) Influenced by clutter (rare structures)

Learned Weights (UIUC cars) blue (low) , dark red (high) Naïve Bayes Max-Margin Important Parts

Learned Weights (INRIA horses) blue (low) , dark red (high) Naïve Bayes Max-Margin Important Parts

Detection Results (ETHZ dataset) Recall @ 1.0 False Positives Per Window

Detection Results (INRIA Horses) Our Work

Detection Results (UIUC Cars) INRIA horses Our Work

Hough Voting + Verification Classifier Recall @ 0.3 False Positives Per Image ETHZ Shape Dataset IKSVM was run on top 30 windows + local search KAS – Ferrari et.al., PAMI’08 TPS-RPM – Ferrari et.al., CVPR’07 better fitting bounding box Implicit sampling over aspect-ratio

Hough Voting + Verification Classifier IKSVM was run on top 30 windows + local search Our Work

Hough Voting + Verification Classifier UIUC Single Scale Car Dataset IKSVM was run on top 10 windows + local search 1.7% improvement

Summary ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

[object Object],[object Object],[object Object],Acknowledgements Thank You Questions?

Backup Slide : Toy Example Rare but poor localization Rare and good localization

Similar to CVPR2009: Object Detection Using a Max-Margin Hough Transform

iccv2009 tutorial: boosting and random forest - part IIIzukun

Ensemble Learning Featuring the Netflix Prize Competition and ...butest

Sift based arabic sign language recognition aecia 2014 –november17-19, addis ...Tarek Gaber

Exposé OntologyJoaquin Vanschoren

introducción a Machine Learningbutest

AI in ProductionGiovanni Fernandez-Kincade

Discovery Hub: on-the-fly linked data exploratory searchFabien Gandon

Analyse de sentiment et classification par approche neuronale en Python et WekaPatrice Bellot - Aix-Marseille Université / CNRS (LIS, INS2I)

An Introduction to Computer Visionguestd1b1b5

Scalable Software Testing and Verification of Non-Functional Properties throu...Lionel Briand

Artificial Intelligence and Optimization with ParallelismOlivier Teytaud

Deep Learning: Chapter 11 Practical MethodologyJason Tsai

What is pattern recognition (lecture 4 of 6)Randa Elanwar

Part 1butest

Introductionbutest

Cvpr2007 object category recognition p3 - discriminative modelszukun

Machine Learning for EveryoneAly Abdelkareem

Similar to CVPR2009: Object Detection Using a Max-Margin Hough Transform (20)

iccv2009 tutorial: boosting and random forest - part III

Ensemble Learning Featuring the Netflix Prize Competition and ...

Sift based arabic sign language recognition aecia 2014 –november17-19, addis ...

Exposé Ontology

introducción a Machine Learning

AI in Production

Discovery Hub: on-the-fly linked data exploratory search

Analyse de sentiment et classification par approche neuronale en Python et Weka

An Introduction to Computer Vision

Scalable Software Testing and Verification of Non-Functional Properties throu...

Artificial Intelligence and Optimization with Parallelism

Deep Learning: Chapter 11 Practical Methodology

What is pattern recognition (lecture 4 of 6)

Part 1

Introduction

Cvpr2007 object category recognition p3 - discriminative models

Machine Learning for Everyone

Recently uploaded

Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha

Paris 2024 Olympic Geographies - an activityGeoBlogs

Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN

The basics of sentences session 2pptx copy.pptxheathfieldcps1

Software Engineering Methodologies (overview)eniolaolutunde

TataKelola dan KamSiber Kecerdasan Buatan v022.pdfSarwono Sutikno, Dr.Eng.,CISA,CISSP,CISM,CSX-F

Staff of Color (SOC) Retention Efforts DDSDDavid Douglas School District

Código Creativo y Arte de Software | Unidad 1Maestría en Comunicación Digital Interactiva - UNR

SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching

PSYCHIATRIC History collection FORMAT.pptxPoojaSen20

BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy

Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝9953056974 Low Rate Call Girls In Saket, Delhi NCR

Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani

Concept of Vouching. B.Com(Hons) /B.CompdfUmakantAnnand

Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre

CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2

Grant Readiness 101 TechSoup and Remy ConsultingTechSoup

URLs and Routing in the Odoo 17 Website AppCeline George

Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝9953056974 Low Rate Call Girls In Saket, Delhi NCR

Recently uploaded (20)

Call Girls in Dwarka Mor Delhi Contact Us 9654467111

Paris 2024 Olympic Geographies - an activity

Solving Puzzles Benefits Everyone (English).pptx

The basics of sentences session 2pptx copy.pptx

Software Engineering Methodologies (overview)

TataKelola dan KamSiber Kecerdasan Buatan v022.pdf

Staff of Color (SOC) Retention Efforts DDSD

Código Creativo y Arte de Software | Unidad 1

SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...

PSYCHIATRIC History collection FORMAT.pptx

BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf

Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝

Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991

Concept of Vouching. B.Com(Hons) /B.Compdf

Organic Name Reactions for the students and aspirants of Chemistry12th.pptx

CARE OF CHILD IN INCUBATOR..........pptx

Grant Readiness 101 TechSoup and Remy Consulting

URLs and Routing in the Odoo 17 Website App

Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝

CVPR2009: Object Detection Using a Max-Margin Hough Transform

1. CVPR 2009, Miami, Florida Subhransu Maji and Jitendra Malik University of California at Berkeley, Berkeley, CA-94720 Object Detection Using a Max-Margin Hough Transform

5. Detection Pipeline B. Leibe, A. Leonardis, and B. Schiele. Combined object categorization and segmentation with an implicit shape model ‘ 2004 Probabilistic Voting Interest Points eg. SIFT,GB, Local Patches Matched Codebook Entries KD Tree

10.

11. Experiment Datasets ETHZ Shape Dataset ( Ferrari et al., ECCV 2006) 255 images, over 5 classes (Apple logo, Bottle, Giraffe, Mug, Swan) UIUC Single Scale Cars Dataset ( Agarwal & Roth, ECCV 2002) 1050 training, 170 test images INRIA Horse Dataset ( Jurie & Ferrari) 170 positive + 170 negative images (50 + 50 for training)

12.

13. Learned Weights (ETHZ shape) Max-Margin Important Parts Naïve Bayes blue (low) , dark red (high) Influenced by clutter (rare structures)

14. Learned Weights (UIUC cars) blue (low) , dark red (high) Naïve Bayes Max-Margin Important Parts

15. Learned Weights (INRIA horses) blue (low) , dark red (high) Naïve Bayes Max-Margin Important Parts

16. Detection Results (ETHZ dataset) Recall @ 1.0 False Positives Per Window

17. Detection Results (INRIA Horses) Our Work

18. Detection Results (UIUC Cars) INRIA horses Our Work

19. Hough Voting + Verification Classifier Recall @ 0.3 False Positives Per Image ETHZ Shape Dataset IKSVM was run on top 30 windows + local search KAS – Ferrari et.al., PAMI’08 TPS-RPM – Ferrari et.al., CVPR’07 better fitting bounding box Implicit sampling over aspect-ratio

20. Hough Voting + Verification Classifier IKSVM was run on top 30 windows + local search Our Work

21. Hough Voting + Verification Classifier UIUC Single Scale Car Dataset IKSVM was run on top 10 windows + local search 1.7% improvement

22.

23.

24. Backup Slide : Toy Example Rare but poor localization Rare and good localization

Editor's Notes

Thank you. Good morning. I am going to present a learning framework for Hough transform based object detection.
We are interested in the task of object detection where we are interested in localizing an instance of an object in an image. We use an approach based on hough transform. Before I go into the details, I will present an overview of hough tranform followed by our learning framework. I will then present experimental results and conclude.
Yet another way of doing this is hough transform based approach. This is of course an old idea proposed by Hough for detecting lines more than 50 years ago. Since then it has been generalized to detect parametric shapes like ellipses and circles. Local parts cast vote for object pose and the complexity scales linearly with # parts times # votes.
Recently Liebe and Schile have extended this framework for object detection. A slide from their Implicit Shape Model framework illustrates the technique. Local parts are based on patches represented using a dictionary learned form training examples. The position of each codeword is recorded on the training example to from a distribution of each codeword location wrto the object center. For example the patch corresponding to the head of the person is typically at a fixed vertical offset wrto the torso as seen in the bottom left distribution. At test time the interest points are detected and matched to the codebook entries which vote for the object center. The peaks of the voting space correspond to object locations. Quite simple but a powerful framework.
Introducing you to a set of notations for the next set of slides. Let C be the learned codebook, let f denote the features and l the location of the features. The overall detection score is the sum of contributions from each feature f_j observed at a location l_j. Each feature is matched to a codebook as given by p(Ci|fj). This could be simply 1 for the nearest neighbour and 0 for the other codewords. P(x|O,Ci,l_j) is the distribution of the centroid given the Codeword Ci observed at location lj. The last term p(O|Ci,lj) is the confidence (or weight) of the codeword Ci.
Learning codeword weights in the context of Hough transform has not been addressed well in the literature. In an earlier talk today we saw a way of learning discriminative dictionaries for Hough transform. However in situations where the codebook is fixed we would like to learn the importance of each codeword. I.e. we have been given a codebook and the posterior distribution of the object center for each codeword and we would like to learn weights so that the Hough transform detector has the best detection rates. What we show is that these weights can be learned optimally using convex optimization and leads to better detection rates when compared to uniform weights and even a simple learning scheme.
Assign each codebook a weight proportional to the relative frequency of the object. We call this the naïve Bayes weights. (Read from slides)
If you look at the equation of the Hough tranform you realize that the overall score is linear in the codebook weights. This is assuming a location invariance of the object (i.e. the object can appear anywhere in the image). Thus the score is a dot product of the weight vector and a activation vector. The activations are independent of the weights given the features and their locations. This suggests a learning scheme which learns weights which increases the score on the positive locations over negative ones. We formalize this in the next slide.
We perform experiments on 3 datasets (ETHZ, UIUC cars and INRIA horses)
Our HT detector is based on GB descriptors (read from slide) and correct detections are counted using the PASCAL criterion i.e. an overlap of greater than 0.5.
To illustrate the idea : consider a toy example. We are trying to detect squares where the negative examples are parallel lines as shown. We have four kinds of codewords. The tips, vertical edges, horizontal edges and corners. Both corners and horizontal edges occur on the positive example only, however lets assume that corners are easy to localize while the horizontal edge can appear anywhere. The NB scheme assigns equal weights to both these whereas our framework distinguishes them correctly as seen in the table weights. The final scores on the + and – for all the schemes are shown and one can see that the m2ht achieves the maximum separation.

CVPR2009: Object Detection Using a Max-Margin Hough Transform

Recommended

Recommended

More Related Content

Similar to CVPR2009: Object Detection Using a Max-Margin Hough Transform

Similar to CVPR2009: Object Detection Using a Max-Margin Hough Transform (20)

More from zukun

More from zukun (20)

Recently uploaded

Recently uploaded (20)

CVPR2009: Object Detection Using a Max-Margin Hough Transform

Editor's Notes