SlideShare una empresa de Scribd logo
1 de 14
Descargar para leer sin conexión
A Benchmark for Image
Retrieval using
Distributed Systems over
the Internet: BIRDS-I
Neil J. Gunther
Performance Dynamics Consulting

Giordano Beretta
Hewlett-Packard Laboratories
                                                                               www
       13th IS&T/SPIE Symposium on Electronic Imaging—Science and Technology
Systems & scalability                                                                          2
            The Internet is a visual medium

•     A picture is worth ten thousand words…
      •     A screenful of text requires 24×80 = 1,920 bytes
      •     A VGA size image requires 640×480×3 = 921,600 bytes

•     …but requires almost 500 times as much bandwidth…
      •     data compression is essential for images on the Internet
      •     which compression is best for my image?

•     Need to maintain user response time within acceptable
      range when aggregate Web traffic increases

•     The Web is a system of mostly unspecified elements

•     Need to measure the scalability of application services


N.J. Gunther & G.B. Beretta   EI 2001— San José, 25 January 2001       4311–30 — Benchathlon
User experience expectation                                                                     3
            Internet bandwidth, processing power, data size

•     Today’s user experiences set very high expectations for
      imaging on the Internet
      •     many users access the Internet in the office on fast workstations
            connected over fast links to the Internet
      •     at home users often have fast graphics controllers for playing realistic
            computer games
      •     increasingly, private homes are equipped with fast connections over DSL,
            cable modem, satellite, …
      •     the latest video game machines are very powerful graphic workstations
      •     the new generation grew up on video games & WWW

•     People have about 2T bytes of own data and want to
      access all of the Web
      •     at work, they expect concise answers immediately on multiple media


N.J. Gunther & G.B. Beretta   EI 2001— San José, 25 January 2001        4311–30 — Benchathlon
The real world                                                                              4
            The nomadic workforce

•     The new working world is mobile and wireless
      •     a comprehensive fast fiber optics network provides a global backbone
      •     the “last mile” is wireless
      •     computers are becoming wearable (PDAs)

•     Displays and power are the main two open problems

•     To conserve power, wireless devices will have low effective
      transmission bandwidth and small display areas
      •     humans generate 80 W while sleeping and 400 W walking
      •     thermo-generator in a shirt can harvest 5 W
      •     piezo-generator in a shoe heel can produce 8 W

•     Concomitantly the new users are impatient

N.J. Gunther & G.B. Beretta    EI 2001— San José, 25 January 2001   4311–30 — Benchathlon
Image retrieval                                                                            5
            Text or contents based

•     Text-based image retrieval: images are annotated and a
      database management system is used to perform image
      retrieval on the annotation
      •     drawback 1: labor required to manually annotate the images
      •     drawback 2: imprecision in the annotation process

•     Content-based image retrieval systems overcome these
      problems by indexing the images according to their visual
      content, such as color, texture, etc.

•     A goal in CBIR research is to design representations that
      correlate well with the human visual system



N.J. Gunther & G.B. Beretta   EI 2001— San José, 25 January 2001   4311–30 — Benchathlon
Retrieval metrics                                                                          6
            Exact queries are not possible for images (nor text)

•     Recall (Sensitivity) = Number of relevant items retrieved /
      Number of relevant items in database

•     Precision (Specificity) = Number of relevant items
      retrieved / Number of items retrieved

•     CBIR algorithms must make a compromise between these
      two metrics: broad general vs. narrow specific query
      formulation

•     CBIR algorithms tend to be very imprecise…

•     …the result of a query requires ongoing iterative
      refinement (Leung, Viper)

N.J. Gunther & G.B. Beretta   EI 2001— San José, 25 January 2001   4311–30 — Benchathlon
Ground truth                                                                                 7
•     Precision and recall are determined with the aid of a data
      structure known as the ground truth
      •     embodies the relevance judgments for queries
      •     done only for the ground truth not for each image as in text based IR

•     Two problems
      •     appropriate image categories must be defined
      •     images must be categorized according to those definitions

•     Challenge: scalability
      •     gestalt approach
      •     decoupled ontologies
      •     categorization supervised by experts
      •     ground truth data structure created by program




N.J. Gunther & G.B. Beretta    EI 2001— San José, 25 January 2001    4311–30 — Benchathlon
Run rules                                                                                       8

•     The benchmark is run by executing scripts
      •     no human involved at query time
      •     aspects of CBIR applications such as the readability of the result or the
            user interface of the application are not the subject of this benchmark

•     All files, including the image collection, the versioned
      ground truth files, and the scripts for each benchmark
      must be freely accessible on the World Wide Web

•     An attribute of the personalized Web experience is near-
      instantaneous response, including the retrieval of images
      •     any CBIR benchmark must provide metrics for response time performance




N.J. Gunther & G.B. Beretta   EI 2001— San José, 25 January 2001        4311–30 — Benchathlon
Benchmark metrics                                                                          9

•     BIRDS-I uses a normalized rank similar to that proposed by
      Müller et al., but with the additional feature of penalizing
      missed images


•     A single leading performance indicator should be
      provided with any system benchmark otherwise, a less
      reliable ad hoc metric simply will get invented for making
      first-order comparisons between CBIR benchmark results




N.J. Gunther & G.B. Beretta   EI 2001— San José, 25 January 2001   4311–30 — Benchathlon
Window optimization problem                                                                   10
•     A scoring window W(q) is associated with the query q
      such that the returned images contained in W(q) are
      ranked according to an index r = 1, 2, …, W
                                              W(q)




                                  3
                         1    2           4          5   6    7     8



• Number of images returned is unknown
      •     must select a search window of size W(q) proportioned to each query's
            ground truth
      •     determining the size for W(q) is an optimization problem
      •     should look for a (piecewise) continuous convex function from which to
            select the values for W(q)
      •     family of inverted parabolic functions



N.J. Gunther & G.B. Beretta           EI 2001— San José, 25 January 2001   4311–30 — Benchathlon
Optimal retrieval window                                                                                                       11

                                        200
                                              W
                                                              Wmpeg

                                                                                                   W = 2·G
                                        150
                  Scoring Window Size




                                                                          W(2,1)
                                                                                                   W(1,2)

                                        100


                                                                                                   W=G
                                                                          W(1,1)
                                         50



                                                                                                             G
                                          0
                                              0   20             40                60         80             100
                                                              Number of Ground Truth Images




the MPEG–7 function is labeled Wmpeg

N.J. Gunther & G.B. Beretta                       EI 2001— San José, 25 January 2001                        4311–30 — Benchathlon
Generalized class of convex scoring fns                                                                                                12

                                           400
                                                 W                                                         W(2,2)

                                           350


                                           300


                                           250
                     Scoring Window Size




                                                                                                           W(1,2)
                                           200


                                           150
                                                                                                           W(2,1)

                                           100


                                            50
                                                                                                  W(1,1)
                                                                                                                    G
                                             0
                                                                             100
                                                            50                                   150            200
                                                                 Number of Ground Truth Images




N.J. Gunther & G.B. Beretta                          EI 2001— San José, 25 January 2001                             4311–30 — Benchathlon
Other CBIR benchmark activities                                                       13

•     MPEG-7 — B.S. Manjunath

•     IEEE Multimedia — Clement Leung, Horace Ip

•     Viper Team — Thierry Pun et al.

•     Benchathlon




N.J. Gunther & G.B. Beretta   EI 2001— San José, 25 January 2001   4311–30 — Benchathlon
Summary                                                                                    14
•     BIRDS-I benchmark has been designed with the trend
      toward the use of small, personalized, wireless-networked
      systems clearly in mind
      •     personalization with respect to the Web implies heterogeneous
            collections of images and this, in turn, dictates certain constraints on how
            these images can be organized and the type of retrieval accuracy
            measures that can be applied to CBIR performance

•     BIRDS-I only requires controlled human intervention for
      compilation of the image collection, none for the ground
      truth generation & the assessment of retrieval accuracy
      •     benchmark image collections need to be evolved incrementally toward
            millions of images and that scaleup can only be achieved through the use
            of computer-aided compilation
      •     Two concepts we introduced into the scoring metric are a tightly
            optimized image-ranking window, and a penalty for missed images, each
            of which is important for the automated benchmarking of large-scale
            CBIR

N.J. Gunther & G.B. Beretta   EI 2001— San José, 25 January 2001        4311–30 — Benchathlon

Más contenido relacionado

Más de Giordano Beretta

Towards Print Production Simulation
Towards Print Production SimulationTowards Print Production Simulation
Towards Print Production SimulationGiordano Beretta
 
The numerical stability of LUT-based color transformations
The numerical stability of LUT-based color  transformationsThe numerical stability of LUT-based color  transformations
The numerical stability of LUT-based color transformationsGiordano Beretta
 
Color naming: color scientists do it between Munsell Sheets of Color
Color naming: color scientists do it between  Munsell Sheets of ColorColor naming: color scientists do it between  Munsell Sheets of Color
Color naming: color scientists do it between Munsell Sheets of ColorGiordano Beretta
 
Towards robust categorical colour perception
Towards robust categorical colour perceptionTowards robust categorical colour perception
Towards robust categorical colour perceptionGiordano Beretta
 
Trends in color imaging on the Internet
Trends in color imaging on the InternetTrends in color imaging on the Internet
Trends in color imaging on the InternetGiordano Beretta
 
Image Processing For Color Facsimile
Image Processing For Color FacsimileImage Processing For Color Facsimile
Image Processing For Color FacsimileGiordano Beretta
 
How The Internet Will Determine the Future of Publishing
How The Internet Will Determine the Future of PublishingHow The Internet Will Determine the Future of Publishing
How The Internet Will Determine the Future of PublishingGiordano Beretta
 
Color Imaging on the Internet
Color Imaging on the InternetColor Imaging on the Internet
Color Imaging on the InternetGiordano Beretta
 
Personalized Public Health Campaigns
Personalized Public Health CampaignsPersonalized Public Health Campaigns
Personalized Public Health CampaignsGiordano Beretta
 
Cognitive Aspects of Color
Cognitive Aspects of ColorCognitive Aspects of Color
Cognitive Aspects of ColorGiordano Beretta
 

Más de Giordano Beretta (13)

Towards Print Production Simulation
Towards Print Production SimulationTowards Print Production Simulation
Towards Print Production Simulation
 
The numerical stability of LUT-based color transformations
The numerical stability of LUT-based color  transformationsThe numerical stability of LUT-based color  transformations
The numerical stability of LUT-based color transformations
 
Color naming: color scientists do it between Munsell Sheets of Color
Color naming: color scientists do it between  Munsell Sheets of ColorColor naming: color scientists do it between  Munsell Sheets of Color
Color naming: color scientists do it between Munsell Sheets of Color
 
Towards robust categorical colour perception
Towards robust categorical colour perceptionTowards robust categorical colour perception
Towards robust categorical colour perception
 
Trends in color imaging on the Internet
Trends in color imaging on the InternetTrends in color imaging on the Internet
Trends in color imaging on the Internet
 
Image Processing For Color Facsimile
Image Processing For Color FacsimileImage Processing For Color Facsimile
Image Processing For Color Facsimile
 
How The Internet Will Determine the Future of Publishing
How The Internet Will Determine the Future of PublishingHow The Internet Will Determine the Future of Publishing
How The Internet Will Determine the Future of Publishing
 
The Journal Impact Factor
The Journal Impact FactorThe Journal Impact Factor
The Journal Impact Factor
 
Color Imaging on the Internet
Color Imaging on the InternetColor Imaging on the Internet
Color Imaging on the Internet
 
Personalized Public Health Campaigns
Personalized Public Health CampaignsPersonalized Public Health Campaigns
Personalized Public Health Campaigns
 
Introduction to MPEG21
Introduction to MPEG21Introduction to MPEG21
Introduction to MPEG21
 
Understanding Color
Understanding ColorUnderstanding Color
Understanding Color
 
Cognitive Aspects of Color
Cognitive Aspects of ColorCognitive Aspects of Color
Cognitive Aspects of Color
 

Último

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 

Último (20)

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 

A Benchmark for Image Retrieval using Distributed Systems over the Internet: BIRDS-I

  • 1. A Benchmark for Image Retrieval using Distributed Systems over the Internet: BIRDS-I Neil J. Gunther Performance Dynamics Consulting Giordano Beretta Hewlett-Packard Laboratories www 13th IS&T/SPIE Symposium on Electronic Imaging—Science and Technology
  • 2. Systems & scalability 2 The Internet is a visual medium • A picture is worth ten thousand words… • A screenful of text requires 24×80 = 1,920 bytes • A VGA size image requires 640×480×3 = 921,600 bytes • …but requires almost 500 times as much bandwidth… • data compression is essential for images on the Internet • which compression is best for my image? • Need to maintain user response time within acceptable range when aggregate Web traffic increases • The Web is a system of mostly unspecified elements • Need to measure the scalability of application services N.J. Gunther & G.B. Beretta EI 2001— San José, 25 January 2001 4311–30 — Benchathlon
  • 3. User experience expectation 3 Internet bandwidth, processing power, data size • Today’s user experiences set very high expectations for imaging on the Internet • many users access the Internet in the office on fast workstations connected over fast links to the Internet • at home users often have fast graphics controllers for playing realistic computer games • increasingly, private homes are equipped with fast connections over DSL, cable modem, satellite, … • the latest video game machines are very powerful graphic workstations • the new generation grew up on video games & WWW • People have about 2T bytes of own data and want to access all of the Web • at work, they expect concise answers immediately on multiple media N.J. Gunther & G.B. Beretta EI 2001— San José, 25 January 2001 4311–30 — Benchathlon
  • 4. The real world 4 The nomadic workforce • The new working world is mobile and wireless • a comprehensive fast fiber optics network provides a global backbone • the “last mile” is wireless • computers are becoming wearable (PDAs) • Displays and power are the main two open problems • To conserve power, wireless devices will have low effective transmission bandwidth and small display areas • humans generate 80 W while sleeping and 400 W walking • thermo-generator in a shirt can harvest 5 W • piezo-generator in a shoe heel can produce 8 W • Concomitantly the new users are impatient N.J. Gunther & G.B. Beretta EI 2001— San José, 25 January 2001 4311–30 — Benchathlon
  • 5. Image retrieval 5 Text or contents based • Text-based image retrieval: images are annotated and a database management system is used to perform image retrieval on the annotation • drawback 1: labor required to manually annotate the images • drawback 2: imprecision in the annotation process • Content-based image retrieval systems overcome these problems by indexing the images according to their visual content, such as color, texture, etc. • A goal in CBIR research is to design representations that correlate well with the human visual system N.J. Gunther & G.B. Beretta EI 2001— San José, 25 January 2001 4311–30 — Benchathlon
  • 6. Retrieval metrics 6 Exact queries are not possible for images (nor text) • Recall (Sensitivity) = Number of relevant items retrieved / Number of relevant items in database • Precision (Specificity) = Number of relevant items retrieved / Number of items retrieved • CBIR algorithms must make a compromise between these two metrics: broad general vs. narrow specific query formulation • CBIR algorithms tend to be very imprecise… • …the result of a query requires ongoing iterative refinement (Leung, Viper) N.J. Gunther & G.B. Beretta EI 2001— San José, 25 January 2001 4311–30 — Benchathlon
  • 7. Ground truth 7 • Precision and recall are determined with the aid of a data structure known as the ground truth • embodies the relevance judgments for queries • done only for the ground truth not for each image as in text based IR • Two problems • appropriate image categories must be defined • images must be categorized according to those definitions • Challenge: scalability • gestalt approach • decoupled ontologies • categorization supervised by experts • ground truth data structure created by program N.J. Gunther & G.B. Beretta EI 2001— San José, 25 January 2001 4311–30 — Benchathlon
  • 8. Run rules 8 • The benchmark is run by executing scripts • no human involved at query time • aspects of CBIR applications such as the readability of the result or the user interface of the application are not the subject of this benchmark • All files, including the image collection, the versioned ground truth files, and the scripts for each benchmark must be freely accessible on the World Wide Web • An attribute of the personalized Web experience is near- instantaneous response, including the retrieval of images • any CBIR benchmark must provide metrics for response time performance N.J. Gunther & G.B. Beretta EI 2001— San José, 25 January 2001 4311–30 — Benchathlon
  • 9. Benchmark metrics 9 • BIRDS-I uses a normalized rank similar to that proposed by Müller et al., but with the additional feature of penalizing missed images • A single leading performance indicator should be provided with any system benchmark otherwise, a less reliable ad hoc metric simply will get invented for making first-order comparisons between CBIR benchmark results N.J. Gunther & G.B. Beretta EI 2001— San José, 25 January 2001 4311–30 — Benchathlon
  • 10. Window optimization problem 10 • A scoring window W(q) is associated with the query q such that the returned images contained in W(q) are ranked according to an index r = 1, 2, …, W W(q) 3 1 2 4 5 6 7 8 • Number of images returned is unknown • must select a search window of size W(q) proportioned to each query's ground truth • determining the size for W(q) is an optimization problem • should look for a (piecewise) continuous convex function from which to select the values for W(q) • family of inverted parabolic functions N.J. Gunther & G.B. Beretta EI 2001— San José, 25 January 2001 4311–30 — Benchathlon
  • 11. Optimal retrieval window 11 200 W Wmpeg W = 2·G 150 Scoring Window Size W(2,1) W(1,2) 100 W=G W(1,1) 50 G 0 0 20 40 60 80 100 Number of Ground Truth Images the MPEG–7 function is labeled Wmpeg N.J. Gunther & G.B. Beretta EI 2001— San José, 25 January 2001 4311–30 — Benchathlon
  • 12. Generalized class of convex scoring fns 12 400 W W(2,2) 350 300 250 Scoring Window Size W(1,2) 200 150 W(2,1) 100 50 W(1,1) G 0 100 50 150 200 Number of Ground Truth Images N.J. Gunther & G.B. Beretta EI 2001— San José, 25 January 2001 4311–30 — Benchathlon
  • 13. Other CBIR benchmark activities 13 • MPEG-7 — B.S. Manjunath • IEEE Multimedia — Clement Leung, Horace Ip • Viper Team — Thierry Pun et al. • Benchathlon N.J. Gunther & G.B. Beretta EI 2001— San José, 25 January 2001 4311–30 — Benchathlon
  • 14. Summary 14 • BIRDS-I benchmark has been designed with the trend toward the use of small, personalized, wireless-networked systems clearly in mind • personalization with respect to the Web implies heterogeneous collections of images and this, in turn, dictates certain constraints on how these images can be organized and the type of retrieval accuracy measures that can be applied to CBIR performance • BIRDS-I only requires controlled human intervention for compilation of the image collection, none for the ground truth generation & the assessment of retrieval accuracy • benchmark image collections need to be evolved incrementally toward millions of images and that scaleup can only be achieved through the use of computer-aided compilation • Two concepts we introduced into the scoring metric are a tightly optimized image-ranking window, and a penalty for missed images, each of which is important for the automated benchmarking of large-scale CBIR N.J. Gunther & G.B. Beretta EI 2001— San José, 25 January 2001 4311–30 — Benchathlon