SlideShare una empresa de Scribd logo
1 de 13
Descargar para leer sin conexión
Callisto – A Content-based Tag
Recommendation Tool
    M. Lux, A. Pitman, and O. Marques
What does Callisto do?
• Given an image and one or more start tags
• Callisto finds ranked tag recommendations

1. Based on our model (NCP)
2. Based on statistical analysis (Stat)
What are the benefits of NCP & Callisto?

• Different tags are suggested.
• Tags are re-ranked based on visual content.
  – Consequently:
     • With the NCP model, it is common to see tags that are
       highly related to visual features being suggested if such
       features are there, and not suggested if those features
       are missing.
        – E.g.: sunset is not suggested if typical colors of sunsets are
          missing in the image.
The Application

  Image to be tagged




                       Start tag(s)

Low-level
features used



Suggestions
Ranked suggested tags

Suggestions by our content-based model (NCP)




Suggestions based on tag co-occurrence (baseline)
Use Case: Beach

 Start tag: beach




• Suggestions by both models
are almost the same
• Both feature good quality
suggestions
Use Case: Beach

   Same start tag, different photo




• Suggestions differ
• NCP has different tags to offer
Use Case: Ocean

 Start tag: ocean




• NCP suggests clouds
Use Case: Juggling

 Start tag: juggling




• NCP ranks fire first
• NCP doesn‘t include balls in
the list, which is good, since
there are no balls involved
Use Case: Juggling

 Start tag: juggling




• NCP suggests portrait
and people
• NCP doesn‘t suggest fire
Use Case: Juggling girl

 Start tags: juggling girl




• NCP suggests woman
• NCP ranks people higher
Performance issues
• Callisto has to download images and tags for
  suggestions, which is slow.
• Callisto caches downloads, so next time (with
  the same start tag) it is much faster.
• The number of downloaded photos is critical.
  – 28 works fine and is not too slow
  – 100 is much better, but downloading takes forever
Live demo
• Keep your fingers crossed…

Más contenido relacionado

Más de dermotte

LIRE presentation at the ACM Multimedia Open Source Software Competition 2013
LIRE presentation at the ACM Multimedia Open Source Software Competition 2013LIRE presentation at the ACM Multimedia Open Source Software Competition 2013
LIRE presentation at the ACM Multimedia Open Source Software Competition 2013
dermotte
 
CBMI 2013 Presentation: User Intentions in Multimedia
CBMI 2013 Presentation: User Intentions in MultimediaCBMI 2013 Presentation: User Intentions in Multimedia
CBMI 2013 Presentation: User Intentions in Multimedia
dermotte
 
Content based image retrieval with LIRe
Content based image retrieval with LIReContent based image retrieval with LIRe
Content based image retrieval with LIRe
dermotte
 
Using Visual Features to Improve Tag Suggestions in Image Sharing Sites :: pr...
Using Visual Features to Improve Tag Suggestions in Image Sharing Sites :: pr...Using Visual Features to Improve Tag Suggestions in Image Sharing Sites :: pr...
Using Visual Features to Improve Tag Suggestions in Image Sharing Sites :: pr...
dermotte
 

Más de dermotte (10)

Invited Talk OAGM Workshop Salzburg, May 2015
Invited Talk OAGM Workshop Salzburg, May 2015Invited Talk OAGM Workshop Salzburg, May 2015
Invited Talk OAGM Workshop Salzburg, May 2015
 
LIRE presentation at the ACM Multimedia Open Source Software Competition 2013
LIRE presentation at the ACM Multimedia Open Source Software Competition 2013LIRE presentation at the ACM Multimedia Open Source Software Competition 2013
LIRE presentation at the ACM Multimedia Open Source Software Competition 2013
 
CBMI 2013 Presentation: User Intentions in Multimedia
CBMI 2013 Presentation: User Intentions in MultimediaCBMI 2013 Presentation: User Intentions in Multimedia
CBMI 2013 Presentation: User Intentions in Multimedia
 
Content based image retrieval with LIRe
Content based image retrieval with LIReContent based image retrieval with LIRe
Content based image retrieval with LIRe
 
Ohne LIRe keine Bildsuche
Ohne LIRe keine BildsucheOhne LIRe keine Bildsuche
Ohne LIRe keine Bildsuche
 
User Intentions or "The other end of the camera ..."
User Intentions or "The other end of the camera ..."User Intentions or "The other end of the camera ..."
User Intentions or "The other end of the camera ..."
 
Visual Information Retrieval
Visual Information RetrievalVisual Information Retrieval
Visual Information Retrieval
 
Using Visual Features to Improve Tag Suggestions in Image Sharing Sites :: pr...
Using Visual Features to Improve Tag Suggestions in Image Sharing Sites :: pr...Using Visual Features to Improve Tag Suggestions in Image Sharing Sites :: pr...
Using Visual Features to Improve Tag Suggestions in Image Sharing Sites :: pr...
 
Power Laws Popularity And Interestingness
Power Laws Popularity And InterestingnessPower Laws Popularity And Interestingness
Power Laws Popularity And Interestingness
 
Aspects of broad folksonomies
Aspects of broad folksonomiesAspects of broad folksonomies
Aspects of broad folksonomies
 

Último

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Último (20)

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 

Callisto: Content Based Tag Recommendation for Images

  • 1. Callisto – A Content-based Tag Recommendation Tool M. Lux, A. Pitman, and O. Marques
  • 2. What does Callisto do? • Given an image and one or more start tags • Callisto finds ranked tag recommendations 1. Based on our model (NCP) 2. Based on statistical analysis (Stat)
  • 3. What are the benefits of NCP & Callisto? • Different tags are suggested. • Tags are re-ranked based on visual content. – Consequently: • With the NCP model, it is common to see tags that are highly related to visual features being suggested if such features are there, and not suggested if those features are missing. – E.g.: sunset is not suggested if typical colors of sunsets are missing in the image.
  • 4. The Application Image to be tagged Start tag(s) Low-level features used Suggestions
  • 5. Ranked suggested tags Suggestions by our content-based model (NCP) Suggestions based on tag co-occurrence (baseline)
  • 6. Use Case: Beach Start tag: beach • Suggestions by both models are almost the same • Both feature good quality suggestions
  • 7. Use Case: Beach Same start tag, different photo • Suggestions differ • NCP has different tags to offer
  • 8. Use Case: Ocean Start tag: ocean • NCP suggests clouds
  • 9. Use Case: Juggling Start tag: juggling • NCP ranks fire first • NCP doesn‘t include balls in the list, which is good, since there are no balls involved
  • 10. Use Case: Juggling Start tag: juggling • NCP suggests portrait and people • NCP doesn‘t suggest fire
  • 11. Use Case: Juggling girl Start tags: juggling girl • NCP suggests woman • NCP ranks people higher
  • 12. Performance issues • Callisto has to download images and tags for suggestions, which is slow. • Callisto caches downloads, so next time (with the same start tag) it is much faster. • The number of downloaded photos is critical. – 28 works fine and is not too slow – 100 is much better, but downloading takes forever
  • 13. Live demo • Keep your fingers crossed…