SlideShare una empresa de Scribd logo
1 de 14
Descargar para leer sin conexión
CERTH @ MediaEval 2011 Social
Event Detection Task
Symeon Papadopoulos, Christos Zigkolis, Yiannis
Kompatsiaris, Athena Vakali



Pisa, 1-2 September 2011
The problem
•   Identify social events in tagged photos collections:
    –   Challenge 1: Soccer matches @ Barcelona, Rome
    –   Challenge 2: Events @ Paradiso (Amsterdam) and
                       Parc del Forum (Barcelona)

•   Alternative formulation:
    –   For each photo of the collection answer the questions:
        Q1. Is this photo related to a social event of the given types?
        Q2. If yes, to which event is it related?
    –   Points to classification and clustering as methods to
        address the problem.


                                                                          2
Approach

  Q1

  Q2


 Q1 / Q2

           3
Photo Filtering (1)
• City classification
   – If geo-tagging available (~20%), use it  simple
     nearest-neighbour classifier
   – If not, match against city-specific tag models:
       • Created from processing independent geo-tagged
         photo collections         TAG MODEL SAMPLES
 Amsterdam (74)   Barcelona (57)   London (89)      Paris (51)   Rome (42)
 amsterdam        barcelona        london           paris        rome
 netherlands      catalunya        uk               france       italy
 holland          catalonia        united kingdom   francia      vaticano
 nederland        españa           great britain    versailles   italia
 ….               ….               ….               ….           ….

                                                                             4
Photo Filtering (2)
• Soccer/Venue classification
    – In the case of venue classification, use geo-tagging
      information if available.
    – Match against soccer/venue tag model:
         • Parameter (cf. evaluation)
                                                TAG MODEL SAMPLES (baseline)
Soccer (53, m1,b)                     Paradiso (6, m2,b)   Parc del Forum (8, m2,b)
soccer                                paradiso             parc del forum
football       names of Spanish FCs   concert              primavera sound
           +
goal           names of Italian FCs   festival             concert
goalkeeper                            gig                  festival
…                                     live music           …
                                                        +
                          domain           names of scheduled bands (last.fm)
                         knowledge                                              5
Event Partitioning

• Very simple implementation:
  – Find all unique dates of photos that “passed” the
    first filtering step.
  – For each date, find all associated photos and split
    them into groups based on the city they are
    classified (same classifier as in Step 1).
  – Consider the resulting groups of photos, as the set
    of events.


                                                      6
Event Expansion
• Expand in three ways:
  – Photos having the same owner as one of the
    owners in the event & captured at the same date.
  – Photos captured at the same location (<200m)
    with the event center & at the same date (only for
    geo-tagged photos)
  – Photos belonging to the same cluster (by use of
    method [1]) & having the same owner as one of
    the owners in the event (parameter: cluster type)
             [1] S. Papadopoulos, C. Zigkolis, Y. Kompatsiaris, A. Vakali. “Cluster-based
             Landmark and Event Detection on Tagged Photo Collections”. In IEEE Multimedia
             Magazine 18(1), pp. 52-63, 2011
                                                                                       7
Evaluation (1)
                                  Challenge 1




Notation
Parameter 1 (p1): m1,b (baseline tag model), m1,+ (extended soccer tag model)
Parameter 2 (p2): tt (use photo title + tags), ttd (use photo description + tt)
Parameter 3 (p3): ∅ (no clustering), T (tag-based clustering), V (visual clustering)




                                                                                       8
Evaluation (2)
                                  Challenge 2




Notation
Parameter 1 (p1): m2,b (baseline tag model), m2,+ (extended venue tag model)
Parameter 3 (p3): ∅ (no clustering), T (tag-based clustering), V (visual clustering),
                    H (hybrid clustering)

m2,+ was created by adding to baseline the names of the bands that played in these
          venues in the same month (collected from last.fm API)

                                                                                        9
Failure examples (1)
C1 - Run1 / False positives
3559542192                            3618132279                         3580841609




Title: AVUÍ SOM 77.331                Title: Sant Pere                  Title: roma 09.
Tags: …, Campions, Trophy,            Tags: Barcelone, Barcelona,       Tags: rome, italy coliseum,
campnou, soccer, football,            Night Ambiance, Light             palatino, chuch, soccer,
caosasuna, barça, fiesta, …                                             statues, art

Many of the photo tags                Captured at the same              Just one of the tags
are related to soccer and             date and in the vicinity of       (soccer) is related to
even to a soccer event                the event.                        soccer.
(fiesta, champions).

                              Most of the false positives were due to the expansion step
                              (i.e. same day + close by, or same day + same user)
                                                                                                   10
Failure examples (2)
C1 - Run2 / False negatives
3559542192                             3571654936                         3583033760




Title: near Tor di Quinto,             Title: Barcelona v.
Latium, Italy                                                            Title: DSC_0029
                                       Manchester United
Tags: N/A                                                                Tags: FC Barcelona Fiesta Tri
                                       Tags: Sigma 10-20mm, F4-5.6
                                                                         Campions
Description: s.s. lazio wins           EX DC HSM, barcelona, spain,
the coppa italia                       moo2
Here the event                         The information could be          Event information is
information is only                    inferred from title if our tag    encoded in a single tag,
present in the photo                   model contained FC names          but we don’t tokenize
description.                           from different countries.         tags, so we miss it.

                               Most of the false negatives were due to failure in matching
                               the textual metadata of photos to the soccer tag model.            11
Discussion (1)
• Most important factor:
   – a good tag model to be used for classification


• Marginal contribution of clustering:
   – expansion by spatio-temporal metadata already captures
     most related photos
   – tag-based clusters tend to include many of the photos of
     the same user at the same date
   – visual clusters did not yield further improvements as one
     would hope (at least with employed visual similarity
     measure: 500 feature vector from clustering SIFT features)


                                                             12
Discussion (2)
• Future action: study in detail failure cases and
  make necessary modifications to approach
• Ways to improve:
  – better topic/entity classification methods
     • better/richer tag models + text matching methods
     • more sophisticated methods: e.g. SVMs, relational
       learning + more discriminative features (text, visual,
       social)
  – more elaborate city classification methods or even
    precise geo-tagging methods

                                                                13
Questions




            14

Más contenido relacionado

Más de Symeon Papadopoulos

DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...
DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...
DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...Symeon Papadopoulos
 
Deepfakes: An Emerging Internet Threat and their Detection
Deepfakes: An Emerging Internet Threat and their DetectionDeepfakes: An Emerging Internet Threat and their Detection
Deepfakes: An Emerging Internet Threat and their DetectionSymeon Papadopoulos
 
Knowledge-based Fusion for Image Tampering Localization
Knowledge-based Fusion for Image Tampering LocalizationKnowledge-based Fusion for Image Tampering Localization
Knowledge-based Fusion for Image Tampering LocalizationSymeon Papadopoulos
 
Deepfake Detection: The Importance of Training Data Preprocessing and Practic...
Deepfake Detection: The Importance of Training Data Preprocessing and Practic...Deepfake Detection: The Importance of Training Data Preprocessing and Practic...
Deepfake Detection: The Importance of Training Data Preprocessing and Practic...Symeon Papadopoulos
 
COVID-19 Infodemic vs Contact Tracing
COVID-19 Infodemic vs Contact TracingCOVID-19 Infodemic vs Contact Tracing
COVID-19 Infodemic vs Contact TracingSymeon Papadopoulos
 
Similarity-based retrieval of multimedia content
Similarity-based retrieval of multimedia contentSimilarity-based retrieval of multimedia content
Similarity-based retrieval of multimedia contentSymeon Papadopoulos
 
Twitter-based Sensing of City-level Air Quality
Twitter-based Sensing of City-level Air QualityTwitter-based Sensing of City-level Air Quality
Twitter-based Sensing of City-level Air QualitySymeon Papadopoulos
 
Aggregating and Analyzing the Context of Social Media Content
Aggregating and Analyzing the Context of Social Media ContentAggregating and Analyzing the Context of Social Media Content
Aggregating and Analyzing the Context of Social Media ContentSymeon Papadopoulos
 
Verifying Multimedia Content on the Internet
Verifying Multimedia Content on the InternetVerifying Multimedia Content on the Internet
Verifying Multimedia Content on the InternetSymeon Papadopoulos
 
A Web-based Service for Image Tampering Detection
A Web-based Service for Image Tampering DetectionA Web-based Service for Image Tampering Detection
A Web-based Service for Image Tampering DetectionSymeon Papadopoulos
 
Learning to detect Misleading Content on Twitter
Learning to detect Misleading Content on TwitterLearning to detect Misleading Content on Twitter
Learning to detect Misleading Content on TwitterSymeon Papadopoulos
 
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN LayersNear-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN LayersSymeon Papadopoulos
 
Verifying Multimedia Use at MediaEval 2016
Verifying Multimedia Use at MediaEval 2016Verifying Multimedia Use at MediaEval 2016
Verifying Multimedia Use at MediaEval 2016Symeon Papadopoulos
 
Placing Images with Refined Language Models and Similarity Search with PCA-re...
Placing Images with Refined Language Models and Similarity Search with PCA-re...Placing Images with Refined Language Models and Similarity Search with PCA-re...
Placing Images with Refined Language Models and Similarity Search with PCA-re...Symeon Papadopoulos
 
In-depth Exploration of Geotagging Performance
In-depth Exploration of Geotagging PerformanceIn-depth Exploration of Geotagging Performance
In-depth Exploration of Geotagging PerformanceSymeon Papadopoulos
 
Perceived versus Actual Predictability of Personal Information in Social Netw...
Perceived versus Actual Predictability of Personal Information in Social Netw...Perceived versus Actual Predictability of Personal Information in Social Netw...
Perceived versus Actual Predictability of Personal Information in Social Netw...Symeon Papadopoulos
 
Web and Social Media Image Forensics for News Professionals
Web and Social Media Image Forensics for News ProfessionalsWeb and Social Media Image Forensics for News Professionals
Web and Social Media Image Forensics for News ProfessionalsSymeon Papadopoulos
 
Predicting News Popularity by Mining Online Discussions
Predicting News Popularity by Mining Online DiscussionsPredicting News Popularity by Mining Online Discussions
Predicting News Popularity by Mining Online DiscussionsSymeon Papadopoulos
 
Finding Diverse Social Images at MediaEval 2015
Finding Diverse Social Images at MediaEval 2015Finding Diverse Social Images at MediaEval 2015
Finding Diverse Social Images at MediaEval 2015Symeon Papadopoulos
 

Más de Symeon Papadopoulos (20)

DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...
DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...
DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...
 
Deepfakes: An Emerging Internet Threat and their Detection
Deepfakes: An Emerging Internet Threat and their DetectionDeepfakes: An Emerging Internet Threat and their Detection
Deepfakes: An Emerging Internet Threat and their Detection
 
Knowledge-based Fusion for Image Tampering Localization
Knowledge-based Fusion for Image Tampering LocalizationKnowledge-based Fusion for Image Tampering Localization
Knowledge-based Fusion for Image Tampering Localization
 
Deepfake Detection: The Importance of Training Data Preprocessing and Practic...
Deepfake Detection: The Importance of Training Data Preprocessing and Practic...Deepfake Detection: The Importance of Training Data Preprocessing and Practic...
Deepfake Detection: The Importance of Training Data Preprocessing and Practic...
 
COVID-19 Infodemic vs Contact Tracing
COVID-19 Infodemic vs Contact TracingCOVID-19 Infodemic vs Contact Tracing
COVID-19 Infodemic vs Contact Tracing
 
Similarity-based retrieval of multimedia content
Similarity-based retrieval of multimedia contentSimilarity-based retrieval of multimedia content
Similarity-based retrieval of multimedia content
 
Twitter-based Sensing of City-level Air Quality
Twitter-based Sensing of City-level Air QualityTwitter-based Sensing of City-level Air Quality
Twitter-based Sensing of City-level Air Quality
 
Aggregating and Analyzing the Context of Social Media Content
Aggregating and Analyzing the Context of Social Media ContentAggregating and Analyzing the Context of Social Media Content
Aggregating and Analyzing the Context of Social Media Content
 
Verifying Multimedia Content on the Internet
Verifying Multimedia Content on the InternetVerifying Multimedia Content on the Internet
Verifying Multimedia Content on the Internet
 
A Web-based Service for Image Tampering Detection
A Web-based Service for Image Tampering DetectionA Web-based Service for Image Tampering Detection
A Web-based Service for Image Tampering Detection
 
Learning to detect Misleading Content on Twitter
Learning to detect Misleading Content on TwitterLearning to detect Misleading Content on Twitter
Learning to detect Misleading Content on Twitter
 
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN LayersNear-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
 
Verifying Multimedia Use at MediaEval 2016
Verifying Multimedia Use at MediaEval 2016Verifying Multimedia Use at MediaEval 2016
Verifying Multimedia Use at MediaEval 2016
 
Multimedia Privacy
Multimedia PrivacyMultimedia Privacy
Multimedia Privacy
 
Placing Images with Refined Language Models and Similarity Search with PCA-re...
Placing Images with Refined Language Models and Similarity Search with PCA-re...Placing Images with Refined Language Models and Similarity Search with PCA-re...
Placing Images with Refined Language Models and Similarity Search with PCA-re...
 
In-depth Exploration of Geotagging Performance
In-depth Exploration of Geotagging PerformanceIn-depth Exploration of Geotagging Performance
In-depth Exploration of Geotagging Performance
 
Perceived versus Actual Predictability of Personal Information in Social Netw...
Perceived versus Actual Predictability of Personal Information in Social Netw...Perceived versus Actual Predictability of Personal Information in Social Netw...
Perceived versus Actual Predictability of Personal Information in Social Netw...
 
Web and Social Media Image Forensics for News Professionals
Web and Social Media Image Forensics for News ProfessionalsWeb and Social Media Image Forensics for News Professionals
Web and Social Media Image Forensics for News Professionals
 
Predicting News Popularity by Mining Online Discussions
Predicting News Popularity by Mining Online DiscussionsPredicting News Popularity by Mining Online Discussions
Predicting News Popularity by Mining Online Discussions
 
Finding Diverse Social Images at MediaEval 2015
Finding Diverse Social Images at MediaEval 2015Finding Diverse Social Images at MediaEval 2015
Finding Diverse Social Images at MediaEval 2015
 

Último

08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 

Último (20)

08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 

CERTH @ MediaEval 2011 Social Event Detection Task

  • 1. CERTH @ MediaEval 2011 Social Event Detection Task Symeon Papadopoulos, Christos Zigkolis, Yiannis Kompatsiaris, Athena Vakali Pisa, 1-2 September 2011
  • 2. The problem • Identify social events in tagged photos collections: – Challenge 1: Soccer matches @ Barcelona, Rome – Challenge 2: Events @ Paradiso (Amsterdam) and Parc del Forum (Barcelona) • Alternative formulation: – For each photo of the collection answer the questions: Q1. Is this photo related to a social event of the given types? Q2. If yes, to which event is it related? – Points to classification and clustering as methods to address the problem. 2
  • 3. Approach Q1 Q2 Q1 / Q2 3
  • 4. Photo Filtering (1) • City classification – If geo-tagging available (~20%), use it  simple nearest-neighbour classifier – If not, match against city-specific tag models: • Created from processing independent geo-tagged photo collections TAG MODEL SAMPLES Amsterdam (74) Barcelona (57) London (89) Paris (51) Rome (42) amsterdam barcelona london paris rome netherlands catalunya uk france italy holland catalonia united kingdom francia vaticano nederland españa great britain versailles italia …. …. …. …. …. 4
  • 5. Photo Filtering (2) • Soccer/Venue classification – In the case of venue classification, use geo-tagging information if available. – Match against soccer/venue tag model: • Parameter (cf. evaluation) TAG MODEL SAMPLES (baseline) Soccer (53, m1,b) Paradiso (6, m2,b) Parc del Forum (8, m2,b) soccer paradiso parc del forum football names of Spanish FCs concert primavera sound + goal names of Italian FCs festival concert goalkeeper gig festival … live music … + domain names of scheduled bands (last.fm) knowledge 5
  • 6. Event Partitioning • Very simple implementation: – Find all unique dates of photos that “passed” the first filtering step. – For each date, find all associated photos and split them into groups based on the city they are classified (same classifier as in Step 1). – Consider the resulting groups of photos, as the set of events. 6
  • 7. Event Expansion • Expand in three ways: – Photos having the same owner as one of the owners in the event & captured at the same date. – Photos captured at the same location (<200m) with the event center & at the same date (only for geo-tagged photos) – Photos belonging to the same cluster (by use of method [1]) & having the same owner as one of the owners in the event (parameter: cluster type) [1] S. Papadopoulos, C. Zigkolis, Y. Kompatsiaris, A. Vakali. “Cluster-based Landmark and Event Detection on Tagged Photo Collections”. In IEEE Multimedia Magazine 18(1), pp. 52-63, 2011 7
  • 8. Evaluation (1) Challenge 1 Notation Parameter 1 (p1): m1,b (baseline tag model), m1,+ (extended soccer tag model) Parameter 2 (p2): tt (use photo title + tags), ttd (use photo description + tt) Parameter 3 (p3): ∅ (no clustering), T (tag-based clustering), V (visual clustering) 8
  • 9. Evaluation (2) Challenge 2 Notation Parameter 1 (p1): m2,b (baseline tag model), m2,+ (extended venue tag model) Parameter 3 (p3): ∅ (no clustering), T (tag-based clustering), V (visual clustering), H (hybrid clustering) m2,+ was created by adding to baseline the names of the bands that played in these venues in the same month (collected from last.fm API) 9
  • 10. Failure examples (1) C1 - Run1 / False positives 3559542192 3618132279 3580841609 Title: AVUÍ SOM 77.331 Title: Sant Pere Title: roma 09. Tags: …, Campions, Trophy, Tags: Barcelone, Barcelona, Tags: rome, italy coliseum, campnou, soccer, football, Night Ambiance, Light palatino, chuch, soccer, caosasuna, barça, fiesta, … statues, art Many of the photo tags Captured at the same Just one of the tags are related to soccer and date and in the vicinity of (soccer) is related to even to a soccer event the event. soccer. (fiesta, champions). Most of the false positives were due to the expansion step (i.e. same day + close by, or same day + same user) 10
  • 11. Failure examples (2) C1 - Run2 / False negatives 3559542192 3571654936 3583033760 Title: near Tor di Quinto, Title: Barcelona v. Latium, Italy Title: DSC_0029 Manchester United Tags: N/A Tags: FC Barcelona Fiesta Tri Tags: Sigma 10-20mm, F4-5.6 Campions Description: s.s. lazio wins EX DC HSM, barcelona, spain, the coppa italia moo2 Here the event The information could be Event information is information is only inferred from title if our tag encoded in a single tag, present in the photo model contained FC names but we don’t tokenize description. from different countries. tags, so we miss it. Most of the false negatives were due to failure in matching the textual metadata of photos to the soccer tag model. 11
  • 12. Discussion (1) • Most important factor: – a good tag model to be used for classification • Marginal contribution of clustering: – expansion by spatio-temporal metadata already captures most related photos – tag-based clusters tend to include many of the photos of the same user at the same date – visual clusters did not yield further improvements as one would hope (at least with employed visual similarity measure: 500 feature vector from clustering SIFT features) 12
  • 13. Discussion (2) • Future action: study in detail failure cases and make necessary modifications to approach • Ways to improve: – better topic/entity classification methods • better/richer tag models + text matching methods • more sophisticated methods: e.g. SVMs, relational learning + more discriminative features (text, visual, social) – more elaborate city classification methods or even precise geo-tagging methods 13
  • 14. Questions 14