SlideShare una empresa de Scribd logo
1 de 14
Descargar para leer sin conexión
CERTH @ MediaEval 2011 Social
Event Detection Task
Symeon Papadopoulos, Christos Zigkolis, Yiannis
Kompatsiaris, Athena Vakali



Pisa, 1-2 September 2011
The problem
•   Identify social events in tagged photos collections:
    –   Challenge 1: Soccer matches @ Barcelona, Rome
    –   Challenge 2: Events @ Paradiso (Amsterdam) and
                       Parc del Forum (Barcelona)

•   Alternative formulation:
    –   For each photo of the collection answer the questions:
        Q1. Is this photo related to a social event of the given types?
        Q2. If yes, to which event is it related?
    –   Points to classification and clustering as methods to
        address the problem.


                                                                          2
Approach

  Q1

  Q2


 Q1 / Q2

           3
Photo Filtering (1)
• City classification
   – If geo-tagging available (~20%), use it  simple
     nearest-neighbour classifier
   – If not, match against city-specific tag models:
       • Created from processing independent geo-tagged
         photo collections         TAG MODEL SAMPLES
 Amsterdam (74)   Barcelona (57)   London (89)      Paris (51)   Rome (42)
 amsterdam        barcelona        london           paris        rome
 netherlands      catalunya        uk               france       italy
 holland          catalonia        united kingdom   francia      vaticano
 nederland        españa           great britain    versailles   italia
 ….               ….               ….               ….           ….

                                                                             4
Photo Filtering (2)
• Soccer/Venue classification
    – In the case of venue classification, use geo-tagging
      information if available.
    – Match against soccer/venue tag model:
         • Parameter (cf. evaluation)
                                                TAG MODEL SAMPLES (baseline)
Soccer (53, m1,b)                     Paradiso (6, m2,b)   Parc del Forum (8, m2,b)
soccer                                paradiso             parc del forum
football       names of Spanish FCs   concert              primavera sound
           +
goal           names of Italian FCs   festival             concert
goalkeeper                            gig                  festival
…                                     live music           …
                                                        +
                          domain           names of scheduled bands (last.fm)
                         knowledge                                              5
Event Partitioning

• Very simple implementation:
  – Find all unique dates of photos that “passed” the
    first filtering step.
  – For each date, find all associated photos and split
    them into groups based on the city they are
    classified (same classifier as in Step 1).
  – Consider the resulting groups of photos, as the set
    of events.


                                                      6
Event Expansion
• Expand in three ways:
  – Photos having the same owner as one of the
    owners in the event & captured at the same date.
  – Photos captured at the same location (<200m)
    with the event center & at the same date (only for
    geo-tagged photos)
  – Photos belonging to the same cluster (by use of
    method [1]) & having the same owner as one of
    the owners in the event (parameter: cluster type)
             [1] S. Papadopoulos, C. Zigkolis, Y. Kompatsiaris, A. Vakali. “Cluster-based
             Landmark and Event Detection on Tagged Photo Collections”. In IEEE Multimedia
             Magazine 18(1), pp. 52-63, 2011
                                                                                       7
Evaluation (1)
                                  Challenge 1




Notation
Parameter 1 (p1): m1,b (baseline tag model), m1,+ (extended soccer tag model)
Parameter 2 (p2): tt (use photo title + tags), ttd (use photo description + tt)
Parameter 3 (p3): ∅ (no clustering), T (tag-based clustering), V (visual clustering)




                                                                                       8
Evaluation (2)
                                  Challenge 2




Notation
Parameter 1 (p1): m2,b (baseline tag model), m2,+ (extended venue tag model)
Parameter 3 (p3): ∅ (no clustering), T (tag-based clustering), V (visual clustering),
                    H (hybrid clustering)

m2,+ was created by adding to baseline the names of the bands that played in these
          venues in the same month (collected from last.fm API)

                                                                                        9
Failure examples (1)
C1 - Run1 / False positives
3559542192                            3618132279                         3580841609




Title: AVUÍ SOM 77.331                Title: Sant Pere                  Title: roma 09.
Tags: …, Campions, Trophy,            Tags: Barcelone, Barcelona,       Tags: rome, italy coliseum,
campnou, soccer, football,            Night Ambiance, Light             palatino, chuch, soccer,
caosasuna, barça, fiesta, …                                             statues, art

Many of the photo tags                Captured at the same              Just one of the tags
are related to soccer and             date and in the vicinity of       (soccer) is related to
even to a soccer event                the event.                        soccer.
(fiesta, champions).

                              Most of the false positives were due to the expansion step
                              (i.e. same day + close by, or same day + same user)
                                                                                                   10
Failure examples (2)
C1 - Run2 / False negatives
3559542192                             3571654936                         3583033760




Title: near Tor di Quinto,             Title: Barcelona v.
Latium, Italy                                                            Title: DSC_0029
                                       Manchester United
Tags: N/A                                                                Tags: FC Barcelona Fiesta Tri
                                       Tags: Sigma 10-20mm, F4-5.6
                                                                         Campions
Description: s.s. lazio wins           EX DC HSM, barcelona, spain,
the coppa italia                       moo2
Here the event                         The information could be          Event information is
information is only                    inferred from title if our tag    encoded in a single tag,
present in the photo                   model contained FC names          but we don’t tokenize
description.                           from different countries.         tags, so we miss it.

                               Most of the false negatives were due to failure in matching
                               the textual metadata of photos to the soccer tag model.            11
Discussion (1)
• Most important factor:
   – a good tag model to be used for classification


• Marginal contribution of clustering:
   – expansion by spatio-temporal metadata already captures
     most related photos
   – tag-based clusters tend to include many of the photos of
     the same user at the same date
   – visual clusters did not yield further improvements as one
     would hope (at least with employed visual similarity
     measure: 500 feature vector from clustering SIFT features)


                                                             12
Discussion (2)
• Future action: study in detail failure cases and
  make necessary modifications to approach
• Ways to improve:
  – better topic/entity classification methods
     • better/richer tag models + text matching methods
     • more sophisticated methods: e.g. SVMs, relational
       learning + more discriminative features (text, visual,
       social)
  – more elaborate city classification methods or even
    precise geo-tagging methods

                                                                13
Questions




            14

Más contenido relacionado

Más de Symeon Papadopoulos

Más de Symeon Papadopoulos (20)

DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...
DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...
DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...
 
Deepfakes: An Emerging Internet Threat and their Detection
Deepfakes: An Emerging Internet Threat and their DetectionDeepfakes: An Emerging Internet Threat and their Detection
Deepfakes: An Emerging Internet Threat and their Detection
 
Knowledge-based Fusion for Image Tampering Localization
Knowledge-based Fusion for Image Tampering LocalizationKnowledge-based Fusion for Image Tampering Localization
Knowledge-based Fusion for Image Tampering Localization
 
Deepfake Detection: The Importance of Training Data Preprocessing and Practic...
Deepfake Detection: The Importance of Training Data Preprocessing and Practic...Deepfake Detection: The Importance of Training Data Preprocessing and Practic...
Deepfake Detection: The Importance of Training Data Preprocessing and Practic...
 
COVID-19 Infodemic vs Contact Tracing
COVID-19 Infodemic vs Contact TracingCOVID-19 Infodemic vs Contact Tracing
COVID-19 Infodemic vs Contact Tracing
 
Similarity-based retrieval of multimedia content
Similarity-based retrieval of multimedia contentSimilarity-based retrieval of multimedia content
Similarity-based retrieval of multimedia content
 
Twitter-based Sensing of City-level Air Quality
Twitter-based Sensing of City-level Air QualityTwitter-based Sensing of City-level Air Quality
Twitter-based Sensing of City-level Air Quality
 
Aggregating and Analyzing the Context of Social Media Content
Aggregating and Analyzing the Context of Social Media ContentAggregating and Analyzing the Context of Social Media Content
Aggregating and Analyzing the Context of Social Media Content
 
Verifying Multimedia Content on the Internet
Verifying Multimedia Content on the InternetVerifying Multimedia Content on the Internet
Verifying Multimedia Content on the Internet
 
A Web-based Service for Image Tampering Detection
A Web-based Service for Image Tampering DetectionA Web-based Service for Image Tampering Detection
A Web-based Service for Image Tampering Detection
 
Learning to detect Misleading Content on Twitter
Learning to detect Misleading Content on TwitterLearning to detect Misleading Content on Twitter
Learning to detect Misleading Content on Twitter
 
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN LayersNear-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
 
Verifying Multimedia Use at MediaEval 2016
Verifying Multimedia Use at MediaEval 2016Verifying Multimedia Use at MediaEval 2016
Verifying Multimedia Use at MediaEval 2016
 
Multimedia Privacy
Multimedia PrivacyMultimedia Privacy
Multimedia Privacy
 
Placing Images with Refined Language Models and Similarity Search with PCA-re...
Placing Images with Refined Language Models and Similarity Search with PCA-re...Placing Images with Refined Language Models and Similarity Search with PCA-re...
Placing Images with Refined Language Models and Similarity Search with PCA-re...
 
In-depth Exploration of Geotagging Performance
In-depth Exploration of Geotagging PerformanceIn-depth Exploration of Geotagging Performance
In-depth Exploration of Geotagging Performance
 
Perceived versus Actual Predictability of Personal Information in Social Netw...
Perceived versus Actual Predictability of Personal Information in Social Netw...Perceived versus Actual Predictability of Personal Information in Social Netw...
Perceived versus Actual Predictability of Personal Information in Social Netw...
 
Web and Social Media Image Forensics for News Professionals
Web and Social Media Image Forensics for News ProfessionalsWeb and Social Media Image Forensics for News Professionals
Web and Social Media Image Forensics for News Professionals
 
Predicting News Popularity by Mining Online Discussions
Predicting News Popularity by Mining Online DiscussionsPredicting News Popularity by Mining Online Discussions
Predicting News Popularity by Mining Online Discussions
 
Finding Diverse Social Images at MediaEval 2015
Finding Diverse Social Images at MediaEval 2015Finding Diverse Social Images at MediaEval 2015
Finding Diverse Social Images at MediaEval 2015
 

Último

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider  Progress from Awareness to Implementation.pptxTales from a Passkey Provider  Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
FIDO Alliance
 

Último (20)

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Choreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringChoreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software Engineering
 
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
JavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate GuideJavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate Guide
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
ChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps Productivity
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
How to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfHow to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cf
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider  Progress from Awareness to Implementation.pptxTales from a Passkey Provider  Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Simplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxSimplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptx
 
الأمن السيبراني - ما لا يسع للمستخدم جهله
الأمن السيبراني - ما لا يسع للمستخدم جهلهالأمن السيبراني - ما لا يسع للمستخدم جهله
الأمن السيبراني - ما لا يسع للمستخدم جهله
 

CERTH @ MediaEval 2011 Social Event Detection Task

  • 1. CERTH @ MediaEval 2011 Social Event Detection Task Symeon Papadopoulos, Christos Zigkolis, Yiannis Kompatsiaris, Athena Vakali Pisa, 1-2 September 2011
  • 2. The problem • Identify social events in tagged photos collections: – Challenge 1: Soccer matches @ Barcelona, Rome – Challenge 2: Events @ Paradiso (Amsterdam) and Parc del Forum (Barcelona) • Alternative formulation: – For each photo of the collection answer the questions: Q1. Is this photo related to a social event of the given types? Q2. If yes, to which event is it related? – Points to classification and clustering as methods to address the problem. 2
  • 3. Approach Q1 Q2 Q1 / Q2 3
  • 4. Photo Filtering (1) • City classification – If geo-tagging available (~20%), use it  simple nearest-neighbour classifier – If not, match against city-specific tag models: • Created from processing independent geo-tagged photo collections TAG MODEL SAMPLES Amsterdam (74) Barcelona (57) London (89) Paris (51) Rome (42) amsterdam barcelona london paris rome netherlands catalunya uk france italy holland catalonia united kingdom francia vaticano nederland españa great britain versailles italia …. …. …. …. …. 4
  • 5. Photo Filtering (2) • Soccer/Venue classification – In the case of venue classification, use geo-tagging information if available. – Match against soccer/venue tag model: • Parameter (cf. evaluation) TAG MODEL SAMPLES (baseline) Soccer (53, m1,b) Paradiso (6, m2,b) Parc del Forum (8, m2,b) soccer paradiso parc del forum football names of Spanish FCs concert primavera sound + goal names of Italian FCs festival concert goalkeeper gig festival … live music … + domain names of scheduled bands (last.fm) knowledge 5
  • 6. Event Partitioning • Very simple implementation: – Find all unique dates of photos that “passed” the first filtering step. – For each date, find all associated photos and split them into groups based on the city they are classified (same classifier as in Step 1). – Consider the resulting groups of photos, as the set of events. 6
  • 7. Event Expansion • Expand in three ways: – Photos having the same owner as one of the owners in the event & captured at the same date. – Photos captured at the same location (<200m) with the event center & at the same date (only for geo-tagged photos) – Photos belonging to the same cluster (by use of method [1]) & having the same owner as one of the owners in the event (parameter: cluster type) [1] S. Papadopoulos, C. Zigkolis, Y. Kompatsiaris, A. Vakali. “Cluster-based Landmark and Event Detection on Tagged Photo Collections”. In IEEE Multimedia Magazine 18(1), pp. 52-63, 2011 7
  • 8. Evaluation (1) Challenge 1 Notation Parameter 1 (p1): m1,b (baseline tag model), m1,+ (extended soccer tag model) Parameter 2 (p2): tt (use photo title + tags), ttd (use photo description + tt) Parameter 3 (p3): ∅ (no clustering), T (tag-based clustering), V (visual clustering) 8
  • 9. Evaluation (2) Challenge 2 Notation Parameter 1 (p1): m2,b (baseline tag model), m2,+ (extended venue tag model) Parameter 3 (p3): ∅ (no clustering), T (tag-based clustering), V (visual clustering), H (hybrid clustering) m2,+ was created by adding to baseline the names of the bands that played in these venues in the same month (collected from last.fm API) 9
  • 10. Failure examples (1) C1 - Run1 / False positives 3559542192 3618132279 3580841609 Title: AVUÍ SOM 77.331 Title: Sant Pere Title: roma 09. Tags: …, Campions, Trophy, Tags: Barcelone, Barcelona, Tags: rome, italy coliseum, campnou, soccer, football, Night Ambiance, Light palatino, chuch, soccer, caosasuna, barça, fiesta, … statues, art Many of the photo tags Captured at the same Just one of the tags are related to soccer and date and in the vicinity of (soccer) is related to even to a soccer event the event. soccer. (fiesta, champions). Most of the false positives were due to the expansion step (i.e. same day + close by, or same day + same user) 10
  • 11. Failure examples (2) C1 - Run2 / False negatives 3559542192 3571654936 3583033760 Title: near Tor di Quinto, Title: Barcelona v. Latium, Italy Title: DSC_0029 Manchester United Tags: N/A Tags: FC Barcelona Fiesta Tri Tags: Sigma 10-20mm, F4-5.6 Campions Description: s.s. lazio wins EX DC HSM, barcelona, spain, the coppa italia moo2 Here the event The information could be Event information is information is only inferred from title if our tag encoded in a single tag, present in the photo model contained FC names but we don’t tokenize description. from different countries. tags, so we miss it. Most of the false negatives were due to failure in matching the textual metadata of photos to the soccer tag model. 11
  • 12. Discussion (1) • Most important factor: – a good tag model to be used for classification • Marginal contribution of clustering: – expansion by spatio-temporal metadata already captures most related photos – tag-based clusters tend to include many of the photos of the same user at the same date – visual clusters did not yield further improvements as one would hope (at least with employed visual similarity measure: 500 feature vector from clustering SIFT features) 12
  • 13. Discussion (2) • Future action: study in detail failure cases and make necessary modifications to approach • Ways to improve: – better topic/entity classification methods • better/richer tag models + text matching methods • more sophisticated methods: e.g. SVMs, relational learning + more discriminative features (text, visual, social) – more elaborate city classification methods or even precise geo-tagging methods 13
  • 14. Questions 14