SlideShare una empresa de Scribd logo
1 de 13
Descargar para leer sin conexión
Social Event Detection (SED):
Challenges, Dataset and Evaluation
 Raphaël Troncy <raphael.troncy@eurecom.fr>
 Vasileios Mezaris <bmezaris@iti.gr>
 Symeon Papadopoulos <papadop@iti.gr>
 Emmanouil Schinas <manosetro@iti.gr>
 Ioannis Kompatsiaris <ikom@iti.gr>
What are Events?

 Events are observable occurrences grouping




                      People                       Places Time

                 Experiences documented by Media




  04/10/2012 -        Social Event Detection (SED) Task - MediaEval 2012, Pisa, Italy   -2
SED: bigger, longer, harder

 In 2011                                                   In 2012
   2 challenges                                                      3 challenges
   73k photos (2,43 Gb)                                                       1 from SED 2011
   No training dataset                                               167k photos (5,5 Gb)
                                                                               cc licence check
   18 teams interested
   7 teams submitted runs                                            Training dataset =
                                                                       SED 2011
 Considered easy                                                     21 teams interested
   F-measure = 85%                                                            … from 15 countries
    (challenge 1)                                                     5 teams submitted runs
   F-measure = 69%
    (challenge 2)                                           Much harder !

   04/10/2012 -   Social Event Detection (SED) Task - MediaEval 2012, Pisa, Italy      -3
Three challenges (type and venue)
1. Find all technical events that took place in Germany in
   the test collection.
2. Find all soccer events taking place in Hamburg
   (Germany) and Madrid (Spain) in the collection.
3. Find all demonstration and protest events of the
   Indignados movement occurring in public places in
   Madrid in the collection

    For each event, we provided relevant and non relevant
     example photos
 Task = detect events and provide all illustrating photos


    04/10/2012 -   Social Event Detection (SED) Task - MediaEval 2012, Pisa, Italy   -4
Dataset Construction

 Collect 167332 Flickr Photos (Jan 2009-Dec 2011)
   4,422 unique Flickr users, all in CC licence
   All geo-tagged in 5 cities: Barcelona (72255), Cologne
    (15850), Hannover (2823), Hamburg (16958), Madrid
    (59043) + 0,22 % (403) from EventMedia

 Altered metadata:
   geo-tags removed for 80% of the photos (random)
   33466 photos still geo-tagged

 Provide only metadata … but real media were
  available to participants if they asked (5,5 Gb)

    04/10/2012 -   Social Event Detection (SED) Task - MediaEval 2012, Pisa, Italy   -5
Ground Truth and Evaluation Measures

 CrEve annotation tool: http://www.clusttour.gr/creve/
    For each of the 6 collections, review all photos and
     associate them to events (that have to be created)
    Search by text, geo-coordinates, date and user
    Review annotations made by others
    Use EventMedia and machine tags (upcoming:event=xxx)

 Evaluation Measures:
    Harmonic mean (F-score of Precision and Recall)
    Normalized Mutual Information (NMI): jointly consider the
     goodness of the photos retrieved and their correct
     assignment to different events

    04/10/2012 -   Social Event Detection (SED) Task - MediaEval 2012, Pisa, Italy   -6
What ideally should be found

 Challenge 1:
   19 events, 2234 photos (avg = 117)
   Baseline precision (random): 0,01%

 Challenge 2:
   79 events, 1684 photos (avg = 21)
   Baseline precision (random): 0,01%

 Challenge 3:
   52 events, 3992 Photos (avg = 77)
   Baseline precision (random): 0,02%



   04/10/2012 -   Social Event Detection (SED) Task - MediaEval 2012, Pisa, Italy   -7
Who Has Participated ?

 21 Teams registered (18 in 2011)
 5 Teams cross the lines (7 in 2011, 2 overlaps)




 One participant missing at the workshop!
    04/10/2012 -   Social Event Detection (SED) Task - MediaEval 2012, Pisa, Italy   -8
Quick Summary of Approaches
 2011: all but 1 participants use background knowledge
    Last.fm (all), Fbleague (EURECOM), PlayerHistory (QMUL)
    DBpedia, Freebase, Geonames, WordNet

 2012: all but 2 participants use a generic approach
    IR approach: query matching clusters (metadata, temporal, spatial):
     MISIMIS
    Classification approach:
          Topic detection with LDA, city classification with TF-IDF, event detection using
           peaks in timeline using the query topics: AUTH-ISSEL
          Learning model using the training data and SVM: CERTH-ITI
    Background knowledge: QMUL, DISI

 2012: all approaches are NOT fully automatic
    Manual selection of some parameters (e.g. topics)

    04/10/2012 -          Social Event Detection (SED) Task - MediaEval 2012, Pisa, Italy   -9
Results – Challenge 1 (Technical Events)

                    Precision                             Recall                               F-score                       NMI
AUTHISSEL_4                 76,29                             94,9                               84,58                       0,7238
CERTH_1                     43,11                            11,91                               18,66                       0,1877
DISI_1                      86,23                           59,13                                70,15                       0,6011
MISIMS_2                    2,52                              1,88                                2,15                       0,0236
QMUL_4                      3,86                            12,85                                 5,93                       0,0475
         90         84,58
         80
                                                                    70,15
         70

         60

         50

         40

         30
                                          18,66
         20

         10                                                                                                           5,93
                                                                                               2,15
         0
                                                                    Runs

                                AUTHISSEL_4         CERTHITI_1        DISI_1      MISIMS_2            QMUL_4

     04/10/2012 -            Social Event Detection (SED) Task - MediaEval 2012, Pisa, Italy                   - 10
Results – Challenge 2 (Soccer Events)

                    Precision                            Recall                               F-score           NMI
AUTHISSEL_4             88,18                              93,49                               90,76            0,8499
CERTH_1                 85,57                              66,19                               74,64            0,6745
DISI_1
MISIMS_2                34,49                              17,25                               22,99            0,1993
QMUL_4                  79,04                              67,12                               72,59            0,6493
         100
                    90,76
           90
           80                            74,64                                                          72,59
           70
           60
           50
           40
           30                                                                          22,99
           20
           10
            0
                                                                 Runs

                            AUTHISSEL_4          CERTHITI_3       DISI_1      MISIMS_2         QMUL_1


     04/10/2012 -           Social Event Detection (SED) Task - MediaEval 2012, Pisa, Italy             - 11
Results – Challenge 3 (Indignados Events)

                    Precision                            Recall                               F-score           NMI
AUTHISSEL_4            88,91                               90,78                               89,83            0,738
CERTH_1                86,24                               54,61                               66,87            0,4654
DISI_1                 86,15                               47,17                               60,96            0,4465
MISIMS_2                48,3                               46,87                               47,58            0,3088
QMUL_4                 22,88                               33,48                               27,19            0,1988
           100
                    89,83
            90
            80
            70                           66,87
                                                                60,96
            60
                                                                                       47,58
            50
            40
            30                                                                                          27,19

            20
            10
              0
                                                                 Runs

                            AUTHISSEL_4          CERTHITI_3       DISI_1      MISIMS_2         QMUL_4

     04/10/2012 -           Social Event Detection (SED) Task - MediaEval 2012, Pisa, Italy             - 12
Conclusion

 Lessons learned
   Clear winner for all tasks: generic approach but manual
    selection of the topics
   Use of background knowledge still useful if well-used

 Looking at next year SED
   Shlomo Geva (Queensland University of Technology) +
    Philipp Cimiano (University of Bielefeld)
   Dataset: bigger, more diverse
   Media: photos and videos ? (at least 10% videos?)
   Metadata: include some social network relationships,
    participation at events
   Evaluation measures: event granularity? Time/CPU?
   04/10/2012 -   Social Event Detection (SED) Task - MediaEval 2012, Pisa, Italy   - 13

Más contenido relacionado

Destacado

SAMT09 - Web of Data Tutorial - Part 1
SAMT09 - Web of Data Tutorial - Part 1SAMT09 - Web of Data Tutorial - Part 1
SAMT09 - Web of Data Tutorial - Part 1
Bernhard Haslhofer
 

Destacado (19)

Linking Media and Data using Apache Marmotta (LIME workshop keynote)
Linking Media and Data using Apache Marmotta  (LIME workshop keynote)Linking Media and Data using Apache Marmotta  (LIME workshop keynote)
Linking Media and Data using Apache Marmotta (LIME workshop keynote)
 
SAMT09 - Web of Data Tutorial - Part 1
SAMT09 - Web of Data Tutorial - Part 1SAMT09 - Web of Data Tutorial - Part 1
SAMT09 - Web of Data Tutorial - Part 1
 
A Semantic Multimedia Web (Part 1)
A Semantic Multimedia Web (Part 1)A Semantic Multimedia Web (Part 1)
A Semantic Multimedia Web (Part 1)
 
The Story behind Everything Is Connected: Multimedia narration of automatical...
The Story behind Everything Is Connected: Multimedia narration of automatical...The Story behind Everything Is Connected: Multimedia narration of automatical...
The Story behind Everything Is Connected: Multimedia narration of automatical...
 
Linked Media Management with Apache Marmotta
Linked Media Management with Apache MarmottaLinked Media Management with Apache Marmotta
Linked Media Management with Apache Marmotta
 
Multimedia Semantics: Metadata, Analysis and Interaction
Multimedia Semantics:Metadata, Analysis and InteractionMultimedia Semantics:Metadata, Analysis and Interaction
Multimedia Semantics: Metadata, Analysis and Interaction
 
Semantics at the multimedia fragment level SSSW 2013
Semantics at the multimedia fragment level SSSW 2013Semantics at the multimedia fragment level SSSW 2013
Semantics at the multimedia fragment level SSSW 2013
 
A Semantic Multimedia Web (Part 2)
A Semantic Multimedia Web (Part 2)A Semantic Multimedia Web (Part 2)
A Semantic Multimedia Web (Part 2)
 
A Semantic Multimedia Web (Part 3)
A Semantic Multimedia Web (Part 3)A Semantic Multimedia Web (Part 3)
A Semantic Multimedia Web (Part 3)
 
MICO — Towards Contextual Media Analysis
MICO — Towards Contextual Media AnalysisMICO — Towards Contextual Media Analysis
MICO — Towards Contextual Media Analysis
 
Multimedia big data 140619
Multimedia big data 140619Multimedia big data 140619
Multimedia big data 140619
 
Semantics at the multimedia fragment level or how enabling the remixing of on...
Semantics at the multimedia fragment level or how enabling the remixing of on...Semantics at the multimedia fragment level or how enabling the remixing of on...
Semantics at the multimedia fragment level or how enabling the remixing of on...
 
Describing Media Assets: Media Fragment Specification and Description
Describing Media Assets: Media Fragment Specification and DescriptionDescribing Media Assets: Media Fragment Specification and Description
Describing Media Assets: Media Fragment Specification and Description
 
Multimedia Annotation Localisation on the Web of Linked Data
Multimedia Annotation Localisation on the Web of Linked DataMultimedia Annotation Localisation on the Web of Linked Data
Multimedia Annotation Localisation on the Web of Linked Data
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Structured Data & the Future of Educational Material
Structured Data & the Future of Educational MaterialStructured Data & the Future of Educational Material
Structured Data & the Future of Educational Material
 
Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia F...
Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia F...Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia F...
Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia F...
 
A Semantic Multimedia Web: Create, Annotate, Present and Share your Media
A Semantic Multimedia Web: Create, Annotate, Present and Share your MediaA Semantic Multimedia Web: Create, Annotate, Present and Share your Media
A Semantic Multimedia Web: Create, Annotate, Present and Share your Media
 
A replication study of the top performing systems in SemEval twitter sentimen...
A replication study of the top performing systems in SemEval twitter sentimen...A replication study of the top performing systems in SemEval twitter sentimen...
A replication study of the top performing systems in SemEval twitter sentimen...
 

Más de Raphael Troncy

Más de Raphael Troncy (20)

K CAP 2019 Opening Ceremony
K CAP 2019 Opening CeremonyK CAP 2019 Opening Ceremony
K CAP 2019 Opening Ceremony
 
Semantic Technologies for Connected Vehicles in a Web of Things Environment
Semantic Technologies for Connected Vehicles in a Web of Things EnvironmentSemantic Technologies for Connected Vehicles in a Web of Things Environment
Semantic Technologies for Connected Vehicles in a Web of Things Environment
 
HyperTED: exploring video lectures at the fragment levels for enhancing learning
HyperTED: exploring video lectures at the fragment levels for enhancing learningHyperTED: exploring video lectures at the fragment levels for enhancing learning
HyperTED: exploring video lectures at the fragment levels for enhancing learning
 
Location Embeddings for Next Trip Recommendation
Location Embeddings for Next Trip RecommendationLocation Embeddings for Next Trip Recommendation
Location Embeddings for Next Trip Recommendation
 
Contextualizing Events in TV News Shows - SNOW 2014
Contextualizing Events in TV News Shows - SNOW 2014Contextualizing Events in TV News Shows - SNOW 2014
Contextualizing Events in TV News Shows - SNOW 2014
 
Modeling Geometry and Reference Systems on the Web of Data - LGD 2014
Modeling Geometry and Reference Systems on the Web of Data - LGD 2014Modeling Geometry and Reference Systems on the Web of Data - LGD 2014
Modeling Geometry and Reference Systems on the Web of Data - LGD 2014
 
NERD: an open source platform for extracting and disambiguating named entitie...
NERD: an open source platform for extracting and disambiguating named entitie...NERD: an open source platform for extracting and disambiguating named entitie...
NERD: an open source platform for extracting and disambiguating named entitie...
 
Deep-linking into Media Assets at the Fragment Level SMAM 2013
Deep-linking into Media Assets at the Fragment Level SMAM 2013Deep-linking into Media Assets at the Fragment Level SMAM 2013
Deep-linking into Media Assets at the Fragment Level SMAM 2013
 
Live topic generation from event streams
Live topic generation from event streamsLive topic generation from event streams
Live topic generation from event streams
 
MediaEval 2011 SED Opening
MediaEval 2011 SED OpeningMediaEval 2011 SED Opening
MediaEval 2011 SED Opening
 
ShareIt: Mining SocialMedia Activities for Detecting Events
ShareIt: Mining SocialMedia Activities for Detecting EventsShareIt: Mining SocialMedia Activities for Detecting Events
ShareIt: Mining SocialMedia Activities for Detecting Events
 
Finding media illustrating events
Finding media illustrating eventsFinding media illustrating events
Finding media illustrating events
 
Experiencing Events through User-Generated Media
Experiencing Events through User-Generated MediaExperiencing Events through User-Generated Media
Experiencing Events through User-Generated Media
 
Linking Events with Media
Linking Events with MediaLinking Events with Media
Linking Events with Media
 
Multimedia Semantics - SSMS 2010
Multimedia Semantics - SSMS 2010Multimedia Semantics - SSMS 2010
Multimedia Semantics - SSMS 2010
 
LODE: Une Ontologie pour representer des evenements dans le Web de Donnees
LODE: Une Ontologie pour representer des evenements dans le Web de DonneesLODE: Une Ontologie pour representer des evenements dans le Web de Donnees
LODE: Une Ontologie pour representer des evenements dans le Web de Donnees
 
Provenance for Multimedia
Provenance for MultimediaProvenance for Multimedia
Provenance for Multimedia
 
Designing User Support for Event-based Annotation and Exploration of Media
Designing User Support for Event-based Annotation and Exploration of MediaDesigning User Support for Event-based Annotation and Exploration of Media
Designing User Support for Event-based Annotation and Exploration of Media
 
Towards Collaborative Annotation for Video Accessibility
Towards Collaborative Annotation for Video AccessibilityTowards Collaborative Annotation for Video Accessibility
Towards Collaborative Annotation for Video Accessibility
 
Implementing the Media Fragments URI Specification
Implementing the Media Fragments URI SpecificationImplementing the Media Fragments URI Specification
Implementing the Media Fragments URI Specification
 

Último

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Último (20)

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 

MediaEval 2012 SED Opening

  • 1. Social Event Detection (SED): Challenges, Dataset and Evaluation Raphaël Troncy <raphael.troncy@eurecom.fr> Vasileios Mezaris <bmezaris@iti.gr> Symeon Papadopoulos <papadop@iti.gr> Emmanouil Schinas <manosetro@iti.gr> Ioannis Kompatsiaris <ikom@iti.gr>
  • 2. What are Events? Events are observable occurrences grouping People Places Time Experiences documented by Media 04/10/2012 - Social Event Detection (SED) Task - MediaEval 2012, Pisa, Italy -2
  • 3. SED: bigger, longer, harder  In 2011  In 2012  2 challenges  3 challenges  73k photos (2,43 Gb) 1 from SED 2011  No training dataset  167k photos (5,5 Gb) cc licence check  18 teams interested  7 teams submitted runs  Training dataset = SED 2011  Considered easy  21 teams interested  F-measure = 85% … from 15 countries (challenge 1)  5 teams submitted runs  F-measure = 69% (challenge 2)  Much harder ! 04/10/2012 - Social Event Detection (SED) Task - MediaEval 2012, Pisa, Italy -3
  • 4. Three challenges (type and venue) 1. Find all technical events that took place in Germany in the test collection. 2. Find all soccer events taking place in Hamburg (Germany) and Madrid (Spain) in the collection. 3. Find all demonstration and protest events of the Indignados movement occurring in public places in Madrid in the collection  For each event, we provided relevant and non relevant example photos  Task = detect events and provide all illustrating photos 04/10/2012 - Social Event Detection (SED) Task - MediaEval 2012, Pisa, Italy -4
  • 5. Dataset Construction  Collect 167332 Flickr Photos (Jan 2009-Dec 2011)  4,422 unique Flickr users, all in CC licence  All geo-tagged in 5 cities: Barcelona (72255), Cologne (15850), Hannover (2823), Hamburg (16958), Madrid (59043) + 0,22 % (403) from EventMedia  Altered metadata:  geo-tags removed for 80% of the photos (random)  33466 photos still geo-tagged  Provide only metadata … but real media were available to participants if they asked (5,5 Gb) 04/10/2012 - Social Event Detection (SED) Task - MediaEval 2012, Pisa, Italy -5
  • 6. Ground Truth and Evaluation Measures  CrEve annotation tool: http://www.clusttour.gr/creve/  For each of the 6 collections, review all photos and associate them to events (that have to be created)  Search by text, geo-coordinates, date and user  Review annotations made by others  Use EventMedia and machine tags (upcoming:event=xxx)  Evaluation Measures:  Harmonic mean (F-score of Precision and Recall)  Normalized Mutual Information (NMI): jointly consider the goodness of the photos retrieved and their correct assignment to different events 04/10/2012 - Social Event Detection (SED) Task - MediaEval 2012, Pisa, Italy -6
  • 7. What ideally should be found  Challenge 1:  19 events, 2234 photos (avg = 117)  Baseline precision (random): 0,01%  Challenge 2:  79 events, 1684 photos (avg = 21)  Baseline precision (random): 0,01%  Challenge 3:  52 events, 3992 Photos (avg = 77)  Baseline precision (random): 0,02% 04/10/2012 - Social Event Detection (SED) Task - MediaEval 2012, Pisa, Italy -7
  • 8. Who Has Participated ?  21 Teams registered (18 in 2011)  5 Teams cross the lines (7 in 2011, 2 overlaps)  One participant missing at the workshop! 04/10/2012 - Social Event Detection (SED) Task - MediaEval 2012, Pisa, Italy -8
  • 9. Quick Summary of Approaches  2011: all but 1 participants use background knowledge  Last.fm (all), Fbleague (EURECOM), PlayerHistory (QMUL)  DBpedia, Freebase, Geonames, WordNet  2012: all but 2 participants use a generic approach  IR approach: query matching clusters (metadata, temporal, spatial): MISIMIS  Classification approach:  Topic detection with LDA, city classification with TF-IDF, event detection using peaks in timeline using the query topics: AUTH-ISSEL  Learning model using the training data and SVM: CERTH-ITI  Background knowledge: QMUL, DISI  2012: all approaches are NOT fully automatic  Manual selection of some parameters (e.g. topics) 04/10/2012 - Social Event Detection (SED) Task - MediaEval 2012, Pisa, Italy -9
  • 10. Results – Challenge 1 (Technical Events) Precision Recall F-score NMI AUTHISSEL_4 76,29 94,9 84,58 0,7238 CERTH_1 43,11 11,91 18,66 0,1877 DISI_1 86,23 59,13 70,15 0,6011 MISIMS_2 2,52 1,88 2,15 0,0236 QMUL_4 3,86 12,85 5,93 0,0475 90 84,58 80 70,15 70 60 50 40 30 18,66 20 10 5,93 2,15 0 Runs AUTHISSEL_4 CERTHITI_1 DISI_1 MISIMS_2 QMUL_4 04/10/2012 - Social Event Detection (SED) Task - MediaEval 2012, Pisa, Italy - 10
  • 11. Results – Challenge 2 (Soccer Events) Precision Recall F-score NMI AUTHISSEL_4 88,18 93,49 90,76 0,8499 CERTH_1 85,57 66,19 74,64 0,6745 DISI_1 MISIMS_2 34,49 17,25 22,99 0,1993 QMUL_4 79,04 67,12 72,59 0,6493 100 90,76 90 80 74,64 72,59 70 60 50 40 30 22,99 20 10 0 Runs AUTHISSEL_4 CERTHITI_3 DISI_1 MISIMS_2 QMUL_1 04/10/2012 - Social Event Detection (SED) Task - MediaEval 2012, Pisa, Italy - 11
  • 12. Results – Challenge 3 (Indignados Events) Precision Recall F-score NMI AUTHISSEL_4 88,91 90,78 89,83 0,738 CERTH_1 86,24 54,61 66,87 0,4654 DISI_1 86,15 47,17 60,96 0,4465 MISIMS_2 48,3 46,87 47,58 0,3088 QMUL_4 22,88 33,48 27,19 0,1988 100 89,83 90 80 70 66,87 60,96 60 47,58 50 40 30 27,19 20 10 0 Runs AUTHISSEL_4 CERTHITI_3 DISI_1 MISIMS_2 QMUL_4 04/10/2012 - Social Event Detection (SED) Task - MediaEval 2012, Pisa, Italy - 12
  • 13. Conclusion  Lessons learned  Clear winner for all tasks: generic approach but manual selection of the topics  Use of background knowledge still useful if well-used  Looking at next year SED  Shlomo Geva (Queensland University of Technology) + Philipp Cimiano (University of Bielefeld)  Dataset: bigger, more diverse  Media: photos and videos ? (at least 10% videos?)  Metadata: include some social network relationships, participation at events  Evaluation measures: event granularity? Time/CPU? 04/10/2012 - Social Event Detection (SED) Task - MediaEval 2012, Pisa, Italy - 13