SlideShare una empresa de Scribd logo
1 de 22
Evaluating Entity Summarization
      Using a Game-Based Ground Truth
           Andreas Thalhammer¹, Magnus Knuth²,
                     and Harald Sack²




                             ¹ University of Innsbruck, Austria
13 Nov. 2012
ISWC 2012 Boston             ² Hasso Plattner Institute Potsdam, Germany
Google: “Get the best summary” [1]
  • Inglourious Basterds (Movie)
  • Freebase: 1279 triples
  • DBpedia: 217 triples


   • Google Knowledge Graph
     summary: 14 triples

13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   2
Entity Summarization
• First attempt towards a definition:
“... not just represent the main themes of the
original data, but rather, can best identify the
underlying entity” [2]

                       Is this a good definition?



13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   3
Entity Summarization (cont.)
   “A summary can be loosely defined as a text that is
   produced from one or more texts, that conveys
   important information in the original text(s), and
   that is no longer than half of the original text(s) and
   usually significantly less than that.” [3]

               A summary is
                 • short
                 • and conveys important information.

13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   4
Entity Summarization (cont.)
• Our (loose) definition:
    “Entity summarization is the task of producing a
    summary that conveys important facts about the
    entity while reducing the number of shown facts
    significantly.”




13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   5
The Problem: Evaluation
  • How do we make different summarization
    systems comparable?

           Sub-question:
         • How do we grasp the idea of “important
            facts”?



13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   6
Related Work
• RELIN: Relatedness and Informativeness-based
  Centrality for Entity Summarization [2]

      – Intrinsic: 24 users compiled summaries of 149
        entities (forming a gold standard)
        (Intersection-based similarity)

      – Extrinsic: 47 pairs of FB and DBpedia entities were
        selected (24 correct ones, 23 incorrect ones).
                      Users judge whether pairs are correct or not.

13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   7
Related Work (cont.)
• Towards exploratory video search using linked
  data [4]

      – Quantitative evaluation of heuristics
         Ground truth, containing 115 entities
        summarized by 72 users.
      – Precision/Recall similarity measure



13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   8
Related Work (cont.)
• It is hard to find participants.
• Generating summaries is a cumbersome
  process.
• Only a subset of property-value pairs are
  ranked by the users.
• Up to this point, none of the two evaluation
  datasets is publically available.


13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   9
Our Idea
• Important facts are commonly known
• Unimportant facts are rarely known

• How to find out?




                      Linked Data quiz game!
13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   10
Hypothesis
“A game-based ground truth is suitable for
evaluating the performance of summarization
approaches in the movie domain”

Assumption: implemented approaches correlate
with the game-based ground truth while random
summaries do not.


13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   11
Dataset
• 60 arbitrary selected movies from IMDb Top250
• RDF descriptions from Freebase
• Usage of a property white list
• Triple store: Ontotext’s OWLIM with OWL2-RL
  reasoning enabled.
• Property chains:
               <http://some-name.space/hasActor>
               <http://www.w3.org/2002/07/owl#propertyChainAxiom> (
               <http://rdf.freebase.com/ns/film.film.starring>
               <http://rdf.freebase.com/ns/film.performance.actor> ).
                 All data is available at: http://yovisto.com/labs/iswc2012
13 Nov. 2012    Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   12
WhoKnows?Movies!
            S                        P                        O
:The_Princess_Bride         prop:actor            :Billy_Crystal, ...
:Braveheart                 prop:actor            :Mel_Gibson, ...
:Pulp_Fiction               prop:actor            :John_Travolta .


  •      Question types:
                 - One-to-One
                 - One-to-N
  •      Questions are composed
         upside down:
                  ‘Object is the property of subject1,
                  subject2, subject3’
                     Play the game at: http://bit.ly/WhoKnowsMovies
  13 Nov. 2012      Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   13
Frequency == Importance ???
                                                                      word       upper          lower

• Information retrieval:                                         frequency       cut-off        cut-off



      – Luhn (1958):
        “resolving power of words” [5]
                                                                                                          ranking by
                                                                                                          word frequency

• Game supports half-knowledge in general
      – e.g. which movie was released 1994?
        Monsters, Inc. – Pulp Fiction – Casablanca
      – ... but the human brain performs better with
            pictures (actors), sounds (film music), ...
13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston             14
Evaluated Systems
• UBES (Usage-based Entity Summarization) [5]
      – Combine Freebase with HetRec2011 MovieLens2k [6]
      – Use item-based collaborative filtering to form
        neighborhoods for each movie
      – Find out which property-value pairs a movie shares
        with its neighbors
      – Use a TF-IDF related weighting scheme
                                    Bob          Alice         Marc          Elena         John     Mary
                Pulp Fiction            1            0             1              0            1      1
                Heat                    0            0             1              1            0      0
                Kill Bill               1            0             1              0            1      0
13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston          15
Evaluated Systems (cont.)
  • GKG (Google Knowledge Graph) [1]
        – Enables semi-automatic transformation to Freebase
/search?hl=en&q=quentin+tarantino&
stick=H4sIAAAAAAAAAONgVuLQz9U3MLM0zgEA_
sQyxwwAAAA&
sa=X&ei=FnjTT7rXN8jftAaAhPWIDw&
ved=0CKwBEJsTKAA

        – base64 + gzip
         /m/0693l
                        http://www.freebase.com/view/m/0693l
                        redirects to:
                        http://www.freebase.com/view/en/quentin_tarantino
  13 Nov. 2012    Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   16
Results
• 690 sessions, 8308 questions
• 217 players (135 players played only once)
• 2314 of 2829 triples were played more than 3
  times




13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   17
Result: Kendall’s τ
• Property ranking:



• Feature (property-value) ranking:




13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   18
Conclusion
• The results indicate that a game-based ground
  truth is suitable for evaluating entity
  summarization.

• The current dataset is too sparse to make valid
  assumptions about the importance of single
  facts.


13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   19
Future Work
•    Increase the number of players
•    Score the exclusion principle
•    Increase the number of movies
•    Application to additional domains
•    Publish new versions of the evaluation dataset
     on a regular basis



13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   20
Questions?

                              Help collecting data:
                  http://bit.ly/WhoKnowsMovies


                                              Andreas Thalhammer (andreas.thalhammer@sti2.at)
                                              Magnus Knuth (magnus.knuth@hpi.uni-potsdam.de)
                                                   Harald Sack (harald.sack@hpi.uni-potsdam.de)

13 Nov. 2012   Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   21
References
     [1] Singhal, A.: Introducing the knowledge graph: things, not strings (2012),
          http://googleblog.blogspot.com/2012/05/introducing-knowledge-graph-things-not.html
     [2] Cheng, G., Tran, T., Qu, Y.: RELIN: Relatedness and Informativeness-Based Centrality for Entity
          Summarization. In: Aroyo, L., Welty, C., Alani, H., Taylor, J., Bernstein, A., Kagal, L., Noy, N.,
          Blomqvist, E. (eds.) ISWC 2011, Part I. LNCS, vol. 7031, pp. 114–129. Springer, Heidelberg
          (2011)
     [3] Dragomir R. Radev, Eduard Hovy, and Kathleen McKeown. 2002. Introduction to the special
          issue on summarization. Comput. Linguist. 28, 4 (December 2002), 399-408.
          DOI=10.1162/089120102762671927 http://dx.doi.org/10.1162/089120102762671927
     [4] Waitelonis, J., Sack, H.: Towards exploratory video search using linked data. Multimedia Tools
          and Applications 59, 645–672 (2012), 10.1007/s11042-011-0733-1
     [5] Thalhammer, A., Toma, I., Roa-Valverde, A.J., Fensel, D.: Leveraging usage data for linked data
          movie entity summarization. In: Proc. of the 2nd Int. Ws. on Usage Analysis and the Web of
          Data (USEWOD 2012) co-located with WWW 2012, Lyon, France, vol. abs/1204.2718 (2012)
     [6] Cantador, I., Brusilovsky, P., Kuflik, T.: 2nd ws. on information heterogeneity and fusion in
          recommender systems (hetrec 2011). In: Proc. of 5th ACM Conf. on Recommender systems,
          RecSys 2011. ACM, New York (2011)



13 Nov. 2012      Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston   22

Más contenido relacionado

Último

Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????blackmambaettijean
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 

Último (20)

Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 

Destacado

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at WorkGetSmarter
 

Destacado (20)

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 

Evaluating Entity Summarization Using a Game-Based Ground Truth

  • 1. Evaluating Entity Summarization Using a Game-Based Ground Truth Andreas Thalhammer¹, Magnus Knuth², and Harald Sack² ¹ University of Innsbruck, Austria 13 Nov. 2012 ISWC 2012 Boston ² Hasso Plattner Institute Potsdam, Germany
  • 2. Google: “Get the best summary” [1] • Inglourious Basterds (Movie) • Freebase: 1279 triples • DBpedia: 217 triples • Google Knowledge Graph summary: 14 triples 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 2
  • 3. Entity Summarization • First attempt towards a definition: “... not just represent the main themes of the original data, but rather, can best identify the underlying entity” [2] Is this a good definition? 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 3
  • 4. Entity Summarization (cont.) “A summary can be loosely defined as a text that is produced from one or more texts, that conveys important information in the original text(s), and that is no longer than half of the original text(s) and usually significantly less than that.” [3] A summary is • short • and conveys important information. 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 4
  • 5. Entity Summarization (cont.) • Our (loose) definition: “Entity summarization is the task of producing a summary that conveys important facts about the entity while reducing the number of shown facts significantly.” 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 5
  • 6. The Problem: Evaluation • How do we make different summarization systems comparable? Sub-question: • How do we grasp the idea of “important facts”? 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 6
  • 7. Related Work • RELIN: Relatedness and Informativeness-based Centrality for Entity Summarization [2] – Intrinsic: 24 users compiled summaries of 149 entities (forming a gold standard) (Intersection-based similarity) – Extrinsic: 47 pairs of FB and DBpedia entities were selected (24 correct ones, 23 incorrect ones).  Users judge whether pairs are correct or not. 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 7
  • 8. Related Work (cont.) • Towards exploratory video search using linked data [4] – Quantitative evaluation of heuristics  Ground truth, containing 115 entities summarized by 72 users. – Precision/Recall similarity measure 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 8
  • 9. Related Work (cont.) • It is hard to find participants. • Generating summaries is a cumbersome process. • Only a subset of property-value pairs are ranked by the users. • Up to this point, none of the two evaluation datasets is publically available. 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 9
  • 10. Our Idea • Important facts are commonly known • Unimportant facts are rarely known • How to find out?  Linked Data quiz game! 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 10
  • 11. Hypothesis “A game-based ground truth is suitable for evaluating the performance of summarization approaches in the movie domain” Assumption: implemented approaches correlate with the game-based ground truth while random summaries do not. 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 11
  • 12. Dataset • 60 arbitrary selected movies from IMDb Top250 • RDF descriptions from Freebase • Usage of a property white list • Triple store: Ontotext’s OWLIM with OWL2-RL reasoning enabled. • Property chains: <http://some-name.space/hasActor> <http://www.w3.org/2002/07/owl#propertyChainAxiom> ( <http://rdf.freebase.com/ns/film.film.starring> <http://rdf.freebase.com/ns/film.performance.actor> ). All data is available at: http://yovisto.com/labs/iswc2012 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 12
  • 13. WhoKnows?Movies! S P O :The_Princess_Bride prop:actor :Billy_Crystal, ... :Braveheart prop:actor :Mel_Gibson, ... :Pulp_Fiction prop:actor :John_Travolta . • Question types: - One-to-One - One-to-N • Questions are composed upside down: ‘Object is the property of subject1, subject2, subject3’ Play the game at: http://bit.ly/WhoKnowsMovies 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 13
  • 14. Frequency == Importance ??? word upper lower • Information retrieval: frequency cut-off cut-off – Luhn (1958): “resolving power of words” [5] ranking by word frequency • Game supports half-knowledge in general – e.g. which movie was released 1994? Monsters, Inc. – Pulp Fiction – Casablanca – ... but the human brain performs better with pictures (actors), sounds (film music), ... 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 14
  • 15. Evaluated Systems • UBES (Usage-based Entity Summarization) [5] – Combine Freebase with HetRec2011 MovieLens2k [6] – Use item-based collaborative filtering to form neighborhoods for each movie – Find out which property-value pairs a movie shares with its neighbors – Use a TF-IDF related weighting scheme Bob Alice Marc Elena John Mary Pulp Fiction 1 0 1 0 1 1 Heat 0 0 1 1 0 0 Kill Bill 1 0 1 0 1 0 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 15
  • 16. Evaluated Systems (cont.) • GKG (Google Knowledge Graph) [1] – Enables semi-automatic transformation to Freebase /search?hl=en&q=quentin+tarantino& stick=H4sIAAAAAAAAAONgVuLQz9U3MLM0zgEA_ sQyxwwAAAA& sa=X&ei=FnjTT7rXN8jftAaAhPWIDw& ved=0CKwBEJsTKAA – base64 + gzip  /m/0693l http://www.freebase.com/view/m/0693l redirects to: http://www.freebase.com/view/en/quentin_tarantino 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 16
  • 17. Results • 690 sessions, 8308 questions • 217 players (135 players played only once) • 2314 of 2829 triples were played more than 3 times 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 17
  • 18. Result: Kendall’s τ • Property ranking: • Feature (property-value) ranking: 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 18
  • 19. Conclusion • The results indicate that a game-based ground truth is suitable for evaluating entity summarization. • The current dataset is too sparse to make valid assumptions about the importance of single facts. 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 19
  • 20. Future Work • Increase the number of players • Score the exclusion principle • Increase the number of movies • Application to additional domains • Publish new versions of the evaluation dataset on a regular basis 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 20
  • 21. Questions? Help collecting data: http://bit.ly/WhoKnowsMovies Andreas Thalhammer (andreas.thalhammer@sti2.at) Magnus Knuth (magnus.knuth@hpi.uni-potsdam.de) Harald Sack (harald.sack@hpi.uni-potsdam.de) 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 21
  • 22. References [1] Singhal, A.: Introducing the knowledge graph: things, not strings (2012), http://googleblog.blogspot.com/2012/05/introducing-knowledge-graph-things-not.html [2] Cheng, G., Tran, T., Qu, Y.: RELIN: Relatedness and Informativeness-Based Centrality for Entity Summarization. In: Aroyo, L., Welty, C., Alani, H., Taylor, J., Bernstein, A., Kagal, L., Noy, N., Blomqvist, E. (eds.) ISWC 2011, Part I. LNCS, vol. 7031, pp. 114–129. Springer, Heidelberg (2011) [3] Dragomir R. Radev, Eduard Hovy, and Kathleen McKeown. 2002. Introduction to the special issue on summarization. Comput. Linguist. 28, 4 (December 2002), 399-408. DOI=10.1162/089120102762671927 http://dx.doi.org/10.1162/089120102762671927 [4] Waitelonis, J., Sack, H.: Towards exploratory video search using linked data. Multimedia Tools and Applications 59, 645–672 (2012), 10.1007/s11042-011-0733-1 [5] Thalhammer, A., Toma, I., Roa-Valverde, A.J., Fensel, D.: Leveraging usage data for linked data movie entity summarization. In: Proc. of the 2nd Int. Ws. on Usage Analysis and the Web of Data (USEWOD 2012) co-located with WWW 2012, Lyon, France, vol. abs/1204.2718 (2012) [6] Cantador, I., Brusilovsky, P., Kuflik, T.: 2nd ws. on information heterogeneity and fusion in recommender systems (hetrec 2011). In: Proc. of 5th ACM Conf. on Recommender systems, RecSys 2011. ACM, New York (2011) 13 Nov. 2012 Evaluating Entity Summarization Using a Game-Based Ground Truth. ISWC 2012, Boston 22