SlideShare a Scribd company logo
1 of 11
Nervo Verdezoto
             University of Trento
      nervo.verdezoto@studenti.unitn.it


  Prof. Laure Vieu and Prof. Alessandro Oltramari
                      Tutors
Application of formal ontology and semantic
 techniques to improve the coherence and
       usability of lexical resources


                                           Master HLTI 2009-2010
Outline

   Objectives and Tasks
      – Data
      – Ontological Principles
      – Experiments
      – Results
   Manual Analysis and discussion
   Summary


                                     Master HLTI 2009-2010
Objectives

•   Get familiar with Ontology-driven
    Conceptual Modeling
•   Develop semi-automatic methods to
    spot semantic/ontological problems in
    WordNet at lower levels
•   Get familiar with scientific reporting



                                    Master HLTI 2009-2010
Tasks


    Study WordNet semantic relations to spot ontological
    problems

    Applications:
    
         RTE
    
         Automatic detection of part-whole relations e.g.
        (atmospheric phenomenon, communication), (shape, artifact),
        (shape, physical phenomenon)




                                                            Master HLTI 2009-2010
The Data

      WordNet: 82115 synsets were examined to collect the initial data, 22187 were
      involved in meronyms and holonyms relations (50% meronyms – 50% holonyms)
    
      Semeval 2007: 89 pairs relations were extracted.

      Additionally, we eliminated the redundant pairs from initial data.


                                       MERONYMS
                      14000


                      12000


                      10000


                       8000                                # PAIRS –
                                                           MERONYMS
                       6000


                       4000


                       2000


                          0
                              MEMBER    PART   SUBSTANCE




                                                                       Master HLTI 2009-2010
Ontological Principles

•   Constraints: part and whole should be of a
    similar nature.
•   DOLCE-ontological distinctions between:
     –   endurants (ED) or physical entities (like a
         dog, a table, a cave, etc.)
     –   perdurants (PD) or eventualities (like a
         lecture, a sleep, a raining, etc.)
     –   abstract (AB, entities like a number, the
         content of a text, etc.).


                                             Master HLTI 2009-2010
Experiments – Tests
     [defining queries]

•   Semantic Constraints
     –    Test 0: Individual – Class pairs:
             •    (great_divide%1:15:00,continental_divide%1:15:00)
     –    Test 4: Meronymy – Member and Member–Collection
          pairs:
             •    (coronal%1:06:00, rose%1:20:00)

•   Ontological Constraints
     –    Test 1: ED–AB (test 1.1) or AB–ED (test 1.2)
             •    Test 1.1: physical entity 1:03:00 (but not process 1:03:00) / abstraction 1:03:00 (but not event
                  1:03:00 + state 1:03:00. (head%1:06:04::,coin%1:21:02::)
     –    Test 2: ED–PD (test 2.1) or PD–ED (test 2.2)
             •    Test 2.1 , physical entity 1:03:00 (but not process 1:03:00) / process 1:03:00 + event 1:03:00 +
                  state 1:03:00. ⟨air%1:27:00, wind%1:19:00⟩

     –    Test 3: PD–AB (test 3.1) or AB–PD (test 3.2)
             •    Test 3.1 , abstraction 1:03:00 – but not event 1:03:00 + state 1:03:00(first all and then without
                  group) / event 1:03:00 + state 1:03:00 + process 1:03:00. ⟨regulation time%1:28:00, athletic
                  game%1:04:00⟩
                                                                                                 Master HLTI 2009-2010
Results
                           Ontological Problems

  180
        163
  160


  140


  120                                             108

  100                                                            WORDNET

                                                                 SEMEVAL
   80


   60
                             45

   40


   20
                       2                   2
   0

              Test 1              Test 2                Test 3




                           Ontological Problems

  180
        163
  160

  140

  120                                             108

  100                                                            W ORDNET

                                                                 SEMEVAL
   80

   60                        45

   40

   20
                       2                   2
   0

              Test 1              Test 2                Test 3



                                                                            Master HLTI 2009-2010
Manual Analysis and discussion

General Errors
•       a synset is considered as a class but should be an individual
    –      Confusion between class and an instance of this class for which the term is used with a specific
           sense e.g., ⟨great_divide%1:15:00,continental_divide%1:15:00⟩
    –      Confusion between class and group e.g., new_testament%1:10:00
•       a synset is not attached to the right place in the taxonomy
    –      Confusion between a property and a physical entity having that property (shape, quantity or
           measure, location) or between a relation and a physical entity being an argument in that relation
           e.g., coin%1:21:02, hay_mow%1:23:00 - calyx%1:20:00, mothball%1:06:00
•       a synset mixes two senses, and the missing sense should be attached elsewhere in the
        taxonomy or this missing sense is an individual, not a class
    –      Confusion between 2 senses of a word, amounting to a missing sense e.g.
           ⟨ethiopian%1:18:00, ethiopia%1:15:00⟩
•     the meronymy relation is wrong
    –    Confusion between meronymy and other relations (location, participation, etc.):
           •   “is located in” - ⟨balkan_wars%1:04:00, balkan_peninsula%1:15:00⟩
           •   “participates in” - ⟨feminist%1:18:00,feminist_movement%1:04:00⟩




                                                                                             Master HLTI 2009-2010
Summary and future work

•       An automatic query system based on ontological
        principles and semantic constraints is effective to build
        semi-automatic methods to spot errors in WordNet
•       Increase the number and type of experiments
•       Exploit the results of this study to:
    –       Develop a semi-automatic tool for ”cleaning-up” WordNet
    –       Design and develop guidelines to help lexicographers
            (Christiane Fellbaum from Princeton WordNet Group) to
            prevent classical ontological mistakes
    –       Evaluation for NLP applications




                                                          Master HLTI 2009-2010
THANK YOU


Nervo Verdezoto D.
                     Master HLTI 2009-2010

More Related Content

Recently uploaded

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Recently uploaded (20)

08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

Featured

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Featured (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Application of formal ontology and semantic techniques to improve the coherence and usability of lexical resources

  • 1. Nervo Verdezoto University of Trento nervo.verdezoto@studenti.unitn.it Prof. Laure Vieu and Prof. Alessandro Oltramari Tutors Application of formal ontology and semantic techniques to improve the coherence and usability of lexical resources Master HLTI 2009-2010
  • 2. Outline  Objectives and Tasks – Data – Ontological Principles – Experiments – Results  Manual Analysis and discussion  Summary Master HLTI 2009-2010
  • 3. Objectives • Get familiar with Ontology-driven Conceptual Modeling • Develop semi-automatic methods to spot semantic/ontological problems in WordNet at lower levels • Get familiar with scientific reporting Master HLTI 2009-2010
  • 4. Tasks  Study WordNet semantic relations to spot ontological problems  Applications:  RTE  Automatic detection of part-whole relations e.g. (atmospheric phenomenon, communication), (shape, artifact), (shape, physical phenomenon) Master HLTI 2009-2010
  • 5. The Data  WordNet: 82115 synsets were examined to collect the initial data, 22187 were involved in meronyms and holonyms relations (50% meronyms – 50% holonyms)  Semeval 2007: 89 pairs relations were extracted.  Additionally, we eliminated the redundant pairs from initial data. MERONYMS 14000 12000 10000 8000 # PAIRS – MERONYMS 6000 4000 2000 0 MEMBER PART SUBSTANCE Master HLTI 2009-2010
  • 6. Ontological Principles • Constraints: part and whole should be of a similar nature. • DOLCE-ontological distinctions between: – endurants (ED) or physical entities (like a dog, a table, a cave, etc.) – perdurants (PD) or eventualities (like a lecture, a sleep, a raining, etc.) – abstract (AB, entities like a number, the content of a text, etc.). Master HLTI 2009-2010
  • 7. Experiments – Tests [defining queries] • Semantic Constraints – Test 0: Individual – Class pairs: • (great_divide%1:15:00,continental_divide%1:15:00) – Test 4: Meronymy – Member and Member–Collection pairs: • (coronal%1:06:00, rose%1:20:00) • Ontological Constraints – Test 1: ED–AB (test 1.1) or AB–ED (test 1.2) • Test 1.1: physical entity 1:03:00 (but not process 1:03:00) / abstraction 1:03:00 (but not event 1:03:00 + state 1:03:00. (head%1:06:04::,coin%1:21:02::) – Test 2: ED–PD (test 2.1) or PD–ED (test 2.2) • Test 2.1 , physical entity 1:03:00 (but not process 1:03:00) / process 1:03:00 + event 1:03:00 + state 1:03:00. ⟨air%1:27:00, wind%1:19:00⟩ – Test 3: PD–AB (test 3.1) or AB–PD (test 3.2) • Test 3.1 , abstraction 1:03:00 – but not event 1:03:00 + state 1:03:00(first all and then without group) / event 1:03:00 + state 1:03:00 + process 1:03:00. ⟨regulation time%1:28:00, athletic game%1:04:00⟩ Master HLTI 2009-2010
  • 8. Results Ontological Problems 180 163 160 140 120 108 100 WORDNET SEMEVAL 80 60 45 40 20 2 2 0 Test 1 Test 2 Test 3 Ontological Problems 180 163 160 140 120 108 100 W ORDNET SEMEVAL 80 60 45 40 20 2 2 0 Test 1 Test 2 Test 3 Master HLTI 2009-2010
  • 9. Manual Analysis and discussion General Errors • a synset is considered as a class but should be an individual – Confusion between class and an instance of this class for which the term is used with a specific sense e.g., ⟨great_divide%1:15:00,continental_divide%1:15:00⟩ – Confusion between class and group e.g., new_testament%1:10:00 • a synset is not attached to the right place in the taxonomy – Confusion between a property and a physical entity having that property (shape, quantity or measure, location) or between a relation and a physical entity being an argument in that relation e.g., coin%1:21:02, hay_mow%1:23:00 - calyx%1:20:00, mothball%1:06:00 • a synset mixes two senses, and the missing sense should be attached elsewhere in the taxonomy or this missing sense is an individual, not a class – Confusion between 2 senses of a word, amounting to a missing sense e.g. ⟨ethiopian%1:18:00, ethiopia%1:15:00⟩ • the meronymy relation is wrong – Confusion between meronymy and other relations (location, participation, etc.): • “is located in” - ⟨balkan_wars%1:04:00, balkan_peninsula%1:15:00⟩ • “participates in” - ⟨feminist%1:18:00,feminist_movement%1:04:00⟩ Master HLTI 2009-2010
  • 10. Summary and future work • An automatic query system based on ontological principles and semantic constraints is effective to build semi-automatic methods to spot errors in WordNet • Increase the number and type of experiments • Exploit the results of this study to: – Develop a semi-automatic tool for ”cleaning-up” WordNet – Design and develop guidelines to help lexicographers (Christiane Fellbaum from Princeton WordNet Group) to prevent classical ontological mistakes – Evaluation for NLP applications Master HLTI 2009-2010
  • 11. THANK YOU Nervo Verdezoto D. Master HLTI 2009-2010