SlideShare una empresa de Scribd logo
1 de 20
Improving Automatic Semantic Tag Recommendation
            through Fuzzy Ontologies


                 Panos Alexopoulos, Manolis Wallace

 7th International Workshop Semantic and Social Media Adaptation and
                            Personalization

                  Luxembourg, December 3-4, 2012
Agenda

   Introduction
       Problem Definition and Paper
         Focus
       Approach Overview and Rationale

   Proposed Framework
      Tagging Evidence Model
      Tagging Process

   Framework Evaluation
      Evaluation Process
      Evaluation Results

   Conclusions and Future Work

                                          2
Introduction
                                                                        Problem Definition
Semantic Tagging

 ● Semantic tagging involves identifying and assigning to texts appropriate entities that
   reflect what the document actually talks about.

 ● One important challenge in this task is the correct distinction between the entities that
   play a central role to the document’s meaning and those that are just complementary
   to it.

 ● For example, consider the following text:

     ● “Annie Hall is a much better movie than Deconstructing Harry, mainly
       because Alvy Singer is such a well formed character and Diane Keaton gives
       the performance of her life”.

     ● The text mentions two films, yet the one it actually talks about is only “Annie
       Hall”, meaning that only this is an appropriate tag.



                                                                                               3
Introduction
                                                                       Problem Definition
Semantic Tagging

 ● A second challenge is the inference of appropriate tags even when these are not
   explicitly mentioned within the text.

 ● For example:

     ● “In June 1863, Colonel James Montgomery commanded a brigade in operations
       along the coast resembling his earlier Jayhawk raids. The most famous of his
       controversial operations was the Raid at Combahee Ferry in which 800 slaves
       were liberated with the help of Harriett Tubman”.

     ● The text describes a historical battle which took place in Beaufort County,
       South Carolina.

     ● This means that this location is an important geographical tag for the text, yet it
       is not explicitly mentioned within it.



                                                                                             4
Introduction
                                                                                Paper Focus

● In a previous work we have already proposed a framework for semantic tagging
  through the exploitation of domain ontologies.

● The ontologies describe the domain(s) of the texts to be tagged and their entities
  serve as a source of possible tags for them.

● The key idea is that a given ontological entity is more likely to represent the text’s
  meaning (and thus be an accurate tag) when there are many ontologically related
  to it entities in the text.

● In this paper we revise and extend the above framework so as to enable it to exploit
  also fuzzy ontological information.

● Our assumption is that the fuzziness that may characterize some of the ontology’s
  relations can increase the evidential power of its entities and consequently the
  effectiveness of the tag recommendation process



                                                                                             5
Proposed Framework
                                                   Approach Overview and Rationale

● Our existing framework targets the task of semantic tagging based on the intuition
  that a given ontological entity is more likely to represent the meaning the text when
  there are many ontologically related to it entities in the text.

● E.g. in the example text the entities “Alvy Singer” and “Diane Keaton” indicate
  that the text is about the film “Annie Hall”.

● That is because Alvy Singer is a character of this film and Diane Keaton an actor of
  it.

● All these entities and the relations between them are derived from one or more
  domain ontologies.




                                                                                          6
Proposed Framework
                                                  Approach Overview and Rationale

● The extension we propose to this framework has to do with considering, where
  possible, fuzzy relations between entities rather than crisp ones.

● Fuzzy relations allow the assignment of truth degrees to vague ontological relations
  in an effort to quantify their vagueness.

● E.g. Instead of ‘‘Annie Hall is a comedy” we may say that “Annie Hall is a
  comedy to a degree of 0.7”

● Similarly, we may say that “Woody Allen is an expert director at human
  relations to a degree of 0.8”.

● Thus, by using a fuzzy ontology one can represent useful semantic information for
  the tag recommendation task in a higher level of granularity than with a crisp
  ontology.




                                                                                         7
Proposed Framework
                                                   Approach Overview and Rationale

● For example, in the film domain, instead of having just the relation
  hasPlayedInFilm it is more useful to have the fuzzy relation
  wasAnImportantActorInFilm.

    ● E.g. “Robert Duvall was an important actor in Apocalypse Now to a degree of
      0.6”.

● To see why this is useful consider the text “Robert Duvall’s brilliant performance in
  the film showed that his choice by Francis Ford Copola was wise”.

● If Duvall and Copola have collaborated in more than one film but in only one of
  them Duval had a major role (as captured by the fuzzy degree of his relation to the
  film) then this film is more likely to be the subject of this text.




                                                                                          8
Proposed Framework
                                                              Framework Components

● Our proposed framework assumes the availability of a fuzzy ontology for the domain
  of the texts to be tagged and defines two components:

    ● A Tag Fuzzy Ontological Evidence Model that contains entities that may
      serve as tag-related evidence for the application scenario and domain at hand.

             ● Each entity is assigned evidential power degrees which denote its
               usefulness as evidence for the tag recommendation task.


    ● A Tag Recommendation Process that uses the evidence model to determine,
      for a given text, the ontological entities that potentially represent its content.

             ● A confidence score for each entity is used to denote the most probable
               tags.




                                                                                           9
Proposed Framework
                                                                 Tagging Evidence Model

● Defines for each ontology entity which other instances and to what extent should be
  used as evidence towards the correct determination of the texts’ tags.

● It consists of entity pairs where a particular entity provides quantified evidence for a
  another one.




                                                                                             10
Proposed Framework
                                                         Evidence Model Construction

● Construction of the evidence model depends on the characteristics of the domain and
  the texts.

● The first step of the construction is manual and involves:

    ● The identification of the concepts whose instances are expected to be used as
      tags (e.g. military conflicts, films etc.)

    ● The determination, for each of these concepts, of the related to them concepts
      whose instances may serve as tag evidence:

              ● For example, in texts that review films, some concepts whose instances
                may act as tag evidence are related directors, actors and characters.

    ● The identification, for each pair of evidence and target concept, of the fuzzy
      relation paths that links them.



                                                                                         11
Proposed Framework
                                                           Evidence Model Construction

● The result of this first step is a tag evidence concept mapping like the following:




● This mapping is typically small so its manual construction is not difficult.




                                                                                           12
Proposed Framework
                                                         Evidence Model Construction

● Based on such mappings, the second step of the construction is automatic and
  involves the generation of the tag-evidence entity pairs along with a tag evidential
  strength.

● This strength is:
    ● Proportional to the fuzzy degree of the relation linking the evidence entity with
        the tag.
    ● Inversely proportional to the evidential entity’s own ambiguity as well as to the
        number and fuzzy degrees of the other tags it provides evidence for.

● For example, “Woody Allen” provides evidence for the film “Annie Hall” to a strength
  of 0.02 because he has directed many other films while the character “Alvy Singer”
  has evidential strength of 1 as it appears only on this film.




                                                                                          13
Proposed Framework
                                                        Tag Recommendation Process

● Step 1: We extract from the text the terms that possibly refer to tag entities.

● Step 2: We extract from the text the terms that possibly refer to evidential entities

● Step 3: We consider as candidate tag entities not only those found within the text but
  practically all those that are related to instances of the evidential concepts in the
  ontology.

    ● E.g. If we find the term “Woody Allen” then all his films are candidate tag
      entities.

● Step 4: Using the evidence model of the previous slide we compute for each
  candidate tag entity the confidence that it actually represents the text’s meaning.

● Note: The evidence model is assumed to have been calculated offline and stored in
  an index, so as to make the above process more efficient.



                                                                                           14
Framework Evaluation
                                                                    Evaluation Process
Description

 ● Two tagging scenarios:
     ● Film reviews.
     ● Texts describing military conflicts.

 ● Fuzzy ontologies for both domains, based on the manual fuzzification of a small
   portion of DBPedia and Freebase semantic data.

 ● Effectiveness was measured by determining the number of correctly tagged texts,
   namely texts whose highest ranked tags were the correct ones




                                                                                       15
Framework Evaluation
                                                                    Evaluation Process
Film Reviews Scenario                         Miltary Conflicts Scenario

 ● 100 texts describing 20 distinct films      ● 100 texts describing 100 miltary
   that were similar to each other in terms      conflicts that were similar to each other
   of genre, actors and directors and thus       in terms of participants and places
   more difficult to distinguish between
   them in a given review.

 ● Fuzzy Film Ontology:                        ● Fuzzy Conflict Ontology:

     ● Concepts: Film, Actor, Director,             ● Concepts: Location, Military
       Character                                      Conflict, Military Person

     ● Relations:                                   ● Relations:
       wasAnImportantActorInFilm,                     tookPlaceNearLocation,
       isFamousForDirectingFilm,                      wasAnImportantPartOfConflict,
       wasCharacterInFilm.                            playedMajorRoleInConflict,
                                                      isNearToLocation

                                                                                         16
Framework Evaluation
                                                                 Evaluation Results

● We measured tagging effectiveness by determining the number of correctly tagged
  texts, namely texts whose highest ranked films were the correct ones.

● For comparison purposes, we performed the same process using a crisp version of
  the ontologies.

● Results:




                                                                                    17
Conclusions and Future Work
                                                                                 Key Points

● We proposed a novel framework that exploits fuzzy semantic information for
  automatically generating semantic tags for text documents

● This had two challenges:
    ● Distinguishing correctly between the entities that play a central role to the
        document’s meaning and those that are just complementary to it.
    ● Inferring appropriate tags even when these are not explicitly mentioned within
        the text.

● Our approach has been based on the customized utilization of fuzzy domain-
  specific ontological relations for extracting and evaluating tag “evidence” from within
  the text

● The added value that the consideration and exploitation of fuzziness brought to the
  tag recommendation task was experimentally verified through experiments in
  different domains



                                                                                            18
Conclusions and Future Work
                                                                 Framework Extensions


● One important obstacle for the wider applicability of our approach is the bottleneck of
  acquiring (through development or reuse) the required fuzzy ontological information
  for the domain at hand.

● For that reason, our future work will focus on determining automated methods for
  fuzzifying crisp ontological facts:

    ● Data mining
    ● Social network analysis
    ● Crowdsourcing




                                                                                            19
Thank you!
                                                                                       Contact iSOCO


                                            Dr. Panos Alexopoulos
                                               Senior Researcher
                                           palexopoulos@isoco.com




                                                Questions?


Barcelona                   Madrid                          Pamplona              Valencia
Tel +34 935 677 200         Tel +34 913 349 797             Tel +34 948 102 408   Tel +34 963 467 143
Edificio Testa A            Av. del Partenón, 16-18, 1º7ª   Parque Tomás          Oficina 107
C/ Alcalde Barnils, 64-68   Campo de las Naciones           Caballero, 2, 6º-4ª   C/ Prof. Beltrán Báguena, 4
St. Cugat del Vallès        28042 Madrid                    31006 Pamplona        46009 Valencia
08174 Barcelona




                                                                                                                20

Más contenido relacionado

Similar a Improving Automatic Semantic Tag Recommendation through Fuzzy Ontologies

Artificial intelligence
Artificial intelligenceArtificial intelligence
Artificial intelligenceVikasGhelani1
 
ONTOLOGICAL MODEL FOR CHARACTER RECOGNITION BASED ON SPATIAL RELATIONS
ONTOLOGICAL MODEL FOR CHARACTER RECOGNITION BASED ON SPATIAL RELATIONSONTOLOGICAL MODEL FOR CHARACTER RECOGNITION BASED ON SPATIAL RELATIONS
ONTOLOGICAL MODEL FOR CHARACTER RECOGNITION BASED ON SPATIAL RELATIONSsipij
 
Object And Oriented Programing ( Oop ) Languages
Object And Oriented Programing ( Oop ) LanguagesObject And Oriented Programing ( Oop ) Languages
Object And Oriented Programing ( Oop ) LanguagesJessica Deakin
 
Conceptual Interoperability and Biomedical Data
Conceptual Interoperability and Biomedical DataConceptual Interoperability and Biomedical Data
Conceptual Interoperability and Biomedical DataJim McCusker
 
Question answer template
Question answer templateQuestion answer template
Question answer templateThanuw Chaks
 
Formalization and implementation of BFO 2 with a focus on the OWL implementation
Formalization and implementation of BFO 2 with a focus on the OWL implementationFormalization and implementation of BFO 2 with a focus on the OWL implementation
Formalization and implementation of BFO 2 with a focus on the OWL implementationgolpedegato2
 
A Survey of Object Oriented Programming LanguagesMaya Hris.docx
A Survey of Object Oriented Programming LanguagesMaya Hris.docxA Survey of Object Oriented Programming LanguagesMaya Hris.docx
A Survey of Object Oriented Programming LanguagesMaya Hris.docxdaniahendric
 
Effective Semantics for Engineering NLP Systems
Effective Semantics for Engineering NLP SystemsEffective Semantics for Engineering NLP Systems
Effective Semantics for Engineering NLP SystemsAndre Freitas
 
Utterance Topic Model for Generating Coherent Summaries
Utterance Topic Model for Generating Coherent SummariesUtterance Topic Model for Generating Coherent Summaries
Utterance Topic Model for Generating Coherent SummariesContent Savvy
 
Ontology quality, ontology design patterns, and competency questions
Ontology quality, ontology design patterns, and competency questionsOntology quality, ontology design patterns, and competency questions
Ontology quality, ontology design patterns, and competency questionsNicola Guarino
 
types of analyses of research
types of analyses of researchtypes of analyses of research
types of analyses of researchssuser1ee781
 
Ontology based opinion mining for book reviews
Ontology based opinion mining for book reviewsOntology based opinion mining for book reviews
Ontology based opinion mining for book reviewsfirzhan naqash
 
Contextualized Scene Knowledge Graphs for XAI Benchmarking
Contextualized Scene Knowledge Graphs for XAI BenchmarkingContextualized Scene Knowledge Graphs for XAI Benchmarking
Contextualized Scene Knowledge Graphs for XAI BenchmarkingKnowledgeGraph
 

Similar a Improving Automatic Semantic Tag Recommendation through Fuzzy Ontologies (17)

Artificial intelligence
Artificial intelligenceArtificial intelligence
Artificial intelligence
 
ONTOLOGICAL MODEL FOR CHARACTER RECOGNITION BASED ON SPATIAL RELATIONS
ONTOLOGICAL MODEL FOR CHARACTER RECOGNITION BASED ON SPATIAL RELATIONSONTOLOGICAL MODEL FOR CHARACTER RECOGNITION BASED ON SPATIAL RELATIONS
ONTOLOGICAL MODEL FOR CHARACTER RECOGNITION BASED ON SPATIAL RELATIONS
 
Object And Oriented Programing ( Oop ) Languages
Object And Oriented Programing ( Oop ) LanguagesObject And Oriented Programing ( Oop ) Languages
Object And Oriented Programing ( Oop ) Languages
 
Conceptual Interoperability and Biomedical Data
Conceptual Interoperability and Biomedical DataConceptual Interoperability and Biomedical Data
Conceptual Interoperability and Biomedical Data
 
Models and Ontologies: differences
Models and Ontologies: differencesModels and Ontologies: differences
Models and Ontologies: differences
 
Question answer template
Question answer templateQuestion answer template
Question answer template
 
Formalization and implementation of BFO 2 with a focus on the OWL implementation
Formalization and implementation of BFO 2 with a focus on the OWL implementationFormalization and implementation of BFO 2 with a focus on the OWL implementation
Formalization and implementation of BFO 2 with a focus on the OWL implementation
 
A Survey of Object Oriented Programming LanguagesMaya Hris.docx
A Survey of Object Oriented Programming LanguagesMaya Hris.docxA Survey of Object Oriented Programming LanguagesMaya Hris.docx
A Survey of Object Oriented Programming LanguagesMaya Hris.docx
 
Effective Semantics for Engineering NLP Systems
Effective Semantics for Engineering NLP SystemsEffective Semantics for Engineering NLP Systems
Effective Semantics for Engineering NLP Systems
 
Utterance Topic Model for Generating Coherent Summaries
Utterance Topic Model for Generating Coherent SummariesUtterance Topic Model for Generating Coherent Summaries
Utterance Topic Model for Generating Coherent Summaries
 
Narrative and genre
Narrative and genreNarrative and genre
Narrative and genre
 
Ontology quality, ontology design patterns, and competency questions
Ontology quality, ontology design patterns, and competency questionsOntology quality, ontology design patterns, and competency questions
Ontology quality, ontology design patterns, and competency questions
 
types of analyses of research
types of analyses of researchtypes of analyses of research
types of analyses of research
 
Ontology based opinion mining for book reviews
Ontology based opinion mining for book reviewsOntology based opinion mining for book reviews
Ontology based opinion mining for book reviews
 
Contextualized Scene Knowledge Graphs for XAI Benchmarking
Contextualized Scene Knowledge Graphs for XAI BenchmarkingContextualized Scene Knowledge Graphs for XAI Benchmarking
Contextualized Scene Knowledge Graphs for XAI Benchmarking
 
2.7 Text Connections
2.7 Text Connections2.7 Text Connections
2.7 Text Connections
 
Ma
MaMa
Ma
 

Más de Panos Alexopoulos

How many truths can you handle?
How many truths can you handle?How many truths can you handle?
How many truths can you handle?Panos Alexopoulos
 
Troubleshooting and Optimizing Named Entity Resolution Systems in the Industry
Troubleshooting and Optimizing Named Entity Resolution Systems in the IndustryTroubleshooting and Optimizing Named Entity Resolution Systems in the Industry
Troubleshooting and Optimizing Named Entity Resolution Systems in the IndustryPanos Alexopoulos
 
Detecting, Measuring and Representing Vagueness in Ontologies
Detecting, Measuring and Representing Vagueness in OntologiesDetecting, Measuring and Representing Vagueness in Ontologies
Detecting, Measuring and Representing Vagueness in OntologiesPanos Alexopoulos
 
A Vague Sense Classifier for Detecting Vague Definitions in Ontologies
A Vague Sense Classifier for Detecting Vague Definitions in OntologiesA Vague Sense Classifier for Detecting Vague Definitions in Ontologies
A Vague Sense Classifier for Detecting Vague Definitions in OntologiesPanos Alexopoulos
 
Towards Vagueness-Aware Semantic Data
Towards Vagueness-Aware Semantic DataTowards Vagueness-Aware Semantic Data
Towards Vagueness-Aware Semantic DataPanos Alexopoulos
 
Towards Purposeful Reuse of Semantic Datasets Through Goal-Driven Summarization
Towards Purposeful Reuse of Semantic Datasets Through Goal-Driven SummarizationTowards Purposeful Reuse of Semantic Datasets Through Goal-Driven Summarization
Towards Purposeful Reuse of Semantic Datasets Through Goal-Driven SummarizationPanos Alexopoulos
 
Learning Vague Knowledge From Socially Generated Content in an Enterprise Fra...
Learning Vague Knowledge From Socially Generated Content in an Enterprise Fra...Learning Vague Knowledge From Socially Generated Content in an Enterprise Fra...
Learning Vague Knowledge From Socially Generated Content in an Enterprise Fra...Panos Alexopoulos
 
Vagueness in Semantic Information Management
Vagueness in Semantic Information ManagementVagueness in Semantic Information Management
Vagueness in Semantic Information ManagementPanos Alexopoulos
 

Más de Panos Alexopoulos (9)

How many truths can you handle?
How many truths can you handle?How many truths can you handle?
How many truths can you handle?
 
Mind the Semantic Gap
Mind the Semantic GapMind the Semantic Gap
Mind the Semantic Gap
 
Troubleshooting and Optimizing Named Entity Resolution Systems in the Industry
Troubleshooting and Optimizing Named Entity Resolution Systems in the IndustryTroubleshooting and Optimizing Named Entity Resolution Systems in the Industry
Troubleshooting and Optimizing Named Entity Resolution Systems in the Industry
 
Detecting, Measuring and Representing Vagueness in Ontologies
Detecting, Measuring and Representing Vagueness in OntologiesDetecting, Measuring and Representing Vagueness in Ontologies
Detecting, Measuring and Representing Vagueness in Ontologies
 
A Vague Sense Classifier for Detecting Vague Definitions in Ontologies
A Vague Sense Classifier for Detecting Vague Definitions in OntologiesA Vague Sense Classifier for Detecting Vague Definitions in Ontologies
A Vague Sense Classifier for Detecting Vague Definitions in Ontologies
 
Towards Vagueness-Aware Semantic Data
Towards Vagueness-Aware Semantic DataTowards Vagueness-Aware Semantic Data
Towards Vagueness-Aware Semantic Data
 
Towards Purposeful Reuse of Semantic Datasets Through Goal-Driven Summarization
Towards Purposeful Reuse of Semantic Datasets Through Goal-Driven SummarizationTowards Purposeful Reuse of Semantic Datasets Through Goal-Driven Summarization
Towards Purposeful Reuse of Semantic Datasets Through Goal-Driven Summarization
 
Learning Vague Knowledge From Socially Generated Content in an Enterprise Fra...
Learning Vague Knowledge From Socially Generated Content in an Enterprise Fra...Learning Vague Knowledge From Socially Generated Content in an Enterprise Fra...
Learning Vague Knowledge From Socially Generated Content in an Enterprise Fra...
 
Vagueness in Semantic Information Management
Vagueness in Semantic Information ManagementVagueness in Semantic Information Management
Vagueness in Semantic Information Management
 

Último

Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 

Último (20)

Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 

Improving Automatic Semantic Tag Recommendation through Fuzzy Ontologies

  • 1. Improving Automatic Semantic Tag Recommendation through Fuzzy Ontologies Panos Alexopoulos, Manolis Wallace 7th International Workshop Semantic and Social Media Adaptation and Personalization Luxembourg, December 3-4, 2012
  • 2. Agenda  Introduction  Problem Definition and Paper Focus  Approach Overview and Rationale  Proposed Framework  Tagging Evidence Model  Tagging Process  Framework Evaluation  Evaluation Process  Evaluation Results  Conclusions and Future Work 2
  • 3. Introduction Problem Definition Semantic Tagging ● Semantic tagging involves identifying and assigning to texts appropriate entities that reflect what the document actually talks about. ● One important challenge in this task is the correct distinction between the entities that play a central role to the document’s meaning and those that are just complementary to it. ● For example, consider the following text: ● “Annie Hall is a much better movie than Deconstructing Harry, mainly because Alvy Singer is such a well formed character and Diane Keaton gives the performance of her life”. ● The text mentions two films, yet the one it actually talks about is only “Annie Hall”, meaning that only this is an appropriate tag. 3
  • 4. Introduction Problem Definition Semantic Tagging ● A second challenge is the inference of appropriate tags even when these are not explicitly mentioned within the text. ● For example: ● “In June 1863, Colonel James Montgomery commanded a brigade in operations along the coast resembling his earlier Jayhawk raids. The most famous of his controversial operations was the Raid at Combahee Ferry in which 800 slaves were liberated with the help of Harriett Tubman”. ● The text describes a historical battle which took place in Beaufort County, South Carolina. ● This means that this location is an important geographical tag for the text, yet it is not explicitly mentioned within it. 4
  • 5. Introduction Paper Focus ● In a previous work we have already proposed a framework for semantic tagging through the exploitation of domain ontologies. ● The ontologies describe the domain(s) of the texts to be tagged and their entities serve as a source of possible tags for them. ● The key idea is that a given ontological entity is more likely to represent the text’s meaning (and thus be an accurate tag) when there are many ontologically related to it entities in the text. ● In this paper we revise and extend the above framework so as to enable it to exploit also fuzzy ontological information. ● Our assumption is that the fuzziness that may characterize some of the ontology’s relations can increase the evidential power of its entities and consequently the effectiveness of the tag recommendation process 5
  • 6. Proposed Framework Approach Overview and Rationale ● Our existing framework targets the task of semantic tagging based on the intuition that a given ontological entity is more likely to represent the meaning the text when there are many ontologically related to it entities in the text. ● E.g. in the example text the entities “Alvy Singer” and “Diane Keaton” indicate that the text is about the film “Annie Hall”. ● That is because Alvy Singer is a character of this film and Diane Keaton an actor of it. ● All these entities and the relations between them are derived from one or more domain ontologies. 6
  • 7. Proposed Framework Approach Overview and Rationale ● The extension we propose to this framework has to do with considering, where possible, fuzzy relations between entities rather than crisp ones. ● Fuzzy relations allow the assignment of truth degrees to vague ontological relations in an effort to quantify their vagueness. ● E.g. Instead of ‘‘Annie Hall is a comedy” we may say that “Annie Hall is a comedy to a degree of 0.7” ● Similarly, we may say that “Woody Allen is an expert director at human relations to a degree of 0.8”. ● Thus, by using a fuzzy ontology one can represent useful semantic information for the tag recommendation task in a higher level of granularity than with a crisp ontology. 7
  • 8. Proposed Framework Approach Overview and Rationale ● For example, in the film domain, instead of having just the relation hasPlayedInFilm it is more useful to have the fuzzy relation wasAnImportantActorInFilm. ● E.g. “Robert Duvall was an important actor in Apocalypse Now to a degree of 0.6”. ● To see why this is useful consider the text “Robert Duvall’s brilliant performance in the film showed that his choice by Francis Ford Copola was wise”. ● If Duvall and Copola have collaborated in more than one film but in only one of them Duval had a major role (as captured by the fuzzy degree of his relation to the film) then this film is more likely to be the subject of this text. 8
  • 9. Proposed Framework Framework Components ● Our proposed framework assumes the availability of a fuzzy ontology for the domain of the texts to be tagged and defines two components: ● A Tag Fuzzy Ontological Evidence Model that contains entities that may serve as tag-related evidence for the application scenario and domain at hand. ● Each entity is assigned evidential power degrees which denote its usefulness as evidence for the tag recommendation task. ● A Tag Recommendation Process that uses the evidence model to determine, for a given text, the ontological entities that potentially represent its content. ● A confidence score for each entity is used to denote the most probable tags. 9
  • 10. Proposed Framework Tagging Evidence Model ● Defines for each ontology entity which other instances and to what extent should be used as evidence towards the correct determination of the texts’ tags. ● It consists of entity pairs where a particular entity provides quantified evidence for a another one. 10
  • 11. Proposed Framework Evidence Model Construction ● Construction of the evidence model depends on the characteristics of the domain and the texts. ● The first step of the construction is manual and involves: ● The identification of the concepts whose instances are expected to be used as tags (e.g. military conflicts, films etc.) ● The determination, for each of these concepts, of the related to them concepts whose instances may serve as tag evidence: ● For example, in texts that review films, some concepts whose instances may act as tag evidence are related directors, actors and characters. ● The identification, for each pair of evidence and target concept, of the fuzzy relation paths that links them. 11
  • 12. Proposed Framework Evidence Model Construction ● The result of this first step is a tag evidence concept mapping like the following: ● This mapping is typically small so its manual construction is not difficult. 12
  • 13. Proposed Framework Evidence Model Construction ● Based on such mappings, the second step of the construction is automatic and involves the generation of the tag-evidence entity pairs along with a tag evidential strength. ● This strength is: ● Proportional to the fuzzy degree of the relation linking the evidence entity with the tag. ● Inversely proportional to the evidential entity’s own ambiguity as well as to the number and fuzzy degrees of the other tags it provides evidence for. ● For example, “Woody Allen” provides evidence for the film “Annie Hall” to a strength of 0.02 because he has directed many other films while the character “Alvy Singer” has evidential strength of 1 as it appears only on this film. 13
  • 14. Proposed Framework Tag Recommendation Process ● Step 1: We extract from the text the terms that possibly refer to tag entities. ● Step 2: We extract from the text the terms that possibly refer to evidential entities ● Step 3: We consider as candidate tag entities not only those found within the text but practically all those that are related to instances of the evidential concepts in the ontology. ● E.g. If we find the term “Woody Allen” then all his films are candidate tag entities. ● Step 4: Using the evidence model of the previous slide we compute for each candidate tag entity the confidence that it actually represents the text’s meaning. ● Note: The evidence model is assumed to have been calculated offline and stored in an index, so as to make the above process more efficient. 14
  • 15. Framework Evaluation Evaluation Process Description ● Two tagging scenarios: ● Film reviews. ● Texts describing military conflicts. ● Fuzzy ontologies for both domains, based on the manual fuzzification of a small portion of DBPedia and Freebase semantic data. ● Effectiveness was measured by determining the number of correctly tagged texts, namely texts whose highest ranked tags were the correct ones 15
  • 16. Framework Evaluation Evaluation Process Film Reviews Scenario Miltary Conflicts Scenario ● 100 texts describing 20 distinct films ● 100 texts describing 100 miltary that were similar to each other in terms conflicts that were similar to each other of genre, actors and directors and thus in terms of participants and places more difficult to distinguish between them in a given review. ● Fuzzy Film Ontology: ● Fuzzy Conflict Ontology: ● Concepts: Film, Actor, Director, ● Concepts: Location, Military Character Conflict, Military Person ● Relations: ● Relations: wasAnImportantActorInFilm, tookPlaceNearLocation, isFamousForDirectingFilm, wasAnImportantPartOfConflict, wasCharacterInFilm. playedMajorRoleInConflict, isNearToLocation 16
  • 17. Framework Evaluation Evaluation Results ● We measured tagging effectiveness by determining the number of correctly tagged texts, namely texts whose highest ranked films were the correct ones. ● For comparison purposes, we performed the same process using a crisp version of the ontologies. ● Results: 17
  • 18. Conclusions and Future Work Key Points ● We proposed a novel framework that exploits fuzzy semantic information for automatically generating semantic tags for text documents ● This had two challenges: ● Distinguishing correctly between the entities that play a central role to the document’s meaning and those that are just complementary to it. ● Inferring appropriate tags even when these are not explicitly mentioned within the text. ● Our approach has been based on the customized utilization of fuzzy domain- specific ontological relations for extracting and evaluating tag “evidence” from within the text ● The added value that the consideration and exploitation of fuzziness brought to the tag recommendation task was experimentally verified through experiments in different domains 18
  • 19. Conclusions and Future Work Framework Extensions ● One important obstacle for the wider applicability of our approach is the bottleneck of acquiring (through development or reuse) the required fuzzy ontological information for the domain at hand. ● For that reason, our future work will focus on determining automated methods for fuzzifying crisp ontological facts: ● Data mining ● Social network analysis ● Crowdsourcing 19
  • 20. Thank you! Contact iSOCO Dr. Panos Alexopoulos Senior Researcher palexopoulos@isoco.com Questions? Barcelona Madrid Pamplona Valencia Tel +34 935 677 200 Tel +34 913 349 797 Tel +34 948 102 408 Tel +34 963 467 143 Edificio Testa A Av. del Partenón, 16-18, 1º7ª Parque Tomás Oficina 107 C/ Alcalde Barnils, 64-68 Campo de las Naciones Caballero, 2, 6º-4ª C/ Prof. Beltrán Báguena, 4 St. Cugat del Vallès 28042 Madrid 31006 Pamplona 46009 Valencia 08174 Barcelona 20