SlideShare una empresa de Scribd logo
1 de 19
Towards Vagueness-Aware Semantic Data
Panos Alexopoulos, Boris Villazon-Terrazas, Jeff Z. Pan
9th International Workshop in Uncertainty Reasoning for the Semantic Web
Sydney, Australia, October 21st, 2013
Vagueness & Semantic Data

Vagueness

●Semantic data and ontologies
● Semantic data: data annotated with ontological vocabulary
● Ontological vocabulary is defined in ontologies
● Ontologies meant to capture shared understanding
●What happens when vocabularies have blurred boundaries:
● What’s the threshold number of years separating old and not old
films?
● What are the exact criteria that distinguish modern restaurants
from non-modern?

2
Vagueness & Semantic Data

Examples of Vague Semantic Data

●Vague categorization relations where entities are assigned to
categories with no clear applicability criteria.
● “hasFilmGenre” (LinkedMDB and Freebase).
● “dbpedia-owl:ideology“ and “dbpedia-owl:movement“ (DBPedia)

●Specializations of concepts according to some vague property of
them.
● “Famous Person" and “Big Building" (Cyc Ontology)
● “Managerial Role” and “Competitor“ (Business Role Ontology)

3
Vagueness & Semantic Data

Vagueness

●Two main types:
● Quantitative: Lack of crisp applicability boundaries for the
predicate in one or more dimension (e.g., Tall, Rich, Recent etc.)
● Qualitative: Inability to define sufficient applicability criteria for
the predicate (e.g., Modern, Expert, Religion etc.)
●Additional characteristics:
● Subjectiveness: The same vague term can be differently
interpreted and/or applied by different users.
● Context dependence: The same vague term can be differently
interpreted or applied in different contexts (even if the user is the
same).
4
Vagueness & Semantic Data

Our Threefold Position

1. Vagueness may be found not only within isolated applicationspecific semantic data but also in public datasets that are meant
to be shareable and reusable (e.g. Linked Open Data).
2. This vagueness may hamper the comprehensibility and
shareability of these datasets and cause problems in a number of
different use case scenarios.

3. The negative effects vagueness may cause, can be partially tackled
by annotating the vague data elements with metainformation
that explicitly describes the vagueness's nature and characteristics.

5
Vagueness & Semantic Data

Vagueness Consequences

●The problem with vague terms in semantic data is that they may
cause disagreements among the people who develop it, maintain it or
use it.
●E.g., when we asked domain experts to provide instances of the
concept Critical Business Process, there were certain processes for
which there was a dispute among them about whether they should be
regarded as critical or not
●The problem was that different experts had different criteria of process
criticality and could not decide which of those criteria were sufficient
to classify a process as critical.
●In other words, qualitative vagueness!.
6
Vagueness & Semantic Data

Vagueness in Data Creation and Exploitation Scenarios

1. Structuring Data with a Vague Ontology: When domain experts
are asked to define instances of vague concepts and relations, then
disagreements may occur on whether particular entities are actually
instances of them or not.

2. Utilizing Vague Facts in Ontology-Based Systems: When
knowledge-based systems utilize vague facts as part of their
reasoning, then their output might not be optimal for its users when
the latter disagree with these facts.

7
Vagueness & Semantic Data

Vagueness Consequences in Use Case Scenarios

3. Integrating Vague Semantic Information: When semantic data
from different sources need to be merged in a single dataset then
the merging of particular vague elements can lead to data that will
not be valid for all its users.

4. Evaluating Vague Semantic Datasets for Reuse: When data
practitioners need to decide whether a particular dataset is suitable
for their needs, then the existence of vague elements in it can make
it difficult to assess a priori whether the data related to these
elements are valid for their application context.

8
Vagueness & Semantic Data

Rationale for Vagueness-Aware Data

●Vagueness-aware semantic data is “data whose vague ontological
elements are accompanied by comprehensive metainformation that
describes the nature and characteristics of their vagueness”.
●E.g., a useful piece of metainformation is the set of applicability
criteria that the element creator had in mind when defining the
element .
●When two datasets have the same vague concept, the knowledge of
these criteria can:
● Prevent their merging in case these criteria are different.
● Help a data practitioner decide which of these two concepts'
associated instances are more suitable for his/her application.
9
A Metaontology for Vagueness

Proposed Metainformation

Vagueness Type

E.g. the concept “Low Budget” has
quantitative vagueness while the
concept “Expert Consultant” qualitative.

Vagueness dimensions
and applicability criteria

“E.g. the term “Strategic Client" is
vague in the dimension of the
revenue it generates”

Applicability Context

E.g. “Company X is a strategic
client” only for the purposes of R&D
collaboration.

Element Creator

E.g. “Company X is a strategic
client” is asserted by the Financial
Manager.

10
A Metaontology for Vagueness

Metamodel Classes

11
A Metaontology for Vagueness

Metamodel Properties

12
Vagueness-Aware Semantic Data

Production & Consumption

●Our metamodel is meant to be used by both producers and
consumers of ontologies and semantic data.
● The former to annotate the vague part of their produced
ontologies with relevant vagueness metainformation
● The latter to query this metainformation and use it to make a
better use of the data.
●The annotation task is primarily manual and should take place in the
course of the ontology's construction and evolution process.
●Future research should focus on making production of vaguenessaware data easier and more efficient.
13
Vagueness-Aware Semantic Data

Examples of Annotated Vague Semantic Data
Vague Class with Type and Dimensions

Vague Relation with Type, Dimension and Context

Vague Axioms with Context and Creator

14
Vagueness-Aware Semantic Data

Supported Competency Questions

●What elements have been defined as vague?
●What elements have qualitative vagueness?
●What elements have quantitative vagueness, in what contexts and in
what dimensions?
●How many different applicability contexts does the relation
“isExpertAtResearch” have?

●Who asserts that “Accenture is a Competitor” and in what context?

15
Vagueness-Aware Semantic Data

Usage and Benefits
Metamodel Usage
• Communicate the meaning of the vague
elements to the domain experts.

Structuring Data with a Vague
Ontology

• Use the metamodel to characterize the created
data's vagueness.

Expected Benefits
• Make the job of the experts easier
and faster and reduce
disagreements among them.

• Check which data is vague.

Utilizing Vague Semantic Data
in an Ontology-Based System

• Use the properties of the vague elements to
provide vagueness-related explanations to the
users.

• Know a priori which data may
affect the system's effectiveness.

Integrating Vague Semantic
Datasets

• Compare same vague elements across
datasets according to their vagueness type
and dimensions

• Know a priori which data may
affect the system's effectiveness.

Evaluating Vague Semantic
Datasets for Reuse

• Query the metamodel to check the vagueness
compatibility of the dataset with the intended
domain and application scenario.

• Avoid re-using (parts of) datasets
that are not compatible to own
interpretation of vagueness.

16
Vagueness-Aware Semantic Data

Concluding Remarks

●The overall idea behind our proposed approach is that the
availability of vagueness metadata in ontologies and semantic data
will manage to reduce the high level of disagreement and increase
the low level of comprehensibility it may cause.
●In that sense, it can be a major step towards achieving better
common understanding, and thus shareability, of vague semantic
information in the Semantic Web.
●Moreover, our approach is complementary to any fuzzy ontology
related work, in the sense that it may be used to provide better
explanations on the intended meaning of fuzzy degrees.

17
Vagueness-Aware Semantic Data

Future Work

●We are currently working towards three directions:
● Express the metamodel in a more formal way.
● Evaluate its usefulness by means of qualtitative experiments that
will measure the level of increased semantic data
comprehensibility and shareability it may achieve.
● Facilitate and encourage the practical use of the metamodel
within the Semantic Data Lifecycle by means of relevant
guidelines, methods and tools.

18
Thank you!

Questions?

Dr. Panos Alexopoulos
Semantic Applications Research
Manager

Quieres
innovar?

palexopoulos@isoco.com
(t) +34 913 349 797

iSOCO Barcelona

iSOCO Madrid

iSOCO Pamplona

iSOCO Valencia

iSOCO Colombia

Av. Torre Blanca, 57
Edificio ESADE CREAPOLIS
Oficina 3C 15
08172 Sant Cugat del Vallès
Barcelona, España
(t) +34 935 677 200

Av. del Partenón, 16-18, 1º7ª
Campo de las Naciones
28042 Madrid
España
(t) +34 913 349 797

Parque Tomás
Caballero, 2, 6º4ª
31006 Pamplona
España
(t) +34 948 102 408

C/ Prof. Beltrán Báguena, 4
Oficina 107
46009 Valencia
España
(t) +34 963 467 143

Complejo Ruta N
Calle 67, 52-20
Piso 3, Torre A
Medellín
Colombia
(t) +57 516 7770 ext. 1132

Key Vendor
Virtual Assistant 2013

19

Más contenido relacionado

La actualidad más candente

295B_Report_Sentiment_analysis
295B_Report_Sentiment_analysis295B_Report_Sentiment_analysis
295B_Report_Sentiment_analysis
Zahid Azam
 
Query recommendation papers
Query recommendation papersQuery recommendation papers
Query recommendation papers
Ashish Kulkarni
 
Bba q&a study final white
Bba q&a study final whiteBba q&a study final white
Bba q&a study final white
Greg Sterling
 
Sentiment Analysis on Amazon Movie Reviews Dataset
Sentiment Analysis on Amazon Movie Reviews DatasetSentiment Analysis on Amazon Movie Reviews Dataset
Sentiment Analysis on Amazon Movie Reviews Dataset
Maham F'Rajput
 

La actualidad más candente (20)

NLP Ecosystem
NLP EcosystemNLP Ecosystem
NLP Ecosystem
 
SENTIMENT ANALYSIS-AN OBJECTIVE VIEW
SENTIMENT ANALYSIS-AN OBJECTIVE VIEWSENTIMENT ANALYSIS-AN OBJECTIVE VIEW
SENTIMENT ANALYSIS-AN OBJECTIVE VIEW
 
ACM Hypertext and Social Media Conference Tutorial on Knowledge-infused Deep ...
ACM Hypertext and Social Media Conference Tutorial on Knowledge-infused Deep ...ACM Hypertext and Social Media Conference Tutorial on Knowledge-infused Deep ...
ACM Hypertext and Social Media Conference Tutorial on Knowledge-infused Deep ...
 
A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet
A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet
A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet
 
Comparative Study on Lexicon-based sentiment analysers over Negative sentiment
Comparative Study on Lexicon-based sentiment analysers over Negative sentimentComparative Study on Lexicon-based sentiment analysers over Negative sentiment
Comparative Study on Lexicon-based sentiment analysers over Negative sentiment
 
Sentiment Analysis of Feedback Data
Sentiment Analysis of Feedback DataSentiment Analysis of Feedback Data
Sentiment Analysis of Feedback Data
 
295B_Report_Sentiment_analysis
295B_Report_Sentiment_analysis295B_Report_Sentiment_analysis
295B_Report_Sentiment_analysis
 
Sentiment analysis
Sentiment analysisSentiment analysis
Sentiment analysis
 
Methods for Sentiment Analysis: A Literature Study
Methods for Sentiment Analysis: A Literature StudyMethods for Sentiment Analysis: A Literature Study
Methods for Sentiment Analysis: A Literature Study
 
SemEval-2017 Task 5: Fine-Grained Sentiment Analysis on Financial Microblogs ...
SemEval-2017 Task 5: Fine-Grained Sentiment Analysis on Financial Microblogs ...SemEval-2017 Task 5: Fine-Grained Sentiment Analysis on Financial Microblogs ...
SemEval-2017 Task 5: Fine-Grained Sentiment Analysis on Financial Microblogs ...
 
Supervised Sentiment Classification using DTDP algorithm
Supervised Sentiment Classification using DTDP algorithmSupervised Sentiment Classification using DTDP algorithm
Supervised Sentiment Classification using DTDP algorithm
 
IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...
IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...
IRJET - Sentiment Analysis for Marketing and Product Review using a Hybrid Ap...
 
A review of sentiment analysis approaches in big
A review of sentiment analysis approaches in bigA review of sentiment analysis approaches in big
A review of sentiment analysis approaches in big
 
Query recommendation papers
Query recommendation papersQuery recommendation papers
Query recommendation papers
 
Sentiment Features based Analysis of Online Reviews
Sentiment Features based Analysis of Online ReviewsSentiment Features based Analysis of Online Reviews
Sentiment Features based Analysis of Online Reviews
 
Bba q&a study final white
Bba q&a study final whiteBba q&a study final white
Bba q&a study final white
 
Amazon Product Sentiment review
Amazon Product Sentiment reviewAmazon Product Sentiment review
Amazon Product Sentiment review
 
Deciphering AI - Unlocking the Black Box of AIML with State-of-the-Art Techno...
Deciphering AI - Unlocking the Black Box of AIML with State-of-the-Art Techno...Deciphering AI - Unlocking the Black Box of AIML with State-of-the-Art Techno...
Deciphering AI - Unlocking the Black Box of AIML with State-of-the-Art Techno...
 
Project report
Project reportProject report
Project report
 
Sentiment Analysis on Amazon Movie Reviews Dataset
Sentiment Analysis on Amazon Movie Reviews DatasetSentiment Analysis on Amazon Movie Reviews Dataset
Sentiment Analysis on Amazon Movie Reviews Dataset
 

Similar a Towards Vagueness-Aware Semantic Data

Nlp and semantic_web_for_competitive_int
Nlp and semantic_web_for_competitive_intNlp and semantic_web_for_competitive_int
Nlp and semantic_web_for_competitive_int
KarenVacca
 
The FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdfThe FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdf
Alan Morrison
 
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
William Gunn
 
Mining in Ontology with Multi Agent System in Semantic Web : A Novel Approach
Mining in Ontology with Multi Agent System in Semantic Web : A Novel ApproachMining in Ontology with Multi Agent System in Semantic Web : A Novel Approach
Mining in Ontology with Multi Agent System in Semantic Web : A Novel Approach
ijma
 
6711SafeAssign Originality Report69Total S.docx
6711SafeAssign Originality Report69Total S.docx6711SafeAssign Originality Report69Total S.docx
6711SafeAssign Originality Report69Total S.docx
priestmanmable
 
6711SafeAssign Originality Report69Total S.docx
6711SafeAssign Originality Report69Total S.docx6711SafeAssign Originality Report69Total S.docx
6711SafeAssign Originality Report69Total S.docx
sodhi3
 
Running Head Data Mining in The Cloud .docx
Running Head Data Mining in The Cloud                            .docxRunning Head Data Mining in The Cloud                            .docx
Running Head Data Mining in The Cloud .docx
healdkathaleen
 
If You Tag it, Will They Come? Metadata Quality and Repository Management
If You Tag it, Will They Come? Metadata Quality and Repository ManagementIf You Tag it, Will They Come? Metadata Quality and Repository Management
If You Tag it, Will They Come? Metadata Quality and Repository Management
Sarah Currier
 
Poster Abstracts
Poster AbstractsPoster Abstracts
Poster Abstracts
butest
 

Similar a Towards Vagueness-Aware Semantic Data (20)

Nlp and semantic_web_for_competitive_int
Nlp and semantic_web_for_competitive_intNlp and semantic_web_for_competitive_int
Nlp and semantic_web_for_competitive_int
 
The FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdfThe FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdf
 
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
 
Model evaluation in the land of deep learning
Model evaluation in the land of deep learningModel evaluation in the land of deep learning
Model evaluation in the land of deep learning
 
Knowledge Management in the AI Driven Scintific System
Knowledge Management in the AI Driven Scintific SystemKnowledge Management in the AI Driven Scintific System
Knowledge Management in the AI Driven Scintific System
 
Mining in Ontology with Multi Agent System in Semantic Web : A Novel Approach
Mining in Ontology with Multi Agent System in Semantic Web : A Novel ApproachMining in Ontology with Multi Agent System in Semantic Web : A Novel Approach
Mining in Ontology with Multi Agent System in Semantic Web : A Novel Approach
 
Of Unicorns, Yetis, and Error-Free Datasets (or what is data quality?)
Of Unicorns, Yetis, and Error-Free Datasets (or what is data quality?)Of Unicorns, Yetis, and Error-Free Datasets (or what is data quality?)
Of Unicorns, Yetis, and Error-Free Datasets (or what is data quality?)
 
6711SafeAssign Originality Report69Total S.docx
6711SafeAssign Originality Report69Total S.docx6711SafeAssign Originality Report69Total S.docx
6711SafeAssign Originality Report69Total S.docx
 
6711SafeAssign Originality Report69Total S.docx
6711SafeAssign Originality Report69Total S.docx6711SafeAssign Originality Report69Total S.docx
6711SafeAssign Originality Report69Total S.docx
 
A Survey On Ontology Agent Based Distributed Data Mining
A Survey On Ontology Agent Based Distributed Data MiningA Survey On Ontology Agent Based Distributed Data Mining
A Survey On Ontology Agent Based Distributed Data Mining
 
Extreme-scale Identity Management for Scientific Collaborations
Extreme-scale Identity Management for Scientific CollaborationsExtreme-scale Identity Management for Scientific Collaborations
Extreme-scale Identity Management for Scientific Collaborations
 
data mining and data warehousing
data mining and data warehousingdata mining and data warehousing
data mining and data warehousing
 
FAIR data_ Superior data visibility and reuse without warehousing.pdf
FAIR data_ Superior data visibility and reuse without warehousing.pdfFAIR data_ Superior data visibility and reuse without warehousing.pdf
FAIR data_ Superior data visibility and reuse without warehousing.pdf
 
dissertation proposal writing service
dissertation proposal writing servicedissertation proposal writing service
dissertation proposal writing service
 
Running Head Data Mining in The Cloud .docx
Running Head Data Mining in The Cloud                            .docxRunning Head Data Mining in The Cloud                            .docx
Running Head Data Mining in The Cloud .docx
 
Trajectory Data Fuzzy Modeling : Ambulances Management Use Case
Trajectory Data Fuzzy Modeling : Ambulances Management Use CaseTrajectory Data Fuzzy Modeling : Ambulances Management Use Case
Trajectory Data Fuzzy Modeling : Ambulances Management Use Case
 
If You Tag it, Will They Come? Metadata Quality and Repository Management
If You Tag it, Will They Come? Metadata Quality and Repository ManagementIf You Tag it, Will They Come? Metadata Quality and Repository Management
If You Tag it, Will They Come? Metadata Quality and Repository Management
 
Poster Abstracts
Poster AbstractsPoster Abstracts
Poster Abstracts
 
Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...
Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...
Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...
 
Causal networks, learning and inference - Introduction
Causal networks, learning and inference - IntroductionCausal networks, learning and inference - Introduction
Causal networks, learning and inference - Introduction
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 

Towards Vagueness-Aware Semantic Data

  • 1. Towards Vagueness-Aware Semantic Data Panos Alexopoulos, Boris Villazon-Terrazas, Jeff Z. Pan 9th International Workshop in Uncertainty Reasoning for the Semantic Web Sydney, Australia, October 21st, 2013
  • 2. Vagueness & Semantic Data Vagueness ●Semantic data and ontologies ● Semantic data: data annotated with ontological vocabulary ● Ontological vocabulary is defined in ontologies ● Ontologies meant to capture shared understanding ●What happens when vocabularies have blurred boundaries: ● What’s the threshold number of years separating old and not old films? ● What are the exact criteria that distinguish modern restaurants from non-modern? 2
  • 3. Vagueness & Semantic Data Examples of Vague Semantic Data ●Vague categorization relations where entities are assigned to categories with no clear applicability criteria. ● “hasFilmGenre” (LinkedMDB and Freebase). ● “dbpedia-owl:ideology“ and “dbpedia-owl:movement“ (DBPedia) ●Specializations of concepts according to some vague property of them. ● “Famous Person" and “Big Building" (Cyc Ontology) ● “Managerial Role” and “Competitor“ (Business Role Ontology) 3
  • 4. Vagueness & Semantic Data Vagueness ●Two main types: ● Quantitative: Lack of crisp applicability boundaries for the predicate in one or more dimension (e.g., Tall, Rich, Recent etc.) ● Qualitative: Inability to define sufficient applicability criteria for the predicate (e.g., Modern, Expert, Religion etc.) ●Additional characteristics: ● Subjectiveness: The same vague term can be differently interpreted and/or applied by different users. ● Context dependence: The same vague term can be differently interpreted or applied in different contexts (even if the user is the same). 4
  • 5. Vagueness & Semantic Data Our Threefold Position 1. Vagueness may be found not only within isolated applicationspecific semantic data but also in public datasets that are meant to be shareable and reusable (e.g. Linked Open Data). 2. This vagueness may hamper the comprehensibility and shareability of these datasets and cause problems in a number of different use case scenarios. 3. The negative effects vagueness may cause, can be partially tackled by annotating the vague data elements with metainformation that explicitly describes the vagueness's nature and characteristics. 5
  • 6. Vagueness & Semantic Data Vagueness Consequences ●The problem with vague terms in semantic data is that they may cause disagreements among the people who develop it, maintain it or use it. ●E.g., when we asked domain experts to provide instances of the concept Critical Business Process, there were certain processes for which there was a dispute among them about whether they should be regarded as critical or not ●The problem was that different experts had different criteria of process criticality and could not decide which of those criteria were sufficient to classify a process as critical. ●In other words, qualitative vagueness!. 6
  • 7. Vagueness & Semantic Data Vagueness in Data Creation and Exploitation Scenarios 1. Structuring Data with a Vague Ontology: When domain experts are asked to define instances of vague concepts and relations, then disagreements may occur on whether particular entities are actually instances of them or not. 2. Utilizing Vague Facts in Ontology-Based Systems: When knowledge-based systems utilize vague facts as part of their reasoning, then their output might not be optimal for its users when the latter disagree with these facts. 7
  • 8. Vagueness & Semantic Data Vagueness Consequences in Use Case Scenarios 3. Integrating Vague Semantic Information: When semantic data from different sources need to be merged in a single dataset then the merging of particular vague elements can lead to data that will not be valid for all its users. 4. Evaluating Vague Semantic Datasets for Reuse: When data practitioners need to decide whether a particular dataset is suitable for their needs, then the existence of vague elements in it can make it difficult to assess a priori whether the data related to these elements are valid for their application context. 8
  • 9. Vagueness & Semantic Data Rationale for Vagueness-Aware Data ●Vagueness-aware semantic data is “data whose vague ontological elements are accompanied by comprehensive metainformation that describes the nature and characteristics of their vagueness”. ●E.g., a useful piece of metainformation is the set of applicability criteria that the element creator had in mind when defining the element . ●When two datasets have the same vague concept, the knowledge of these criteria can: ● Prevent their merging in case these criteria are different. ● Help a data practitioner decide which of these two concepts' associated instances are more suitable for his/her application. 9
  • 10. A Metaontology for Vagueness Proposed Metainformation Vagueness Type E.g. the concept “Low Budget” has quantitative vagueness while the concept “Expert Consultant” qualitative. Vagueness dimensions and applicability criteria “E.g. the term “Strategic Client" is vague in the dimension of the revenue it generates” Applicability Context E.g. “Company X is a strategic client” only for the purposes of R&D collaboration. Element Creator E.g. “Company X is a strategic client” is asserted by the Financial Manager. 10
  • 11. A Metaontology for Vagueness Metamodel Classes 11
  • 12. A Metaontology for Vagueness Metamodel Properties 12
  • 13. Vagueness-Aware Semantic Data Production & Consumption ●Our metamodel is meant to be used by both producers and consumers of ontologies and semantic data. ● The former to annotate the vague part of their produced ontologies with relevant vagueness metainformation ● The latter to query this metainformation and use it to make a better use of the data. ●The annotation task is primarily manual and should take place in the course of the ontology's construction and evolution process. ●Future research should focus on making production of vaguenessaware data easier and more efficient. 13
  • 14. Vagueness-Aware Semantic Data Examples of Annotated Vague Semantic Data Vague Class with Type and Dimensions Vague Relation with Type, Dimension and Context Vague Axioms with Context and Creator 14
  • 15. Vagueness-Aware Semantic Data Supported Competency Questions ●What elements have been defined as vague? ●What elements have qualitative vagueness? ●What elements have quantitative vagueness, in what contexts and in what dimensions? ●How many different applicability contexts does the relation “isExpertAtResearch” have? ●Who asserts that “Accenture is a Competitor” and in what context? 15
  • 16. Vagueness-Aware Semantic Data Usage and Benefits Metamodel Usage • Communicate the meaning of the vague elements to the domain experts. Structuring Data with a Vague Ontology • Use the metamodel to characterize the created data's vagueness. Expected Benefits • Make the job of the experts easier and faster and reduce disagreements among them. • Check which data is vague. Utilizing Vague Semantic Data in an Ontology-Based System • Use the properties of the vague elements to provide vagueness-related explanations to the users. • Know a priori which data may affect the system's effectiveness. Integrating Vague Semantic Datasets • Compare same vague elements across datasets according to their vagueness type and dimensions • Know a priori which data may affect the system's effectiveness. Evaluating Vague Semantic Datasets for Reuse • Query the metamodel to check the vagueness compatibility of the dataset with the intended domain and application scenario. • Avoid re-using (parts of) datasets that are not compatible to own interpretation of vagueness. 16
  • 17. Vagueness-Aware Semantic Data Concluding Remarks ●The overall idea behind our proposed approach is that the availability of vagueness metadata in ontologies and semantic data will manage to reduce the high level of disagreement and increase the low level of comprehensibility it may cause. ●In that sense, it can be a major step towards achieving better common understanding, and thus shareability, of vague semantic information in the Semantic Web. ●Moreover, our approach is complementary to any fuzzy ontology related work, in the sense that it may be used to provide better explanations on the intended meaning of fuzzy degrees. 17
  • 18. Vagueness-Aware Semantic Data Future Work ●We are currently working towards three directions: ● Express the metamodel in a more formal way. ● Evaluate its usefulness by means of qualtitative experiments that will measure the level of increased semantic data comprehensibility and shareability it may achieve. ● Facilitate and encourage the practical use of the metamodel within the Semantic Data Lifecycle by means of relevant guidelines, methods and tools. 18
  • 19. Thank you! Questions? Dr. Panos Alexopoulos Semantic Applications Research Manager Quieres innovar? palexopoulos@isoco.com (t) +34 913 349 797 iSOCO Barcelona iSOCO Madrid iSOCO Pamplona iSOCO Valencia iSOCO Colombia Av. Torre Blanca, 57 Edificio ESADE CREAPOLIS Oficina 3C 15 08172 Sant Cugat del Vallès Barcelona, España (t) +34 935 677 200 Av. del Partenón, 16-18, 1º7ª Campo de las Naciones 28042 Madrid España (t) +34 913 349 797 Parque Tomás Caballero, 2, 6º4ª 31006 Pamplona España (t) +34 948 102 408 C/ Prof. Beltrán Báguena, 4 Oficina 107 46009 Valencia España (t) +34 963 467 143 Complejo Ruta N Calle 67, 52-20 Piso 3, Torre A Medellín Colombia (t) +57 516 7770 ext. 1132 Key Vendor Virtual Assistant 2013 19