SlideShare una empresa de Scribd logo
1 de 1
Descargar para leer sin conexión
Reusing XML Schemas' Information as a Foundation for Designing
                             Domain Ontologies
                                                      Thomas Bosch (M.Sc.)
                                      thomas.bosch@gesis.org | http://boschthomas.blogspot.com

Problem                                                                Main Research Question
• Traditionally, ontology engineers work in close collaboration with   How to accelerate the time-consuming process designing DOs
  domain experts to design domain ontologies (DOs) which requires      based on already available XSDs?
  lots of time and effort
• DOs as well as XSDs describe domain data models                      Hypothesis
• In many cases, XSDs are already defined and can therefore be         The effort and the time delivering high quality DOs using the
  reused to design DOs                                                 proposed approach is much less than creating DOs completely
                                                                       manual

Proposed Approach                                                      Derive DOs using SWRL rules




                                                                       Benefits
XSD and OWL follow different modeling goals, the mapping transports    • Process designing DOs from scratch is sped up significantly
only XSDs' information, and generated ontologies (GOs) are not         • All XSDs' information (terminology, syntactic structure of XML docs)
conform to the highest quality requirements of DOs                       is reused in GOs
 GOs are not immediately useful                                       • GOs' RDF representations can be published in the LOD cloud and
 domain experts and ontology engineers enrich GOs with additional       linked to other RDF datasets
   domain-specific semantic information in form of DOs                 • All XML data conforming to XSDs can be imported automatically as
                                                                         DOs' instances
Map XSDs to GOs                                                        • GOs and DOs can be maintained in a fast way
• <xs:element name="VariableName" ... />                               • Detect technical and content-related data models' weaknesses
   VariableName ⊑ Element
• <xs:element name= "VariableName" ... />
                                                                       Novelty of Approach
                                                                       •   Based on XSD meta-model
   VariableName ⊑ name_Element_String.{'VariableName'}
                                                                       •   Does not extract semantics out of XSDs
• <xs:attribute ref="lang"/>
                                                                       •   Transformation on terminological and assertional knowledge level
   Lang-Reference ⊑ ref_Attribute_Attribute.Lang
                                                                       •   Automatic transformation of XSDs and XML docs
• <xs:element name="VariableName" type="NameType"/>
                                                                       •   More expressive power of OWL instead of RDFS GOs
   VariableName ⊑ type_Element_Type.NameType
• <xs:extension><xs:attribute name="translated"/><xs:attribute         Limitations
  name= "translatable"/></xs:extension>  Extension1 ⊑                 • Prerequisite: XSDs
    contains_Extension_Attribute.(Translated ⊔ Translatable)           • Not suitable use cases (e.g. when XSDs do not represent the
                                                                         domain knowledge correctly or when XSDs are technically not well
Use Cases                                                                designed)
• To proof approach's generality: any XSDs and corresp. XML docs
  can be converted to GOs and their RDF representations, as all XSD    Evaluation
  meta-model's components are covered                                  • To verify the hypothesis
• Generic test cases: derived from XSD meta-model                      • User study to compare traditional manual and proposed approach
• Domain-specific use cases: Data Documentation Initiative (DDI)         (define measurement methods)
  ontology; projects: MISSY, da|ra, LOD pilot project, SOFISwiki       • Derive DOs of multiple and differing domains

Más contenido relacionado

Último

CORS (Kitworks Team Study 양다윗 발표자료 240510)
CORS (Kitworks Team Study 양다윗 발표자료 240510)CORS (Kitworks Team Study 양다윗 발표자료 240510)
CORS (Kitworks Team Study 양다윗 발표자료 240510)
Wonjun Hwang
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 

Último (20)

Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
 
Portal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russePortal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russe
 
ChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps Productivity
 
Microsoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - QuestionnaireMicrosoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - Questionnaire
 
Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data Science
 
Design Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptxDesign Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptx
 
Intro to Passkeys and the State of Passwordless.pptx
Intro to Passkeys and the State of Passwordless.pptxIntro to Passkeys and the State of Passwordless.pptx
Intro to Passkeys and the State of Passwordless.pptx
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 
How to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfHow to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cf
 
CORS (Kitworks Team Study 양다윗 발표자료 240510)
CORS (Kitworks Team Study 양다윗 발표자료 240510)CORS (Kitworks Team Study 양다윗 발표자료 240510)
CORS (Kitworks Team Study 양다윗 발표자료 240510)
 
UiPath manufacturing technology benefits and AI overview
UiPath manufacturing technology benefits and AI overviewUiPath manufacturing technology benefits and AI overview
UiPath manufacturing technology benefits and AI overview
 
Vector Search @ sw2con for slideshare.pptx
Vector Search @ sw2con for slideshare.pptxVector Search @ sw2con for slideshare.pptx
Vector Search @ sw2con for slideshare.pptx
 
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptxCyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
 
Navigating the Large Language Model choices_Ravi Daparthi
Navigating the Large Language Model choices_Ravi DaparthiNavigating the Large Language Model choices_Ravi Daparthi
Navigating the Large Language Model choices_Ravi Daparthi
 
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on ThanabotsContinuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
الأمن السيبراني - ما لا يسع للمستخدم جهله
الأمن السيبراني - ما لا يسع للمستخدم جهلهالأمن السيبراني - ما لا يسع للمستخدم جهله
الأمن السيبراني - ما لا يسع للمستخدم جهله
 
State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
JavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate GuideJavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate Guide
 

Destacado

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Destacado (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Reusing XML Schemas' Information as a Foundation for Designing Domain Ontologies by Thomas Bosch

  • 1. Reusing XML Schemas' Information as a Foundation for Designing Domain Ontologies Thomas Bosch (M.Sc.) thomas.bosch@gesis.org | http://boschthomas.blogspot.com Problem Main Research Question • Traditionally, ontology engineers work in close collaboration with How to accelerate the time-consuming process designing DOs domain experts to design domain ontologies (DOs) which requires based on already available XSDs? lots of time and effort • DOs as well as XSDs describe domain data models Hypothesis • In many cases, XSDs are already defined and can therefore be The effort and the time delivering high quality DOs using the reused to design DOs proposed approach is much less than creating DOs completely manual Proposed Approach Derive DOs using SWRL rules Benefits XSD and OWL follow different modeling goals, the mapping transports • Process designing DOs from scratch is sped up significantly only XSDs' information, and generated ontologies (GOs) are not • All XSDs' information (terminology, syntactic structure of XML docs) conform to the highest quality requirements of DOs is reused in GOs  GOs are not immediately useful • GOs' RDF representations can be published in the LOD cloud and  domain experts and ontology engineers enrich GOs with additional linked to other RDF datasets domain-specific semantic information in form of DOs • All XML data conforming to XSDs can be imported automatically as DOs' instances Map XSDs to GOs • GOs and DOs can be maintained in a fast way • <xs:element name="VariableName" ... /> • Detect technical and content-related data models' weaknesses  VariableName ⊑ Element • <xs:element name= "VariableName" ... /> Novelty of Approach • Based on XSD meta-model  VariableName ⊑ name_Element_String.{'VariableName'} • Does not extract semantics out of XSDs • <xs:attribute ref="lang"/> • Transformation on terminological and assertional knowledge level  Lang-Reference ⊑ ref_Attribute_Attribute.Lang • Automatic transformation of XSDs and XML docs • <xs:element name="VariableName" type="NameType"/> • More expressive power of OWL instead of RDFS GOs  VariableName ⊑ type_Element_Type.NameType • <xs:extension><xs:attribute name="translated"/><xs:attribute Limitations name= "translatable"/></xs:extension>  Extension1 ⊑ • Prerequisite: XSDs contains_Extension_Attribute.(Translated ⊔ Translatable) • Not suitable use cases (e.g. when XSDs do not represent the domain knowledge correctly or when XSDs are technically not well Use Cases designed) • To proof approach's generality: any XSDs and corresp. XML docs can be converted to GOs and their RDF representations, as all XSD Evaluation meta-model's components are covered • To verify the hypothesis • Generic test cases: derived from XSD meta-model • User study to compare traditional manual and proposed approach • Domain-specific use cases: Data Documentation Initiative (DDI) (define measurement methods) ontology; projects: MISSY, da|ra, LOD pilot project, SOFISwiki • Derive DOs of multiple and differing domains