SlideShare una empresa de Scribd logo
1 de 1
Descargar para leer sin conexión
Reusing XML Schemas' Information as a Foundation for Designing
                             Domain Ontologies
                                                      Thomas Bosch (M.Sc.)
                                      thomas.bosch@gesis.org | http://boschthomas.blogspot.com

Problem                                                                Main Research Question
• Traditionally, ontology engineers work in close collaboration with   How to accelerate the time-consuming process designing DOs
  domain experts to design domain ontologies (DOs) which requires      based on already available XSDs?
  lots of time and effort
• DOs as well as XSDs describe domain data models                      Hypothesis
• In many cases, XSDs are already defined and can therefore be         The effort and the time delivering high quality DOs using the
  reused to design DOs                                                 proposed approach is much less than creating DOs completely
                                                                       manual

Proposed Approach                                                      Derive DOs using SWRL rules




                                                                       Benefits
XSD and OWL follow different modeling goals, the mapping transports    • Process designing DOs from scratch is sped up significantly
only XSDs' information, and generated ontologies (GOs) are not         • All XSDs' information (terminology, syntactic structure of XML docs)
conform to the highest quality requirements of DOs                       is reused in GOs
 GOs are not immediately useful                                       • GOs' RDF representations can be published in the LOD cloud and
 domain experts and ontology engineers enrich GOs with additional       linked to other RDF datasets
   domain-specific semantic information in form of DOs                 • All XML data conforming to XSDs can be imported automatically as
                                                                         DOs' instances
Map XSDs to GOs                                                        • GOs and DOs can be maintained in a fast way
• <xs:element name="VariableName" ... />                               • Detect technical and content-related data models' weaknesses
   VariableName ⊑ Element
• <xs:element name= "VariableName" ... />
                                                                       Novelty of Approach
                                                                       •   Based on XSD meta-model
   VariableName ⊑ name_Element_String.{'VariableName'}
                                                                       •   Does not extract semantics out of XSDs
• <xs:attribute ref="lang"/>
                                                                       •   Transformation on terminological and assertional knowledge level
   Lang-Reference ⊑ ref_Attribute_Attribute.Lang
                                                                       •   Automatic transformation of XSDs and XML docs
• <xs:element name="VariableName" type="NameType"/>
                                                                       •   More expressive power of OWL instead of RDFS GOs
   VariableName ⊑ type_Element_Type.NameType
• <xs:extension><xs:attribute name="translated"/><xs:attribute         Limitations
  name= "translatable"/></xs:extension>  Extension1 ⊑                 • Prerequisite: XSDs
    contains_Extension_Attribute.(Translated ⊔ Translatable)           • Not suitable use cases (e.g. when XSDs do not represent the
                                                                         domain knowledge correctly or when XSDs are technically not well
Use Cases                                                                designed)
• To proof approach's generality: any XSDs and corresp. XML docs
  can be converted to GOs and their RDF representations, as all XSD    Evaluation
  meta-model's components are covered                                  • To verify the hypothesis
• Generic test cases: derived from XSD meta-model                      • User study to compare traditional manual and proposed approach
• Domain-specific use cases: Data Documentation Initiative (DDI)         (define measurement methods)
  ontology; projects: MISSY, da|ra, LOD pilot project, SOFISwiki       • Derive DOs of multiple and differing domains

Más contenido relacionado

La actualidad más candente

Mdst3703 2013-09-17-text-models
Mdst3703 2013-09-17-text-modelsMdst3703 2013-09-17-text-models
Mdst3703 2013-09-17-text-models
Rafael Alvarado
 

La actualidad más candente (13)

SHACL: Shaping the Big Ball of Data Mud
SHACL: Shaping the Big Ball of Data MudSHACL: Shaping the Big Ball of Data Mud
SHACL: Shaping the Big Ball of Data Mud
 
RDF validation tutorial
RDF validation tutorialRDF validation tutorial
RDF validation tutorial
 
RDF Graph Data Management in Oracle Database and NoSQL Platforms
RDF Graph Data Management in Oracle Database and NoSQL PlatformsRDF Graph Data Management in Oracle Database and NoSQL Platforms
RDF Graph Data Management in Oracle Database and NoSQL Platforms
 
Data shapes-test-suite
Data shapes-test-suiteData shapes-test-suite
Data shapes-test-suite
 
SHACL Specification Draft
SHACL Specification DraftSHACL Specification Draft
SHACL Specification Draft
 
The Web Ontology Language
The Web Ontology LanguageThe Web Ontology Language
The Web Ontology Language
 
Core Java Training in Bangalore | Best Core Java Class in Bangalore
Core Java Training in Bangalore | Best Core Java Class in BangaloreCore Java Training in Bangalore | Best Core Java Class in Bangalore
Core Java Training in Bangalore | Best Core Java Class in Bangalore
 
Graph Databases and Web Frameworks (NodeJS, AngularJS, GridFS, OpenLink Virtu...
Graph Databases and Web Frameworks (NodeJS, AngularJS, GridFS, OpenLink Virtu...Graph Databases and Web Frameworks (NodeJS, AngularJS, GridFS, OpenLink Virtu...
Graph Databases and Web Frameworks (NodeJS, AngularJS, GridFS, OpenLink Virtu...
 
Jarrar: OWL (Web Ontology Language)
Jarrar: OWL (Web Ontology Language)Jarrar: OWL (Web Ontology Language)
Jarrar: OWL (Web Ontology Language)
 
SHACL Overview
SHACL OverviewSHACL Overview
SHACL Overview
 
Introduction to Apache Solr
Introduction to Apache SolrIntroduction to Apache Solr
Introduction to Apache Solr
 
Mdst3703 2013-09-17-text-models
Mdst3703 2013-09-17-text-modelsMdst3703 2013-09-17-text-models
Mdst3703 2013-09-17-text-models
 
PhD thesis defense: Large-scale multilingual knowledge extraction, publishin...
PhD thesis defense:  Large-scale multilingual knowledge extraction, publishin...PhD thesis defense:  Large-scale multilingual knowledge extraction, publishin...
PhD thesis defense: Large-scale multilingual knowledge extraction, publishin...
 

Destacado (12)

Fogão a Lenha
Fogão a LenhaFogão a Lenha
Fogão a Lenha
 
Fumicold
FumicoldFumicold
Fumicold
 
Hill Resume
Hill ResumeHill Resume
Hill Resume
 
Poster Presentation Final Draft
Poster Presentation Final DraftPoster Presentation Final Draft
Poster Presentation Final Draft
 
CC2 Tower Crane
CC2 Tower CraneCC2 Tower Crane
CC2 Tower Crane
 
Ideen für sichere und gesunde Arbeit gesucht 2013
Ideen für sichere und gesunde Arbeit gesucht 2013Ideen für sichere und gesunde Arbeit gesucht 2013
Ideen für sichere und gesunde Arbeit gesucht 2013
 
FST recognition letter -- Mike E.PDF
FST recognition letter -- Mike E.PDFFST recognition letter -- Mike E.PDF
FST recognition letter -- Mike E.PDF
 
Queue 7
Queue 7Queue 7
Queue 7
 
PIANTE NELLO SPAZIO
PIANTE NELLO SPAZIOPIANTE NELLO SPAZIO
PIANTE NELLO SPAZIO
 
Manzano. maria. actvidad no1
Manzano. maria. actvidad no1Manzano. maria. actvidad no1
Manzano. maria. actvidad no1
 
Ashley Bell Case Part 2
Ashley Bell Case Part 2Ashley Bell Case Part 2
Ashley Bell Case Part 2
 
Terminos psu
Terminos psuTerminos psu
Terminos psu
 

Similar a SSSW 2012 - Reusing XML Schemas' Information as a Foundation for Designing Domain Ontologies

ravenbenweb xml and its application .PPT
ravenbenweb xml and its application .PPTravenbenweb xml and its application .PPT
ravenbenweb xml and its application .PPT
ubaidullah75790
 
Drill Bay Area HUG 2012-09-19
Drill Bay Area HUG 2012-09-19Drill Bay Area HUG 2012-09-19
Drill Bay Area HUG 2012-09-19
jasonfrantz
 
Tools for Next Generation of CMS: XML, RDF, & GRDDL
Tools for Next Generation of CMS: XML, RDF, & GRDDLTools for Next Generation of CMS: XML, RDF, & GRDDL
Tools for Next Generation of CMS: XML, RDF, & GRDDL
Chimezie Ogbuji
 

Similar a SSSW 2012 - Reusing XML Schemas' Information as a Foundation for Designing Domain Ontologies (20)

2012.11 - ISWC 2012 - DC - 2
2012.11 - ISWC 2012 - DC -  22012.11 - ISWC 2012 - DC -  2
2012.11 - ISWC 2012 - DC - 2
 
2012.11 - ISWC 2012 - DC - 1
2012.11 - ISWC 2012 - DC - 12012.11 - ISWC 2012 - DC - 1
2012.11 - ISWC 2012 - DC - 1
 
Agile xml
Agile xmlAgile xml
Agile xml
 
Evolution of the Graph Schema
Evolution of the Graph SchemaEvolution of the Graph Schema
Evolution of the Graph Schema
 
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the CloudFirst Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
 
Unit iv xml dom
Unit iv xml domUnit iv xml dom
Unit iv xml dom
 
The state of KOS in the Linked Data movement
The state of KOS in the Linked Data movementThe state of KOS in the Linked Data movement
The state of KOS in the Linked Data movement
 
Poster
PosterPoster
Poster
 
XML - SAX
XML - SAXXML - SAX
XML - SAX
 
ravenbenweb xml and its application .PPT
ravenbenweb xml and its application .PPTravenbenweb xml and its application .PPT
ravenbenweb xml and its application .PPT
 
MongoDB 2.4 and spring data
MongoDB 2.4 and spring dataMongoDB 2.4 and spring data
MongoDB 2.4 and spring data
 
Semantic Web use cases in outcomes research
Semantic Web use cases in outcomes researchSemantic Web use cases in outcomes research
Semantic Web use cases in outcomes research
 
Oracle Week 2016 - Modern Data Architecture
Oracle Week 2016 - Modern Data ArchitectureOracle Week 2016 - Modern Data Architecture
Oracle Week 2016 - Modern Data Architecture
 
sw owl
 sw owl sw owl
sw owl
 
NoSQL and MongoDB
NoSQL and MongoDBNoSQL and MongoDB
NoSQL and MongoDB
 
Drill dchug-29 nov2012
Drill dchug-29 nov2012Drill dchug-29 nov2012
Drill dchug-29 nov2012
 
NOsql Presentation.pdf
NOsql Presentation.pdfNOsql Presentation.pdf
NOsql Presentation.pdf
 
Drill Bay Area HUG 2012-09-19
Drill Bay Area HUG 2012-09-19Drill Bay Area HUG 2012-09-19
Drill Bay Area HUG 2012-09-19
 
Sep 2012 HUG: Apache Drill for Interactive Analysis
Sep 2012 HUG: Apache Drill for Interactive Analysis Sep 2012 HUG: Apache Drill for Interactive Analysis
Sep 2012 HUG: Apache Drill for Interactive Analysis
 
Tools for Next Generation of CMS: XML, RDF, & GRDDL
Tools for Next Generation of CMS: XML, RDF, & GRDDLTools for Next Generation of CMS: XML, RDF, & GRDDL
Tools for Next Generation of CMS: XML, RDF, & GRDDL
 

Más de Dr.-Ing. Thomas Hartmann

KIT Graduiertenkolloquium 11.05.2016
KIT Graduiertenkolloquium 11.05.2016KIT Graduiertenkolloquium 11.05.2016
KIT Graduiertenkolloquium 11.05.2016
Dr.-Ing. Thomas Hartmann
 
2015.03 - The RDF Validator - A Tool to Validate RDF Data (KIM)
2015.03 - The RDF Validator - A Tool to Validate RDF Data (KIM)2015.03 - The RDF Validator - A Tool to Validate RDF Data (KIM)
2015.03 - The RDF Validator - A Tool to Validate RDF Data (KIM)
Dr.-Ing. Thomas Hartmann
 
Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]
Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]
Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]
Dr.-Ing. Thomas Hartmann
 
2013.02 - 7th Workshop of German Panel Surveys
2013.02 - 7th Workshop of German Panel Surveys2013.02 - 7th Workshop of German Panel Surveys
2013.02 - 7th Workshop of German Panel Surveys
Dr.-Ing. Thomas Hartmann
 

Más de Dr.-Ing. Thomas Hartmann (20)

Doctoral Examination at the Karlsruhe Institute of Technology (08.07.2016)
Doctoral Examination at the Karlsruhe Institute of Technology (08.07.2016)Doctoral Examination at the Karlsruhe Institute of Technology (08.07.2016)
Doctoral Examination at the Karlsruhe Institute of Technology (08.07.2016)
 
KIT Graduiertenkolloquium 11.05.2016
KIT Graduiertenkolloquium 11.05.2016KIT Graduiertenkolloquium 11.05.2016
KIT Graduiertenkolloquium 11.05.2016
 
2016.02 - Validating RDF Data Quality using Constraints to Direct the Develop...
2016.02 - Validating RDF Data Quality using Constraints to Direct the Develop...2016.02 - Validating RDF Data Quality using Constraints to Direct the Develop...
2016.02 - Validating RDF Data Quality using Constraints to Direct the Develop...
 
2015.09. - The Role of Reasoning for RDF Validation (SEMANTiCS 2015)
2015.09. - The Role of Reasoning for RDF Validation (SEMANTiCS 2015)2015.09. - The Role of Reasoning for RDF Validation (SEMANTiCS 2015)
2015.09. - The Role of Reasoning for RDF Validation (SEMANTiCS 2015)
 
2015.09 - Guidance, Please! Towards a Framework for RDF-Based Constraint Lang...
2015.09 - Guidance, Please! Towards a Framework for RDF-Based Constraint Lang...2015.09 - Guidance, Please! Towards a Framework for RDF-Based Constraint Lang...
2015.09 - Guidance, Please! Towards a Framework for RDF-Based Constraint Lang...
 
2015.03 - The RDF Validator - A Tool to Validate RDF Data (KIM)
2015.03 - The RDF Validator - A Tool to Validate RDF Data (KIM)2015.03 - The RDF Validator - A Tool to Validate RDF Data (KIM)
2015.03 - The RDF Validator - A Tool to Validate RDF Data (KIM)
 
2014.12 - Let's Disco - 2 (EDDI 2014)
2014.12 - Let's Disco - 2 (EDDI 2014)2014.12 - Let's Disco - 2 (EDDI 2014)
2014.12 - Let's Disco - 2 (EDDI 2014)
 
2014.12 - Let's Disco (EDDI 2014)
2014.12 - Let's Disco (EDDI 2014)2014.12 - Let's Disco (EDDI 2014)
2014.12 - Let's Disco (EDDI 2014)
 
2014.10 - How to Formulate and Validate Constraints (DC 2014)
2014.10 - How to Formulate and Validate Constraints (DC 2014)2014.10 - How to Formulate and Validate Constraints (DC 2014)
2014.10 - How to Formulate and Validate Constraints (DC 2014)
 
2014.10 - Towards Description Set Profiles for RDF Using SPARQL as Intermedia...
2014.10 - Towards Description Set Profiles for RDF Using SPARQL as Intermedia...2014.10 - Towards Description Set Profiles for RDF Using SPARQL as Intermedia...
2014.10 - Towards Description Set Profiles for RDF Using SPARQL as Intermedia...
 
2014.10 - Requirements on RDF Constraint Formulation and Validation (DC 2014)
2014.10 - Requirements on RDF Constraint Formulation and Validation (DC 2014)2014.10 - Requirements on RDF Constraint Formulation and Validation (DC 2014)
2014.10 - Requirements on RDF Constraint Formulation and Validation (DC 2014)
 
The Next Generation of the Microdata Information System MISSY - An Integrated...
The Next Generation of the Microdata Information System MISSY - An Integrated...The Next Generation of the Microdata Information System MISSY - An Integrated...
The Next Generation of the Microdata Information System MISSY - An Integrated...
 
The New Microdata Information System (MISSY) - Integration of DDI-based Data ...
The New Microdata Information System (MISSY) - Integration of DDI-based Data ...The New Microdata Information System (MISSY) - Integration of DDI-based Data ...
The New Microdata Information System (MISSY) - Integration of DDI-based Data ...
 
Use Cases and Vocabularies Related to the DDI-RDF Discovery Vocabulary (EDDI ...
Use Cases and Vocabularies Related to the DDI-RDF Discovery Vocabulary (EDDI ...Use Cases and Vocabularies Related to the DDI-RDF Discovery Vocabulary (EDDI ...
Use Cases and Vocabularies Related to the DDI-RDF Discovery Vocabulary (EDDI ...
 
Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]
Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]
Towards the Discovery of Person-Level Data (SemStats, ISWC 2013) [2013.10]
 
2013.05 - IASSIST 2013 - 3
2013.05 - IASSIST 2013 - 32013.05 - IASSIST 2013 - 3
2013.05 - IASSIST 2013 - 3
 
2013.05 - IASSIST 2013 - 2
2013.05 - IASSIST 2013 - 22013.05 - IASSIST 2013 - 2
2013.05 - IASSIST 2013 - 2
 
2013.05 - IASSIST 2013
2013.05 - IASSIST 20132013.05 - IASSIST 2013
2013.05 - IASSIST 2013
 
2013.05 - LDOW 2013 @ WWW 2013
2013.05 - LDOW 2013 @ WWW 20132013.05 - LDOW 2013 @ WWW 2013
2013.05 - LDOW 2013 @ WWW 2013
 
2013.02 - 7th Workshop of German Panel Surveys
2013.02 - 7th Workshop of German Panel Surveys2013.02 - 7th Workshop of German Panel Surveys
2013.02 - 7th Workshop of German Panel Surveys
 

SSSW 2012 - Reusing XML Schemas' Information as a Foundation for Designing Domain Ontologies

  • 1. Reusing XML Schemas' Information as a Foundation for Designing Domain Ontologies Thomas Bosch (M.Sc.) thomas.bosch@gesis.org | http://boschthomas.blogspot.com Problem Main Research Question • Traditionally, ontology engineers work in close collaboration with How to accelerate the time-consuming process designing DOs domain experts to design domain ontologies (DOs) which requires based on already available XSDs? lots of time and effort • DOs as well as XSDs describe domain data models Hypothesis • In many cases, XSDs are already defined and can therefore be The effort and the time delivering high quality DOs using the reused to design DOs proposed approach is much less than creating DOs completely manual Proposed Approach Derive DOs using SWRL rules Benefits XSD and OWL follow different modeling goals, the mapping transports • Process designing DOs from scratch is sped up significantly only XSDs' information, and generated ontologies (GOs) are not • All XSDs' information (terminology, syntactic structure of XML docs) conform to the highest quality requirements of DOs is reused in GOs  GOs are not immediately useful • GOs' RDF representations can be published in the LOD cloud and  domain experts and ontology engineers enrich GOs with additional linked to other RDF datasets domain-specific semantic information in form of DOs • All XML data conforming to XSDs can be imported automatically as DOs' instances Map XSDs to GOs • GOs and DOs can be maintained in a fast way • <xs:element name="VariableName" ... /> • Detect technical and content-related data models' weaknesses  VariableName ⊑ Element • <xs:element name= "VariableName" ... /> Novelty of Approach • Based on XSD meta-model  VariableName ⊑ name_Element_String.{'VariableName'} • Does not extract semantics out of XSDs • <xs:attribute ref="lang"/> • Transformation on terminological and assertional knowledge level  Lang-Reference ⊑ ref_Attribute_Attribute.Lang • Automatic transformation of XSDs and XML docs • <xs:element name="VariableName" type="NameType"/> • More expressive power of OWL instead of RDFS GOs  VariableName ⊑ type_Element_Type.NameType • <xs:extension><xs:attribute name="translated"/><xs:attribute Limitations name= "translatable"/></xs:extension>  Extension1 ⊑ • Prerequisite: XSDs contains_Extension_Attribute.(Translated ⊔ Translatable) • Not suitable use cases (e.g. when XSDs do not represent the domain knowledge correctly or when XSDs are technically not well Use Cases designed) • To proof approach's generality: any XSDs and corresp. XML docs can be converted to GOs and their RDF representations, as all XSD Evaluation meta-model's components are covered • To verify the hypothesis • Generic test cases: derived from XSD meta-model • User study to compare traditional manual and proposed approach • Domain-specific use cases: Data Documentation Initiative (DDI) (define measurement methods) ontology; projects: MISSY, da|ra, LOD pilot project, SOFISwiki • Derive DOs of multiple and differing domains