SlideShare una empresa de Scribd logo
1 de 42
keynote presentation at
DC-2010 Conference
Pittsburgh, PA
October 22, 2010
Bridging the Gaps:
Adaptive Approaches to Data Interoperability
Michael K. Bergman
2
The Iconoclast Cometh
3
Outline of Talk
Linked Data
Data Web, Structured Data and Semantic Web
Players and Roles  DCMI
Conclusions
4
Three Overall Assertions
<LinkedData> <isA> <ValuableTechnique>
<DataWeb> <hasNeedOf> <Semantics>
<DCMI> <hasRole> <Unique>
Linked Data
6
Three Linked Data Assertions
<LinkedData> <isA> <PreferredTechnique>
<Techniques> <doNotSolve> <RootChallenges>
<RDF> <hasBestRoleAs> <CanonicalDataModel>
7
Three More Linked Data Assertions
<LinkedData> <hasGrowing> <Triples>
<LDUsers> <wronglyUse> <ManyPredicates>
<LinkedData> <hasLack> <MajorUptake>
8
25 Billion Linked Data Triples
9
Bad Results from sameAs Misuse
10
The State of Linked Data
 Growing, but not as fast as promise would suggest
 Not used much, except curated settings
 Few actual dataset linkages
 NO true interoperability, except curated (life science,
some others)
 Difficult to publish
 If done right, best form to consume
Data, Structure and Semantic Web
12
Three Structured Data Web Assertions
<Heterogeneity> <isA> <Reality>
<LinkedData> <isOnly> <TinyContributor>
<Semantics> <isThe> <MissingLink>
13
Hundreds of Formats in the Wild
14
How to Aliquot the Firehose ?
15
Three Semantics Assertions (+ Axiom)
<ReferenceVocabs> <organize> <MassiveContent>
<LinkingPredicates> <gather> <RelatedContent>
<intersectionOf>
<SemanticContent> <enables> <MeaningfulWork>
16
Fixed References Help Orient
17
Concepts are the Fixed References
18
Design Aspects of Reference Concepts
 Truly are concepts, the idea of a thing
 Labels are language independent (à la SKOS):
 Preferred, human-readable label (prefLabel)
 Many, alternate synonyms, jargon, etc. (altLabel)
 Misspellings (hiddenLabel)
 all combined for tagging, IE purposes
 MUST have definition: what does this concept mean ?
 Organized into coherent structures (graphs)
 Inferencing
 Discovery and navigation
 Act as both classes and instances (RDF / OWL-speak)
 MUST have persistent URIs
19
Mappings Get Stuff into the Right Room
20
Many Mappings Should be Approximate
 skos:broadMatch
 skos:related
 ore:similarTo
 umbel:isAbout
 vmf:isInVocabulary
 skos:closeMatch
 lvont:nearlySameAs
 umbel:isLike
 umbel:hasCharacteristic
 lvont:somewhatSameAs
 rdfs:seeAlso
 ore:describes
 map:narrowerThan
 skos:narrower
 map:broaderThan
 skos:broader
 dc:subject
 link:uri
 foaf:isPrimaryTopicOf
21
Some Conditions for Interoperability
<Interoperability> <needsMapping> <Predicates>
<Interoperability> <needsReference> <Nouns>
Three Major Players
23
World Role
<World> <hasRole> <ContentAndStructure>
24
W3C Role
<W3C> <hasRole> <Standards>
25
DCMI Role
<DCMI> <hasRole> <ReferenceMetadata>
26
Three Going Forward Assertions
<LinkedData> <hasNeedOf> <MapPredicates>
<DataWeb> <hasNeedOf> <ReferenceConcepts>
<DCMI> <hasUniqueRole> <BothRequirements>
27
DCMI: the Unique Franchise
 DCMI already has unique authority in:
1. dc:subject
2. dc:subject qualifiers
3. initial Open Registry effort
4. core foundational properties
 DCMI has unique experience in:
1. diverse vocabularies
2. cataloging and classification
3. semantics
28
Reference Authority - Needed DCMI Role
<RefMetadata> <notSameAs> <OneRingRulesAll>
29
Reference Metadata is Not a Third Rail
30
The Web is Parched for Semantics
 Reference vocabularies
 Persistent URIs
 Re-use of vocabs
 Vetting + ranking
 Alignment services
 Annotation services
 RDFa injection
 Open source frameworks
31
We’re also Ready to Help
+
+ + + ???
32
A First Exemplar: FactForge
 A “reason-able” view to linked open data
 Pre-loaded semantic repository: reasoning, querying, exploration
 Ontologies
 Dublin Core, SKOS, RSS, FOAF
 Datasets
 DBpedia, Freebase, Geonames, UMBEL, MusicBrainz, Wordnet, CIA
World Factbook, Lingvoj
 Very large scale
 1.2B explicit + 0.9B inferred  10B retrievable statements
 Managed by BigOWLIM
 Free public service with many features:
 Auto-suggest
 Query and explore through Forest, RelFinder and Tabulator
 RDF search
 SPARQL end-point
33
Next Step, RENDER
 New EU project
 Large-scale LOD interoperability, methods
 Players:
 Karlsruher Institut fuer Technologie (DE)
 Ontotext (BG)
 Institut Jozef Stefan (SI)
 Telefonica (ES)
 Google (IE)
 Wikimedia (DE)
 STI Innsbruck (AT)
 Testbed for possible follow-ons ??
34
Possible Ontotext + SD Contributions
1. Mapping services to all comers (“vocabulary neutrality”)
2. Tagging services
3. Software + systems for other tagging services
4. Possible technical support for Metadata Registry
5. Lead / support for possible EU grant-seeking efforts
↓↓↓
If DCMI willing to partner, Ontotext + SD willing to
contribute in a neutral, open source manner
35
Ontotext + SD Links
 FactForge
http://www.factforge.net
 PROTON
http://proton.semanticweb.com
 Ontotext
http://www.ontotext.com
 RENDER
http://render-project.eu
 UMBEL
http://www.umbel.org
 Structured Dynamics
http://structureddynamics.com
Conclusion
37
Main Assertions Re-visited
 Interoperability on the Web not working:
1. Not (generally) fulfilled by linked data in current state
2. Predicates for approximate mappings lacking
3. Reference vocabularies essential as connecting nodes
 DCMI is the best (only?) player to plug these gaps
 We are willing to help find the resources + right
process to help plug the interoperability gap
38
DCMI Interoperability Services ?
Q & A
41
Contacts & Information
Michael K. Bergman
CEO
319.621.5225
mike@structureddynamics.com
blog: www.mkbergman.com
Web Sites
structureddynamics.com
citizen-dan.org (community indicator
systems)
openstructs.org (open source software)
techwiki.openstructs.org (open license
technical documentation)
umbel.org
umbel.structureddynamics.com (UMBEL
Web services)
DCMI Keynote: Bridging the Semantic Gaps and Interoperability

Más contenido relacionado

La actualidad más candente

Big Data and the Semantic Web: Challenges and Opportunities
Big Data and the Semantic Web: Challenges and OpportunitiesBig Data and the Semantic Web: Challenges and Opportunities
Big Data and the Semantic Web: Challenges and Opportunities
Srinath Srinivasa
 
One Ontology, One Data Set, Multiple Shapes with SHACL
One Ontology, One Data Set, Multiple Shapes with SHACLOne Ontology, One Data Set, Multiple Shapes with SHACL
One Ontology, One Data Set, Multiple Shapes with SHACL
Connected Data World
 

La actualidad más candente (20)

Big Data and the Semantic Web: Challenges and Opportunities
Big Data and the Semantic Web: Challenges and OpportunitiesBig Data and the Semantic Web: Challenges and Opportunities
Big Data and the Semantic Web: Challenges and Opportunities
 
Linked data for Enterprise Data Integration
Linked data for Enterprise Data IntegrationLinked data for Enterprise Data Integration
Linked data for Enterprise Data Integration
 
External CV support in Dataverse 5.7
External CV support in Dataverse 5.7External CV support in Dataverse 5.7
External CV support in Dataverse 5.7
 
The Semantic Data Web, Sören Auer, University of Leipzig
The Semantic Data Web, Sören Auer, University of LeipzigThe Semantic Data Web, Sören Auer, University of Leipzig
The Semantic Data Web, Sören Auer, University of Leipzig
 
Linking Big Data to Rich Process Descriptions
Linking Big Data to Rich Process DescriptionsLinking Big Data to Rich Process Descriptions
Linking Big Data to Rich Process Descriptions
 
Semantic Web Nature
Semantic Web NatureSemantic Web Nature
Semantic Web Nature
 
What is New in W3C land?
What is New in W3C land?What is New in W3C land?
What is New in W3C land?
 
Towards an Open Research Knowledge Graph
Towards an Open Research Knowledge GraphTowards an Open Research Knowledge Graph
Towards an Open Research Knowledge Graph
 
Linked data life cycles
Linked data life cyclesLinked data life cycles
Linked data life cycles
 
Fighting COVID-19 with Artificial Intelligence
Fighting COVID-19 with Artificial IntelligenceFighting COVID-19 with Artificial Intelligence
Fighting COVID-19 with Artificial Intelligence
 
Cognitive data
Cognitive dataCognitive data
Cognitive data
 
How Semantics Solves Big Data Challenges
How Semantics Solves Big Data ChallengesHow Semantics Solves Big Data Challenges
How Semantics Solves Big Data Challenges
 
Structured Data for the Financial Industry
Structured Data for the Financial Industry Structured Data for the Financial Industry
Structured Data for the Financial Industry
 
Going for GOLD - Adventures in Open Linked Metadata
Going for GOLD - Adventures in Open Linked MetadataGoing for GOLD - Adventures in Open Linked Metadata
Going for GOLD - Adventures in Open Linked Metadata
 
Ethics & (Explainable) AI – Semantic AI & the Role of the Knowledge Scientist
Ethics & (Explainable) AI – Semantic AI & the Role of the Knowledge ScientistEthics & (Explainable) AI – Semantic AI & the Role of the Knowledge Scientist
Ethics & (Explainable) AI – Semantic AI & the Role of the Knowledge Scientist
 
One Ontology, One Data Set, Multiple Shapes with SHACL
One Ontology, One Data Set, Multiple Shapes with SHACLOne Ontology, One Data Set, Multiple Shapes with SHACL
One Ontology, One Data Set, Multiple Shapes with SHACL
 
Setting up Dataverse repository for research data
Setting up Dataverse repository for research dataSetting up Dataverse repository for research data
Setting up Dataverse repository for research data
 
Semantics for Big Data Integration and Analysis
Semantics for Big Data Integration and AnalysisSemantics for Big Data Integration and Analysis
Semantics for Big Data Integration and Analysis
 
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the CloudFirst Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
 
Creating Linked Data from Relational Databases
Creating Linked Data from Relational DatabasesCreating Linked Data from Relational Databases
Creating Linked Data from Relational Databases
 

Destacado

Context & Connections: Designing a Vocab App
Context & Connections: Designing a Vocab AppContext & Connections: Designing a Vocab App
Context & Connections: Designing a Vocab App
Joshua Underwood
 
Ticclassiques
TicclassiquesTicclassiques
Ticclassiques
iesrb4
 
Coretta Scott King Shihab
Coretta Scott King ShihabCoretta Scott King Shihab
Coretta Scott King Shihab
anaq
 
Programas De
Programas DeProgramas De
Programas De
tat
 
Ch14 OS
Ch14 OSCh14 OS
Ch14 OS
C.U
 
Tales of an Open Scholar
Tales of an Open ScholarTales of an Open Scholar
Tales of an Open Scholar
ethan.watrall
 

Destacado (20)

Context & Connections: Designing a Vocab App
Context & Connections: Designing a Vocab AppContext & Connections: Designing a Vocab App
Context & Connections: Designing a Vocab App
 
Livinbrand 2016 - Jakub Michl, Beneš & Michl: Jak prosazujeme branding ve fir...
Livinbrand 2016 - Jakub Michl, Beneš & Michl: Jak prosazujeme branding ve fir...Livinbrand 2016 - Jakub Michl, Beneš & Michl: Jak prosazujeme branding ve fir...
Livinbrand 2016 - Jakub Michl, Beneš & Michl: Jak prosazujeme branding ve fir...
 
miLexicon @ Eurocall2010
miLexicon @ Eurocall2010miLexicon @ Eurocall2010
miLexicon @ Eurocall2010
 
Foundations of Open Source Economic Development Presentation 2 Curve 1
Foundations of Open Source Economic Development Presentation 2 Curve 1Foundations of Open Source Economic Development Presentation 2 Curve 1
Foundations of Open Source Economic Development Presentation 2 Curve 1
 
Ticclassiques
TicclassiquesTicclassiques
Ticclassiques
 
Salzburg
SalzburgSalzburg
Salzburg
 
User Testing Tactics
User Testing TacticsUser Testing Tactics
User Testing Tactics
 
Coretta Scott King Shihab
Coretta Scott King ShihabCoretta Scott King Shihab
Coretta Scott King Shihab
 
Social and business activities alignment
Social and business activities alignmentSocial and business activities alignment
Social and business activities alignment
 
The commoditization and fragmentation of the ia community
The commoditization and fragmentation of the ia communityThe commoditization and fragmentation of the ia community
The commoditization and fragmentation of the ia community
 
Intj0808pdf
Intj0808pdfIntj0808pdf
Intj0808pdf
 
Top Reasons To Recycle
Top Reasons To RecycleTop Reasons To Recycle
Top Reasons To Recycle
 
Programas De
Programas DeProgramas De
Programas De
 
Is This Clickable? - Change how you look at the web
Is This Clickable? - Change how you look at the webIs This Clickable? - Change how you look at the web
Is This Clickable? - Change how you look at the web
 
Ch14 OS
Ch14 OSCh14 OS
Ch14 OS
 
Tales of an Open Scholar
Tales of an Open ScholarTales of an Open Scholar
Tales of an Open Scholar
 
Data-driven Applications with conStruct
Data-driven Applications with conStructData-driven Applications with conStruct
Data-driven Applications with conStruct
 
Publicness
PublicnessPublicness
Publicness
 
Googley Family Philanthropy
Googley Family PhilanthropyGoogley Family Philanthropy
Googley Family Philanthropy
 
User Experience Utopia (Ad Club Seattle)
User Experience Utopia (Ad Club Seattle)User Experience Utopia (Ad Club Seattle)
User Experience Utopia (Ad Club Seattle)
 

Similar a DCMI Keynote: Bridging the Semantic Gaps and Interoperability

Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Web
Tomek Pluskiewicz
 
Web 3 Mark Greaves
Web 3 Mark GreavesWeb 3 Mark Greaves
Web 3 Mark Greaves
Mediabistro
 

Similar a DCMI Keynote: Bridging the Semantic Gaps and Interoperability (20)

The Future of LOD
The Future of LODThe Future of LOD
The Future of LOD
 
OSLC & The Future of Interoperability
OSLC & The Future of InteroperabilityOSLC & The Future of Interoperability
OSLC & The Future of Interoperability
 
Web Data Management in the RDF Age
Web Data Management in the RDF AgeWeb Data Management in the RDF Age
Web Data Management in the RDF Age
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Web
 
Decentralized Data Management for the Semantic Web
Decentralized Data Management for the Semantic WebDecentralized Data Management for the Semantic Web
Decentralized Data Management for the Semantic Web
 
Linking Open Data
Linking Open DataLinking Open Data
Linking Open Data
 
Sem tech 2011 v8
Sem tech 2011 v8Sem tech 2011 v8
Sem tech 2011 v8
 
鏈結資料在圖書館的應用20131107
鏈結資料在圖書館的應用20131107鏈結資料在圖書館的應用20131107
鏈結資料在圖書館的應用20131107
 
20100614 ISWSA Keynote
20100614 ISWSA Keynote20100614 ISWSA Keynote
20100614 ISWSA Keynote
 
Information Extraction and Linked Data Cloud
Information Extraction and Linked Data CloudInformation Extraction and Linked Data Cloud
Information Extraction and Linked Data Cloud
 
The Web of data and web data commons
The Web of data and web data commonsThe Web of data and web data commons
The Web of data and web data commons
 
Linked Open Data Visualization
Linked Open Data VisualizationLinked Open Data Visualization
Linked Open Data Visualization
 
Towards Virtual Knowledge Graphs over Web APIs
Towards Virtual Knowledge Graphs over Web APIsTowards Virtual Knowledge Graphs over Web APIs
Towards Virtual Knowledge Graphs over Web APIs
 
Linked data and voyager
Linked data and voyagerLinked data and voyager
Linked data and voyager
 
DCMI/RDA Task Group Report, DC-2010 Pittsburgh
DCMI/RDA Task Group Report, DC-2010 PittsburghDCMI/RDA Task Group Report, DC-2010 Pittsburgh
DCMI/RDA Task Group Report, DC-2010 Pittsburgh
 
Map of the CETIS metadata and digital repository interoperability domain
Map of the CETIS metadata and digital repository interoperability domainMap of the CETIS metadata and digital repository interoperability domain
Map of the CETIS metadata and digital repository interoperability domain
 
Web 3 Mark Greaves
Web 3 Mark GreavesWeb 3 Mark Greaves
Web 3 Mark Greaves
 
Virtuoso -- The Prometheus of RDF
Virtuoso -- The Prometheus of RDFVirtuoso -- The Prometheus of RDF
Virtuoso -- The Prometheus of RDF
 
The Empirical Turn in Knowledge Representation
The Empirical Turn in Knowledge RepresentationThe Empirical Turn in Knowledge Representation
The Empirical Turn in Knowledge Representation
 
Hala skafkeynote@conferencedata2021
Hala skafkeynote@conferencedata2021Hala skafkeynote@conferencedata2021
Hala skafkeynote@conferencedata2021
 

Más de Mike Bergman

Más de Mike Bergman (6)

Context, Perspective, and Generalities in a Knowledge Ontology
Context, Perspective, and Generalities in a Knowledge OntologyContext, Perspective, and Generalities in a Knowledge Ontology
Context, Perspective, and Generalities in a Knowledge Ontology
 
Seven Arguments for Semantic Technologies
Seven Arguments for Semantic TechnologiesSeven Arguments for Semantic Technologies
Seven Arguments for Semantic Technologies
 
The Rationale for Semantic Technologies
The Rationale for Semantic TechnologiesThe Rationale for Semantic Technologies
The Rationale for Semantic Technologies
 
Structured Dynamics' Semantic Technologies Product Stack
Structured Dynamics' Semantic Technologies Product StackStructured Dynamics' Semantic Technologies Product Stack
Structured Dynamics' Semantic Technologies Product Stack
 
UMBEL: Subject Concepts Layer for the Web
UMBEL: Subject Concepts Layer for the WebUMBEL: Subject Concepts Layer for the Web
UMBEL: Subject Concepts Layer for the Web
 
UMBEL Semantic Web Services
UMBEL Semantic Web ServicesUMBEL Semantic Web Services
UMBEL Semantic Web Services
 

Último

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 

Último (20)

🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 

DCMI Keynote: Bridging the Semantic Gaps and Interoperability

  • 1. keynote presentation at DC-2010 Conference Pittsburgh, PA October 22, 2010 Bridging the Gaps: Adaptive Approaches to Data Interoperability Michael K. Bergman
  • 3. 3 Outline of Talk Linked Data Data Web, Structured Data and Semantic Web Players and Roles  DCMI Conclusions
  • 4. 4 Three Overall Assertions <LinkedData> <isA> <ValuableTechnique> <DataWeb> <hasNeedOf> <Semantics> <DCMI> <hasRole> <Unique>
  • 6. 6 Three Linked Data Assertions <LinkedData> <isA> <PreferredTechnique> <Techniques> <doNotSolve> <RootChallenges> <RDF> <hasBestRoleAs> <CanonicalDataModel>
  • 7. 7 Three More Linked Data Assertions <LinkedData> <hasGrowing> <Triples> <LDUsers> <wronglyUse> <ManyPredicates> <LinkedData> <hasLack> <MajorUptake>
  • 8. 8 25 Billion Linked Data Triples
  • 9. 9 Bad Results from sameAs Misuse
  • 10. 10 The State of Linked Data  Growing, but not as fast as promise would suggest  Not used much, except curated settings  Few actual dataset linkages  NO true interoperability, except curated (life science, some others)  Difficult to publish  If done right, best form to consume
  • 11. Data, Structure and Semantic Web
  • 12. 12 Three Structured Data Web Assertions <Heterogeneity> <isA> <Reality> <LinkedData> <isOnly> <TinyContributor> <Semantics> <isThe> <MissingLink>
  • 13. 13 Hundreds of Formats in the Wild
  • 14. 14 How to Aliquot the Firehose ?
  • 15. 15 Three Semantics Assertions (+ Axiom) <ReferenceVocabs> <organize> <MassiveContent> <LinkingPredicates> <gather> <RelatedContent> <intersectionOf> <SemanticContent> <enables> <MeaningfulWork>
  • 17. 17 Concepts are the Fixed References
  • 18. 18 Design Aspects of Reference Concepts  Truly are concepts, the idea of a thing  Labels are language independent (à la SKOS):  Preferred, human-readable label (prefLabel)  Many, alternate synonyms, jargon, etc. (altLabel)  Misspellings (hiddenLabel)  all combined for tagging, IE purposes  MUST have definition: what does this concept mean ?  Organized into coherent structures (graphs)  Inferencing  Discovery and navigation  Act as both classes and instances (RDF / OWL-speak)  MUST have persistent URIs
  • 19. 19 Mappings Get Stuff into the Right Room
  • 20. 20 Many Mappings Should be Approximate  skos:broadMatch  skos:related  ore:similarTo  umbel:isAbout  vmf:isInVocabulary  skos:closeMatch  lvont:nearlySameAs  umbel:isLike  umbel:hasCharacteristic  lvont:somewhatSameAs  rdfs:seeAlso  ore:describes  map:narrowerThan  skos:narrower  map:broaderThan  skos:broader  dc:subject  link:uri  foaf:isPrimaryTopicOf
  • 21. 21 Some Conditions for Interoperability <Interoperability> <needsMapping> <Predicates> <Interoperability> <needsReference> <Nouns>
  • 23. 23 World Role <World> <hasRole> <ContentAndStructure>
  • 25. 25 DCMI Role <DCMI> <hasRole> <ReferenceMetadata>
  • 26. 26 Three Going Forward Assertions <LinkedData> <hasNeedOf> <MapPredicates> <DataWeb> <hasNeedOf> <ReferenceConcepts> <DCMI> <hasUniqueRole> <BothRequirements>
  • 27. 27 DCMI: the Unique Franchise  DCMI already has unique authority in: 1. dc:subject 2. dc:subject qualifiers 3. initial Open Registry effort 4. core foundational properties  DCMI has unique experience in: 1. diverse vocabularies 2. cataloging and classification 3. semantics
  • 28. 28 Reference Authority - Needed DCMI Role <RefMetadata> <notSameAs> <OneRingRulesAll>
  • 29. 29 Reference Metadata is Not a Third Rail
  • 30. 30 The Web is Parched for Semantics  Reference vocabularies  Persistent URIs  Re-use of vocabs  Vetting + ranking  Alignment services  Annotation services  RDFa injection  Open source frameworks
  • 31. 31 We’re also Ready to Help + + + + ???
  • 32. 32 A First Exemplar: FactForge  A “reason-able” view to linked open data  Pre-loaded semantic repository: reasoning, querying, exploration  Ontologies  Dublin Core, SKOS, RSS, FOAF  Datasets  DBpedia, Freebase, Geonames, UMBEL, MusicBrainz, Wordnet, CIA World Factbook, Lingvoj  Very large scale  1.2B explicit + 0.9B inferred  10B retrievable statements  Managed by BigOWLIM  Free public service with many features:  Auto-suggest  Query and explore through Forest, RelFinder and Tabulator  RDF search  SPARQL end-point
  • 33. 33 Next Step, RENDER  New EU project  Large-scale LOD interoperability, methods  Players:  Karlsruher Institut fuer Technologie (DE)  Ontotext (BG)  Institut Jozef Stefan (SI)  Telefonica (ES)  Google (IE)  Wikimedia (DE)  STI Innsbruck (AT)  Testbed for possible follow-ons ??
  • 34. 34 Possible Ontotext + SD Contributions 1. Mapping services to all comers (“vocabulary neutrality”) 2. Tagging services 3. Software + systems for other tagging services 4. Possible technical support for Metadata Registry 5. Lead / support for possible EU grant-seeking efforts ↓↓↓ If DCMI willing to partner, Ontotext + SD willing to contribute in a neutral, open source manner
  • 35. 35 Ontotext + SD Links  FactForge http://www.factforge.net  PROTON http://proton.semanticweb.com  Ontotext http://www.ontotext.com  RENDER http://render-project.eu  UMBEL http://www.umbel.org  Structured Dynamics http://structureddynamics.com
  • 37. 37 Main Assertions Re-visited  Interoperability on the Web not working: 1. Not (generally) fulfilled by linked data in current state 2. Predicates for approximate mappings lacking 3. Reference vocabularies essential as connecting nodes  DCMI is the best (only?) player to plug these gaps  We are willing to help find the resources + right process to help plug the interoperability gap
  • 39.
  • 40. Q & A
  • 41. 41 Contacts & Information Michael K. Bergman CEO 319.621.5225 mike@structureddynamics.com blog: www.mkbergman.com Web Sites structureddynamics.com citizen-dan.org (community indicator systems) openstructs.org (open source software) techwiki.openstructs.org (open license technical documentation) umbel.org umbel.structureddynamics.com (UMBEL Web services)

Notas del editor

  1. Predicate Role Reference Concepts (“is About”) Role DCMI PerfectlySituated
  2. Predicate Role Reference Concepts (“is About”) Role DCMI PerfectlySituated
  3. Predicate Role Reference Concepts (“is About”) Role DCMI PerfectlySituated
  4. Predicate Role Reference Concepts (“is About”) Role DCMI PerfectlySituated
  5. Predicate Role Reference Concepts (“is About”) Role DCMI PerfectlySituated
  6. Predicate Role Reference Concepts (“is About”) Role DCMI PerfectlySituated
  7. Predicate Role Reference Concepts (“is About”) Role DCMI PerfectlySituated