SlideShare una empresa de Scribd logo
1 de 45
SchemEX – Building an Index
        for Linked Open Data
  Ansgar Scherp, Thomas Gottron, Mathias Konrath
  University of Koblenz-Landau, Germany



  Oslo, Norway
  August 2012
SchemEX – Building an Index for LOD   Slide 1 of 44
Learning Goals

• Understand the motivation and
  fundamentals of Linked Open Data (LOD).
• Qualify in why an index for LOD is needed
  and how to efficiently create such an index.




SchemEX – Building an Index for LOD   Slide 2 of 44
Scenario

• Tim plans to travel
  – from London
  – to a customer in Cologne




SchemEX – Building an Index for LOD   Slide 3 of 44
Website of the German Railway




It works, why bother…?
SchemEX – Building an Index for LOD   Slide 4 of 44
Let„s Try Different Queries

 Bottlenecks in public transportation?
 Compare the connections with flights?
 Visualize on a map?
…


 All these queries cannot be answered,
  because the data …


SchemEX – Building an Index for LOD   Slide 5 of 44
… locked in Silos!


 – High Integration Effort
 – Lack in Reuse of Data
SchemEX – Building an Index for LOD   Slide 6Jagendorf, http://www.flickr.com/photos/bobjagendorf/, CC-BY
                                          B. of 44
Linked Data
• Publishing and interlinking of data
• Different quality and purpose
• From different sources in the Web

          World Wide Web                       Linked Data
        Documents                          Data
        Hyperlinks                         Typed Links
        HTML                               RDF
        Addresses (URIs)                   Addresses (URIs)

               Example: http://www.uio.no/
SchemEX – Building an Index for LOD   Slide 7 of 44
Relevance of Linked Data?




SchemEX – Building an Index for LOD   Slide 8 of 44
Linked Data: May „07                                                Sept. „11
                                                Web 2.0


                                      Media



                                                                                  Publications

    eGovernment

                                      Cross-Domain



                                                                 Life
                Geographic                                     Sciences



SchemEX – Building an Index for LOD
< 31 Billion Triples                           Slide 9 of 44              Source: http://lod-cloud.net
Linked Data Principles


1.        Identification
2.        Interlinkage
3.        Dereferencing
4.        Description




SchemEX – Building an Index for LOD   Slide 10 of 44
Example: Big Lynx
                                 Matt Briggs




                                Scott Miller
                                                     ?
                                                     Big Lynx
                                                     Company




SchemEX – Building an Index
< 31 Milliarde Triple for LOD      Slide 11 of 44   Source: http://lod-cloud.net
1. Use URIs for Identification




 Matt Briggs


                                                                                Scott Miller
         http://biglynx.co.uk/
         people/matt-briggs
                                                                           http://biglynx.co.uk/
                                                                           people/scott-miller

SchemEX – Building B. Gazen,http://www.flickr.com/photos/bayat/,12 of 44
                   an Index for LOD                       Slide CC-BY
Example: Big Lynx
                                       Matt Briggs




                                      Scott Miller
                                                          Big Lynx
                                                          Company



 How to model relationships like knows?

SchemEX – Building an Index for LOD      Slide 13 of 44
Resource DescriptionFramework (RDF)
• Description of Ressources with RDF triple
            Matt Briggs                   is a         Person


                  Subject             Predicate         Object

 @prefix rdf:<http://w3.org/1999/02/22-rdf-
       syntax-ns#> .
 @prefix foaf:<http://xmlns.com/foaf/0.1/> .
 <http://biglynx.co.uk/people/matt-briggs>
     rdf:type foaf:Person .
SchemEX – Building an Index for LOD   Slide 14 of 44
1. Use URIs also for Relations




        http://biglynx.co.uk/
        people/matt-briggs

                                                                           http://biglynx.co.uk/
                                                                           people/scott-miller

SchemEX – Building B. Gazen,http://www.flickr.com/photos/bayat/,15 of 44
                   an Index for LOD                       Slide CC-BY
Example: Big Lynx
                                                             Dave Smith
         London
                                      „lives here―

                                        Matt Briggs

                                         „same
                                        Scott Miller
                                                            Big Lynx
                          …               person―
                                                            Company

           DBpedia                                           Matt Briggs

                               Matts private
                               Webseite
SchemEX – Building an Index for LOD        Slide 16 of 44
2. Establishing Interlinkage
• Relation links between ressources
       <http://biglynx.co.uk/people/dave-smith>
           foaf:based_near
           <http://dbpedia.org/resource/London> .


 Identity links between ressources
    <http://biglynx.co.uk/people/matt-briggs>
        owl:sameAs
         <http://www.matt-briggs.eg.uk#me> .
SchemEX – Building an Index for LOD   Slide 17 of 44
Example: Big Lynx
                                                               Dave Smith
         London
                                        „lives here―
                                      foaf:based_near


                                          Matt Briggs

                                           „same
                                          owl:sameAs
                                           Person―            Big Lynx
                                                              Company

           DBpedia                                             Matt Briggs

                               Matts private
                               Webseite
SchemEX – Building an Index for LOD          Slide 18 of 44
3. Dereferencing of URIs

• Looking up of web documents

• How can we ―look up‖ things of the real world?




                                 http://biglynx.co.uk/
                                 people/matt-briggs


SchemEX – Building an Index for LOD          Slide 19 of 44
Two Approaches
1. Hash URIs
   – URI contains a part separated by #, e.g.,
    http://biglynx.co.uk/vocab/sme#Team

2. Negotiation via „303 See Other― request
      http://biglynx.co.uk/people/matt-briggs
      Response: „Look here:―
      http://biglynx.co.uk/people/matt-briggs.rdf


SchemEX – Building an Index for LOD   Slide 20 of 44
Example: Big Lynx
                                                                Dave Smith
         London
                                      foaf:based_near


                               Description of
                                     Matt Briggs
                               Matt?
                                          owl:sameAs
                                                               Big Lynx
                                                               Company

           DBpedia                                              Matt Briggs

                               Matts private
                               Webseite
SchemEX – Building an Index for LOD           Slide 21 of 44
4. Description of URIs
                  foaf:Person                                                   …
…                                                   dp:Birmingham
                               rdf:type
                                                 foaf:based_near                …

             biglynx:matt-briggs                  ex:loc
                                                                 _:point
                               foaf:knows
                                                                             wgs84:
                                                             wgs84:            long
            biglynx:dave-smith
                                                             lat
                                                                           ―-0.118‖
                               foaf:based_near
                                                                ―51.509‖
                   dp:London

        …                                         …
SchemEX – Building an Index for LOD         Slide 22 of 44
RDF / RDF Schema Vocabulary
•    Set of URIs defined in rdf:/rdfs: namespace
•    rdf:type               • rdfs:domain
•    rdf:Property           • rdfs:range
•    rdf:XMLLiteral         • rdfs:Resource
•    rdf:List               • rdfs:Literal
•    rdf:first              • rdfs:Datatype
•    rdf:rest               • rdfs:Class
•    rdf:Seq                • rdfs:subClassOf
•    rdf:Bag                • rdfs:subPropertyOf
•    rdf:Alt                • rdfs:comment
•    ...                    • …
•    rdf:value              • rdfs:label
SchemEX – Building an Index for LOD   Slide 23 of 44
Semantic Web Layer Cake (Simplified)




SchemEX – Building an Index for LOD   Slide 24 of 44
Learning Goals

• Understand the motivation and
  fundamentals of Linked Open Data (LOD).
• Qualify in why an index for LOD is needed
  and how to efficiently create such an index.




SchemEX – Building an Index for LOD   Slide 25 of 44
Scenario
• People who are politicians and actors




• Who else?
• Where do they live?
• Whom do they know? …are they married with?
SchemEX – Building an Index for LOD   Slide 26 of 44
Problem
• No single federated query interface provided
• Execute those queries on the LOD cloud
SELECT ?x
FROM …
WHERE {
 ?x rdf:type ex:Actor .
 ?x rdf:type ex:Politician .
}




      “politicians
      and actors”
SchemEX – Building an Index for LOD   Slide 27 of 44
Principle Solution
• Suitable index structure for looking up sources




       “politicians
       and actors”

SchemEX – Building an Index for LOD   Slide 28 of 44
The Naive Approach
1.     Download the entire LOD cloud
2.     Put it into a (really) large triple store
3.     Process the data and extract schema
4.     Provide lookup

- Big machinery
- Late in processing the data
- High effort to scale with LOD cloud



SchemEX – Building an Index for LOD   Slide 29 of 44
Idea
 Schema-level index
   Define families of graph patterns
   Assign instances to graph patterns
   Map graph patterns to context (source URI)
 Construction
   Stream-based for scalability
   Little loss of accuracy
 Note
   Index defined over instances
   But stores the context
SchemEX – Building an Index for LOD   Slide 30 of 44
Input Data
 n-Quads
         <subject> <predicate> <object> <context>
 Example:
            <http://www.w3.org/People/Connolly/#me>
            <http://www.w3.org/1999/02/22-rdf-syntax-ns#
            <http://xmlns.com/foaf/0.1/Person>
            <http://dig.csail.mit.edu/2008/webdav/timbl/
                             http://dig.csail.mit.edu/2008/
                             webdav/timbl/foaf.rdf
                          w3p:
                          #me
                                                           foaf:
                                                          Person



SchemEX – Building an Index for LOD           Slide 31 of 44
Building the Schema and Index
                                                                     RDF
      C1                C2            C3         …           Ck
                                                                    classes
                                      consistsOf
                                                                     Type
         TC1                    TC2              …          TCm     clusters
hasEQ
Class                  p1                   p2
       EQC1                   EQC2               … EQCn           Equivalence
                                                                    classes
                                       hasDataSource

                                                 …                  Data
  DS1 DS2 DS3 DS4 DS5                                       DSx    sources
SchemEX – Building an Index for LOD        Slide 32 of 44
Layer 1: RDF Classes
 All instances of a                                                 C1
  particular type
                                                          DS 1      DS 2        DS 3

 SELECT ?x
 FROM …
 WHERE {
    ?x rdfs:type foaf:Person .
                           foaf:Person
 }

                                                                 http://dig.csail.mit.edu/2008/...
                                 foaf:
 timbl:                         Person
 card#i                                    http://www.w3.org/People/Berners-Lee/card



SchemEX – Building an Index for LOD      Slide 33 of 44
Layer 2: Type Clusters
 All instances belonging                                       C1         C2

  to exactly the same set
                                                                     TC1
  of types
 SELECT ?x                     DS 1      DS 2    DS 3
 FROM …
 WHERE {
                            foaf:Person       pim:Male
    ?x rdfs:type foaf:Person .
    ?x rdfs:type pim:Male .           tc4711
 }
                       pim:
                       Male
                                                 http://www.w3.org/People/Berners-Lee/card
                                       foaf:
 timbl:
                                      Person
 card#i
SchemEX – Building an Index for LOD            Slide 34 of 44
Layer 3: Equivalence Classes
 Two instances are                                     C1           C2         C3

  equivalent iff:
    They are in the same TC                                  TC1               TC2

    They have the same                                                   p
     properties
                                                              EQC1
    The property targets are
     in the same TC                                    DS 1     DS 2          DS 3




  Similar to 1-Bisimulation
SchemEX – Building an Index for LOD   Slide 35 of 44
Layer 3: Equivalence Classes
SELECT ?x
WHERE {
   ?x rdfs:type foaf:Person foaf:Person
                            .
   ?x rdfs:type pim:Male .            pim:Male foaf:PPD
   ?x foaf:maker ?y .
   ?y rdfs:type
      foaf:PersonalProfileDocument .
                                 tc4711         tc1234
}                                       eqc0815
                                                                           -maker-
 pim:           foaf:                   foaf:                               tc1234
 Male          Person                   PPD
                                                                 eqc0815
                                                                                foaf:maker


                                      timbl:     http://www.w3.org/People/Berners-Lee/card
      timbl:                           card
      card#i
SchemEX – Building an Index for LOD             Slide 36 of 44
The SchemEX Approach
• Stream-based schema extraction
• While crawling the data


                                      FIFO
LOD-Crawler                                    Instance-
 RDF-Dump                                        Cache       RDF
 Triple Store                                               RDBMS
                              NxParser

     Nquad-                                     Schema-     Schema-
                                Parser
     Stream                                     Extractor    Level
                                                             Index
SchemEX – Building an Index for LOD      Slide 37 of 44
Building the Index from a Stream
 Stream of n-quads (coming from a LD crawler)
      … Q16, Q15, Q14, Q13, Q12, Q11, Q10, Q9, Q8, Q7, Q6, Q5, Q4, Q3, Q2, Q1



                                                       FiFo
                                                                     1
                                             C3         4
                                                                     6
                                             C2         3
                                                                     4
                                                        2
                                             C2                      2
                                                        1                3
                                             C1                      5



• Linear runtime complexity wrt # of input triples
SchemEX – Building an Index for LOD   Slide 38 of 44
Computing SchemEX: TimBL Data Set
• Analysis of a smaller data set
• 11 M triples, TimBL’s FOAF profile
• LDspider with ~ 2k triples / sec


•    Different cache sizes: 100, 1k, 10k, 50k, 100k
•    Compared SchemEX with reference schema
•    Index queries on all Types, TCs, EQCs
•    Good precision/recall ratio at 50k+

SchemEX – Building an Index for LOD   Slide 39 of 44
Quality of Stream-based Index
Construction




• Runtime increases hardly with window size
• Memory consumption scales with window size
SchemEX – Building an Index for LOD   Slide 40 of 44
Computing SchemEX: Full BTC 2011 Data




Cache size: 50 k
SchemEX – Building an Index for LOD   Slide 41 of 44
Billion Triple Challenge 2011




  …




SchemEX – Building an Index for LOD   Slide 42 of 44
Conclusions: SchemEX
• Linked Open Data (LOD) approach
   • Publishing and interlinking data on the web

• SchemEX
   • Stream-based approach to LOD schema
     extraction
   • Scalable to arbitrary amount of Linked Data
   • Applicable on commodity hardware
     (4GB RAM, single CPU)

SchemEX – Building an Index for LOD   Slide 43 of 44
Learning Goals

• Understand the motivation and
  fundamentals of Linked Open Data (LOD).
• Qualify in why an index for LOD is needed
  and how to efficiently create such an index.




SchemEX – Building an Index for LOD   Slide 44 of 44
Recommended Readings
• Maciej Janik, Ansgar Scherp, Steffen Staab: The Semantic Web:
  Collective Intelligence on the Web. Informatik Spektrum 34(5): 469-483
  (2011)
  URL: http://dx.doi.org/10.1007/s00287-011-0535-x

• Mathias Konrath, Thomas Gottron, Steffen Staab, Ansgar Scherp:
  SchemEX — Efficient construction of a data catalogue by stream-based
  indexing of linked data, J. of Web Semantics: Science, Services and
  Agents on the World Wide Web, Available online 23 June 2012
  URL: http://www.sciencedirect.com/science/article/pii/S1570826812000716

• Tom Heath, Christian Bizer: Linked Data: Evolving the Web into a Global
  Data Space, Morgan & Claypool Publishers, 2011
  URL: http://dx.doi.org/10.2200/S00334ED1V01Y201102WBE001


SchemEX – Building an Index for LOD   Slide 45 of 44

Más contenido relacionado

La actualidad más candente

Signposting Overview (Version November 2017)
Signposting Overview (Version November 2017)Signposting Overview (Version November 2017)
Signposting Overview (Version November 2017)Herbert Van de Sompel
 
Place graphs are the new social graphs
Place graphs are the new social graphsPlace graphs are the new social graphs
Place graphs are the new social graphsMatt Biddulph
 
Evolving the Web into a Global Dataspace – Advances and Applications
Evolving the Web into a Global Dataspace – Advances and ApplicationsEvolving the Web into a Global Dataspace – Advances and Applications
Evolving the Web into a Global Dataspace – Advances and ApplicationsChris Bizer
 
FAIR Signposting: A KISS Approach to a Burning Issue
FAIR Signposting: A KISS Approach to a Burning IssueFAIR Signposting: A KISS Approach to a Burning Issue
FAIR Signposting: A KISS Approach to a Burning IssueHerbert Van de Sompel
 
Context is King: On Semantic Publishing
Context is King: On Semantic PublishingContext is King: On Semantic Publishing
Context is King: On Semantic PublishingStefan Gradmann
 
Persistent Identification: Easier Said than Done
Persistent Identification: Easier Said than DonePersistent Identification: Easier Said than Done
Persistent Identification: Easier Said than DoneHerbert Van de Sompel
 

La actualidad más candente (7)

Signposting Overview (Version November 2017)
Signposting Overview (Version November 2017)Signposting Overview (Version November 2017)
Signposting Overview (Version November 2017)
 
Place graphs are the new social graphs
Place graphs are the new social graphsPlace graphs are the new social graphs
Place graphs are the new social graphs
 
Evolving the Web into a Global Dataspace – Advances and Applications
Evolving the Web into a Global Dataspace – Advances and ApplicationsEvolving the Web into a Global Dataspace – Advances and Applications
Evolving the Web into a Global Dataspace – Advances and Applications
 
FAIR Signposting: A KISS Approach to a Burning Issue
FAIR Signposting: A KISS Approach to a Burning IssueFAIR Signposting: A KISS Approach to a Burning Issue
FAIR Signposting: A KISS Approach to a Burning Issue
 
Context is King: On Semantic Publishing
Context is King: On Semantic PublishingContext is King: On Semantic Publishing
Context is King: On Semantic Publishing
 
Linked Data
Linked DataLinked Data
Linked Data
 
Persistent Identification: Easier Said than Done
Persistent Identification: Easier Said than DonePersistent Identification: Easier Said than Done
Persistent Identification: Easier Said than Done
 

Destacado

وثيقة النموذج المرجعي للتطبيقات الوطنية
وثيقة النموذج المرجعي للتطبيقات الوطنيةوثيقة النموذج المرجعي للتطبيقات الوطنية
وثيقة النموذج المرجعي للتطبيقات الوطنيةYesserProgram
 
واقع البوابات المفتوحة في الحكومات الالكترونية في دول مجلس التعاون الخليجي
واقع البوابات المفتوحة في الحكومات الالكترونية في دول مجلس التعاون الخليجيواقع البوابات المفتوحة في الحكومات الالكترونية في دول مجلس التعاون الخليجي
واقع البوابات المفتوحة في الحكومات الالكترونية في دول مجلس التعاون الخليجيSaeed Al Dhaheri
 
Using Minecraft for community engagement and public space design
Using Minecraft for community engagement and public space designUsing Minecraft for community engagement and public space design
Using Minecraft for community engagement and public space designmysociety
 
الدليل التعريفي بالبيانات المفتوحة
الدليل التعريفي بالبيانات المفتوحةالدليل التعريفي بالبيانات المفتوحة
الدليل التعريفي بالبيانات المفتوحةYesserProgram
 
تقرير الامم المتحدة لتطور الحكومة الالكترونية لعام 2016
تقرير الامم المتحدة لتطور الحكومة الالكترونية لعام 2016تقرير الامم المتحدة لتطور الحكومة الالكترونية لعام 2016
تقرير الامم المتحدة لتطور الحكومة الالكترونية لعام 2016YesserProgram
 
واقع الحكومة الالكترونية العربية
واقع الحكومة الالكترونية العربيةواقع الحكومة الالكترونية العربية
واقع الحكومة الالكترونية العربيةAbbas Badran
 
ماذا يتعين على الجهات الحكومية لمواكبة ‏رؤية السعودية 2030‬ ؟
ماذا يتعين على الجهات الحكومية لمواكبة ‏رؤية السعودية 2030‬ ؟ماذا يتعين على الجهات الحكومية لمواكبة ‏رؤية السعودية 2030‬ ؟
ماذا يتعين على الجهات الحكومية لمواكبة ‏رؤية السعودية 2030‬ ؟YesserProgram
 
نظم التشغيل المفتوحة والتطبيقات الحرة
نظم التشغيل المفتوحة والتطبيقات الحرةنظم التشغيل المفتوحة والتطبيقات الحرة
نظم التشغيل المفتوحة والتطبيقات الحرةissa alsumri
 
الإدارة الإلكترونية
الإدارة الإلكترونيةالإدارة الإلكترونية
الإدارة الإلكترونيةProf. Sherif Shaheen
 
الحكومة الالكترونية الامكانات و المعوقات
الحكومة الالكترونية الامكانات و المعوقاتالحكومة الالكترونية الامكانات و المعوقات
الحكومة الالكترونية الامكانات و المعوقاتMajid eddan
 

Destacado (12)

وثيقة النموذج المرجعي للتطبيقات الوطنية
وثيقة النموذج المرجعي للتطبيقات الوطنيةوثيقة النموذج المرجعي للتطبيقات الوطنية
وثيقة النموذج المرجعي للتطبيقات الوطنية
 
واقع البوابات المفتوحة في الحكومات الالكترونية في دول مجلس التعاون الخليجي
واقع البوابات المفتوحة في الحكومات الالكترونية في دول مجلس التعاون الخليجيواقع البوابات المفتوحة في الحكومات الالكترونية في دول مجلس التعاون الخليجي
واقع البوابات المفتوحة في الحكومات الالكترونية في دول مجلس التعاون الخليجي
 
Using Minecraft for community engagement and public space design
Using Minecraft for community engagement and public space designUsing Minecraft for community engagement and public space design
Using Minecraft for community engagement and public space design
 
الدليل التعريفي بالبيانات المفتوحة
الدليل التعريفي بالبيانات المفتوحةالدليل التعريفي بالبيانات المفتوحة
الدليل التعريفي بالبيانات المفتوحة
 
البيانات المفتوحة المترابطة
البيانات المفتوحة المترابطة البيانات المفتوحة المترابطة
البيانات المفتوحة المترابطة
 
Global - Local Open Data Index
Global - Local Open Data Index Global - Local Open Data Index
Global - Local Open Data Index
 
تقرير الامم المتحدة لتطور الحكومة الالكترونية لعام 2016
تقرير الامم المتحدة لتطور الحكومة الالكترونية لعام 2016تقرير الامم المتحدة لتطور الحكومة الالكترونية لعام 2016
تقرير الامم المتحدة لتطور الحكومة الالكترونية لعام 2016
 
واقع الحكومة الالكترونية العربية
واقع الحكومة الالكترونية العربيةواقع الحكومة الالكترونية العربية
واقع الحكومة الالكترونية العربية
 
ماذا يتعين على الجهات الحكومية لمواكبة ‏رؤية السعودية 2030‬ ؟
ماذا يتعين على الجهات الحكومية لمواكبة ‏رؤية السعودية 2030‬ ؟ماذا يتعين على الجهات الحكومية لمواكبة ‏رؤية السعودية 2030‬ ؟
ماذا يتعين على الجهات الحكومية لمواكبة ‏رؤية السعودية 2030‬ ؟
 
نظم التشغيل المفتوحة والتطبيقات الحرة
نظم التشغيل المفتوحة والتطبيقات الحرةنظم التشغيل المفتوحة والتطبيقات الحرة
نظم التشغيل المفتوحة والتطبيقات الحرة
 
الإدارة الإلكترونية
الإدارة الإلكترونيةالإدارة الإلكترونية
الإدارة الإلكترونية
 
الحكومة الالكترونية الامكانات و المعوقات
الحكومة الالكترونية الامكانات و المعوقاتالحكومة الالكترونية الامكانات و المعوقات
الحكومة الالكترونية الامكانات و المعوقات
 

Similar a SchemEX -- Building an Index for Linked Open Data

Skb web2.0
Skb web2.0Skb web2.0
Skb web2.0animove
 
Why SKOS should be a Focal Point of your Linked Data Strategy
Why SKOS should be a Focal Point of your Linked Data StrategyWhy SKOS should be a Focal Point of your Linked Data Strategy
Why SKOS should be a Focal Point of your Linked Data StrategySemantic Web Company
 
Beautifying Data in the real world
Beautifying Data in the real worldBeautifying Data in the real world
Beautifying Data in the real worldTan Tran
 
A NOSQL Overview And The Benefits Of Graph Databases (nosql east 2009)
A NOSQL Overview And The Benefits Of Graph Databases (nosql east 2009)A NOSQL Overview And The Benefits Of Graph Databases (nosql east 2009)
A NOSQL Overview And The Benefits Of Graph Databases (nosql east 2009)Emil Eifrem
 
Linked Data an Introduction
Linked Data an IntroductionLinked Data an Introduction
Linked Data an IntroductionTalis Consulting
 
The Semantic Web – A Vision Come True, or Giving Up the Great Plan?
The Semantic Web – A Vision Come True, or Giving Up the Great Plan?The Semantic Web – A Vision Come True, or Giving Up the Great Plan?
The Semantic Web – A Vision Come True, or Giving Up the Great Plan?Martin Hepp
 
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...Artificial Intelligence Institute at UofSC
 
Science and Web2.0
Science and Web2.0Science and Web2.0
Science and Web2.0Ian Mulvany
 
NoSQL intro for YaJUG / NoSQL UG Luxembourg
NoSQL intro for YaJUG / NoSQL UG LuxembourgNoSQL intro for YaJUG / NoSQL UG Luxembourg
NoSQL intro for YaJUG / NoSQL UG LuxembourgNGDATA
 
What is New in W3C land?
What is New in W3C land?What is New in W3C land?
What is New in W3C land?Ivan Herman
 
Towards Interoperable Metadata Provenance
Towards Interoperable Metadata ProvenanceTowards Interoperable Metadata Provenance
Towards Interoperable Metadata ProvenanceKai Eckert
 
Provenance for Multimedia
Provenance for MultimediaProvenance for Multimedia
Provenance for MultimediaRaphael Troncy
 
Semantic web and Linked Data
Semantic web and Linked DataSemantic web and Linked Data
Semantic web and Linked DataHyun Namgoong
 
A Study of Internet RFC Authors using NetDraw and yEd
A Study of Internet RFC Authors using NetDraw and yEdA Study of Internet RFC Authors using NetDraw and yEd
A Study of Internet RFC Authors using NetDraw and yEdOlivier MJ Crépin-Leblond
 
The technical case for a semantic web
The technical case for a semantic webThe technical case for a semantic web
The technical case for a semantic webTony Dobaj
 
Twitter case study
Twitter case studyTwitter case study
Twitter case studydivya_binu
 
Jarrar: The Next Generation of the Web 3.0: The Semantic Web
Jarrar: The Next Generation of the Web 3.0: The Semantic WebJarrar: The Next Generation of the Web 3.0: The Semantic Web
Jarrar: The Next Generation of the Web 3.0: The Semantic WebMustafa Jarrar
 

Similar a SchemEX -- Building an Index for Linked Open Data (20)

Skb web2.0
Skb web2.0Skb web2.0
Skb web2.0
 
Why SKOS should be a Focal Point of your Linked Data Strategy
Why SKOS should be a Focal Point of your Linked Data StrategyWhy SKOS should be a Focal Point of your Linked Data Strategy
Why SKOS should be a Focal Point of your Linked Data Strategy
 
Beautifying Data in the real world
Beautifying Data in the real worldBeautifying Data in the real world
Beautifying Data in the real world
 
A NOSQL Overview And The Benefits Of Graph Databases (nosql east 2009)
A NOSQL Overview And The Benefits Of Graph Databases (nosql east 2009)A NOSQL Overview And The Benefits Of Graph Databases (nosql east 2009)
A NOSQL Overview And The Benefits Of Graph Databases (nosql east 2009)
 
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
 
Linked Data an Introduction
Linked Data an IntroductionLinked Data an Introduction
Linked Data an Introduction
 
The Semantic Web – A Vision Come True, or Giving Up the Great Plan?
The Semantic Web – A Vision Come True, or Giving Up the Great Plan?The Semantic Web – A Vision Come True, or Giving Up the Great Plan?
The Semantic Web – A Vision Come True, or Giving Up the Great Plan?
 
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
 
Science and Web2.0
Science and Web2.0Science and Web2.0
Science and Web2.0
 
NoSQL intro for YaJUG / NoSQL UG Luxembourg
NoSQL intro for YaJUG / NoSQL UG LuxembourgNoSQL intro for YaJUG / NoSQL UG Luxembourg
NoSQL intro for YaJUG / NoSQL UG Luxembourg
 
What is New in W3C land?
What is New in W3C land?What is New in W3C land?
What is New in W3C land?
 
Towards Interoperable Metadata Provenance
Towards Interoperable Metadata ProvenanceTowards Interoperable Metadata Provenance
Towards Interoperable Metadata Provenance
 
Provenance for Multimedia
Provenance for MultimediaProvenance for Multimedia
Provenance for Multimedia
 
Semantic web and Linked Data
Semantic web and Linked DataSemantic web and Linked Data
Semantic web and Linked Data
 
A Study of Internet RFC Authors using NetDraw and yEd
A Study of Internet RFC Authors using NetDraw and yEdA Study of Internet RFC Authors using NetDraw and yEd
A Study of Internet RFC Authors using NetDraw and yEd
 
LOD2 Webinar Series: SILK
LOD2 Webinar Series: SILKLOD2 Webinar Series: SILK
LOD2 Webinar Series: SILK
 
The technical case for a semantic web
The technical case for a semantic webThe technical case for a semantic web
The technical case for a semantic web
 
Unlocking Doors: recent initiatives in open and linked data at the National L...
Unlocking Doors: recent initiatives in open and linked data at the National L...Unlocking Doors: recent initiatives in open and linked data at the National L...
Unlocking Doors: recent initiatives in open and linked data at the National L...
 
Twitter case study
Twitter case studyTwitter case study
Twitter case study
 
Jarrar: The Next Generation of the Web 3.0: The Semantic Web
Jarrar: The Next Generation of the Web 3.0: The Semantic WebJarrar: The Next Generation of the Web 3.0: The Semantic Web
Jarrar: The Next Generation of the Web 3.0: The Semantic Web
 

Más de Ansgar Scherp

Analysis of GraphSum's Attention Weights to Improve the Explainability of Mul...
Analysis of GraphSum's Attention Weights to Improve the Explainability of Mul...Analysis of GraphSum's Attention Weights to Improve the Explainability of Mul...
Analysis of GraphSum's Attention Weights to Improve the Explainability of Mul...Ansgar Scherp
 
STEREO: A Pipeline for Extracting Experiment Statistics, Conditions, and Topi...
STEREO: A Pipeline for Extracting Experiment Statistics, Conditions, and Topi...STEREO: A Pipeline for Extracting Experiment Statistics, Conditions, and Topi...
STEREO: A Pipeline for Extracting Experiment Statistics, Conditions, and Topi...Ansgar Scherp
 
Text Localization in Scientific Figures using Fully Convolutional Neural Netw...
Text Localization in Scientific Figures using Fully Convolutional Neural Netw...Text Localization in Scientific Figures using Fully Convolutional Neural Netw...
Text Localization in Scientific Figures using Fully Convolutional Neural Netw...Ansgar Scherp
 
A Comparison of Approaches for Automated Text Extraction from Scholarly Figures
A Comparison of Approaches for Automated Text Extraction from Scholarly FiguresA Comparison of Approaches for Automated Text Extraction from Scholarly Figures
A Comparison of Approaches for Automated Text Extraction from Scholarly FiguresAnsgar Scherp
 
About Multimedia Presentation Generation and Multimedia Metadata: From Synthe...
About Multimedia Presentation Generation and Multimedia Metadata: From Synthe...About Multimedia Presentation Generation and Multimedia Metadata: From Synthe...
About Multimedia Presentation Generation and Multimedia Metadata: From Synthe...Ansgar Scherp
 
Mining and Managing Large-scale Linked Open Data
Mining and Managing Large-scale Linked Open DataMining and Managing Large-scale Linked Open Data
Mining and Managing Large-scale Linked Open DataAnsgar Scherp
 
Knowledge Discovery in Social Media and Scientific Digital Libraries
Knowledge Discovery in Social Media and Scientific Digital LibrariesKnowledge Discovery in Social Media and Scientific Digital Libraries
Knowledge Discovery in Social Media and Scientific Digital LibrariesAnsgar Scherp
 
A Comparison of Different Strategies for Automated Semantic Document Annotation
A Comparison of Different Strategies for Automated Semantic Document AnnotationA Comparison of Different Strategies for Automated Semantic Document Annotation
A Comparison of Different Strategies for Automated Semantic Document AnnotationAnsgar Scherp
 
Formalization and Preliminary Evaluation of a Pipeline for Text Extraction Fr...
Formalization and Preliminary Evaluation of a Pipeline for Text Extraction Fr...Formalization and Preliminary Evaluation of a Pipeline for Text Extraction Fr...
Formalization and Preliminary Evaluation of a Pipeline for Text Extraction Fr...Ansgar Scherp
 
A Framework for Iterative Signing of Graph Data on the Web
A Framework for Iterative Signing of Graph Data on the WebA Framework for Iterative Signing of Graph Data on the Web
A Framework for Iterative Signing of Graph Data on the WebAnsgar Scherp
 
Smart photo selection: interpret gaze as personal interest
Smart photo selection: interpret gaze as personal interestSmart photo selection: interpret gaze as personal interest
Smart photo selection: interpret gaze as personal interestAnsgar Scherp
 
Events in Multimedia - Theory, Model, Application
Events in Multimedia - Theory, Model, ApplicationEvents in Multimedia - Theory, Model, Application
Events in Multimedia - Theory, Model, ApplicationAnsgar Scherp
 
Can you see it? Annotating Image Regions based on Users' Gaze Information
Can you see it? Annotating Image Regions based on Users' Gaze InformationCan you see it? Annotating Image Regions based on Users' Gaze Information
Can you see it? Annotating Image Regions based on Users' Gaze InformationAnsgar Scherp
 
A Model of Events for Integrating Event-based Information in Complex Socio-te...
A Model of Events for Integrating Event-based Information in Complex Socio-te...A Model of Events for Integrating Event-based Information in Complex Socio-te...
A Model of Events for Integrating Event-based Information in Complex Socio-te...Ansgar Scherp
 
SchemEX - Creating the Yellow Pages for the Linked Open Data Cloud
SchemEX - Creating the Yellow Pages for the Linked Open Data CloudSchemEX - Creating the Yellow Pages for the Linked Open Data Cloud
SchemEX - Creating the Yellow Pages for the Linked Open Data CloudAnsgar Scherp
 
strukt - A Pattern System for Integrating Individual and Organizational Knowl...
strukt - A Pattern System for Integrating Individual and Organizational Knowl...strukt - A Pattern System for Integrating Individual and Organizational Knowl...
strukt - A Pattern System for Integrating Individual and Organizational Knowl...Ansgar Scherp
 
Identifying Objects in Images from Analyzing the User‘s Gaze Movements for Pr...
Identifying Objects in Images from Analyzing the User‘s Gaze Movements for Pr...Identifying Objects in Images from Analyzing the User‘s Gaze Movements for Pr...
Identifying Objects in Images from Analyzing the User‘s Gaze Movements for Pr...Ansgar Scherp
 
Linked Open Data (Entwurfsprinzipien und Muster für vernetzte Daten)
Linked Open Data (Entwurfsprinzipien und Muster für vernetzte Daten)Linked Open Data (Entwurfsprinzipien und Muster für vernetzte Daten)
Linked Open Data (Entwurfsprinzipien und Muster für vernetzte Daten)Ansgar Scherp
 

Más de Ansgar Scherp (18)

Analysis of GraphSum's Attention Weights to Improve the Explainability of Mul...
Analysis of GraphSum's Attention Weights to Improve the Explainability of Mul...Analysis of GraphSum's Attention Weights to Improve the Explainability of Mul...
Analysis of GraphSum's Attention Weights to Improve the Explainability of Mul...
 
STEREO: A Pipeline for Extracting Experiment Statistics, Conditions, and Topi...
STEREO: A Pipeline for Extracting Experiment Statistics, Conditions, and Topi...STEREO: A Pipeline for Extracting Experiment Statistics, Conditions, and Topi...
STEREO: A Pipeline for Extracting Experiment Statistics, Conditions, and Topi...
 
Text Localization in Scientific Figures using Fully Convolutional Neural Netw...
Text Localization in Scientific Figures using Fully Convolutional Neural Netw...Text Localization in Scientific Figures using Fully Convolutional Neural Netw...
Text Localization in Scientific Figures using Fully Convolutional Neural Netw...
 
A Comparison of Approaches for Automated Text Extraction from Scholarly Figures
A Comparison of Approaches for Automated Text Extraction from Scholarly FiguresA Comparison of Approaches for Automated Text Extraction from Scholarly Figures
A Comparison of Approaches for Automated Text Extraction from Scholarly Figures
 
About Multimedia Presentation Generation and Multimedia Metadata: From Synthe...
About Multimedia Presentation Generation and Multimedia Metadata: From Synthe...About Multimedia Presentation Generation and Multimedia Metadata: From Synthe...
About Multimedia Presentation Generation and Multimedia Metadata: From Synthe...
 
Mining and Managing Large-scale Linked Open Data
Mining and Managing Large-scale Linked Open DataMining and Managing Large-scale Linked Open Data
Mining and Managing Large-scale Linked Open Data
 
Knowledge Discovery in Social Media and Scientific Digital Libraries
Knowledge Discovery in Social Media and Scientific Digital LibrariesKnowledge Discovery in Social Media and Scientific Digital Libraries
Knowledge Discovery in Social Media and Scientific Digital Libraries
 
A Comparison of Different Strategies for Automated Semantic Document Annotation
A Comparison of Different Strategies for Automated Semantic Document AnnotationA Comparison of Different Strategies for Automated Semantic Document Annotation
A Comparison of Different Strategies for Automated Semantic Document Annotation
 
Formalization and Preliminary Evaluation of a Pipeline for Text Extraction Fr...
Formalization and Preliminary Evaluation of a Pipeline for Text Extraction Fr...Formalization and Preliminary Evaluation of a Pipeline for Text Extraction Fr...
Formalization and Preliminary Evaluation of a Pipeline for Text Extraction Fr...
 
A Framework for Iterative Signing of Graph Data on the Web
A Framework for Iterative Signing of Graph Data on the WebA Framework for Iterative Signing of Graph Data on the Web
A Framework for Iterative Signing of Graph Data on the Web
 
Smart photo selection: interpret gaze as personal interest
Smart photo selection: interpret gaze as personal interestSmart photo selection: interpret gaze as personal interest
Smart photo selection: interpret gaze as personal interest
 
Events in Multimedia - Theory, Model, Application
Events in Multimedia - Theory, Model, ApplicationEvents in Multimedia - Theory, Model, Application
Events in Multimedia - Theory, Model, Application
 
Can you see it? Annotating Image Regions based on Users' Gaze Information
Can you see it? Annotating Image Regions based on Users' Gaze InformationCan you see it? Annotating Image Regions based on Users' Gaze Information
Can you see it? Annotating Image Regions based on Users' Gaze Information
 
A Model of Events for Integrating Event-based Information in Complex Socio-te...
A Model of Events for Integrating Event-based Information in Complex Socio-te...A Model of Events for Integrating Event-based Information in Complex Socio-te...
A Model of Events for Integrating Event-based Information in Complex Socio-te...
 
SchemEX - Creating the Yellow Pages for the Linked Open Data Cloud
SchemEX - Creating the Yellow Pages for the Linked Open Data CloudSchemEX - Creating the Yellow Pages for the Linked Open Data Cloud
SchemEX - Creating the Yellow Pages for the Linked Open Data Cloud
 
strukt - A Pattern System for Integrating Individual and Organizational Knowl...
strukt - A Pattern System for Integrating Individual and Organizational Knowl...strukt - A Pattern System for Integrating Individual and Organizational Knowl...
strukt - A Pattern System for Integrating Individual and Organizational Knowl...
 
Identifying Objects in Images from Analyzing the User‘s Gaze Movements for Pr...
Identifying Objects in Images from Analyzing the User‘s Gaze Movements for Pr...Identifying Objects in Images from Analyzing the User‘s Gaze Movements for Pr...
Identifying Objects in Images from Analyzing the User‘s Gaze Movements for Pr...
 
Linked Open Data (Entwurfsprinzipien und Muster für vernetzte Daten)
Linked Open Data (Entwurfsprinzipien und Muster für vernetzte Daten)Linked Open Data (Entwurfsprinzipien und Muster für vernetzte Daten)
Linked Open Data (Entwurfsprinzipien und Muster für vernetzte Daten)
 

Último

Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 

Último (20)

Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 

SchemEX -- Building an Index for Linked Open Data

  • 1. SchemEX – Building an Index for Linked Open Data Ansgar Scherp, Thomas Gottron, Mathias Konrath University of Koblenz-Landau, Germany Oslo, Norway August 2012 SchemEX – Building an Index for LOD Slide 1 of 44
  • 2. Learning Goals • Understand the motivation and fundamentals of Linked Open Data (LOD). • Qualify in why an index for LOD is needed and how to efficiently create such an index. SchemEX – Building an Index for LOD Slide 2 of 44
  • 3. Scenario • Tim plans to travel – from London – to a customer in Cologne SchemEX – Building an Index for LOD Slide 3 of 44
  • 4. Website of the German Railway It works, why bother…? SchemEX – Building an Index for LOD Slide 4 of 44
  • 5. Let„s Try Different Queries  Bottlenecks in public transportation?  Compare the connections with flights?  Visualize on a map? …  All these queries cannot be answered, because the data … SchemEX – Building an Index for LOD Slide 5 of 44
  • 6. … locked in Silos! – High Integration Effort – Lack in Reuse of Data SchemEX – Building an Index for LOD Slide 6Jagendorf, http://www.flickr.com/photos/bobjagendorf/, CC-BY B. of 44
  • 7. Linked Data • Publishing and interlinking of data • Different quality and purpose • From different sources in the Web World Wide Web Linked Data Documents Data Hyperlinks Typed Links HTML RDF Addresses (URIs) Addresses (URIs) Example: http://www.uio.no/ SchemEX – Building an Index for LOD Slide 7 of 44
  • 8. Relevance of Linked Data? SchemEX – Building an Index for LOD Slide 8 of 44
  • 9. Linked Data: May „07  Sept. „11 Web 2.0 Media Publications eGovernment Cross-Domain Life Geographic Sciences SchemEX – Building an Index for LOD < 31 Billion Triples Slide 9 of 44 Source: http://lod-cloud.net
  • 10. Linked Data Principles 1. Identification 2. Interlinkage 3. Dereferencing 4. Description SchemEX – Building an Index for LOD Slide 10 of 44
  • 11. Example: Big Lynx Matt Briggs Scott Miller ? Big Lynx Company SchemEX – Building an Index < 31 Milliarde Triple for LOD Slide 11 of 44 Source: http://lod-cloud.net
  • 12. 1. Use URIs for Identification Matt Briggs Scott Miller http://biglynx.co.uk/ people/matt-briggs http://biglynx.co.uk/ people/scott-miller SchemEX – Building B. Gazen,http://www.flickr.com/photos/bayat/,12 of 44 an Index for LOD Slide CC-BY
  • 13. Example: Big Lynx Matt Briggs Scott Miller Big Lynx Company  How to model relationships like knows? SchemEX – Building an Index for LOD Slide 13 of 44
  • 14. Resource DescriptionFramework (RDF) • Description of Ressources with RDF triple Matt Briggs is a Person Subject Predicate Object @prefix rdf:<http://w3.org/1999/02/22-rdf- syntax-ns#> . @prefix foaf:<http://xmlns.com/foaf/0.1/> . <http://biglynx.co.uk/people/matt-briggs> rdf:type foaf:Person . SchemEX – Building an Index for LOD Slide 14 of 44
  • 15. 1. Use URIs also for Relations http://biglynx.co.uk/ people/matt-briggs http://biglynx.co.uk/ people/scott-miller SchemEX – Building B. Gazen,http://www.flickr.com/photos/bayat/,15 of 44 an Index for LOD Slide CC-BY
  • 16. Example: Big Lynx Dave Smith London „lives here― Matt Briggs „same Scott Miller Big Lynx … person― Company DBpedia Matt Briggs Matts private Webseite SchemEX – Building an Index for LOD Slide 16 of 44
  • 17. 2. Establishing Interlinkage • Relation links between ressources <http://biglynx.co.uk/people/dave-smith> foaf:based_near <http://dbpedia.org/resource/London> .  Identity links between ressources <http://biglynx.co.uk/people/matt-briggs> owl:sameAs <http://www.matt-briggs.eg.uk#me> . SchemEX – Building an Index for LOD Slide 17 of 44
  • 18. Example: Big Lynx Dave Smith London „lives here― foaf:based_near Matt Briggs „same owl:sameAs Person― Big Lynx Company DBpedia Matt Briggs Matts private Webseite SchemEX – Building an Index for LOD Slide 18 of 44
  • 19. 3. Dereferencing of URIs • Looking up of web documents • How can we ―look up‖ things of the real world? http://biglynx.co.uk/ people/matt-briggs SchemEX – Building an Index for LOD Slide 19 of 44
  • 20. Two Approaches 1. Hash URIs – URI contains a part separated by #, e.g., http://biglynx.co.uk/vocab/sme#Team 2. Negotiation via „303 See Other― request http://biglynx.co.uk/people/matt-briggs Response: „Look here:― http://biglynx.co.uk/people/matt-briggs.rdf SchemEX – Building an Index for LOD Slide 20 of 44
  • 21. Example: Big Lynx Dave Smith London foaf:based_near Description of Matt Briggs Matt? owl:sameAs Big Lynx Company DBpedia Matt Briggs Matts private Webseite SchemEX – Building an Index for LOD Slide 21 of 44
  • 22. 4. Description of URIs foaf:Person … … dp:Birmingham rdf:type foaf:based_near … biglynx:matt-briggs ex:loc _:point foaf:knows wgs84: wgs84: long biglynx:dave-smith lat ―-0.118‖ foaf:based_near ―51.509‖ dp:London … … SchemEX – Building an Index for LOD Slide 22 of 44
  • 23. RDF / RDF Schema Vocabulary • Set of URIs defined in rdf:/rdfs: namespace • rdf:type • rdfs:domain • rdf:Property • rdfs:range • rdf:XMLLiteral • rdfs:Resource • rdf:List • rdfs:Literal • rdf:first • rdfs:Datatype • rdf:rest • rdfs:Class • rdf:Seq • rdfs:subClassOf • rdf:Bag • rdfs:subPropertyOf • rdf:Alt • rdfs:comment • ... • … • rdf:value • rdfs:label SchemEX – Building an Index for LOD Slide 23 of 44
  • 24. Semantic Web Layer Cake (Simplified) SchemEX – Building an Index for LOD Slide 24 of 44
  • 25. Learning Goals • Understand the motivation and fundamentals of Linked Open Data (LOD). • Qualify in why an index for LOD is needed and how to efficiently create such an index. SchemEX – Building an Index for LOD Slide 25 of 44
  • 26. Scenario • People who are politicians and actors • Who else? • Where do they live? • Whom do they know? …are they married with? SchemEX – Building an Index for LOD Slide 26 of 44
  • 27. Problem • No single federated query interface provided • Execute those queries on the LOD cloud SELECT ?x FROM … WHERE { ?x rdf:type ex:Actor . ?x rdf:type ex:Politician . } “politicians and actors” SchemEX – Building an Index for LOD Slide 27 of 44
  • 28. Principle Solution • Suitable index structure for looking up sources “politicians and actors” SchemEX – Building an Index for LOD Slide 28 of 44
  • 29. The Naive Approach 1. Download the entire LOD cloud 2. Put it into a (really) large triple store 3. Process the data and extract schema 4. Provide lookup - Big machinery - Late in processing the data - High effort to scale with LOD cloud SchemEX – Building an Index for LOD Slide 29 of 44
  • 30. Idea  Schema-level index  Define families of graph patterns  Assign instances to graph patterns  Map graph patterns to context (source URI)  Construction  Stream-based for scalability  Little loss of accuracy  Note  Index defined over instances  But stores the context SchemEX – Building an Index for LOD Slide 30 of 44
  • 31. Input Data  n-Quads <subject> <predicate> <object> <context>  Example: <http://www.w3.org/People/Connolly/#me> <http://www.w3.org/1999/02/22-rdf-syntax-ns# <http://xmlns.com/foaf/0.1/Person> <http://dig.csail.mit.edu/2008/webdav/timbl/ http://dig.csail.mit.edu/2008/ webdav/timbl/foaf.rdf w3p: #me foaf: Person SchemEX – Building an Index for LOD Slide 31 of 44
  • 32. Building the Schema and Index RDF C1 C2 C3 … Ck classes consistsOf Type TC1 TC2 … TCm clusters hasEQ Class p1 p2 EQC1 EQC2 … EQCn Equivalence classes hasDataSource … Data DS1 DS2 DS3 DS4 DS5 DSx sources SchemEX – Building an Index for LOD Slide 32 of 44
  • 33. Layer 1: RDF Classes  All instances of a C1 particular type DS 1 DS 2 DS 3 SELECT ?x FROM … WHERE { ?x rdfs:type foaf:Person . foaf:Person } http://dig.csail.mit.edu/2008/... foaf: timbl: Person card#i http://www.w3.org/People/Berners-Lee/card SchemEX – Building an Index for LOD Slide 33 of 44
  • 34. Layer 2: Type Clusters  All instances belonging C1 C2 to exactly the same set TC1 of types SELECT ?x DS 1 DS 2 DS 3 FROM … WHERE { foaf:Person pim:Male ?x rdfs:type foaf:Person . ?x rdfs:type pim:Male . tc4711 } pim: Male http://www.w3.org/People/Berners-Lee/card foaf: timbl: Person card#i SchemEX – Building an Index for LOD Slide 34 of 44
  • 35. Layer 3: Equivalence Classes  Two instances are C1 C2 C3 equivalent iff:  They are in the same TC TC1 TC2  They have the same p properties EQC1  The property targets are in the same TC DS 1 DS 2 DS 3  Similar to 1-Bisimulation SchemEX – Building an Index for LOD Slide 35 of 44
  • 36. Layer 3: Equivalence Classes SELECT ?x WHERE { ?x rdfs:type foaf:Person foaf:Person . ?x rdfs:type pim:Male . pim:Male foaf:PPD ?x foaf:maker ?y . ?y rdfs:type foaf:PersonalProfileDocument . tc4711 tc1234 } eqc0815 -maker- pim: foaf: foaf: tc1234 Male Person PPD eqc0815 foaf:maker timbl: http://www.w3.org/People/Berners-Lee/card timbl: card card#i SchemEX – Building an Index for LOD Slide 36 of 44
  • 37. The SchemEX Approach • Stream-based schema extraction • While crawling the data FIFO LOD-Crawler Instance- RDF-Dump Cache RDF Triple Store RDBMS NxParser Nquad- Schema- Schema- Parser Stream Extractor Level Index SchemEX – Building an Index for LOD Slide 37 of 44
  • 38. Building the Index from a Stream  Stream of n-quads (coming from a LD crawler) … Q16, Q15, Q14, Q13, Q12, Q11, Q10, Q9, Q8, Q7, Q6, Q5, Q4, Q3, Q2, Q1 FiFo 1 C3 4 6 C2 3 4 2 C2 2 1 3 C1 5 • Linear runtime complexity wrt # of input triples SchemEX – Building an Index for LOD Slide 38 of 44
  • 39. Computing SchemEX: TimBL Data Set • Analysis of a smaller data set • 11 M triples, TimBL’s FOAF profile • LDspider with ~ 2k triples / sec • Different cache sizes: 100, 1k, 10k, 50k, 100k • Compared SchemEX with reference schema • Index queries on all Types, TCs, EQCs • Good precision/recall ratio at 50k+ SchemEX – Building an Index for LOD Slide 39 of 44
  • 40. Quality of Stream-based Index Construction • Runtime increases hardly with window size • Memory consumption scales with window size SchemEX – Building an Index for LOD Slide 40 of 44
  • 41. Computing SchemEX: Full BTC 2011 Data Cache size: 50 k SchemEX – Building an Index for LOD Slide 41 of 44
  • 42. Billion Triple Challenge 2011 … SchemEX – Building an Index for LOD Slide 42 of 44
  • 43. Conclusions: SchemEX • Linked Open Data (LOD) approach • Publishing and interlinking data on the web • SchemEX • Stream-based approach to LOD schema extraction • Scalable to arbitrary amount of Linked Data • Applicable on commodity hardware (4GB RAM, single CPU) SchemEX – Building an Index for LOD Slide 43 of 44
  • 44. Learning Goals • Understand the motivation and fundamentals of Linked Open Data (LOD). • Qualify in why an index for LOD is needed and how to efficiently create such an index. SchemEX – Building an Index for LOD Slide 44 of 44
  • 45. Recommended Readings • Maciej Janik, Ansgar Scherp, Steffen Staab: The Semantic Web: Collective Intelligence on the Web. Informatik Spektrum 34(5): 469-483 (2011) URL: http://dx.doi.org/10.1007/s00287-011-0535-x • Mathias Konrath, Thomas Gottron, Steffen Staab, Ansgar Scherp: SchemEX — Efficient construction of a data catalogue by stream-based indexing of linked data, J. of Web Semantics: Science, Services and Agents on the World Wide Web, Available online 23 June 2012 URL: http://www.sciencedirect.com/science/article/pii/S1570826812000716 • Tom Heath, Christian Bizer: Linked Data: Evolving the Web into a Global Data Space, Morgan & Claypool Publishers, 2011 URL: http://dx.doi.org/10.2200/S00334ED1V01Y201102WBE001 SchemEX – Building an Index for LOD Slide 45 of 44