SlideShare una empresa de Scribd logo
1 de 34
Descargar para leer sin conexión
LECTURE 7:
     LINKED (OPEN) DATA

                     Marieke van Erp
(with slides from Victor de Boer and Christophe Guéret)
TODAY’S LECTURE
•   Why Linked (Open) Data?

•   What is Linked (Open) Data?

•   The story of Linked Open Data

•   Contributing to Linked Data

•   Standards and best practices

•   Consuming Linked Data

•   Drawbacks and problems
WHY LINKED DATA (1/2)




Slide stolen from Christophe Guéret
WHY LINKED DATA (2/2)




Slide stolen from Christophe Guéret
``Sharable, spreadable and nerd-friendly’’




                  -- Charlotte S H Jensen,
WHAT IS LINKED DATA?

•   Linked Data is a method to publish
    structured data for interlinking with
    other data sources

•   Standard Web technology (HTTP
    and URIs)

•   Making information more easily
    readable and shareable for machines

•   Linked Open Data is a W3C
    community project to extend the
    Web with open data sets
                                       http://www.ted.com/talks/tim_berners_lee_on_the_next_web.html
THE STORY




            May 2007
Oct
“Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/”
“Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/”
CONTRIBUTING TO LINKED DATA
Yes, it may be scary to open up
your data but it may lead to:
  • Transparency

  • Participation

  • Improvement

  • Innovation

  • New knowledge & insights
   from combined data sources
STANDARDS AND BEST
     PRACTICES
LINKED OPEN DATA FIVE STAR SYSTEM
                       Available on the web (whatever
              ★
                       format), but with an open license

                       Available as machine-readable
              ★★       structured data (e.g. excel instead of
                       image scan of a table)

                       as (2) plus non-proprietary format (e.g.
              ★★★
                       CSV instead of excel)

                       All the above plus, Use open standards
                       from W3C (RDF and SPARQL) to
              ★★★★
                       identify things, so that people can point
                       at your stuff

                       All the above, plus: Link your data to
              ★★★★★
                       other people’s data to provide context




                     www.w3.org/designissues/
FOUR RULES OF LINKED DATA

1. Use URIs as names for things (Resources)

2. Use HTTP URIs so that people can look up those names.
   (Dereferencing)

3. When someone looks up a URI, provide useful information,
   using the standards (RDF*, SPARQL)

4. Include links to other URIs. so that they can discover more
   things.
                              http://www.w3.org/DesignIssues/
FOUR RULES OF LINKED DATA

1. Use URIs as names for things (Resources)

2. Use HTTP URIs so that people can look up those names.
   (Dereferencing)

3. When someone looks up a URI, provide useful information,
   using the standards (RDF*, SPARQL)

4. Include links to other URIs. so that they can discover more
   things.
                              http://www.w3.org/DesignIssues/
HOW TO MAKE COOL URI’S

•   Use HTTP://

•   Use a namespace you control

•   Unique, stable and persistent



•   Don’t use:

    •   Author name, subject, status, access, file name extension, software mechanism

        C://MyDisk/awesome/MvanErp/latest/cgi_bin/rembrandt.html
FOUR RULES OF LINKED DATA

1. Use URIs as names for things

2. Use HTTP URIs so that people can look up those names.
   (Dereferencing)

3. When someone looks up a URI, provide useful information,
   using the standards (RDF*, SPARQL)

4. Include links to other URIs. so that they can discover more
   things.
                              http://www.w3.org/DesignIssues/
RDF REMINDER
Subject	
      	
       Predicate	
   	
                  Object	
                                   Triples
am:Rembrandt	
          am:hasBirthdate	
                 “1651”

am:Rembrandt	
          foaf:knows	
                      am:PiterLastman


am:PiterLastman	
       am:wasBornIn	
                    geonames:Amsterdam




                                                   ate
                                     am:h asBir thd
                                                                “1651”              geonames:Amsterdam

              am:Rembrandt
                                             foaf:knows
                                                                                    am:w asBorn
                                                                                                In
                                                                                                     Graph
                                                                  am:PiterLastman
RDF CONVERSION
     <record priref="19319 “ >
         <date>1651</date>
<maker>Rembrandt (1606-1669)</maker>
  <object.type>etsplaat</object.type>
                 …                                                      priref                    “19319 ”
             </record>                                                             date            “1651”
                                                                 am:Record
                                                                   _:bn1                      maker
                                                                                                 “Rembrandt (1606-1669)”

                                                                       object.type                “etsplaat”

                                 “19319 ”
             iref
         :pr
      am          am:date             “1651”
                                                                                                   “1234”
                                                                        am:priref
am:Record                        am:maker                  am:Person
                                                                             am:birthdate
 am:proxy-19319                                                                               “1606”
                                                           am:p-1234
                                                                         rda:name                  “Rembrandt”
              am
                 :obje
                       ct.ty
                            pe
                                            skos:Concept                                  “etsplaat”
                                             am:etsplaat          skos:prefLabel
ARCHITECTURE
            SPARQL-app           Browser

                                            Purl.org
                                            redirect


               SPARQL        Web interface
                      HTTP server
                                    Logic
        a




            RDF(s) storage
    tri
  pa
clio




                        Prolog

                                              http://
HOW TO ACCESS THE DATA
• PURL 303 redirect to VU semantic layer
   http://purl.org/collections/nl/am/proxy-63432
   è
   http://semanticweb.cs.vu.nl/europeana/browse/list_resource?r=http://
   purl.org/collections/nl/am/proxy-63432

• At our server: content negotiation
  • HTTP request text/html:
    • Local condensed view	

    • Local full view
  • HTTP request application/rdf+xml
    • rdf/xml “describe”


• SPARQL endpoint
TEXT/HTML
TEXT/HTML
APPLICATION/RDF+XML
@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .
@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .
@prefix ore: <http://www.openarchives.org/ore/terms/> .
@prefix ens: <http://www.europeana.eu/schemas/edm/> .
@prefix ahm: <http://purl.org/collections/nl/am/>


ahm:proxy-66970
       a ore:Proxy ;
       ahm:title "Zegelstempel Felix Meritis"@nl ;
       ahm:material ahm:t-12463 ,
                   ahm:t-5447 ;
       ahm:objectCategory ahm:t-5504 ;
       ahm:objectName ahm:t-13817 ,
                     ahm:t-8489 ;
       ahm:objectNumber "KA 7653.1" ;
       ahm:priref "66970" .

ahm:proxy-66972
       a ore:Proxy ;
       ahm:acquisitionDate "0000" ;
       ahm:title "Zegelstempel mogelijk van familiewapen"@nl .
SPARQL




http://semanticweb.cs.vu.nl/europeana/user/query
FOUR RULES OF LINKED DATA

1. Use URIs as names for things

2. Use HTTP URIs so that people can look up those names.

3. When someone looks up a URI, provide useful information,
   using the standards (RDF*, SPARQL)

4. Include links to other URIs. so that they can discover more
   things.

                              http://www.w3.org/DesignIssues/
LINK TO OTHER SOURCES
                         “19319 ”
          iref
      :pr
   am          am:date        “1651”
                                                                                   “1234”
                                                        am:priref
am:Record                am:maker      am:Person
                                                             am:birthdate
                                                                              “1606”
am:proxy-19319                         am:p-1234
                                                         rda:name                   “Rembrandt”




                                                 owl:sameAs (?)




                                                                            Viaf:nationality
                                                                                                      “Dutch”
                                            Viaf:Person
                                       Viaf:RebrandtvanRijn
                                                                                                “Rembrandt
                                                                                               Harmensz. Van
                                                                        rdfs:label                 Rijn”
CONSUMING LINKED DATA


• Generic Applications

   • Can   process any data from any domain

• Domain   specific applications

   • Covers   needs of specific user community
LINKED DATA BROWSERS
DOMAIN-SPECIFIC APPS
DRAWBACKS AND PROBLEMS
•   Extra burden on the data provider

•   Nerd-only (aka “SPARQL is hard”)

•   How do we build user-friendly systems?

•   Ranking, user-friendly information presentation

•   Scalability (how do you query a huge graph?)

•   Licenses

    •   Is Open always a good idea?

    •   Context?

•   Data quality
FURTHER READING

• Tom Heath and Christian Bizer (2011)
 Linked Data: Evolving the Web into a
 Global Data Space (1st edition).
 Synthesis Lectures on the Semantic
 Web: Theory and Technology, 1:1,
 1-136. Morgan & Claypool (available
 online for free)

Más contenido relacionado

Similar a KM Lecture 7 LOD

Linked data: Four rules and five stars for the Amsterdam Museum
Linked data: Four rules and five stars for the Amsterdam MuseumLinked data: Four rules and five stars for the Amsterdam Museum
Linked data: Four rules and five stars for the Amsterdam MuseumVictor de Boer
 
Amsterdam museum as five star linked data
Amsterdam museum as five star linked dataAmsterdam museum as five star linked data
Amsterdam museum as five star linked dataVictor de Boer
 
Eswc2012 presentation: Supporting Linked Data Production for Cultural Heritag...
Eswc2012 presentation: Supporting Linked Data Production for Cultural Heritag...Eswc2012 presentation: Supporting Linked Data Production for Cultural Heritag...
Eswc2012 presentation: Supporting Linked Data Production for Cultural Heritag...Victor de Boer
 
Radically Open Cultural Heritage Data on the Web
Radically Open Cultural Heritage Data on the WebRadically Open Cultural Heritage Data on the Web
Radically Open Cultural Heritage Data on the WebJulie Allinson
 
Linked Data: principles and examples
Linked Data: principles and examples Linked Data: principles and examples
Linked Data: principles and examples Victor de Boer
 
Kilian Schmidtner/Klaus Thoden - The Missing Links: Defining the Mamluk Empi...
Kilian Schmidtner/Klaus Thoden - The Missing Links:  Defining the Mamluk Empi...Kilian Schmidtner/Klaus Thoden - The Missing Links:  Defining the Mamluk Empi...
Kilian Schmidtner/Klaus Thoden - The Missing Links: Defining the Mamluk Empi...Digitised Manuscripts to Europeana
 
First steps towards publishing library data on the semantic web
First steps towards publishing library data on the semantic webFirst steps towards publishing library data on the semantic web
First steps towards publishing library data on the semantic webhorvadam
 
It's not rocket surgery - Linked In: ALA 2011
It's not rocket surgery - Linked In: ALA 2011It's not rocket surgery - Linked In: ALA 2011
It's not rocket surgery - Linked In: ALA 2011Ross Singer
 
Book of the Dead Project
Book of the Dead ProjectBook of the Dead Project
Book of the Dead ProjectBarry Norton
 
Linked Data for Digital Humanities - Big Data Summerschool
Linked Data for Digital Humanities - Big Data SummerschoolLinked Data for Digital Humanities - Big Data Summerschool
Linked Data for Digital Humanities - Big Data SummerschoolVictor de Boer
 
Presentatie for "Studiemiddag Linked Data Archieven"
Presentatie for "Studiemiddag Linked Data Archieven"Presentatie for "Studiemiddag Linked Data Archieven"
Presentatie for "Studiemiddag Linked Data Archieven"Victor de Boer
 
Two graph data models : RDF and Property Graphs
Two graph data models : RDF and Property GraphsTwo graph data models : RDF and Property Graphs
Two graph data models : RDF and Property Graphsandyseaborne
 
Amsterdam Museum as EDM Linked Open Data
Amsterdam Museum as EDM Linked Open DataAmsterdam Museum as EDM Linked Open Data
Amsterdam Museum as EDM Linked Open DataVictor de Boer
 
2016-02 Graphs - PG+RDF
2016-02 Graphs - PG+RDF2016-02 Graphs - PG+RDF
2016-02 Graphs - PG+RDFandyseaborne
 
Consuming linked data by machines
Consuming linked data by machinesConsuming linked data by machines
Consuming linked data by machinesPatrick Sinclair
 
An introduction to Semantic Web and Linked Data
An introduction to Semantic Web and Linked DataAn introduction to Semantic Web and Linked Data
An introduction to Semantic Web and Linked DataGabriela Agustini
 
W3C Tutorial on Semantic Web and Linked Data at WWW 2013
W3C Tutorial on Semantic Web and Linked Data at WWW 2013W3C Tutorial on Semantic Web and Linked Data at WWW 2013
W3C Tutorial on Semantic Web and Linked Data at WWW 2013Fabien Gandon
 
An introduction to Semantic Web and Linked Data
An introduction to Semantic  Web and Linked DataAn introduction to Semantic  Web and Linked Data
An introduction to Semantic Web and Linked DataGabriela Agustini
 

Similar a KM Lecture 7 LOD (20)

Linked data: Four rules and five stars for the Amsterdam Museum
Linked data: Four rules and five stars for the Amsterdam MuseumLinked data: Four rules and five stars for the Amsterdam Museum
Linked data: Four rules and five stars for the Amsterdam Museum
 
Amsterdam museum as five star linked data
Amsterdam museum as five star linked dataAmsterdam museum as five star linked data
Amsterdam museum as five star linked data
 
Eswc2012 presentation: Supporting Linked Data Production for Cultural Heritag...
Eswc2012 presentation: Supporting Linked Data Production for Cultural Heritag...Eswc2012 presentation: Supporting Linked Data Production for Cultural Heritag...
Eswc2012 presentation: Supporting Linked Data Production for Cultural Heritag...
 
Radically Open Cultural Heritage Data on the Web
Radically Open Cultural Heritage Data on the WebRadically Open Cultural Heritage Data on the Web
Radically Open Cultural Heritage Data on the Web
 
Linked Data: principles and examples
Linked Data: principles and examples Linked Data: principles and examples
Linked Data: principles and examples
 
Kilian Schmidtner/Klaus Thoden - The Missing Links: Defining the Mamluk Empi...
Kilian Schmidtner/Klaus Thoden - The Missing Links:  Defining the Mamluk Empi...Kilian Schmidtner/Klaus Thoden - The Missing Links:  Defining the Mamluk Empi...
Kilian Schmidtner/Klaus Thoden - The Missing Links: Defining the Mamluk Empi...
 
First steps towards publishing library data on the semantic web
First steps towards publishing library data on the semantic webFirst steps towards publishing library data on the semantic web
First steps towards publishing library data on the semantic web
 
It's not rocket surgery - Linked In: ALA 2011
It's not rocket surgery - Linked In: ALA 2011It's not rocket surgery - Linked In: ALA 2011
It's not rocket surgery - Linked In: ALA 2011
 
Book of the Dead Project
Book of the Dead ProjectBook of the Dead Project
Book of the Dead Project
 
Linked Data for Digital Humanities - Big Data Summerschool
Linked Data for Digital Humanities - Big Data SummerschoolLinked Data for Digital Humanities - Big Data Summerschool
Linked Data for Digital Humanities - Big Data Summerschool
 
Presentatie for "Studiemiddag Linked Data Archieven"
Presentatie for "Studiemiddag Linked Data Archieven"Presentatie for "Studiemiddag Linked Data Archieven"
Presentatie for "Studiemiddag Linked Data Archieven"
 
Linked Open Data
Linked Open DataLinked Open Data
Linked Open Data
 
Two graph data models : RDF and Property Graphs
Two graph data models : RDF and Property GraphsTwo graph data models : RDF and Property Graphs
Two graph data models : RDF and Property Graphs
 
Slides
SlidesSlides
Slides
 
Amsterdam Museum as EDM Linked Open Data
Amsterdam Museum as EDM Linked Open DataAmsterdam Museum as EDM Linked Open Data
Amsterdam Museum as EDM Linked Open Data
 
2016-02 Graphs - PG+RDF
2016-02 Graphs - PG+RDF2016-02 Graphs - PG+RDF
2016-02 Graphs - PG+RDF
 
Consuming linked data by machines
Consuming linked data by machinesConsuming linked data by machines
Consuming linked data by machines
 
An introduction to Semantic Web and Linked Data
An introduction to Semantic Web and Linked DataAn introduction to Semantic Web and Linked Data
An introduction to Semantic Web and Linked Data
 
W3C Tutorial on Semantic Web and Linked Data at WWW 2013
W3C Tutorial on Semantic Web and Linked Data at WWW 2013W3C Tutorial on Semantic Web and Linked Data at WWW 2013
W3C Tutorial on Semantic Web and Linked Data at WWW 2013
 
An introduction to Semantic Web and Linked Data
An introduction to Semantic  Web and Linked DataAn introduction to Semantic  Web and Linked Data
An introduction to Semantic Web and Linked Data
 

Más de Marieke van Erp

Towards Culturally Aware AI Systems - TSDH Symposium
Towards Culturally Aware AI Systems - TSDH SymposiumTowards Culturally Aware AI Systems - TSDH Symposium
Towards Culturally Aware AI Systems - TSDH SymposiumMarieke van Erp
 
A Polyvocal and Contextualised Semantic Web
A Polyvocal and Contextualised Semantic WebA Polyvocal and Contextualised Semantic Web
A Polyvocal and Contextualised Semantic WebMarieke van Erp
 
AI x Digital Humanities = > Inclusiviteit
AI x Digital Humanities = > Inclusiviteit AI x Digital Humanities = > Inclusiviteit
AI x Digital Humanities = > Inclusiviteit Marieke van Erp
 
Computationally Tracing Concepts Through Time and Space
Computationally Tracing Concepts Through Time and SpaceComputationally Tracing Concepts Through Time and Space
Computationally Tracing Concepts Through Time and SpaceMarieke van Erp
 
The Hitchhiker's Guide to the Future of Digital Humanities
The Hitchhiker's Guide to the Future of Digital HumanitiesThe Hitchhiker's Guide to the Future of Digital Humanities
The Hitchhiker's Guide to the Future of Digital HumanitiesMarieke van Erp
 
Why language technology can’t handle Game of Thrones (yet)
Why language technology can’t handle Game of Thrones (yet)Why language technology can’t handle Game of Thrones (yet)
Why language technology can’t handle Game of Thrones (yet)Marieke van Erp
 
(Beyond) Combining Text and Tables for qualitative and quantitative research
(Beyond) Combining Text and Tables for qualitative and quantitative research (Beyond) Combining Text and Tables for qualitative and quantitative research
(Beyond) Combining Text and Tables for qualitative and quantitative research Marieke van Erp
 
Finding common ground between text, maps, and tables for quantitative and qua...
Finding common ground between text, maps, and tables for quantitative and qua...Finding common ground between text, maps, and tables for quantitative and qua...
Finding common ground between text, maps, and tables for quantitative and qua...Marieke van Erp
 
Slicing and Dicing a Newspaper Corpus for Historical Ecology Research
Slicing and Dicing a Newspaper Corpus for Historical Ecology ResearchSlicing and Dicing a Newspaper Corpus for Historical Ecology Research
Slicing and Dicing a Newspaper Corpus for Historical Ecology ResearchMarieke van Erp
 
Lessons Learnt from the Named Entity rEcognition and Linking (NEEL) Challenge...
Lessons Learnt from the Named Entity rEcognition and Linking (NEEL) Challenge...Lessons Learnt from the Named Entity rEcognition and Linking (NEEL) Challenge...
Lessons Learnt from the Named Entity rEcognition and Linking (NEEL) Challenge...Marieke van Erp
 
Good Lynx, bad Lynx: Document enrichment for historical ecologists
Good Lynx, bad Lynx: Document enrichment for historical ecologistsGood Lynx, bad Lynx: Document enrichment for historical ecologists
Good Lynx, bad Lynx: Document enrichment for historical ecologistsMarieke van Erp
 
Towards Semantic Enrichment of Newspapers: a historical ecology use case
Towards Semantic Enrichment of Newspapers: a historical ecology use case Towards Semantic Enrichment of Newspapers: a historical ecology use case
Towards Semantic Enrichment of Newspapers: a historical ecology use case Marieke van Erp
 
Natural Language Processing en Named Entity Recognition
Natural Language Processing en Named Entity Recognition Natural Language Processing en Named Entity Recognition
Natural Language Processing en Named Entity Recognition Marieke van Erp
 
HuC lecture - Digital and Humanities: Continuing the Conversation
HuC lecture - Digital and Humanities: Continuing the ConversationHuC lecture - Digital and Humanities: Continuing the Conversation
HuC lecture - Digital and Humanities: Continuing the ConversationMarieke van Erp
 
Multilingual Fine-grained Entity Typing
Multilingual Fine-grained Entity Typing Multilingual Fine-grained Entity Typing
Multilingual Fine-grained Entity Typing Marieke van Erp
 
Entity Typing Using Distributional Semantics and DBpedia
Entity Typing Using Distributional Semantics and DBpedia Entity Typing Using Distributional Semantics and DBpedia
Entity Typing Using Distributional Semantics and DBpedia Marieke van Erp
 
Entity Typing and Event Extraction
Entity Typing and Event Extraction Entity Typing and Event Extraction
Entity Typing and Event Extraction Marieke van Erp
 
The domain as unifier, how focusing on social history can bring technical fie...
The domain as unifier, how focusing on social history can bring technical fie...The domain as unifier, how focusing on social history can bring technical fie...
The domain as unifier, how focusing on social history can bring technical fie...Marieke van Erp
 
Evaluating entity linking an analysis of current benchmark datasets and a ro...
Evaluating entity linking  an analysis of current benchmark datasets and a ro...Evaluating entity linking  an analysis of current benchmark datasets and a ro...
Evaluating entity linking an analysis of current benchmark datasets and a ro...Marieke van Erp
 
Finding Stories in 1,784,532 Events: Scaling up computational models of narr...
Finding Stories in 1,784,532 Events:  Scaling up computational models of narr...Finding Stories in 1,784,532 Events:  Scaling up computational models of narr...
Finding Stories in 1,784,532 Events: Scaling up computational models of narr...Marieke van Erp
 

Más de Marieke van Erp (20)

Towards Culturally Aware AI Systems - TSDH Symposium
Towards Culturally Aware AI Systems - TSDH SymposiumTowards Culturally Aware AI Systems - TSDH Symposium
Towards Culturally Aware AI Systems - TSDH Symposium
 
A Polyvocal and Contextualised Semantic Web
A Polyvocal and Contextualised Semantic WebA Polyvocal and Contextualised Semantic Web
A Polyvocal and Contextualised Semantic Web
 
AI x Digital Humanities = > Inclusiviteit
AI x Digital Humanities = > Inclusiviteit AI x Digital Humanities = > Inclusiviteit
AI x Digital Humanities = > Inclusiviteit
 
Computationally Tracing Concepts Through Time and Space
Computationally Tracing Concepts Through Time and SpaceComputationally Tracing Concepts Through Time and Space
Computationally Tracing Concepts Through Time and Space
 
The Hitchhiker's Guide to the Future of Digital Humanities
The Hitchhiker's Guide to the Future of Digital HumanitiesThe Hitchhiker's Guide to the Future of Digital Humanities
The Hitchhiker's Guide to the Future of Digital Humanities
 
Why language technology can’t handle Game of Thrones (yet)
Why language technology can’t handle Game of Thrones (yet)Why language technology can’t handle Game of Thrones (yet)
Why language technology can’t handle Game of Thrones (yet)
 
(Beyond) Combining Text and Tables for qualitative and quantitative research
(Beyond) Combining Text and Tables for qualitative and quantitative research (Beyond) Combining Text and Tables for qualitative and quantitative research
(Beyond) Combining Text and Tables for qualitative and quantitative research
 
Finding common ground between text, maps, and tables for quantitative and qua...
Finding common ground between text, maps, and tables for quantitative and qua...Finding common ground between text, maps, and tables for quantitative and qua...
Finding common ground between text, maps, and tables for quantitative and qua...
 
Slicing and Dicing a Newspaper Corpus for Historical Ecology Research
Slicing and Dicing a Newspaper Corpus for Historical Ecology ResearchSlicing and Dicing a Newspaper Corpus for Historical Ecology Research
Slicing and Dicing a Newspaper Corpus for Historical Ecology Research
 
Lessons Learnt from the Named Entity rEcognition and Linking (NEEL) Challenge...
Lessons Learnt from the Named Entity rEcognition and Linking (NEEL) Challenge...Lessons Learnt from the Named Entity rEcognition and Linking (NEEL) Challenge...
Lessons Learnt from the Named Entity rEcognition and Linking (NEEL) Challenge...
 
Good Lynx, bad Lynx: Document enrichment for historical ecologists
Good Lynx, bad Lynx: Document enrichment for historical ecologistsGood Lynx, bad Lynx: Document enrichment for historical ecologists
Good Lynx, bad Lynx: Document enrichment for historical ecologists
 
Towards Semantic Enrichment of Newspapers: a historical ecology use case
Towards Semantic Enrichment of Newspapers: a historical ecology use case Towards Semantic Enrichment of Newspapers: a historical ecology use case
Towards Semantic Enrichment of Newspapers: a historical ecology use case
 
Natural Language Processing en Named Entity Recognition
Natural Language Processing en Named Entity Recognition Natural Language Processing en Named Entity Recognition
Natural Language Processing en Named Entity Recognition
 
HuC lecture - Digital and Humanities: Continuing the Conversation
HuC lecture - Digital and Humanities: Continuing the ConversationHuC lecture - Digital and Humanities: Continuing the Conversation
HuC lecture - Digital and Humanities: Continuing the Conversation
 
Multilingual Fine-grained Entity Typing
Multilingual Fine-grained Entity Typing Multilingual Fine-grained Entity Typing
Multilingual Fine-grained Entity Typing
 
Entity Typing Using Distributional Semantics and DBpedia
Entity Typing Using Distributional Semantics and DBpedia Entity Typing Using Distributional Semantics and DBpedia
Entity Typing Using Distributional Semantics and DBpedia
 
Entity Typing and Event Extraction
Entity Typing and Event Extraction Entity Typing and Event Extraction
Entity Typing and Event Extraction
 
The domain as unifier, how focusing on social history can bring technical fie...
The domain as unifier, how focusing on social history can bring technical fie...The domain as unifier, how focusing on social history can bring technical fie...
The domain as unifier, how focusing on social history can bring technical fie...
 
Evaluating entity linking an analysis of current benchmark datasets and a ro...
Evaluating entity linking  an analysis of current benchmark datasets and a ro...Evaluating entity linking  an analysis of current benchmark datasets and a ro...
Evaluating entity linking an analysis of current benchmark datasets and a ro...
 
Finding Stories in 1,784,532 Events: Scaling up computational models of narr...
Finding Stories in 1,784,532 Events:  Scaling up computational models of narr...Finding Stories in 1,784,532 Events:  Scaling up computational models of narr...
Finding Stories in 1,784,532 Events: Scaling up computational models of narr...
 

KM Lecture 7 LOD

  • 1. LECTURE 7: LINKED (OPEN) DATA Marieke van Erp (with slides from Victor de Boer and Christophe Guéret)
  • 2. TODAY’S LECTURE • Why Linked (Open) Data? • What is Linked (Open) Data? • The story of Linked Open Data • Contributing to Linked Data • Standards and best practices • Consuming Linked Data • Drawbacks and problems
  • 3. WHY LINKED DATA (1/2) Slide stolen from Christophe Guéret
  • 4. WHY LINKED DATA (2/2) Slide stolen from Christophe Guéret
  • 5. ``Sharable, spreadable and nerd-friendly’’ -- Charlotte S H Jensen,
  • 6. WHAT IS LINKED DATA? • Linked Data is a method to publish structured data for interlinking with other data sources • Standard Web technology (HTTP and URIs) • Making information more easily readable and shareable for machines • Linked Open Data is a W3C community project to extend the Web with open data sets http://www.ted.com/talks/tim_berners_lee_on_the_next_web.html
  • 7. THE STORY May 2007
  • 8. Oct
  • 9.
  • 10. “Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/”
  • 11. “Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/”
  • 12. CONTRIBUTING TO LINKED DATA Yes, it may be scary to open up your data but it may lead to: • Transparency • Participation • Improvement • Innovation • New knowledge & insights from combined data sources
  • 13. STANDARDS AND BEST PRACTICES
  • 14. LINKED OPEN DATA FIVE STAR SYSTEM Available on the web (whatever ★ format), but with an open license Available as machine-readable ★★ structured data (e.g. excel instead of image scan of a table) as (2) plus non-proprietary format (e.g. ★★★ CSV instead of excel) All the above plus, Use open standards from W3C (RDF and SPARQL) to ★★★★ identify things, so that people can point at your stuff All the above, plus: Link your data to ★★★★★ other people’s data to provide context www.w3.org/designissues/
  • 15. FOUR RULES OF LINKED DATA 1. Use URIs as names for things (Resources) 2. Use HTTP URIs so that people can look up those names. (Dereferencing) 3. When someone looks up a URI, provide useful information, using the standards (RDF*, SPARQL) 4. Include links to other URIs. so that they can discover more things. http://www.w3.org/DesignIssues/
  • 16. FOUR RULES OF LINKED DATA 1. Use URIs as names for things (Resources) 2. Use HTTP URIs so that people can look up those names. (Dereferencing) 3. When someone looks up a URI, provide useful information, using the standards (RDF*, SPARQL) 4. Include links to other URIs. so that they can discover more things. http://www.w3.org/DesignIssues/
  • 17. HOW TO MAKE COOL URI’S • Use HTTP:// • Use a namespace you control • Unique, stable and persistent • Don’t use: • Author name, subject, status, access, file name extension, software mechanism C://MyDisk/awesome/MvanErp/latest/cgi_bin/rembrandt.html
  • 18. FOUR RULES OF LINKED DATA 1. Use URIs as names for things 2. Use HTTP URIs so that people can look up those names. (Dereferencing) 3. When someone looks up a URI, provide useful information, using the standards (RDF*, SPARQL) 4. Include links to other URIs. so that they can discover more things. http://www.w3.org/DesignIssues/
  • 19. RDF REMINDER Subject     Predicate     Object   Triples am:Rembrandt   am:hasBirthdate   “1651” am:Rembrandt   foaf:knows   am:PiterLastman am:PiterLastman   am:wasBornIn   geonames:Amsterdam ate am:h asBir thd “1651” geonames:Amsterdam am:Rembrandt foaf:knows am:w asBorn In Graph am:PiterLastman
  • 20. RDF CONVERSION <record priref="19319 “ > <date>1651</date> <maker>Rembrandt (1606-1669)</maker> <object.type>etsplaat</object.type> … priref “19319 ” </record> date “1651” am:Record _:bn1 maker “Rembrandt (1606-1669)” object.type “etsplaat” “19319 ” iref :pr am am:date “1651” “1234” am:priref am:Record am:maker am:Person am:birthdate am:proxy-19319 “1606” am:p-1234 rda:name “Rembrandt” am :obje ct.ty pe skos:Concept “etsplaat” am:etsplaat skos:prefLabel
  • 21. ARCHITECTURE SPARQL-app Browser Purl.org redirect SPARQL Web interface HTTP server Logic a RDF(s) storage tri pa clio Prolog http://
  • 22. HOW TO ACCESS THE DATA • PURL 303 redirect to VU semantic layer http://purl.org/collections/nl/am/proxy-63432 è http://semanticweb.cs.vu.nl/europeana/browse/list_resource?r=http:// purl.org/collections/nl/am/proxy-63432 • At our server: content negotiation • HTTP request text/html: • Local condensed view • Local full view • HTTP request application/rdf+xml • rdf/xml “describe” • SPARQL endpoint
  • 25. APPLICATION/RDF+XML @prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> . @prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> . @prefix ore: <http://www.openarchives.org/ore/terms/> . @prefix ens: <http://www.europeana.eu/schemas/edm/> . @prefix ahm: <http://purl.org/collections/nl/am/> ahm:proxy-66970 a ore:Proxy ; ahm:title "Zegelstempel Felix Meritis"@nl ; ahm:material ahm:t-12463 , ahm:t-5447 ; ahm:objectCategory ahm:t-5504 ; ahm:objectName ahm:t-13817 , ahm:t-8489 ; ahm:objectNumber "KA 7653.1" ; ahm:priref "66970" . ahm:proxy-66972 a ore:Proxy ; ahm:acquisitionDate "0000" ; ahm:title "Zegelstempel mogelijk van familiewapen"@nl .
  • 27. FOUR RULES OF LINKED DATA 1. Use URIs as names for things 2. Use HTTP URIs so that people can look up those names. 3. When someone looks up a URI, provide useful information, using the standards (RDF*, SPARQL) 4. Include links to other URIs. so that they can discover more things. http://www.w3.org/DesignIssues/
  • 28. LINK TO OTHER SOURCES “19319 ” iref :pr am am:date “1651” “1234” am:priref am:Record am:maker am:Person am:birthdate “1606” am:proxy-19319 am:p-1234 rda:name “Rembrandt” owl:sameAs (?) Viaf:nationality “Dutch” Viaf:Person Viaf:RebrandtvanRijn “Rembrandt Harmensz. Van rdfs:label Rijn”
  • 29. CONSUMING LINKED DATA • Generic Applications • Can process any data from any domain • Domain specific applications • Covers needs of specific user community
  • 32.
  • 33. DRAWBACKS AND PROBLEMS • Extra burden on the data provider • Nerd-only (aka “SPARQL is hard”) • How do we build user-friendly systems? • Ranking, user-friendly information presentation • Scalability (how do you query a huge graph?) • Licenses • Is Open always a good idea? • Context? • Data quality
  • 34. FURTHER READING • Tom Heath and Christian Bizer (2011) Linked Data: Evolving the Web into a Global Data Space (1st edition). Synthesis Lectures on the Semantic Web: Theory and Technology, 1:1, 1-136. Morgan & Claypool (available online for free)