SlideShare una empresa de Scribd logo
1 de 32
Adventures in Linked Data Land:
bringing RDF to the Wordsworth
Trust


Richard Light




CT Linked Data Meeting, 22 February 2010
Discovering Linked Data
Four principles of Linked Data (Tim B-L):

●   Use URIs to identify resources

●   Use HTTP URIs so that people can look them up

●   Provide useful information about the resource

●   Include links to other URIs in your data
Discovering dbPedia
●   Extraction of Linked Data from Wikipedia
●   Statements in info boxes (mainly) become RDF
    triples:
<rdf:Description
  rdf:about="http://dbpedia.org/resource/Ber
  lin_Marathon">
<dbpprop:location
  rdf:resource="http://dbpedia.org/resource/
  Berlin"/>
</rdf:Description>

Note the URLs
Browsing Linked Data

●   View RDF as a web page:
    http://dbpedia.org/page/Berlin

●   Navigate from one data source to another

●   Specialist Linked Data browsers/plugins:
    –   DISCO
    –   Marbles
    –   Openlink Data Explorer
    –   Tabulator
dbPedia page for Berlin
OpenLink Data Explorer: What
OpenLink Data Explorer: Where
Querying Linked Data

●   SPARQL query language:
    http://www.w3.org/TR/2008/REC-rdf-sparql-query-
      20080115/

●   And SPARQL XML results format:
    http://www.w3.org/TR/rdf-sparql-XMLres/

●   “SPARQL end-points”:
    http://dbpedia.org/sparql
    http://dbtune.org/bbc/peel/sparql
    http://data.linkedmdb.org/sparql
dbPedia SPARQL endpoint page
Asking interesting questions

●   German musicians born in Berlin:
●
So what do we have here?

●   An initiative to generate lots of Linked Data

●   A Linked Data Cloud, containing a growing
    number of RDF datasets

●   A hard-to-use query language capable of very
    precise and powerful querying

Where do museums come into this picture?
The Wordsworth Trust

●   Typical museum collection: about 60,000 objects

●   Major collection of manuscripts (notebooks,
    letters, etc.)

●   Objects published to the Web from a ModesXML
    database

●   Unwise enough to allow me Remote Desktop
    access ...
Typical collections object

GRMDC.C104.2
Same object represented as RDF
Same object represented as XTM
One identifier; three “views”
●   This object has a single persistent identifier:
    http://collections.wordsworth.org.uk/object/GRMDC.C104.2

●   This maps to different views depending on the
    “Accept” header in the HTTP request:

    –   application/rdf+xml >> RDF
    –   application/xtm+xml >> XTM Topic Map
    –   Otherwise >> HTML (human-readable)

●   Achieved through a custom 404 “page not found”
    handler
“Page not found” handler (1)

●   All URLs are fictitious, so they generate a 404

●   Modified a generic smart 404 handler from:
    http://evolvedcode.net/content/code_smart404/

●   Added support for “303 See other” redirects

●   added wild card matching to re-format URLs
“Page not found” handler (2)

●   Generic URL, plus requested Accept format,
    determine initial “303 See other” mapping, e.g.:
    http://collections.wordsworth.org.uk/object/GRMDC.C104.2
    +
    Accept: application/rdf+xml
    =
    http://collections.wordsworth.org.uk/object/rdf/GRMDC.C104.2

●   When this is passed back in, the 404 handler has to
    generate the required RDF directly

●   Can't just keep redirecting requests!
“Page not found” handler (3)

●   Redirect rules declare mappings:
“Page not found” handler (4)

●   Generic URL plus a supported Accept type
    generates a “303 See other” redirect

●   If it comes back as a page request, it is further
    redirected with a “301 Moved permanently” to the
    object's web page

●   If it comes back as an RDF or XTM request, the
    record is fetched as XML and subjected to an
    XSLT transform by the handler
Does it work? - yes, sort of
Data Explorer place view
Implementation details

●   HTML needed a “back link” to RDF to keep
    OpenLink Explorer happy:
    <link rel="alternate" type="application/rdf+xml"
      href="http://collections.wordsworth.org.uk/object/data/GRMDC
      .C104.2" title="RDF" />

●   Result is totally unfindable: need a search or
    harvesting mechanism:
     – OAI support (possible)
     – SPARQL end-point (harder)
What has been learnt? (1)
●   The Linked Data paradigm encourages simple
    RDF triples: no “blank nodes”

●   For an object, this becomes a simple metadata set,
    very analogous to the PNDS DCAP format

●   The properties involved need to encapsulate the
    whole relation between object and data, e.g.
    <p:title>Ulswater from Pooley Bridge</p:title>
    <p:technique>drawn</p:technique>
    <p:maker>Farington, Joseph (1747-1821)</p:maker>
    <p:technique>engraved</p:technique>
    <p:maker>Middiman, Samuel (1750-1831)</p:maker>
What has been learnt? (2)

●   Data in linked resources can “add value” to your
    own recording efforts (e.g. place data)
Properties: which framework?

●   I have used dbPedia properties (for compatibility
    with other Linked Data resources … ?):
    http://dbpedia.org/property/title
    http://dbpedia.org/property/maker

●   A viable alternative would be PNDS DCAP:
    http://purl.org/dc/elements/1.1/title
    http://purl.org/dc/elements/1.1/creator

●   One framework which doesn't fit is the CIDOC
    CRM:
    E21 Physical Thing – E12 Production – E39 Actor = “creator”
Do we need “museum” properties?

●   DbPedia properties are not coherent

●   Need something richer than simple metadata

●   Could use CIDOC CRM as basis

●   Existing interchange formats such as LIDO could
    be re-expressed in RDF

●   Could broaden scope: “history” property set?
The problem of URIs

●   Good Linked Data requires URIs everywhere

●   Most of my museum RDF resolves to strings

●   One exception is Geonames lookup:
        Ullswater
    becomes
        http://www.geonames.org/2635191/

●   In the absence of a central “people” registry,
    should be minting URIs myself for people, etc.
Conclusions
●   Implementing an RDF Linked Data front-end to a
    museum database is feasible if:
    –   You can generate multiple outputs from your database
        (XML is sufficient)
    –   You can implement a suitable URL rewriter or 404
        handler

●   It's easy (and a good idea) to mint and publish
    URIs for your collection objects

●   It's less clear where all the other URIs we'll need
    will come from
Challenges for museum linked data

●   Agreeing an ontology to enable cross-collection
    [SPARQL] queries

●   Shared URLs for in-common concepts: people,
    places, events

●   Mechanisms for getting URLs into museum data

●   Getting existing authorities, e.g. AAT, to be
    available as Linked Data
Ask Multimap where Lancaster is
Thank you!




Richard Light
richard@light.demon.co.uk

Más contenido relacionado

La actualidad más candente

Semantic Media Management with Apache Marmotta
Semantic Media Management with Apache MarmottaSemantic Media Management with Apache Marmotta
Semantic Media Management with Apache Marmotta
Thomas Kurz
 
Semantic Pipes and Semantic Mashups
Semantic Pipes and Semantic MashupsSemantic Pipes and Semantic Mashups
Semantic Pipes and Semantic Mashups
giurca
 
RDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
RDF4U: RDF Graph Visualization by Interpreting Linked Data as KnowledgeRDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
RDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
National Institute of Informatics
 

La actualidad más candente (20)

Enabling access to Linked Media with SPARQL-MM
Enabling access to Linked Media with SPARQL-MMEnabling access to Linked Media with SPARQL-MM
Enabling access to Linked Media with SPARQL-MM
 
Flagis linked open_data_stijn_goedertier
Flagis linked open_data_stijn_goedertierFlagis linked open_data_stijn_goedertier
Flagis linked open_data_stijn_goedertier
 
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016
 
JSONpedia - Facilitating consumption of MediaWiki content
JSONpedia - Facilitating consumption of MediaWiki contentJSONpedia - Facilitating consumption of MediaWiki content
JSONpedia - Facilitating consumption of MediaWiki content
 
Graph basedrdf storeforapachecassandra
Graph basedrdf storeforapachecassandraGraph basedrdf storeforapachecassandra
Graph basedrdf storeforapachecassandra
 
Getting Started With The Talis Platform
Getting Started With The Talis PlatformGetting Started With The Talis Platform
Getting Started With The Talis Platform
 
The Web of data and web data commons
The Web of data and web data commonsThe Web of data and web data commons
The Web of data and web data commons
 
Semantic Media Management with Apache Marmotta
Semantic Media Management with Apache MarmottaSemantic Media Management with Apache Marmotta
Semantic Media Management with Apache Marmotta
 
Open Data and CKAN Data Catalogues
Open Data and CKAN Data CataloguesOpen Data and CKAN Data Catalogues
Open Data and CKAN Data Catalogues
 
BDT204 Awesome Applications of Open Data - AWS re: Invent 2012
BDT204 Awesome Applications of Open Data - AWS re: Invent 2012BDT204 Awesome Applications of Open Data - AWS re: Invent 2012
BDT204 Awesome Applications of Open Data - AWS re: Invent 2012
 
Drupal and Apache Stanbol
Drupal and Apache StanbolDrupal and Apache Stanbol
Drupal and Apache Stanbol
 
RDTF Metadata Guidelines: an update
RDTF Metadata Guidelines: an updateRDTF Metadata Guidelines: an update
RDTF Metadata Guidelines: an update
 
Benchmarking RDF Metadata Representations: Reification, Singleton Property an...
Benchmarking RDF Metadata Representations: Reification, Singleton Property an...Benchmarking RDF Metadata Representations: Reification, Singleton Property an...
Benchmarking RDF Metadata Representations: Reification, Singleton Property an...
 
Data Integration And Visualization
Data Integration And VisualizationData Integration And Visualization
Data Integration And Visualization
 
Linked data based semantic annotation using Drupal and Apache Stanbol
Linked data based semantic annotation using Drupal and Apache StanbolLinked data based semantic annotation using Drupal and Apache Stanbol
Linked data based semantic annotation using Drupal and Apache Stanbol
 
Semantic Pipes and Semantic Mashups
Semantic Pipes and Semantic MashupsSemantic Pipes and Semantic Mashups
Semantic Pipes and Semantic Mashups
 
RDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
RDF4U: RDF Graph Visualization by Interpreting Linked Data as KnowledgeRDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
RDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
 
Beyond 2022 project presentation 2021
Beyond 2022 project presentation 2021Beyond 2022 project presentation 2021
Beyond 2022 project presentation 2021
 
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
 
CKAN - the open source data portal platform
CKAN - the open source data portal platformCKAN - the open source data portal platform
CKAN - the open source data portal platform
 

Destacado (11)

Reducing Unauthorized Deductions
Reducing Unauthorized DeductionsReducing Unauthorized Deductions
Reducing Unauthorized Deductions
 
Jugandoalhuevopodrido
JugandoalhuevopodridoJugandoalhuevopodrido
Jugandoalhuevopodrido
 
Opening The Ivory Tower
Opening The Ivory TowerOpening The Ivory Tower
Opening The Ivory Tower
 
Nuevas tecnologías en las escuels
Nuevas tecnologías en las escuelsNuevas tecnologías en las escuels
Nuevas tecnologías en las escuels
 
es fascinante
es fascinantees fascinante
es fascinante
 
DistanceEducation
DistanceEducationDistanceEducation
DistanceEducation
 
9th Grade Chapter 3 Lesson 5
9th Grade Chapter 3 Lesson 59th Grade Chapter 3 Lesson 5
9th Grade Chapter 3 Lesson 5
 
9th Grade Chapter 1 Lesson 4
9th Grade Chapter 1 Lesson 49th Grade Chapter 1 Lesson 4
9th Grade Chapter 1 Lesson 4
 
Non-invasive Glucose Monitor
Non-invasive Glucose MonitorNon-invasive Glucose Monitor
Non-invasive Glucose Monitor
 
Moodle.2009
Moodle.2009Moodle.2009
Moodle.2009
 
Laser applications to medicine and biology
Laser applications to medicine and biologyLaser applications to medicine and biology
Laser applications to medicine and biology
 

Similar a Adventures in Linked Data Land (presentation by Richard Light)

SemanticWeb Nuts 'n Bolts
SemanticWeb Nuts 'n BoltsSemanticWeb Nuts 'n Bolts
SemanticWeb Nuts 'n Bolts
Rinke Hoekstra
 
Deploying PHP applications using Virtuoso as Application Server
Deploying PHP applications using Virtuoso as Application ServerDeploying PHP applications using Virtuoso as Application Server
Deploying PHP applications using Virtuoso as Application Server
webhostingguy
 

Similar a Adventures in Linked Data Land (presentation by Richard Light) (20)

Webofdata
WebofdataWebofdata
Webofdata
 
SemWeb Fundamentals - Info Linking & Layering in Practice
SemWeb Fundamentals - Info Linking & Layering in PracticeSemWeb Fundamentals - Info Linking & Layering in Practice
SemWeb Fundamentals - Info Linking & Layering in Practice
 
Linked Data
Linked DataLinked Data
Linked Data
 
Engineering a Semantic Web (Spring 2018)
Engineering a Semantic Web (Spring 2018)Engineering a Semantic Web (Spring 2018)
Engineering a Semantic Web (Spring 2018)
 
Danbri Drupalcon Export
Danbri Drupalcon ExportDanbri Drupalcon Export
Danbri Drupalcon Export
 
RDFa Introductory Course Session 2/4 How RDFa
RDFa Introductory Course Session 2/4 How RDFaRDFa Introductory Course Session 2/4 How RDFa
RDFa Introductory Course Session 2/4 How RDFa
 
How RDFa works
How RDFa worksHow RDFa works
How RDFa works
 
Madrid SPARQL handson
Madrid SPARQL handsonMadrid SPARQL handson
Madrid SPARQL handson
 
Engineering a Semantic Web: ITWS Capstone Lecture (Spring 2014)
Engineering a Semantic Web: ITWS Capstone Lecture (Spring 2014)Engineering a Semantic Web: ITWS Capstone Lecture (Spring 2014)
Engineering a Semantic Web: ITWS Capstone Lecture (Spring 2014)
 
SemanticWeb Nuts 'n Bolts
SemanticWeb Nuts 'n BoltsSemanticWeb Nuts 'n Bolts
SemanticWeb Nuts 'n Bolts
 
Semantic Web introduction
Semantic Web introductionSemantic Web introduction
Semantic Web introduction
 
IFLA LIDASIG Open Session 2017: Introduction to Linked Data
IFLA LIDASIG Open Session 2017: Introduction to Linked DataIFLA LIDASIG Open Session 2017: Introduction to Linked Data
IFLA LIDASIG Open Session 2017: Introduction to Linked Data
 
Lifting the Lid on Linked Data
Lifting the Lid on Linked DataLifting the Lid on Linked Data
Lifting the Lid on Linked Data
 
Deploying PHP applications using Virtuoso as Application Server
Deploying PHP applications using Virtuoso as Application ServerDeploying PHP applications using Virtuoso as Application Server
Deploying PHP applications using Virtuoso as Application Server
 
Introduction to linked data
Introduction to linked dataIntroduction to linked data
Introduction to linked data
 
Madrid Building blocks of Linked Data
Madrid Building blocks of Linked DataMadrid Building blocks of Linked Data
Madrid Building blocks of Linked Data
 
ITWS 4310: Building and Consuming the Web of Data (Fall 2013)
ITWS 4310: Building and Consuming the Web of Data (Fall 2013)ITWS 4310: Building and Consuming the Web of Data (Fall 2013)
ITWS 4310: Building and Consuming the Web of Data (Fall 2013)
 
Graph databases & data integration v2
Graph databases & data integration v2Graph databases & data integration v2
Graph databases & data integration v2
 
Linked data HHS 2015
Linked data HHS 2015Linked data HHS 2015
Linked data HHS 2015
 
Linked Data Tutorial
Linked Data TutorialLinked Data Tutorial
Linked Data Tutorial
 

Último

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Último (20)

A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 

Adventures in Linked Data Land (presentation by Richard Light)

  • 1. Adventures in Linked Data Land: bringing RDF to the Wordsworth Trust Richard Light CT Linked Data Meeting, 22 February 2010
  • 2. Discovering Linked Data Four principles of Linked Data (Tim B-L): ● Use URIs to identify resources ● Use HTTP URIs so that people can look them up ● Provide useful information about the resource ● Include links to other URIs in your data
  • 3. Discovering dbPedia ● Extraction of Linked Data from Wikipedia ● Statements in info boxes (mainly) become RDF triples: <rdf:Description rdf:about="http://dbpedia.org/resource/Ber lin_Marathon"> <dbpprop:location rdf:resource="http://dbpedia.org/resource/ Berlin"/> </rdf:Description> Note the URLs
  • 4. Browsing Linked Data ● View RDF as a web page: http://dbpedia.org/page/Berlin ● Navigate from one data source to another ● Specialist Linked Data browsers/plugins: – DISCO – Marbles – Openlink Data Explorer – Tabulator
  • 8. Querying Linked Data ● SPARQL query language: http://www.w3.org/TR/2008/REC-rdf-sparql-query- 20080115/ ● And SPARQL XML results format: http://www.w3.org/TR/rdf-sparql-XMLres/ ● “SPARQL end-points”: http://dbpedia.org/sparql http://dbtune.org/bbc/peel/sparql http://data.linkedmdb.org/sparql
  • 10. Asking interesting questions ● German musicians born in Berlin: ●
  • 11. So what do we have here? ● An initiative to generate lots of Linked Data ● A Linked Data Cloud, containing a growing number of RDF datasets ● A hard-to-use query language capable of very precise and powerful querying Where do museums come into this picture?
  • 12. The Wordsworth Trust ● Typical museum collection: about 60,000 objects ● Major collection of manuscripts (notebooks, letters, etc.) ● Objects published to the Web from a ModesXML database ● Unwise enough to allow me Remote Desktop access ...
  • 16. One identifier; three “views” ● This object has a single persistent identifier: http://collections.wordsworth.org.uk/object/GRMDC.C104.2 ● This maps to different views depending on the “Accept” header in the HTTP request: – application/rdf+xml >> RDF – application/xtm+xml >> XTM Topic Map – Otherwise >> HTML (human-readable) ● Achieved through a custom 404 “page not found” handler
  • 17. “Page not found” handler (1) ● All URLs are fictitious, so they generate a 404 ● Modified a generic smart 404 handler from: http://evolvedcode.net/content/code_smart404/ ● Added support for “303 See other” redirects ● added wild card matching to re-format URLs
  • 18. “Page not found” handler (2) ● Generic URL, plus requested Accept format, determine initial “303 See other” mapping, e.g.: http://collections.wordsworth.org.uk/object/GRMDC.C104.2 + Accept: application/rdf+xml = http://collections.wordsworth.org.uk/object/rdf/GRMDC.C104.2 ● When this is passed back in, the 404 handler has to generate the required RDF directly ● Can't just keep redirecting requests!
  • 19. “Page not found” handler (3) ● Redirect rules declare mappings:
  • 20. “Page not found” handler (4) ● Generic URL plus a supported Accept type generates a “303 See other” redirect ● If it comes back as a page request, it is further redirected with a “301 Moved permanently” to the object's web page ● If it comes back as an RDF or XTM request, the record is fetched as XML and subjected to an XSLT transform by the handler
  • 21. Does it work? - yes, sort of
  • 23. Implementation details ● HTML needed a “back link” to RDF to keep OpenLink Explorer happy: <link rel="alternate" type="application/rdf+xml" href="http://collections.wordsworth.org.uk/object/data/GRMDC .C104.2" title="RDF" /> ● Result is totally unfindable: need a search or harvesting mechanism: – OAI support (possible) – SPARQL end-point (harder)
  • 24. What has been learnt? (1) ● The Linked Data paradigm encourages simple RDF triples: no “blank nodes” ● For an object, this becomes a simple metadata set, very analogous to the PNDS DCAP format ● The properties involved need to encapsulate the whole relation between object and data, e.g. <p:title>Ulswater from Pooley Bridge</p:title> <p:technique>drawn</p:technique> <p:maker>Farington, Joseph (1747-1821)</p:maker> <p:technique>engraved</p:technique> <p:maker>Middiman, Samuel (1750-1831)</p:maker>
  • 25. What has been learnt? (2) ● Data in linked resources can “add value” to your own recording efforts (e.g. place data)
  • 26. Properties: which framework? ● I have used dbPedia properties (for compatibility with other Linked Data resources … ?): http://dbpedia.org/property/title http://dbpedia.org/property/maker ● A viable alternative would be PNDS DCAP: http://purl.org/dc/elements/1.1/title http://purl.org/dc/elements/1.1/creator ● One framework which doesn't fit is the CIDOC CRM: E21 Physical Thing – E12 Production – E39 Actor = “creator”
  • 27. Do we need “museum” properties? ● DbPedia properties are not coherent ● Need something richer than simple metadata ● Could use CIDOC CRM as basis ● Existing interchange formats such as LIDO could be re-expressed in RDF ● Could broaden scope: “history” property set?
  • 28. The problem of URIs ● Good Linked Data requires URIs everywhere ● Most of my museum RDF resolves to strings ● One exception is Geonames lookup: Ullswater becomes http://www.geonames.org/2635191/ ● In the absence of a central “people” registry, should be minting URIs myself for people, etc.
  • 29. Conclusions ● Implementing an RDF Linked Data front-end to a museum database is feasible if: – You can generate multiple outputs from your database (XML is sufficient) – You can implement a suitable URL rewriter or 404 handler ● It's easy (and a good idea) to mint and publish URIs for your collection objects ● It's less clear where all the other URIs we'll need will come from
  • 30. Challenges for museum linked data ● Agreeing an ontology to enable cross-collection [SPARQL] queries ● Shared URLs for in-common concepts: people, places, events ● Mechanisms for getting URLs into museum data ● Getting existing authorities, e.g. AAT, to be available as Linked Data
  • 31. Ask Multimap where Lancaster is