SlideShare una empresa de Scribd logo
1 de 21
dans.knaw.nl
DANS is een instituut van KNAW en NWO
Linked Open Data and DANS
Reinier de Valk
reinier.de.valk@dans.knaw.nl
Vyacheslav Tykhonov
vyacheslav.tykhonov@dans.knaw.nl
NOTaS meeting, The Hague, 15.12.2017
LOD | Linked (Open) Data?
• Linked Data (LD) is “a method of publishing structured data so that it
can be interlinked and become more useful through semantic queries” [1]
• Linked Open Data (LOD) is LD that is open, i.e., freely availably to use
and republish
• Builds upon standard web technologies, but extends them so that they
can be read by machines
• Semantic web: a web of data that can be processed by machines
[1] https://en.wikipedia.org/wiki/Linked_data
LOD | Four principles of LD [2]
• Use uniform resource identifiers (URIs) as names for things
• Use HTTP URIs so that people can look up those names
• When someone looks up a URI, provide useful information, using the
standards (RDF*, SPARQL)
• Include links to other URIs, so that they can discover more things
[2] Berners-Lee, T. (2006) Linked data. https://www.w3.org/DesignIssues/LinkedData.html
LOD | Building block: the triple
• The basic building block of LD is the semantic triple (or simply triple)
• a triple is a statement in the form subject-predicate-object
http://example.name#Bob http://purl.org/vocab/relationship/childOf
http://example.name#Alice
http://example.name#Carl http://purl.org/vocab/relationship/childOf
http://example.name#Alice
• Triples are stored in triplestores (purpose-built databases) or graph
databases (databases with a more generalised structure)
• These databases can be queried with query languages such as
SPARQL; this is done using a (SPARQL) endpoint
LOD | The LOD cloud (22.08.2017) [3]
[3] http://lod-cloud.net/
LOD at DANS | Static LOD
• A LOD graph is living – it keeps evolving
• We archive static snapshots of the graph
• LD is in plain ASCII – no complicated formats needed
• The archived static snapshot can be revived – the README file
accompanying the data describes the procedure
• Examples at EASY, DANS online long-term archiving system [4]
• use search term “linked data”
• interesting examples: LOD Laundromat; CEDAR RDF database
[4] http://www.easy.dans.knaw.nl/
LOD at DANS | Static LOD
DANS LOD infrastructure
• LOD conversion tool harvesting public metadata from DANS systems
using OAI-PMH protocol and converting to Turtle RDF format
• Virtuoso with SPARQL endpoint to store and query archived triples
(static)
• grlc to build Web APIs using shared SPARQL queries
• Timbuctoo Linked Data storage to keep different versions of
metadata harvested from DANS systems (tern into schema)
• GraphQL endpoint integrated in Timbuctoo to query repository and
evaluate new links
What is Timbuctoo?
• Timbuctoo is an open source Linked Data repository system developed by Huygens ING
and specialized in handling interpretative and heterogeneous content. Timbuctoo is
specifically designed for academic research in the arts & humanities and is ideally suited
for research institutions, libraries and archives supporting scholars who follow a
hermeneutic methodology.
• Data upload options:
• Excel upload
• CSV upload
• Dataperfect upload
• remote repository upload with ResourceSync
Description of pipeline to archive
• Users depositing new datasets, metadata updating in time
• Snapshots are taken regularly
• ResourceSync is the only option to get updated snapshot in LOD
cloud without manual interaction
Valid resources
• Filetypes that can be imported:
• text/turtle (.ttl)
• application/rdf+xml (.rdf)
• application/n-triples (.nt)
• application/ld+json (.jsonld)
• text/trig (.trig)
• application/n-quads (.nq)
• text/n3 (.n3)
• application/vnd.timbuctoo-rdf.nquads_unified_diff (.nqud)
EASY metadata triples
hdl:10411/UQZGXY
DataverseNL public metadata triples
hdl:10411/AD7VGI
What is GraphQL?
• “GraphQL is a data query language developed internally by Facebook in 2012 before
being publicly released in 2015. It provides an alternative to REST and ad -hoc webservice
architectures.”
• Wikipedia
• "GraphQL is a query language for your API, and a server-side runtime for executing
queries by using a type system you define for your data. GraphQL isn't tied to any specific
database or storage engine and is instead backed by your existing code and data.”
• GraphQL endpoint provided by Timbuctoo RDF storage allows visual Linked Data
exploration.
•
Overview of EASY datasets
Demo
EASY dataset in Timbuctoo GraphQL endpoint
N-Quads U.D.
• RDF data set notations are like snapshots.
• We enrich them…
• What if we need to track changes in resulting new RDF file?
• How do we know which of these predicates has had a previous value?
• What if we want to add new triples?
• N-Quads itself is an extension on N-Triples, Timbuctoo supports both:
• --- easy.nq 2017-12-14 11:18:16.057104790 +0200
• +++ empty.nq 2017-12–14 12:08:18.772264550 +0200
• @@ -1,35652 +0,0 @@
• +<easy:15960> <dc:location> "http://www.gemeentegeschiedenis.nl/gemeentenaam/Slochteren" .
• +<easy:15960> <dc:location> "http://www.gemeentegeschiedenis.nl/gemeentenaam/Sloten_NH" .
•
LOD Archiving
Archived SPARQL endpoint
Memento protocol for Linked Data Fragments
Credits: Ruben Verborgh
Dataverse as Memento Timegate

Más contenido relacionado

La actualidad más candente

Nobel Prizes as Linked Open Data
Nobel Prizes as Linked Open DataNobel Prizes as Linked Open Data
Nobel Prizes as Linked Open Data
MetaSolutions AB
 
Repository technologies
Repository technologiesRepository technologies
Repository technologies
Andrea Bollini
 
DSpace-CRIS: new features and contribution to the DSpace mainstream
DSpace-CRIS: new features and contribution to the DSpace mainstreamDSpace-CRIS: new features and contribution to the DSpace mainstream
DSpace-CRIS: new features and contribution to the DSpace mainstream
Andrea Bollini
 
DSpace-CRIS: an open source solution - Cineca euroCRIS membership meeting Por...
DSpace-CRIS: an open source solution - Cineca euroCRIS membership meeting Por...DSpace-CRIS: an open source solution - Cineca euroCRIS membership meeting Por...
DSpace-CRIS: an open source solution - Cineca euroCRIS membership meeting Por...
Andrea Bollini
 

La actualidad más candente (20)

ORDS, research data network
ORDS, research data networkORDS, research data network
ORDS, research data network
 
Extending DSpace 7: DSpace-CRIS and DSpace-GLAM for empowered repositories an...
Extending DSpace 7: DSpace-CRIS and DSpace-GLAM for empowered repositories an...Extending DSpace 7: DSpace-CRIS and DSpace-GLAM for empowered repositories an...
Extending DSpace 7: DSpace-CRIS and DSpace-GLAM for empowered repositories an...
 
Nobel Prizes as Linked Open Data
Nobel Prizes as Linked Open DataNobel Prizes as Linked Open Data
Nobel Prizes as Linked Open Data
 
DSpace-CRIS Workshop OR2015: Slides
DSpace-CRIS Workshop OR2015: SlidesDSpace-CRIS Workshop OR2015: Slides
DSpace-CRIS Workshop OR2015: Slides
 
Repository technologies
Repository technologiesRepository technologies
Repository technologies
 
DSpace-CRIS: new features and contribution to the DSpace mainstream
DSpace-CRIS: new features and contribution to the DSpace mainstreamDSpace-CRIS: new features and contribution to the DSpace mainstream
DSpace-CRIS: new features and contribution to the DSpace mainstream
 
Repeatable Semantic Queries for the Linked Data Agnostic
Repeatable Semantic Queries for the Linked Data AgnosticRepeatable Semantic Queries for the Linked Data Agnostic
Repeatable Semantic Queries for the Linked Data Agnostic
 
CLARIAH Toogdag 2018: A distributed network of digital heritage information
CLARIAH Toogdag 2018: A distributed network of digital heritage informationCLARIAH Toogdag 2018: A distributed network of digital heritage information
CLARIAH Toogdag 2018: A distributed network of digital heritage information
 
SomeSlides
SomeSlidesSomeSlides
SomeSlides
 
DSpace-CRIS: an open source solution - Cineca euroCRIS membership meeting Por...
DSpace-CRIS: an open source solution - Cineca euroCRIS membership meeting Por...DSpace-CRIS: an open source solution - Cineca euroCRIS membership meeting Por...
DSpace-CRIS: an open source solution - Cineca euroCRIS membership meeting Por...
 
API economy
API economyAPI economy
API economy
 
Seamless access to the world’s open access research papers via ResourceSync
Seamless access to the world’s open access research papers via ResourceSyncSeamless access to the world’s open access research papers via ResourceSync
Seamless access to the world’s open access research papers via ResourceSync
 
SMRUDAS
SMRUDAS SMRUDAS
SMRUDAS
 
ResourceSync Tutorial
ResourceSync TutorialResourceSync Tutorial
ResourceSync Tutorial
 
Open Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked DataOpen Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked Data
 
DSpace-CRIS & OpenAIRE
DSpace-CRIS & OpenAIREDSpace-CRIS & OpenAIRE
DSpace-CRIS & OpenAIRE
 
Sept 24 NISO Virtual Conference: Library Data in the Cloud
Sept 24 NISO Virtual Conference: Library Data in the CloudSept 24 NISO Virtual Conference: Library Data in the Cloud
Sept 24 NISO Virtual Conference: Library Data in the Cloud
 
Nanopublications and Decentralized Publishing
Nanopublications and Decentralized PublishingNanopublications and Decentralized Publishing
Nanopublications and Decentralized Publishing
 
4Science presentes: ORCiD API Tutorial
4Science presentes: ORCiD API Tutorial4Science presentes: ORCiD API Tutorial
4Science presentes: ORCiD API Tutorial
 
Linked Data Notifications Distributed Update Notification and Propagation on ...
Linked Data Notifications Distributed Update Notification and Propagation on ...Linked Data Notifications Distributed Update Notification and Propagation on ...
Linked Data Notifications Distributed Update Notification and Propagation on ...
 

Similar a Linked Open Data and DANS

Cloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentCloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application Development
Peter Haase
 

Similar a Linked Open Data and DANS (20)

Tim Pugh-SPEDDEXES 2014
Tim Pugh-SPEDDEXES 2014Tim Pugh-SPEDDEXES 2014
Tim Pugh-SPEDDEXES 2014
 
RDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival dataRDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival data
 
Linked Data to Improve the OER Experience
Linked Data to Improve the OER ExperienceLinked Data to Improve the OER Experience
Linked Data to Improve the OER Experience
 
Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked Data
 
Building an electronic repository and archives on Dataverse in the European O...
Building an electronic repository and archives on Dataverse in the European O...Building an electronic repository and archives on Dataverse in the European O...
Building an electronic repository and archives on Dataverse in the European O...
 
Publishing and Serving Machine Learning Models with DLHub
Publishing and Serving Machine Learning Models with DLHubPublishing and Serving Machine Learning Models with DLHub
Publishing and Serving Machine Learning Models with DLHub
 
Decentralised identifiers and knowledge graphs
Decentralised identifiers and knowledge graphs Decentralised identifiers and knowledge graphs
Decentralised identifiers and knowledge graphs
 
SFScon21 - Sander Van Dooren - Joinup: Maintaining an Open catalogue of reusa...
SFScon21 - Sander Van Dooren - Joinup: Maintaining an Open catalogue of reusa...SFScon21 - Sander Van Dooren - Joinup: Maintaining an Open catalogue of reusa...
SFScon21 - Sander Van Dooren - Joinup: Maintaining an Open catalogue of reusa...
 
Enterprise knowledge graphs
Enterprise knowledge graphsEnterprise knowledge graphs
Enterprise knowledge graphs
 
PhD Defense
PhD DefensePhD Defense
PhD Defense
 
5 years of Dataverse evolution
5 years of Dataverse evolution 5 years of Dataverse evolution
5 years of Dataverse evolution
 
Cloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentCloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application Development
 
Linked Data
Linked DataLinked Data
Linked Data
 
Introduction to linked data
Introduction to linked dataIntroduction to linked data
Introduction to linked data
 
Building COVID-19 Museum as Open Science Project
Building COVID-19 Museum as Open Science ProjectBuilding COVID-19 Museum as Open Science Project
Building COVID-19 Museum as Open Science Project
 
(PROJEKTURA) Big Data Open Data story for TGG
(PROJEKTURA) Big Data Open Data story for TGG(PROJEKTURA) Big Data Open Data story for TGG
(PROJEKTURA) Big Data Open Data story for TGG
 
Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked Data
 
Big Data Architecture Workshop - Vahid Amiri
Big Data Architecture Workshop -  Vahid AmiriBig Data Architecture Workshop -  Vahid Amiri
Big Data Architecture Workshop - Vahid Amiri
 
No sq lv1_0
No sq lv1_0No sq lv1_0
No sq lv1_0
 
Linked data HHS 2015
Linked data HHS 2015Linked data HHS 2015
Linked data HHS 2015
 

Más de vty

Más de vty (20)

Decentralisation and knowledge graphs
Decentralisation and knowledge graphs Decentralisation and knowledge graphs
Decentralisation and knowledge graphs
 
Decentralised identifiers for CLARIAH infrastructure
Decentralised identifiers for CLARIAH infrastructure Decentralised identifiers for CLARIAH infrastructure
Decentralised identifiers for CLARIAH infrastructure
 
Dataverse repository for research data in the COVID-19 Museum
Dataverse repository for research data  in the COVID-19 MuseumDataverse repository for research data  in the COVID-19 Museum
Dataverse repository for research data in the COVID-19 Museum
 
Metaverse for Dataverse
Metaverse for DataverseMetaverse for Dataverse
Metaverse for Dataverse
 
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and DAN...
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and DAN...Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and DAN...
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and DAN...
 
External CV support in Dataverse 5.7
External CV support in Dataverse 5.7External CV support in Dataverse 5.7
External CV support in Dataverse 5.7
 
Building COVID-19 Knowledge Graph at CoronaWhy
Building COVID-19 Knowledge Graph at CoronaWhyBuilding COVID-19 Knowledge Graph at CoronaWhy
Building COVID-19 Knowledge Graph at CoronaWhy
 
CLARIN CMDI use case and flexible metadata schemes
CLARIN CMDI use case and flexible metadata schemes CLARIN CMDI use case and flexible metadata schemes
CLARIN CMDI use case and flexible metadata schemes
 
Flexible metadata schemes for research data repositories - CLARIN Conference'21
Flexible metadata schemes for research data repositories - CLARIN Conference'21Flexible metadata schemes for research data repositories - CLARIN Conference'21
Flexible metadata schemes for research data repositories - CLARIN Conference'21
 
Controlled vocabularies and ontologies in Dataverse data repository
Controlled vocabularies and ontologies in Dataverse data repositoryControlled vocabularies and ontologies in Dataverse data repository
Controlled vocabularies and ontologies in Dataverse data repository
 
Automated CI/CD testing, installation and deployment of Dataverse infrastruct...
Automated CI/CD testing, installation and deployment of Dataverse infrastruct...Automated CI/CD testing, installation and deployment of Dataverse infrastruct...
Automated CI/CD testing, installation and deployment of Dataverse infrastruct...
 
Fighting COVID-19 with Artificial Intelligence
Fighting COVID-19 with Artificial IntelligenceFighting COVID-19 with Artificial Intelligence
Fighting COVID-19 with Artificial Intelligence
 
External controlled vocabularies support in Dataverse
External controlled vocabularies support in DataverseExternal controlled vocabularies support in Dataverse
External controlled vocabularies support in Dataverse
 
Setting up Dataverse repository for research data
Setting up Dataverse repository for research dataSetting up Dataverse repository for research data
Setting up Dataverse repository for research data
 
Clariah Tech Day: Controlled Vocabularies and Ontologies in Dataverse
Clariah Tech Day: Controlled Vocabularies and Ontologies in DataverseClariah Tech Day: Controlled Vocabularies and Ontologies in Dataverse
Clariah Tech Day: Controlled Vocabularies and Ontologies in Dataverse
 
Ontologies, controlled vocabularies and Dataverse
Ontologies, controlled vocabularies and DataverseOntologies, controlled vocabularies and Dataverse
Ontologies, controlled vocabularies and Dataverse
 
CLARIN CMDI support in Dataverse
CLARIN CMDI support in Dataverse CLARIN CMDI support in Dataverse
CLARIN CMDI support in Dataverse
 
Integration of WORSICA’s thematic service in EOSC, Service QA and Dataverse
Integration of WORSICA’s thematic service in EOSC,  Service QA and DataverseIntegration of WORSICA’s thematic service in EOSC,  Service QA and Dataverse
Integration of WORSICA’s thematic service in EOSC, Service QA and Dataverse
 
The world of Docker and Kubernetes
The world of Docker and Kubernetes The world of Docker and Kubernetes
The world of Docker and Kubernetes
 
Technical integration of data repositories status and challenges
Technical integration of data repositories status and challengesTechnical integration of data repositories status and challenges
Technical integration of data repositories status and challenges
 

Último

Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
Sérgio Sacani
 
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
Areesha Ahmad
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
RizalinePalanog2
 

Último (20)

9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
 
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATIONSTS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
 
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
 

Linked Open Data and DANS

  • 1. dans.knaw.nl DANS is een instituut van KNAW en NWO Linked Open Data and DANS Reinier de Valk reinier.de.valk@dans.knaw.nl Vyacheslav Tykhonov vyacheslav.tykhonov@dans.knaw.nl NOTaS meeting, The Hague, 15.12.2017
  • 2. LOD | Linked (Open) Data? • Linked Data (LD) is “a method of publishing structured data so that it can be interlinked and become more useful through semantic queries” [1] • Linked Open Data (LOD) is LD that is open, i.e., freely availably to use and republish • Builds upon standard web technologies, but extends them so that they can be read by machines • Semantic web: a web of data that can be processed by machines [1] https://en.wikipedia.org/wiki/Linked_data
  • 3. LOD | Four principles of LD [2] • Use uniform resource identifiers (URIs) as names for things • Use HTTP URIs so that people can look up those names • When someone looks up a URI, provide useful information, using the standards (RDF*, SPARQL) • Include links to other URIs, so that they can discover more things [2] Berners-Lee, T. (2006) Linked data. https://www.w3.org/DesignIssues/LinkedData.html
  • 4. LOD | Building block: the triple • The basic building block of LD is the semantic triple (or simply triple) • a triple is a statement in the form subject-predicate-object http://example.name#Bob http://purl.org/vocab/relationship/childOf http://example.name#Alice http://example.name#Carl http://purl.org/vocab/relationship/childOf http://example.name#Alice • Triples are stored in triplestores (purpose-built databases) or graph databases (databases with a more generalised structure) • These databases can be queried with query languages such as SPARQL; this is done using a (SPARQL) endpoint
  • 5. LOD | The LOD cloud (22.08.2017) [3] [3] http://lod-cloud.net/
  • 6. LOD at DANS | Static LOD • A LOD graph is living – it keeps evolving • We archive static snapshots of the graph • LD is in plain ASCII – no complicated formats needed • The archived static snapshot can be revived – the README file accompanying the data describes the procedure • Examples at EASY, DANS online long-term archiving system [4] • use search term “linked data” • interesting examples: LOD Laundromat; CEDAR RDF database [4] http://www.easy.dans.knaw.nl/
  • 7. LOD at DANS | Static LOD
  • 8. DANS LOD infrastructure • LOD conversion tool harvesting public metadata from DANS systems using OAI-PMH protocol and converting to Turtle RDF format • Virtuoso with SPARQL endpoint to store and query archived triples (static) • grlc to build Web APIs using shared SPARQL queries • Timbuctoo Linked Data storage to keep different versions of metadata harvested from DANS systems (tern into schema) • GraphQL endpoint integrated in Timbuctoo to query repository and evaluate new links
  • 9. What is Timbuctoo? • Timbuctoo is an open source Linked Data repository system developed by Huygens ING and specialized in handling interpretative and heterogeneous content. Timbuctoo is specifically designed for academic research in the arts & humanities and is ideally suited for research institutions, libraries and archives supporting scholars who follow a hermeneutic methodology. • Data upload options: • Excel upload • CSV upload • Dataperfect upload • remote repository upload with ResourceSync
  • 10. Description of pipeline to archive • Users depositing new datasets, metadata updating in time • Snapshots are taken regularly • ResourceSync is the only option to get updated snapshot in LOD cloud without manual interaction
  • 11. Valid resources • Filetypes that can be imported: • text/turtle (.ttl) • application/rdf+xml (.rdf) • application/n-triples (.nt) • application/ld+json (.jsonld) • text/trig (.trig) • application/n-quads (.nq) • text/n3 (.n3) • application/vnd.timbuctoo-rdf.nquads_unified_diff (.nqud)
  • 13. DataverseNL public metadata triples hdl:10411/AD7VGI
  • 14. What is GraphQL? • “GraphQL is a data query language developed internally by Facebook in 2012 before being publicly released in 2015. It provides an alternative to REST and ad -hoc webservice architectures.” • Wikipedia • "GraphQL is a query language for your API, and a server-side runtime for executing queries by using a type system you define for your data. GraphQL isn't tied to any specific database or storage engine and is instead backed by your existing code and data.” • GraphQL endpoint provided by Timbuctoo RDF storage allows visual Linked Data exploration. •
  • 15. Overview of EASY datasets Demo
  • 16. EASY dataset in Timbuctoo GraphQL endpoint
  • 17. N-Quads U.D. • RDF data set notations are like snapshots. • We enrich them… • What if we need to track changes in resulting new RDF file? • How do we know which of these predicates has had a previous value? • What if we want to add new triples? • N-Quads itself is an extension on N-Triples, Timbuctoo supports both: • --- easy.nq 2017-12-14 11:18:16.057104790 +0200 • +++ empty.nq 2017-12–14 12:08:18.772264550 +0200 • @@ -1,35652 +0,0 @@ • +<easy:15960> <dc:location> "http://www.gemeentegeschiedenis.nl/gemeentenaam/Slochteren" . • +<easy:15960> <dc:location> "http://www.gemeentegeschiedenis.nl/gemeentenaam/Sloten_NH" . •
  • 20. Memento protocol for Linked Data Fragments Credits: Ruben Verborgh

Notas del editor

  1. Term often heard in this context: semantic web
  2. Four principles of LD, as formulated by Tim Berners-Lee in 2006 URI: uniform resource identifier
  3. you can also reason over the triples [example bob and carl are (half)brothers]
  4. An example of such a graph database is the LOD cloud Picture shows all datasets that have been published in LD format (up until 22.08.2017)