SlideShare una empresa de Scribd logo
1 de 37
Descargar para leer sin conexión
Linked Open Data:
A simple how-to
Nicola Vitucci
nicola.vitucci@gmail.com
Intro
● LOD: Linked Open Data
● Linked Data + Open Data = “5-star” data
● The WWW is on the way to become an
immense database (Web of Data)
● What does this mean? What is it made
of?What is it for? Whom is it for?
Intro
Going from this...
Tartu is the second largest city of Estonia,
following Estonia's political and financial
capital Tallinn.
Tartu is often considered the intellectual
centre of the country, especially since it is
home to the nation's oldest and most
renowned university, the University of
Tartu.
In German, Swedish and Polish the town
has been known and is sometimes still
referred to as Dorpat, a variant of
Tarbatu.
The University of Tartu (Estonian: Tartu
Ülikool, Latin: Universitas Tartuensis) is a
classical university in the city of Tartu,
Estonia.
Intro
… to this!
Tartu
Estonia
Tallinn
University of Tartu
city
college town
Dorpat
Tarbatu
Tartu Ülikool Universitas Tartuensis
hasCapital
is a
name
name
name (official) name (Latin)
located in
in country
is a
LOD cloud
http://lod-cloud.net/versions/2014-08-30/lod-cloud_colored.png
LOD cloud
http://lod-cloud.net/versions/2014-08-30/lod-cloud_colored.png
5-star data
http://5stardata.info
5-star data: how
★ make your stuff available on the Web
(whatever format) under an open license
★★ make it available as structured data (e.g.,
Excel instead of image scan of a table)
★★★ make it available in a non-proprietary open
format (e.g., CSV rather than Excel)
★★★★ use URIs to denote things, so that people can
point at your stuff
★★★★★ link your data to other data to provide context
LD principles
● Original design rules
– Use URIs as unique identifiers for resources (not the
same as URL)
– Use the HTTP URI scheme (rather than other
schemes such as URN), so that URL = URI
– When an ID is dereferenced (= looked up), give
useful information using the standards (e.g. RDF)
– Provide links to other resources
● LOD = LD + open license
RDF model
● RDF (Resource Description Framework) is
a fundamental brick to build LD
● It is built on the concept of triple: a subject
linked to an object by means of a predicate
ns2:Ingredient 1
ns2:Ingredient 2
ns2:Product1
ns:product
ns:product
10
20
ns:weight
ns:weight
ns = http://www.example.com/ ns2 = http://www.anotherexample.com/
RDF: serialization
● It is possible to use content negotiation to get the
same file in different serialization formats
● Linux: use the curl command
– $ curl -L -H "Accept: application/rdf+xml"
http://dbpedia.org/resource/Tartu
– $ curl -L -H "Accept: text/turtle”
http://dbpedia.org/resource/Tartu
● There are also REST clients for Firefox and Chrome
RDF: N-Triples
<http://dbpedia.org/resource/Tartu>
<http://www.w3.org/1999/02/22-rdf-syntax-ns#type>
<http://dbpedia.org/ontology/City>
<http://dbpedia.org/resource/Tartu>
<http://www.w3.org/2000/01/rdf-schema#label> "Tartu"@en .
<http://dbpedia.org/resource/Tartu>
<http://dbpedia.org/ontology/populationTotal>
"97332"^^<http://www.w3.org/2001/XMLSchema#nonNegativeI
nteger> .
<http://dbpedia.org/resource/Tartu>
<http://dbpedia.org/ontology/maximumElevation>
"78.9432"^^<http://www.w3.org/2001/XMLSchema#double> .
RDF: JSON-LD
{
"http://dbpedia.org/resource/Tartu": {
"http://www.w3.org/1999/02/22-rdf-syntax-ns#type": [{
"type": "uri", "value": "http://dbpedia.org/ontology/PopulatedPlace"}],
"http://www.w3.org/2000/01/rdf-schema#label": [{
"type": "literal", "value": "Tartu", "lang": "de"}],
"http://dbpedia.org/ontology/populationTotal": [{
"type": "literal", "value": "97332", "datatype":
"http://www.w3.org/2001/XMLSchema#nonNegativeInteger" }]
}
RDF: Turtle
@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .
@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .
@prefix dbr: <http://dbpedia.org/resource/> .
@prefix dbo: <http://dbpedia.org/ontology/> .
@prefix xsd: <http://www.w3.org/2001/XMLSchema#> .
dbr:Tartu rdf:type dbo:City
dbr:Tartu rdfs:label "Tartu"@en .
dbr:Tartu dbo:populationTotal "97332"^^xsd:nonNegativeInteger .
dbr:Tartu dbo:maximumElevation "78.9432"^^xsd:double .
RDF: XML
<?xml version="1.0" encoding="utf-8" ?>
<rdf:RDF
xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"
xmlns:dbo="http://dbpedia.org/ontology/">
<rdf:Description rdf:about="http://dbpedia.org/resource/Tartu">
<rdf:type rdf:resource="http://dbpedia.org/ontology/City" />
<rdfs:label xml:lang="en">Tartu</rdfs:label>
<dbo:populationTotal
rdf:datatype="http://www.w3.org/2001/XMLSchema#nonNegativeInteger">97332</
dbo:populationTotal>
</rdf:RDF>
RDF: data schema
● In a relational database, we have to look
for definitions in the data schema
● Using RDF, instead, we can fully describe
data and their schema!
● In order to do this, we need vocabularies
– Every term in a vocabulary has a
common base URI called namespace
Common vocabularies
● rdf, rdfs, owl – RDF “core” vocabularies
● dcterms – general properties for resources
● foaf – Friend of a Friend
● geo – geolocalization
● skos – description of schemas and taxonomies
● void, dcat – description of datasets
● doap – description of projects
● rdf and rdfs are used basically everywhere,
since they are used to define the data
schema
● Using rdf we can say that an entity
belongs to a class of entities
● Using rdfs we can define super- and
subclass relations
Common vocabularies
Examples
● “Tartu is a city”
– dbr:Tartu rdf:type dbo:City
– dbr:Tartu a dbo:City
● “Cities are settlements”
– dbo:City rdfs:subClassOf
dbo:Settlement
Ontologies
● An ontology is a model used to describe a
domain
● Ontologies can be used to describe
complex, interesting concepts
● The may be hard to develop, because
logical and modelling decisions are not
always straightforward
Using LD
● Should we know all the details about RDF
to be able to use LD?
● “Follow your nose” approach thanks to links
– https://www.wikidata.org
– http://sameas.org
– https://datahub.io
Using LD
Search Tartu on Wikidata
more links to visit!
Using LD
● SPARQL is to RDF what SQL is to databases:
a query language
● A SPARQL endpoint is a resource where
SPARQL queries can be sent to and data can
be retrieved from
● Some SPARQL endpoints:
– https://query.wikidata.org/
– http://dbpedia.org/sparql
SPARQL queries
● I want to find some interesting facts about
Tartu
● Let’s go to https://www.wikidata.org and
search for Tartu again
● Let’s take a note of the “Q number”
SPARQL queries
● Now let’s go on https://query.wikidata.org
● Let’s insert this query
SELECT DISTINCT *
WHERE {
?person wdt:P19 wd:Q13972; rdfs:label ?personName .
FILTER (LANG(?personName) = "en")
}
ORDER BY ?personName
SPARQL queries
● What do we get?
SPARQL queries
● Anyone born in Tartu whose name looks Italian?
SELECT DISTINCT *
WHERE {
?person wdt:P19 wd:Q13972; rdfs:label ?personName;
wdt:P735/wdt:P407 wd:Q652 .
FILTER (LANG(?personName) = "en")
}
ORDER BY ?personName
SPARQL queries
● Anyone born in Tartu who died somewhere in Italy?
SELECT DISTINCT *
WHERE {
?person wdt:P19 wd:Q13972; rdfs:label ?personName;
wdt:P20 ?place .
?place wdt:P17 wd:Q38; rdfs:label ?placeName .
FILTER (LANG(?personName) = "en" && LANG(?placeName) = "en")
}
ORDER BY ?personName
Software
● There is plenty of software to play with
LOD
– Python: rdflib
(http://rdflib.readthedocs.io)
– Java: the Apache Jena project
(https://jena.apache.org/)
Advantages
● Easier interlinking of heterogeneous data
● Easier creation and maintenance of data
schemas
● Distributed “by default”
● Controlled definition of shared knowledge
Challenges
● Rather new topic
– Needs skill and experience
● As data size increases, performance may worsen
– However, this depends on the use case
● Extra care is necessary when using distributed data
sources
– Accessibility & availability issues
– Data quality
Questions?
Thank you!
Aitäh!
Grazie!
Wikidata
● Wikidata is a free and open knowledge base that can
be read and edited by both humans and machines.
● Wikidata acts as central storage for the structured
data of its Wikimedia sister projects including
Wikipedia, Wikivoyage, Wikisource, and others.
● Wikidata also provides support to many other sites and
services beyond just Wikimedia projects! The content
of Wikidata is available under a free license,
exported using standard formats, and can be
interlinked to other open data sets on the linked data
web.
Wikidata
● Centralized access: only
one resource to link data
belonging to (or created
from) several projects
● Management of
structured data: not just
text pages, but also data
designed according to a
schema and usable by
external software
more Tartu identifiers
Wikidata
Wikidata
● There is a playground to try these things out:
the sandbox element
● Go to
https://www.wikidata.org/wiki/Q4115189
● Start editing!
– Note: it is possible to edit without being
logged in, but (as for Wikipedia) it would
be nicer to have an account

Más contenido relacionado

La actualidad más candente

XML: A New Standard for Data
XML: A New Standard for DataXML: A New Standard for Data
XML: A New Standard for DataDaniel Stout
 
Services semantic technology_terminology
Services semantic technology_terminologyServices semantic technology_terminology
Services semantic technology_terminologyTenforce
 
RDF and Open Linked Data, a first approach
RDF and Open Linked Data, a first approachRDF and Open Linked Data, a first approach
RDF and Open Linked Data, a first approachhorvadam
 
Semantic Technologies in ST&DL
Semantic Technologies in ST&DLSemantic Technologies in ST&DL
Semantic Technologies in ST&DLAndrea Nuzzolese
 
Development of Semantic Web based Disaster Management System
Development of Semantic Web based Disaster Management SystemDevelopment of Semantic Web based Disaster Management System
Development of Semantic Web based Disaster Management SystemNIT Durgapur
 
The SPARQL Anything project
The SPARQL Anything projectThe SPARQL Anything project
The SPARQL Anything projectEnrico Daga
 
Knowledge graph construction with a façade - The SPARQL Anything Project
Knowledge graph construction with a façade - The SPARQL Anything ProjectKnowledge graph construction with a façade - The SPARQL Anything Project
Knowledge graph construction with a façade - The SPARQL Anything ProjectEnrico Daga
 
Trying SPARQL Anything with MEI
Trying SPARQL Anything with MEITrying SPARQL Anything with MEI
Trying SPARQL Anything with MEIEnrico Daga
 
Semantic Web introduction
Semantic Web introductionSemantic Web introduction
Semantic Web introductionGraphity
 
Web ontology language (owl)
Web ontology language (owl)Web ontology language (owl)
Web ontology language (owl)Ameer Sameer
 
Owl web ontology language
Owl  web ontology languageOwl  web ontology language
Owl web ontology languagehassco2011
 
Connections that work: Linked Open Data demystified
Connections that work: Linked Open Data demystifiedConnections that work: Linked Open Data demystified
Connections that work: Linked Open Data demystifiedJakob .
 

La actualidad más candente (17)

XML: A New Standard for Data
XML: A New Standard for DataXML: A New Standard for Data
XML: A New Standard for Data
 
Services semantic technology_terminology
Services semantic technology_terminologyServices semantic technology_terminology
Services semantic technology_terminology
 
Demystifying RDF
Demystifying RDFDemystifying RDF
Demystifying RDF
 
RDF and Open Linked Data, a first approach
RDF and Open Linked Data, a first approachRDF and Open Linked Data, a first approach
RDF and Open Linked Data, a first approach
 
Oke
OkeOke
Oke
 
Introduction to RDF
Introduction to RDFIntroduction to RDF
Introduction to RDF
 
Semantic Technologies in ST&DL
Semantic Technologies in ST&DLSemantic Technologies in ST&DL
Semantic Technologies in ST&DL
 
Development of Semantic Web based Disaster Management System
Development of Semantic Web based Disaster Management SystemDevelopment of Semantic Web based Disaster Management System
Development of Semantic Web based Disaster Management System
 
Rdf
RdfRdf
Rdf
 
The SPARQL Anything project
The SPARQL Anything projectThe SPARQL Anything project
The SPARQL Anything project
 
Knowledge graph construction with a façade - The SPARQL Anything Project
Knowledge graph construction with a façade - The SPARQL Anything ProjectKnowledge graph construction with a façade - The SPARQL Anything Project
Knowledge graph construction with a façade - The SPARQL Anything Project
 
Trying SPARQL Anything with MEI
Trying SPARQL Anything with MEITrying SPARQL Anything with MEI
Trying SPARQL Anything with MEI
 
Semantic Web introduction
Semantic Web introductionSemantic Web introduction
Semantic Web introduction
 
Web ontology language (owl)
Web ontology language (owl)Web ontology language (owl)
Web ontology language (owl)
 
Owl web ontology language
Owl  web ontology languageOwl  web ontology language
Owl web ontology language
 
Connections that work: Linked Open Data demystified
Connections that work: Linked Open Data demystifiedConnections that work: Linked Open Data demystified
Connections that work: Linked Open Data demystified
 
semantic web & natural language
semantic web & natural languagesemantic web & natural language
semantic web & natural language
 

Similar a Linked Open Data: A simple how-to

Slides semantic web and Drupal 7 NYCCamp 2012
Slides semantic web and Drupal 7 NYCCamp 2012Slides semantic web and Drupal 7 NYCCamp 2012
Slides semantic web and Drupal 7 NYCCamp 2012scorlosquet
 
The Semantic Web and Drupal 7 - Loja 2013
The Semantic Web and Drupal 7 - Loja 2013The Semantic Web and Drupal 7 - Loja 2013
The Semantic Web and Drupal 7 - Loja 2013scorlosquet
 
VALA Tech Camp 2017: Intro to Wikidata & SPARQL
VALA Tech Camp 2017: Intro to Wikidata & SPARQLVALA Tech Camp 2017: Intro to Wikidata & SPARQL
VALA Tech Camp 2017: Intro to Wikidata & SPARQLJane Frazier
 
Hacktoberfest 2020 - Intro to Knowledge Graphs
Hacktoberfest 2020 - Intro to Knowledge GraphsHacktoberfest 2020 - Intro to Knowledge Graphs
Hacktoberfest 2020 - Intro to Knowledge GraphsArangoDB Database
 
Graph databases & data integration - the case of RDF
Graph databases & data integration - the case of RDFGraph databases & data integration - the case of RDF
Graph databases & data integration - the case of RDFDimitris Kontokostas
 
An Introduction to Linked Data and Microdata
An Introduction to Linked Data and MicrodataAn Introduction to Linked Data and Microdata
An Introduction to Linked Data and MicrodataDLFCLIR
 
RDF and the Semantic Web -- Joanna Pszenicyn
RDF and the Semantic Web -- Joanna PszenicynRDF and the Semantic Web -- Joanna Pszenicyn
RDF and the Semantic Web -- Joanna PszenicynRichard.Sapon-White
 
Linked (Open) Data: A quick introduction
Linked (Open) Data: A quick introductionLinked (Open) Data: A quick introduction
Linked (Open) Data: A quick introductionnvitucci
 
Introduction to linked data
Introduction to linked dataIntroduction to linked data
Introduction to linked dataLaura Po
 
Linked Open Data for Digital Humanities
Linked Open Data for Digital HumanitiesLinked Open Data for Digital Humanities
Linked Open Data for Digital HumanitiesChristophe Guéret
 
Find your way in Graph labyrinths
Find your way in Graph labyrinthsFind your way in Graph labyrinths
Find your way in Graph labyrinthsDaniel Camarda
 
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven RecipesReasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven RecipesOntotext
 
Eclipse RDF4J - Working with RDF in Java
Eclipse RDF4J - Working with RDF in JavaEclipse RDF4J - Working with RDF in Java
Eclipse RDF4J - Working with RDF in JavaJeen Broekstra
 
Adventures in Linked Data Land (presentation by Richard Light)
Adventures in Linked Data Land (presentation by Richard Light)Adventures in Linked Data Land (presentation by Richard Light)
Adventures in Linked Data Land (presentation by Richard Light)jottevanger
 
Release webinar: Sansa and Ontario
Release webinar: Sansa and OntarioRelease webinar: Sansa and Ontario
Release webinar: Sansa and OntarioBigData_Europe
 
Wed batsakis tut_challenges of preservations
Wed batsakis tut_challenges of preservationsWed batsakis tut_challenges of preservations
Wed batsakis tut_challenges of preservationseswcsummerschool
 

Similar a Linked Open Data: A simple how-to (20)

Semantic Web Technology
Semantic Web TechnologySemantic Web Technology
Semantic Web Technology
 
Slides semantic web and Drupal 7 NYCCamp 2012
Slides semantic web and Drupal 7 NYCCamp 2012Slides semantic web and Drupal 7 NYCCamp 2012
Slides semantic web and Drupal 7 NYCCamp 2012
 
The Semantic Web and Drupal 7 - Loja 2013
The Semantic Web and Drupal 7 - Loja 2013The Semantic Web and Drupal 7 - Loja 2013
The Semantic Web and Drupal 7 - Loja 2013
 
VALA Tech Camp 2017: Intro to Wikidata & SPARQL
VALA Tech Camp 2017: Intro to Wikidata & SPARQLVALA Tech Camp 2017: Intro to Wikidata & SPARQL
VALA Tech Camp 2017: Intro to Wikidata & SPARQL
 
The Danish National Bibliography as LOD
The Danish National Bibliography as LODThe Danish National Bibliography as LOD
The Danish National Bibliography as LOD
 
Hacktoberfest 2020 - Intro to Knowledge Graphs
Hacktoberfest 2020 - Intro to Knowledge GraphsHacktoberfest 2020 - Intro to Knowledge Graphs
Hacktoberfest 2020 - Intro to Knowledge Graphs
 
Graph databases & data integration - the case of RDF
Graph databases & data integration - the case of RDFGraph databases & data integration - the case of RDF
Graph databases & data integration - the case of RDF
 
An Introduction to Linked Data and Microdata
An Introduction to Linked Data and MicrodataAn Introduction to Linked Data and Microdata
An Introduction to Linked Data and Microdata
 
Timbuctoo 2 EASY
Timbuctoo 2 EASYTimbuctoo 2 EASY
Timbuctoo 2 EASY
 
RDF and the Semantic Web -- Joanna Pszenicyn
RDF and the Semantic Web -- Joanna PszenicynRDF and the Semantic Web -- Joanna Pszenicyn
RDF and the Semantic Web -- Joanna Pszenicyn
 
Linked (Open) Data: A quick introduction
Linked (Open) Data: A quick introductionLinked (Open) Data: A quick introduction
Linked (Open) Data: A quick introduction
 
Introduction to linked data
Introduction to linked dataIntroduction to linked data
Introduction to linked data
 
Linked Open Data for Digital Humanities
Linked Open Data for Digital HumanitiesLinked Open Data for Digital Humanities
Linked Open Data for Digital Humanities
 
Find your way in Graph labyrinths
Find your way in Graph labyrinthsFind your way in Graph labyrinths
Find your way in Graph labyrinths
 
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven RecipesReasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
 
Eclipse RDF4J - Working with RDF in Java
Eclipse RDF4J - Working with RDF in JavaEclipse RDF4J - Working with RDF in Java
Eclipse RDF4J - Working with RDF in Java
 
Adventures in Linked Data Land (presentation by Richard Light)
Adventures in Linked Data Land (presentation by Richard Light)Adventures in Linked Data Land (presentation by Richard Light)
Adventures in Linked Data Land (presentation by Richard Light)
 
Best of Marketing
Best of MarketingBest of Marketing
Best of Marketing
 
Release webinar: Sansa and Ontario
Release webinar: Sansa and OntarioRelease webinar: Sansa and Ontario
Release webinar: Sansa and Ontario
 
Wed batsakis tut_challenges of preservations
Wed batsakis tut_challenges of preservationsWed batsakis tut_challenges of preservations
Wed batsakis tut_challenges of preservations
 

Último

原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhThiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhYasamin16
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
IMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptxIMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptxdolaknnilon
 
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degreeyuu sss
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
MK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxMK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxUnduhUnggah1
 

Último (20)

原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhThiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
IMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptxIMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptx
 
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
MK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxMK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docx
 

Linked Open Data: A simple how-to

  • 1. Linked Open Data: A simple how-to Nicola Vitucci nicola.vitucci@gmail.com
  • 2. Intro ● LOD: Linked Open Data ● Linked Data + Open Data = “5-star” data ● The WWW is on the way to become an immense database (Web of Data) ● What does this mean? What is it made of?What is it for? Whom is it for?
  • 3. Intro Going from this... Tartu is the second largest city of Estonia, following Estonia's political and financial capital Tallinn. Tartu is often considered the intellectual centre of the country, especially since it is home to the nation's oldest and most renowned university, the University of Tartu. In German, Swedish and Polish the town has been known and is sometimes still referred to as Dorpat, a variant of Tarbatu. The University of Tartu (Estonian: Tartu Ülikool, Latin: Universitas Tartuensis) is a classical university in the city of Tartu, Estonia.
  • 4. Intro … to this! Tartu Estonia Tallinn University of Tartu city college town Dorpat Tarbatu Tartu Ülikool Universitas Tartuensis hasCapital is a name name name (official) name (Latin) located in in country is a
  • 8. 5-star data: how ★ make your stuff available on the Web (whatever format) under an open license ★★ make it available as structured data (e.g., Excel instead of image scan of a table) ★★★ make it available in a non-proprietary open format (e.g., CSV rather than Excel) ★★★★ use URIs to denote things, so that people can point at your stuff ★★★★★ link your data to other data to provide context
  • 9. LD principles ● Original design rules – Use URIs as unique identifiers for resources (not the same as URL) – Use the HTTP URI scheme (rather than other schemes such as URN), so that URL = URI – When an ID is dereferenced (= looked up), give useful information using the standards (e.g. RDF) – Provide links to other resources ● LOD = LD + open license
  • 10. RDF model ● RDF (Resource Description Framework) is a fundamental brick to build LD ● It is built on the concept of triple: a subject linked to an object by means of a predicate ns2:Ingredient 1 ns2:Ingredient 2 ns2:Product1 ns:product ns:product 10 20 ns:weight ns:weight ns = http://www.example.com/ ns2 = http://www.anotherexample.com/
  • 11. RDF: serialization ● It is possible to use content negotiation to get the same file in different serialization formats ● Linux: use the curl command – $ curl -L -H "Accept: application/rdf+xml" http://dbpedia.org/resource/Tartu – $ curl -L -H "Accept: text/turtle” http://dbpedia.org/resource/Tartu ● There are also REST clients for Firefox and Chrome
  • 12. RDF: N-Triples <http://dbpedia.org/resource/Tartu> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://dbpedia.org/ontology/City> <http://dbpedia.org/resource/Tartu> <http://www.w3.org/2000/01/rdf-schema#label> "Tartu"@en . <http://dbpedia.org/resource/Tartu> <http://dbpedia.org/ontology/populationTotal> "97332"^^<http://www.w3.org/2001/XMLSchema#nonNegativeI nteger> . <http://dbpedia.org/resource/Tartu> <http://dbpedia.org/ontology/maximumElevation> "78.9432"^^<http://www.w3.org/2001/XMLSchema#double> .
  • 13. RDF: JSON-LD { "http://dbpedia.org/resource/Tartu": { "http://www.w3.org/1999/02/22-rdf-syntax-ns#type": [{ "type": "uri", "value": "http://dbpedia.org/ontology/PopulatedPlace"}], "http://www.w3.org/2000/01/rdf-schema#label": [{ "type": "literal", "value": "Tartu", "lang": "de"}], "http://dbpedia.org/ontology/populationTotal": [{ "type": "literal", "value": "97332", "datatype": "http://www.w3.org/2001/XMLSchema#nonNegativeInteger" }] }
  • 14. RDF: Turtle @prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> . @prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> . @prefix dbr: <http://dbpedia.org/resource/> . @prefix dbo: <http://dbpedia.org/ontology/> . @prefix xsd: <http://www.w3.org/2001/XMLSchema#> . dbr:Tartu rdf:type dbo:City dbr:Tartu rdfs:label "Tartu"@en . dbr:Tartu dbo:populationTotal "97332"^^xsd:nonNegativeInteger . dbr:Tartu dbo:maximumElevation "78.9432"^^xsd:double .
  • 15. RDF: XML <?xml version="1.0" encoding="utf-8" ?> <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xmlns:dbo="http://dbpedia.org/ontology/"> <rdf:Description rdf:about="http://dbpedia.org/resource/Tartu"> <rdf:type rdf:resource="http://dbpedia.org/ontology/City" /> <rdfs:label xml:lang="en">Tartu</rdfs:label> <dbo:populationTotal rdf:datatype="http://www.w3.org/2001/XMLSchema#nonNegativeInteger">97332</ dbo:populationTotal> </rdf:RDF>
  • 16. RDF: data schema ● In a relational database, we have to look for definitions in the data schema ● Using RDF, instead, we can fully describe data and their schema! ● In order to do this, we need vocabularies – Every term in a vocabulary has a common base URI called namespace
  • 17. Common vocabularies ● rdf, rdfs, owl – RDF “core” vocabularies ● dcterms – general properties for resources ● foaf – Friend of a Friend ● geo – geolocalization ● skos – description of schemas and taxonomies ● void, dcat – description of datasets ● doap – description of projects
  • 18. ● rdf and rdfs are used basically everywhere, since they are used to define the data schema ● Using rdf we can say that an entity belongs to a class of entities ● Using rdfs we can define super- and subclass relations Common vocabularies
  • 19. Examples ● “Tartu is a city” – dbr:Tartu rdf:type dbo:City – dbr:Tartu a dbo:City ● “Cities are settlements” – dbo:City rdfs:subClassOf dbo:Settlement
  • 20. Ontologies ● An ontology is a model used to describe a domain ● Ontologies can be used to describe complex, interesting concepts ● The may be hard to develop, because logical and modelling decisions are not always straightforward
  • 21. Using LD ● Should we know all the details about RDF to be able to use LD? ● “Follow your nose” approach thanks to links – https://www.wikidata.org – http://sameas.org – https://datahub.io
  • 22. Using LD Search Tartu on Wikidata more links to visit!
  • 23. Using LD ● SPARQL is to RDF what SQL is to databases: a query language ● A SPARQL endpoint is a resource where SPARQL queries can be sent to and data can be retrieved from ● Some SPARQL endpoints: – https://query.wikidata.org/ – http://dbpedia.org/sparql
  • 24. SPARQL queries ● I want to find some interesting facts about Tartu ● Let’s go to https://www.wikidata.org and search for Tartu again ● Let’s take a note of the “Q number”
  • 25. SPARQL queries ● Now let’s go on https://query.wikidata.org ● Let’s insert this query SELECT DISTINCT * WHERE { ?person wdt:P19 wd:Q13972; rdfs:label ?personName . FILTER (LANG(?personName) = "en") } ORDER BY ?personName
  • 27. SPARQL queries ● Anyone born in Tartu whose name looks Italian? SELECT DISTINCT * WHERE { ?person wdt:P19 wd:Q13972; rdfs:label ?personName; wdt:P735/wdt:P407 wd:Q652 . FILTER (LANG(?personName) = "en") } ORDER BY ?personName
  • 28. SPARQL queries ● Anyone born in Tartu who died somewhere in Italy? SELECT DISTINCT * WHERE { ?person wdt:P19 wd:Q13972; rdfs:label ?personName; wdt:P20 ?place . ?place wdt:P17 wd:Q38; rdfs:label ?placeName . FILTER (LANG(?personName) = "en" && LANG(?placeName) = "en") } ORDER BY ?personName
  • 29. Software ● There is plenty of software to play with LOD – Python: rdflib (http://rdflib.readthedocs.io) – Java: the Apache Jena project (https://jena.apache.org/)
  • 30. Advantages ● Easier interlinking of heterogeneous data ● Easier creation and maintenance of data schemas ● Distributed “by default” ● Controlled definition of shared knowledge
  • 31. Challenges ● Rather new topic – Needs skill and experience ● As data size increases, performance may worsen – However, this depends on the use case ● Extra care is necessary when using distributed data sources – Accessibility & availability issues – Data quality
  • 34. Wikidata ● Wikidata is a free and open knowledge base that can be read and edited by both humans and machines. ● Wikidata acts as central storage for the structured data of its Wikimedia sister projects including Wikipedia, Wikivoyage, Wikisource, and others. ● Wikidata also provides support to many other sites and services beyond just Wikimedia projects! The content of Wikidata is available under a free license, exported using standard formats, and can be interlinked to other open data sets on the linked data web.
  • 35. Wikidata ● Centralized access: only one resource to link data belonging to (or created from) several projects ● Management of structured data: not just text pages, but also data designed according to a schema and usable by external software more Tartu identifiers
  • 37. Wikidata ● There is a playground to try these things out: the sandbox element ● Go to https://www.wikidata.org/wiki/Q4115189 ● Start editing! – Note: it is possible to edit without being logged in, but (as for Wikipedia) it would be nicer to have an account