SlideShare una empresa de Scribd logo
1 de 64
Data-mining the Semantic Web
and spatially visualising the results
DAH workshop
Trinity College Dublin 27 May 2015
1 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
Workshop overview
• Morning session : Data-mining
– Open Data
– Linked Data
– Linked Open Data implementation
– Semantic Web and ontologies
– Hands-on practical exercises
2 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
Workshop overview
• Afternoon session : Data visualisation
– Data visualisation concepts introduction
– Web maps and geo-tagging
– Hands-on practical
– Interpretations
– Hermeneutic circle
3 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
But first, a very quick survey
• Your occupation
– UG student
– PG student
– Professional academic
– Non-academic
4 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
A quick survey
• Your age group
– Under 16
– 16-24
– 25-34
– 35-44
– 45-54
– 55 and over
5 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
A quick survey
• How familiar are you with Open Access?
– 1 - Not familiar at all
– 2
– 3
– 4
– 5 – Very familiar
6 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
A quick survey
• How familiar are you with Open Data?
– 1 – Not familiar at all
– 2
– 3
– 4
– 5 – Very familiar
7 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
A quick survey
• How familiar are you with Linked Data?
– 1 – Not familiar at all
– 2
– 3
– 4
– 5 – Very familiar
8 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
A quick survey
• How familiar are you with the Semantic Web?
– 1 – Not familiar at all
– 2
– 3
– 4
– 5 – Very familiar
9 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
A quick survey
• Have you ever published Open Data?
– Yes
– No
10 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
A quick survey
• Have you ever consumed Linked Open Data
services?
– Yes
– No
11 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
A quick survey
• Please fill in your…
– Name
– Email address
Don’t worry – I’m not going to pass them on to anyone 
12 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
From the horse’s mouth
(source: www.ted.com/talks/tim_berners_lee_on_the_next_web)
13 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
14 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
Open Access
Terminology
Open Data
Big Data
The web of data
The Semantic Web
Linked Data
data mining
15 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
Asking questions of digital datasets
Terminology
16 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
Open Access
Terminology
17 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
Design by Julie Beck
for the Harvard University Neuroinformatics dept
(source: www.juliebcreative.com/portfolio/open-data-logo/)
18 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
Linked Data
Terminology
The linkages between the major Linked Data datasets (source: lod-cloud.net)
19 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
Big Data
Terminology
Wordle of terms associated with Big Data activity (source: sfdata.startupweekend.org)
20 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
5 Stars of Open Data
put your data online under an open license
make it structured (e.g. as an Excel file)
use non-proprietary formats (e.g. XML and not Excel)
use URIs to identify resources
link your data to external datasets
21 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
The RDF Triple
22 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
A Triple Example
‘…the boy’s name is Tom…’
subject
predicate
object
23 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
Triple Linking
‘…Tom is short for Thomas…’
subject
predicate
object
24 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
Graph data
25 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
Serialising RDF
• Turtle
• JSON
• RDF/XML
• N-Triples
26 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
RDF Turtle
@base <http://example.org/> .
@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .
@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .
@prefix foaf: <http://xmlns.com/foaf/0.1/> .
@prefix rel: <http://www.perceive.net/schemas/relationship/> .
<green-goblin>
rel:enemyOf <spiderman> ;
a foaf:Person ; # in the context of the Marvel universe
foaf:name "Green Goblin" .
<spiderman>
rel:enemyOf <green-goblin> ;
a foaf:Person ;
foaf:name "Spiderman", "Человек-паук"@ru .
1
2
3
27 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
As N-Triples
<http://example.org/green-goblin> <http://www.perceive.net/schemas/relationship/enemyOf>
<http://example.org/spiderman> .
<http://example.org/green-goblin> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type>
<http://xmlns.com/foaf/0.1/Person> .
<http://example.org/green-goblin> <http://xmlns.com/foaf/0.1/name> "Green Goblin" .
<http://example.org/spiderman> <http://www.perceive.net/schemas/relationship/enemyOf>
<http://example.org/green-goblin> .
<http://example.org/spiderman> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type>
<http://xmlns.com/foaf/0.1/Person> .
<http://example.org/spiderman> <http://xmlns.com/foaf/0.1/name> "Spiderman" .
<http://example.org/spiderman> <http://xmlns.com/foaf/0.1/name>
"u00D0u00A7u00D0u00B5u00D0u00BBu00D0u00BEu00D0u00B2u00D0u00B5u00D0u0
0BA-u00D0u00BFu00D0u00B0u00D1u0083u00D0u00BA"@ru .
28 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
As JSON
{"http://example.org/green-
goblin":{"http://www.perceive.net/schemas/relationship/enemyOf":[{"ty
pe":"uri","value":"http://example.org/spiderman"}],"http://www.w3.org
/1999/02/22-rdf-syntax-
ns#type":[{"type":"uri","value":"http://xmlns.com/foaf/0.1/Person"}],"ht
tp://xmlns.com/foaf/0.1/name":[{"type":"literal","value":"Green
Goblin"}]},"http://example.org/spiderman":{"http://www.perceive.net/s
chemas/relationship/enemyOf":[{"type":"uri","value":"http://example.org
/green-goblin"}],"http://www.w3.org/1999/02/22-rdf-syntax-
ns#type":[{"type":"uri","value":"http://xmlns.com/foaf/0.1/Person"}],"ht
tp://xmlns.com/foaf/0.1/name":[{"type":"literal","value":"Spiderman"},{
"type":"literal","value":"u0427u0435u043bu043eu0432u0435u043a-
u043fu0430u0443u043a","lang":"ru"}]}}
29 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
As RDF/XML
<?xml version="1.0" encoding="utf-8" ?>
<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
xmlns:foaf="http://xmlns.com/foaf/0.1/"
xmlns:ns0="http://www.perceive.net/schemas/relationship/">
<foaf:Person rdf:about="http://example.org/green-goblin">
<ns0:enemyOf>
<foaf:Person rdf:about="http://example.org/spiderman">
<ns0:enemyOf rdf:resource="http://example.org/green-goblin"/>
<foaf:name>Spiderman</foaf:name>
<foaf:name xml:lang="ru">Человек-паук</foaf:name>
</foaf:Person>
</ns0:enemyOf>
<foaf:name>Green Goblin</foaf:name>
</foaf:Person>
</rdf:RDF>
30 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
Visualised as a Graph
31 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
Triplestores
and
Infrastructure
A server farm (source: www.cirrusinsight.com)
32 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
Practical: Making RDF
http://www.franklynam.com/blog.aspx?id=85
Q: Create RDF representations of yourself and
your relationships
33 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
The Semantic Web and Ontologies
The stages of the Web (source: urenio.org)
34 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
Ontological Classes and Properties
35 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
The British Museum data mapping onto the CIDOC CRM
(source: confluence.ontotext.com/display/ResearchSpace/BM+Mapping)
36 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
The CIDOC CRM basic entity types and their relationships
(source: www.cidoc-crm.org/)
37 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
Vocabularies
38 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
Graph data
39 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
Minna Sundberg (source: www.sssscomic.com/comic.php?page=196)
40 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
Querying using SPARQL
SELECT *
WHERE {
?s ?p ?o
} LIMIT 10
41 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
More complex SPARQL
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX letters1916: <http://letters1916.linkedarc.net/ontology/>
PREFIX letters1916data: <http://letters1916.linkedarc.net/data/>
PREFIX schema: <http://schema.org/>
SELECT DISTINCT ?letter ?letterName ?recipientPostalAddressName ?recipientLongitude ?recipientLatitude
WHERE {
?letter rdf:type letters1916:Letter ;
schema:name ?letterName ;
letters1916:recipientLocation ?recipientPostalAddress .
?recipientPostalAddress schema:addressRegion ?recipientPostalAddressRegion ;
FILTER regex(?recipientPostalAddressRegion, 'Galway', 'i')
?recipientPostalAddress schema:name ?recipientPostalAddressName .
?recipientPlace schema:address ?recipientPostalAddress ;
schema:geo ?recipientGeoCoordinates .
?recipientGeoCoordinates schema:longitude ?recipientLongitude ;
schema:latitude ?recipientLatitude
}
1
2
3
42 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
Practical: Universities on DBpedia
http://www.franklynam.com/blog.aspx?id=86
Q: Get a list of all of the universities that DBpedia
knows about
43 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
SKOS
@prefix dct: <http://purl.org/dc/terms/> .
@prefix skos: <http://www.w3.org/2004/02/skos/core#> .
@prefix cc: <http://creativecommons.org/ns#> .
<http://linkedarc.net/vocabs/vessel-jar> a skos:Concept ;
cc:license <http://creativecommons.org/licenses/by/3.0> ;
cc:attributionURL <http://linkedarc.net> ;
cc:attributionName "linkedarc.net" ;
skos:inScheme <http://linkedarc.net/vocabs> ;
skos:prefLabel “Jar" ;
skos:scopeNote ”A jar concept. Pottery. This isn’t a great scope note." ;
dct:publisher <http://linkedarc.net> ;
dct:identifier <http://linkedarc.net/vocabs/vessel-jar> ;
dct:issued "2015-02-23"^^xsd:date ;
skos:exactMatch <http://purl.org/heritagedata/schemes/mda_obj/concepts/97609> .
44 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
SPARQL + FILTER
SELECT * WHERE {
?s rdfs:label ?label .
FILTER langMatches(lang(?label), "en”)
}
45 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
SPARQL + FILTER
SELECT * WHERE {
?s rdfs:label ?label .
FILTER langMatches(lang(?label), "en") .
FILTER regex(?label, ”bell", "i”)
}
46 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
SPARQL + FILTER
SELECT * WHERE {
?s dct:dateCreated ?dateCreated .
FILTER (?dateCreated > '1900-01-01'
}
47 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
Practical: British Museum Sarcophagi
Q: Get the find spots of all of the sarcophagi in
the British Museum collection
SPARQL endpoint: http://collection.britishmuseum.org/sparql
48 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
Practical: Archaeological stratigraphy
Q: Get the stratigraphic relationships between
the contexts excavated at Priniatikos Pyrgos
SPARQL endpoint: http://linkedarc.net/sparql
49 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
Stratigraphy explained (very briefly…)
Sample stratigraphic sequence (source: www.lparchaeology.com)
50 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
The Priniatikos Pyrgos ontology
51 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
Practical: Archaeological stratigraphy
Q: Get the stratigraphic relationships between
the contexts excavated at Priniatikos Pyrgos
SPARQL endpoint: http://linkedarc.net/sparql
Hint: you will need to traverse 2 levels of the ontology’s
hierarchy to get at the stratigraphy data
52 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
Practical: Nomisma and Ancient Coins
Q: Get the geo-coordinates of all of the coin
hoards stored in the Nomisma triplestore
SPARQL endpoint: http://nomisma.org/sparql
53 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
Geo-coding the Find Spots
with Google Refine
54 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
The Google Maps API
Address String
Geo-coordinates as JSON
55 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
Export as CSV
56 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
Practical: Getty Concepts
Q: Get all of the Getty URIs that represent
concepts related to amphorae
SPARQL endpoint: http://vocab.getty.edu/sparql
57 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
Additional Linked Data Resources
http://www.franklynam.com/blog.aspx?id=89
58 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
One final quick survey
• Please arrange the practicals in terms of how
easy they were to complete (1 for hardest and
5 for easiest)?
– Making your FOAF profile
– DBpedia universities
– British Museum sarcophagi hunting
– Getty vocabularies
– Nomisma coin hoards
59 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
One final quick survey
• Would you consider publishing Linked Open
Data in the future?
– 1 – Absolutely not
– 2
– 3
– 4
– 5 – Definitely
60 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
One final quick survey
• Would you consider using Linked Open Data
resources (using SPARQL or otherwise) in the
future?
– 1 – Absolutely not
– 2
– 3
– 4
– 5 – Definitely
61 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
One final quick survey
• Is Linked Open Data a feasible platform on
which to undertake humanities research?
– 1 – Absolutely not
– 2
– 3
– 4
– 5 – Definitely
62 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
One final quick survey
• Any final comments?
63 of 63@flynam @bilusaurus
Data-mining the Semantic Web and spatially visualising the results
DAH workshop
Thank you!
Martin Lemay (source: twitter.com/martinlemay)

Más contenido relacionado

La actualidad más candente

Mapping Tweets to Conference Talks: A Goldmine for Semantics
Mapping Tweets to Conference Talks: A Goldmine for SemanticsMapping Tweets to Conference Talks: A Goldmine for Semantics
Mapping Tweets to Conference Talks: A Goldmine for Semantics
Milan Stankovic
 
IEEE CRS 2014 - Secure Distributed Data Structures for Peer-to-Peer-based Soc...
IEEE CRS 2014 - Secure Distributed Data Structures for Peer-to-Peer-based Soc...IEEE CRS 2014 - Secure Distributed Data Structures for Peer-to-Peer-based Soc...
IEEE CRS 2014 - Secure Distributed Data Structures for Peer-to-Peer-based Soc...
Kalman Graffi
 

La actualidad más candente (17)

Linked Data and Tools
Linked Data and ToolsLinked Data and Tools
Linked Data and Tools
 
Mapping Tweets to Conference Talks: A Goldmine for Semantics
Mapping Tweets to Conference Talks: A Goldmine for SemanticsMapping Tweets to Conference Talks: A Goldmine for Semantics
Mapping Tweets to Conference Talks: A Goldmine for Semantics
 
Computer Scientists Retrieval - PDF Report
Computer Scientists Retrieval - PDF ReportComputer Scientists Retrieval - PDF Report
Computer Scientists Retrieval - PDF Report
 
The Largest Data Science Program in the World: The Johns Hopkins Data Science...
The Largest Data Science Program in the World: The Johns Hopkins Data Science...The Largest Data Science Program in the World: The Johns Hopkins Data Science...
The Largest Data Science Program in the World: The Johns Hopkins Data Science...
 
Linked Data
Linked DataLinked Data
Linked Data
 
10 things statistics taught us about big data
10 things statistics taught us about big data10 things statistics taught us about big data
10 things statistics taught us about big data
 
Reminiscing about interoperability
Reminiscing about interoperabilityReminiscing about interoperability
Reminiscing about interoperability
 
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...
 
Data Science Education at JHSPH
Data Science Education at JHSPHData Science Education at JHSPH
Data Science Education at JHSPH
 
Data management for researchers
Data management for researchersData management for researchers
Data management for researchers
 
Methods for Intrinsic Evaluation of Links in the Web of Data
Methods for Intrinsic Evaluation of Links in the Web of DataMethods for Intrinsic Evaluation of Links in the Web of Data
Methods for Intrinsic Evaluation of Links in the Web of Data
 
Data Provenance and its role in Data Science
Data Provenance and its role in Data ScienceData Provenance and its role in Data Science
Data Provenance and its role in Data Science
 
(Re-)Discovering Lost Web Pages
(Re-)Discovering Lost Web Pages(Re-)Discovering Lost Web Pages
(Re-)Discovering Lost Web Pages
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Web
 
Analyzing social media with Python and other tools (1/4)
Analyzing social media with Python and other tools (1/4)Analyzing social media with Python and other tools (1/4)
Analyzing social media with Python and other tools (1/4)
 
Information Extraction from Text, presented @ Deloitte
Information Extraction from Text, presented @ DeloitteInformation Extraction from Text, presented @ Deloitte
Information Extraction from Text, presented @ Deloitte
 
IEEE CRS 2014 - Secure Distributed Data Structures for Peer-to-Peer-based Soc...
IEEE CRS 2014 - Secure Distributed Data Structures for Peer-to-Peer-based Soc...IEEE CRS 2014 - Secure Distributed Data Structures for Peer-to-Peer-based Soc...
IEEE CRS 2014 - Secure Distributed Data Structures for Peer-to-Peer-based Soc...
 

Destacado

Destacado (7)

Data-mining the Semantic Web
Data-mining the Semantic WebData-mining the Semantic Web
Data-mining the Semantic Web
 
Semantic Web Mining
Semantic Web MiningSemantic Web Mining
Semantic Web Mining
 
Explaining The Semantic Web
Explaining The Semantic WebExplaining The Semantic Web
Explaining The Semantic Web
 
Semantic Web Application Development
Semantic Web Application DevelopmentSemantic Web Application Development
Semantic Web Application Development
 
Developing A Semantic Web Application - ISWC 2008 tutorial
Developing A Semantic Web Application -  ISWC 2008 tutorialDeveloping A Semantic Web Application -  ISWC 2008 tutorial
Developing A Semantic Web Application - ISWC 2008 tutorial
 
Building a semantic website
Building a semantic websiteBuilding a semantic website
Building a semantic website
 
Semantic Web For Dummies
Semantic Web For DummiesSemantic Web For Dummies
Semantic Web For Dummies
 

Similar a Data-mining the Semantic Web @TCD

How we can understand the world through open data
How we can understand the world through open dataHow we can understand the world through open data
How we can understand the world through open data
Marie Gustafsson Friberger
 

Similar a Data-mining the Semantic Web @TCD (20)

Exploring the Semantic Web
Exploring the Semantic WebExploring the Semantic Web
Exploring the Semantic Web
 
How the Web can change social science research (including yours)
How the Web can change social science research (including yours)How the Web can change social science research (including yours)
How the Web can change social science research (including yours)
 
Presentation of Profiling Similarity Links in LOD @ DesWEB, ICDE 2016
Presentation of Profiling Similarity Links in LOD @ DesWEB, ICDE 2016Presentation of Profiling Similarity Links in LOD @ DesWEB, ICDE 2016
Presentation of Profiling Similarity Links in LOD @ DesWEB, ICDE 2016
 
EgoSystem: Presentation to LITA, American Library Association, Nov 8 2014
EgoSystem: Presentation to LITA, American Library Association, Nov 8 2014EgoSystem: Presentation to LITA, American Library Association, Nov 8 2014
EgoSystem: Presentation to LITA, American Library Association, Nov 8 2014
 
Understanding the Standards Gap
Understanding the Standards GapUnderstanding the Standards Gap
Understanding the Standards Gap
 
SemWeb Fundamentals - Info Linking & Layering in Practice
SemWeb Fundamentals - Info Linking & Layering in PracticeSemWeb Fundamentals - Info Linking & Layering in Practice
SemWeb Fundamentals - Info Linking & Layering in Practice
 
Knowledge Graph Construction and the Role of DBPedia
Knowledge Graph Construction and the Role of DBPediaKnowledge Graph Construction and the Role of DBPedia
Knowledge Graph Construction and the Role of DBPedia
 
Discovery Hub: on-the-fly linked data exploratory search
Discovery Hub: on-the-fly linked data exploratory searchDiscovery Hub: on-the-fly linked data exploratory search
Discovery Hub: on-the-fly linked data exploratory search
 
Linked Data: opening Scotland’s library content to the world
Linked Data: opening Scotland’s library content to the world Linked Data: opening Scotland’s library content to the world
Linked Data: opening Scotland’s library content to the world
 
Linking Open Government Data at Scale
Linking Open Government Data at Scale Linking Open Government Data at Scale
Linking Open Government Data at Scale
 
Wimmics Overview 2021
Wimmics Overview 2021Wimmics Overview 2021
Wimmics Overview 2021
 
Howison rutgers-open superposition
Howison rutgers-open superpositionHowison rutgers-open superposition
Howison rutgers-open superposition
 
Camp 4-data workshop presentation
Camp 4-data workshop presentationCamp 4-data workshop presentation
Camp 4-data workshop presentation
 
Linked data presentation for libraries (COMO)
Linked data presentation for libraries (COMO)Linked data presentation for libraries (COMO)
Linked data presentation for libraries (COMO)
 
Connections that work: Linked Open Data demystified
Connections that work: Linked Open Data demystifiedConnections that work: Linked Open Data demystified
Connections that work: Linked Open Data demystified
 
Linked Data at the OU - the story so far
Linked Data at the OU - the story so farLinked Data at the OU - the story so far
Linked Data at the OU - the story so far
 
How we can understand the world through open data
How we can understand the world through open dataHow we can understand the world through open data
How we can understand the world through open data
 
(Re-) Discovering Lost Web Pages
(Re-) Discovering Lost Web Pages(Re-) Discovering Lost Web Pages
(Re-) Discovering Lost Web Pages
 
Linked Open Data
Linked Open DataLinked Open Data
Linked Open Data
 
The Web We Want
The Web We WantThe Web We Want
The Web We Want
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 

Data-mining the Semantic Web @TCD

  • 1. Data-mining the Semantic Web and spatially visualising the results DAH workshop Trinity College Dublin 27 May 2015
  • 2. 1 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop Workshop overview • Morning session : Data-mining – Open Data – Linked Data – Linked Open Data implementation – Semantic Web and ontologies – Hands-on practical exercises
  • 3. 2 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop Workshop overview • Afternoon session : Data visualisation – Data visualisation concepts introduction – Web maps and geo-tagging – Hands-on practical – Interpretations – Hermeneutic circle
  • 4. 3 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop But first, a very quick survey • Your occupation – UG student – PG student – Professional academic – Non-academic
  • 5. 4 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop A quick survey • Your age group – Under 16 – 16-24 – 25-34 – 35-44 – 45-54 – 55 and over
  • 6. 5 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop A quick survey • How familiar are you with Open Access? – 1 - Not familiar at all – 2 – 3 – 4 – 5 – Very familiar
  • 7. 6 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop A quick survey • How familiar are you with Open Data? – 1 – Not familiar at all – 2 – 3 – 4 – 5 – Very familiar
  • 8. 7 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop A quick survey • How familiar are you with Linked Data? – 1 – Not familiar at all – 2 – 3 – 4 – 5 – Very familiar
  • 9. 8 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop A quick survey • How familiar are you with the Semantic Web? – 1 – Not familiar at all – 2 – 3 – 4 – 5 – Very familiar
  • 10. 9 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop A quick survey • Have you ever published Open Data? – Yes – No
  • 11. 10 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop A quick survey • Have you ever consumed Linked Open Data services? – Yes – No
  • 12. 11 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop A quick survey • Please fill in your… – Name – Email address Don’t worry – I’m not going to pass them on to anyone 
  • 13. 12 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop From the horse’s mouth (source: www.ted.com/talks/tim_berners_lee_on_the_next_web)
  • 14. 13 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop
  • 15. 14 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop Open Access Terminology Open Data Big Data The web of data The Semantic Web Linked Data data mining
  • 16. 15 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop Asking questions of digital datasets Terminology
  • 17. 16 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop Open Access Terminology
  • 18. 17 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop Design by Julie Beck for the Harvard University Neuroinformatics dept (source: www.juliebcreative.com/portfolio/open-data-logo/)
  • 19. 18 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop Linked Data Terminology The linkages between the major Linked Data datasets (source: lod-cloud.net)
  • 20. 19 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop Big Data Terminology Wordle of terms associated with Big Data activity (source: sfdata.startupweekend.org)
  • 21. 20 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop 5 Stars of Open Data put your data online under an open license make it structured (e.g. as an Excel file) use non-proprietary formats (e.g. XML and not Excel) use URIs to identify resources link your data to external datasets
  • 22. 21 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop The RDF Triple
  • 23. 22 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop A Triple Example ‘…the boy’s name is Tom…’ subject predicate object
  • 24. 23 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop Triple Linking ‘…Tom is short for Thomas…’ subject predicate object
  • 25. 24 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop Graph data
  • 26. 25 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop Serialising RDF • Turtle • JSON • RDF/XML • N-Triples
  • 27. 26 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop RDF Turtle @base <http://example.org/> . @prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> . @prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> . @prefix foaf: <http://xmlns.com/foaf/0.1/> . @prefix rel: <http://www.perceive.net/schemas/relationship/> . <green-goblin> rel:enemyOf <spiderman> ; a foaf:Person ; # in the context of the Marvel universe foaf:name "Green Goblin" . <spiderman> rel:enemyOf <green-goblin> ; a foaf:Person ; foaf:name "Spiderman", "Человек-паук"@ru . 1 2 3
  • 28. 27 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop As N-Triples <http://example.org/green-goblin> <http://www.perceive.net/schemas/relationship/enemyOf> <http://example.org/spiderman> . <http://example.org/green-goblin> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://xmlns.com/foaf/0.1/Person> . <http://example.org/green-goblin> <http://xmlns.com/foaf/0.1/name> "Green Goblin" . <http://example.org/spiderman> <http://www.perceive.net/schemas/relationship/enemyOf> <http://example.org/green-goblin> . <http://example.org/spiderman> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://xmlns.com/foaf/0.1/Person> . <http://example.org/spiderman> <http://xmlns.com/foaf/0.1/name> "Spiderman" . <http://example.org/spiderman> <http://xmlns.com/foaf/0.1/name> "u00D0u00A7u00D0u00B5u00D0u00BBu00D0u00BEu00D0u00B2u00D0u00B5u00D0u0 0BA-u00D0u00BFu00D0u00B0u00D1u0083u00D0u00BA"@ru .
  • 29. 28 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop As JSON {"http://example.org/green- goblin":{"http://www.perceive.net/schemas/relationship/enemyOf":[{"ty pe":"uri","value":"http://example.org/spiderman"}],"http://www.w3.org /1999/02/22-rdf-syntax- ns#type":[{"type":"uri","value":"http://xmlns.com/foaf/0.1/Person"}],"ht tp://xmlns.com/foaf/0.1/name":[{"type":"literal","value":"Green Goblin"}]},"http://example.org/spiderman":{"http://www.perceive.net/s chemas/relationship/enemyOf":[{"type":"uri","value":"http://example.org /green-goblin"}],"http://www.w3.org/1999/02/22-rdf-syntax- ns#type":[{"type":"uri","value":"http://xmlns.com/foaf/0.1/Person"}],"ht tp://xmlns.com/foaf/0.1/name":[{"type":"literal","value":"Spiderman"},{ "type":"literal","value":"u0427u0435u043bu043eu0432u0435u043a- u043fu0430u0443u043a","lang":"ru"}]}}
  • 30. 29 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop As RDF/XML <?xml version="1.0" encoding="utf-8" ?> <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:ns0="http://www.perceive.net/schemas/relationship/"> <foaf:Person rdf:about="http://example.org/green-goblin"> <ns0:enemyOf> <foaf:Person rdf:about="http://example.org/spiderman"> <ns0:enemyOf rdf:resource="http://example.org/green-goblin"/> <foaf:name>Spiderman</foaf:name> <foaf:name xml:lang="ru">Человек-паук</foaf:name> </foaf:Person> </ns0:enemyOf> <foaf:name>Green Goblin</foaf:name> </foaf:Person> </rdf:RDF>
  • 31. 30 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop Visualised as a Graph
  • 32. 31 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop Triplestores and Infrastructure A server farm (source: www.cirrusinsight.com)
  • 33. 32 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop Practical: Making RDF http://www.franklynam.com/blog.aspx?id=85 Q: Create RDF representations of yourself and your relationships
  • 34. 33 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop The Semantic Web and Ontologies The stages of the Web (source: urenio.org)
  • 35. 34 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop Ontological Classes and Properties
  • 36. 35 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop The British Museum data mapping onto the CIDOC CRM (source: confluence.ontotext.com/display/ResearchSpace/BM+Mapping)
  • 37. 36 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop The CIDOC CRM basic entity types and their relationships (source: www.cidoc-crm.org/)
  • 38. 37 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop Vocabularies
  • 39. 38 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop Graph data
  • 40. 39 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop Minna Sundberg (source: www.sssscomic.com/comic.php?page=196)
  • 41. 40 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop Querying using SPARQL SELECT * WHERE { ?s ?p ?o } LIMIT 10
  • 42. 41 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop More complex SPARQL PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX letters1916: <http://letters1916.linkedarc.net/ontology/> PREFIX letters1916data: <http://letters1916.linkedarc.net/data/> PREFIX schema: <http://schema.org/> SELECT DISTINCT ?letter ?letterName ?recipientPostalAddressName ?recipientLongitude ?recipientLatitude WHERE { ?letter rdf:type letters1916:Letter ; schema:name ?letterName ; letters1916:recipientLocation ?recipientPostalAddress . ?recipientPostalAddress schema:addressRegion ?recipientPostalAddressRegion ; FILTER regex(?recipientPostalAddressRegion, 'Galway', 'i') ?recipientPostalAddress schema:name ?recipientPostalAddressName . ?recipientPlace schema:address ?recipientPostalAddress ; schema:geo ?recipientGeoCoordinates . ?recipientGeoCoordinates schema:longitude ?recipientLongitude ; schema:latitude ?recipientLatitude } 1 2 3
  • 43. 42 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop Practical: Universities on DBpedia http://www.franklynam.com/blog.aspx?id=86 Q: Get a list of all of the universities that DBpedia knows about
  • 44. 43 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop SKOS @prefix dct: <http://purl.org/dc/terms/> . @prefix skos: <http://www.w3.org/2004/02/skos/core#> . @prefix cc: <http://creativecommons.org/ns#> . <http://linkedarc.net/vocabs/vessel-jar> a skos:Concept ; cc:license <http://creativecommons.org/licenses/by/3.0> ; cc:attributionURL <http://linkedarc.net> ; cc:attributionName "linkedarc.net" ; skos:inScheme <http://linkedarc.net/vocabs> ; skos:prefLabel “Jar" ; skos:scopeNote ”A jar concept. Pottery. This isn’t a great scope note." ; dct:publisher <http://linkedarc.net> ; dct:identifier <http://linkedarc.net/vocabs/vessel-jar> ; dct:issued "2015-02-23"^^xsd:date ; skos:exactMatch <http://purl.org/heritagedata/schemes/mda_obj/concepts/97609> .
  • 45. 44 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop SPARQL + FILTER SELECT * WHERE { ?s rdfs:label ?label . FILTER langMatches(lang(?label), "en”) }
  • 46. 45 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop SPARQL + FILTER SELECT * WHERE { ?s rdfs:label ?label . FILTER langMatches(lang(?label), "en") . FILTER regex(?label, ”bell", "i”) }
  • 47. 46 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop SPARQL + FILTER SELECT * WHERE { ?s dct:dateCreated ?dateCreated . FILTER (?dateCreated > '1900-01-01' }
  • 48. 47 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop Practical: British Museum Sarcophagi Q: Get the find spots of all of the sarcophagi in the British Museum collection SPARQL endpoint: http://collection.britishmuseum.org/sparql
  • 49. 48 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop Practical: Archaeological stratigraphy Q: Get the stratigraphic relationships between the contexts excavated at Priniatikos Pyrgos SPARQL endpoint: http://linkedarc.net/sparql
  • 50. 49 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop Stratigraphy explained (very briefly…) Sample stratigraphic sequence (source: www.lparchaeology.com)
  • 51. 50 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop The Priniatikos Pyrgos ontology
  • 52. 51 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop Practical: Archaeological stratigraphy Q: Get the stratigraphic relationships between the contexts excavated at Priniatikos Pyrgos SPARQL endpoint: http://linkedarc.net/sparql Hint: you will need to traverse 2 levels of the ontology’s hierarchy to get at the stratigraphy data
  • 53. 52 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop Practical: Nomisma and Ancient Coins Q: Get the geo-coordinates of all of the coin hoards stored in the Nomisma triplestore SPARQL endpoint: http://nomisma.org/sparql
  • 54. 53 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop Geo-coding the Find Spots with Google Refine
  • 55. 54 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop The Google Maps API Address String Geo-coordinates as JSON
  • 56. 55 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop Export as CSV
  • 57. 56 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop Practical: Getty Concepts Q: Get all of the Getty URIs that represent concepts related to amphorae SPARQL endpoint: http://vocab.getty.edu/sparql
  • 58. 57 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop Additional Linked Data Resources http://www.franklynam.com/blog.aspx?id=89
  • 59. 58 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop One final quick survey • Please arrange the practicals in terms of how easy they were to complete (1 for hardest and 5 for easiest)? – Making your FOAF profile – DBpedia universities – British Museum sarcophagi hunting – Getty vocabularies – Nomisma coin hoards
  • 60. 59 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop One final quick survey • Would you consider publishing Linked Open Data in the future? – 1 – Absolutely not – 2 – 3 – 4 – 5 – Definitely
  • 61. 60 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop One final quick survey • Would you consider using Linked Open Data resources (using SPARQL or otherwise) in the future? – 1 – Absolutely not – 2 – 3 – 4 – 5 – Definitely
  • 62. 61 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop One final quick survey • Is Linked Open Data a feasible platform on which to undertake humanities research? – 1 – Absolutely not – 2 – 3 – 4 – 5 – Definitely
  • 63. 62 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop One final quick survey • Any final comments?
  • 64. 63 of 63@flynam @bilusaurus Data-mining the Semantic Web and spatially visualising the results DAH workshop Thank you! Martin Lemay (source: twitter.com/martinlemay)

Notas del editor

  1. Me DAH PhD Archaeology as UG and MPhil IT industry background
  2. Bit of overlap Themes of the day Using LD and OD as a tool of the CH researcher As a way of dealing with Big Data And as a way of combining data from different datasets From perspective of data consumer and provider. Mainly the former. Practically focused Have your laptops ready Introduction to RDF and most popular LD technologies Introduce yourselves
  3. Tomorrow and Bilu Data viz Taking the data mined today and visualising it spatially Theory briefly Mapping on the web Geo-tagging content Very practical Visualisations aren’t the end point. They lead to more questions. Back to data-mining.
  4. For my PhD Only if you want
  5. Hold on to your page until the end of the morning’s session.
  6. Easy start Tim Berners-Lee at Ted http://www.ted.com/talks/tim_berners_lee_on_the_next_web#t-327012
  7. This will necessarily include a potted history of the field Open Access and Open Data Open Access What is it? Sharing Web 2.0 Democracy Open government Sectors affected Academia Business Journalism Typically human-readable content HTML Images Video Legality of sharing
  8. This will necessarily include a potted history of the field Open Access Open Access What is it? Sharing Web 2.0 Democracy Open government Sectors affected Academia Business Journalism Typically human-readable content HTML Images Video Legality of sharing
  9. Open Data As we saw in TBL Ted Model is the Document Web But for data What is data? Is it publications? Raw data Text Binary data 3D data Images Video Metadata Paradata Clement: live data sources for data viz
  10. Linked Data or Linked Open Data Expands the Open Data idea But more Make datasets transparent Make them inter-dependant The document web model
  11. First used by John Mashey in the mid-1990s Handling and analysis of massive datasets (Kitchin 2014, 67) By 2013 it had move from: The ‘peak of inflated expectation’ to the ‘trough of disillusionment’ Cf. Dr. Clément Levallois: plateau of productivity According to Gartner It still retains a lot of popularity in government, biz and academic sectors Data size? EAA 2014 Gabriele Gattiglia, Uni of Pisa paper Focus on approaches to data Not data size Having said that global data sizes are growing exponentially thanks to sensor data, more digital bureaucracy, commerce mainly Stat: data size growth
  12. Berners-Lee in 2006 He calls it Open Data but really should be Linked Data or LOD In fact back to earliest proposal for WWW “Evolution of objects from being principally human-readable documents to contain more machine-oriented semantic information” (Berners-Lee et al., 1994) Use the existing architecture of the WWW Publish data Link data Data-mine For one star… OK. Pause. Review Lots of terms. Lots of overlap. In a word. Open Data espouses the free movement of nodes of information within and across knowledge domains Linked Data is a superset of OD. And is often called LOD. It is everything that OD is and these data nodes are linkable. See later. Big Data: is the environment in which LOD lives. It is modus operandi. A way of approaching questions. It doesn’t have to be about massive datasets but it often is.
  13. We have done the WHAT in a very general sense. Now on to how to the HOW. Linked Open Data is a knowledge philosophy It is abstract It needs implementation Resource Description Framework Based around simple concept of the triple Very simple but when combined, it can encode great complexity Based on linguistic theory URI at core See previous 5 stars
  14. The boy’s name is Tom
  15. Tom is short for Thomas This is KEY Links create graphs of data. Graphs are not hierarchical in the sense that any one node can only have one parent. They are poly-hierarchical. Multiple parents and children.
  16. RDF needs to encoded or serialised in some way Many serialisations out there Formats N-Triples Turtle RDF XML JSON There are others We will look at Turtle
  17. Header Resource 1: Green goblin Resource 1: Spiderman Link between the two
  18. Different serialisations Same data
  19. From data provider point of view Need to think about: Storage Native triplestores Apache Jena Quad stores Named graphs Virtuoso Quad stores Interfaces Static RDF files Web API SPARQL Key. Come back to this.
  20. You have been introduced to LD and RDF Now write some Encode some meaning Using a popular ontology Read the instructions on my blog Create RDF representations of yourselves and your relationships. What better example subjects to use to understand networks than people?
  21. Back to terminology SW Web of Data Needs semantics Plus ability to find out about the structure of remote datasets What we have just been talking about Structure What do we mean? Ontologies Philosophical sense Relationship of humans to world around us CS sense Way of ordering data
  22. Car example Structured Good for data-mining Bad for determinism, essentialism General ontologies Schema.org FOAF Dublin Core CH ontologies CIDOC CRM Extensions EH ARIADNE linkedARC.net ARCHAEO-ML CHARM Or build your own
  23. British Museum’s data ontology
  24. CIDOC CRM
  25. Aka thesauri, taxonomies Literals Weak for indexing Controlled lists Balance needed Control Flexibility Seneschal project Getty AAT See practical
  26. Compare networks to trees. Poly-hierarchical to hierarchical
  27. Marc Alexander this morning
  28. The data is RDF but how do we get at the semantics? How do we query the data? We don’t want it all – we just want specific parts of it. Similar to MySQL querying Can be difficult to get head around Try it out
  29. Explain. Spend a good bit of time here. This is key to the practicals. Ask students
  30. Get an overview of the predicates associated with the dbpedia-owl:University type. Might have to use http://live.dbpedia.org/sparql instead of http://dbpedia.org/sparql
  31. Back to vocabularies SKOS Simple Knowledge Organization System Key to how CH institutions work. Since the library of Alexandria
  32. http://www.franklynam.com/blog.aspx?id=87
  33. Combine our understanding of SKOS concepts and filters. Get me all the Getty URIs that represent concepts related to amphorae. No one correct answer.
  34. Combine our understanding of SKOS concepts and filters. Get me all the Getty URIs that represent concepts related to amphorae. No one correct answer.
  35. Combine our understanding of SKOS concepts and filters. Get me all the Getty URIs that represent concepts related to amphorae. No one correct answer.
  36. Combine our understanding of SKOS concepts and filters. Get me all the Getty URIs that represent concepts related to amphorae. No one correct answer.
  37. Introduction to geo
  38. What good is a place string?
  39. Get URL for GMaps reverse geo-coding Need a GMaps API key. Signup.
  40. Combine our understanding of SKOS concepts and filters. Get me all the Getty URIs that represent concepts related to amphorae. No one correct answer.
  41. For my PhD Only if you want
  42. For my PhD Only if you want
  43. For my PhD Only if you want
  44. For my PhD Only if you want
  45. For my PhD Only if you want