2. Aims for curriculum
1.Show realistic solutions
2.Use real data
3.Use real tools
4.Show scalable solutions
5.Eat the dog’s food
EUCLID – a Curriculum for Big Linked Data 2
3. Realistic Solution
Application
Analysis & Visualization
RDFa Module
Mining Module
Access
SPARQL
Endpoint
Publishing
LD Dataset
Vocabulary Integrated
Interlinking Cleansing
Mapping Dataset
Physical Wrapper LD Wrapper R2R Transf. LD Wrapper
Data acquisition
RDF/
XML
Streaming providers Downloads
Musical Content Metadata Other content
EUCLID – a Curriculum for Big Linked Data 3
8. Real Tools
• Admitted simple start • Industry-strength
by Module 2
EUCLID – a Curriculum for Big Linked Data 8
9. Real Tools
• All tools explained by screencast
EUCLID – a Curriculum for Big Linked Data 9
10. Real Tools
• All tools explained by screencast
Explains
how
Exercise 1
was
created
EUCLID – a Curriculum for Big Linked Data 10
11. Real Tools
• The data of interest may be stored in a wide range or
formats:
Spreadsheets
Databases Text
or tabular data
• Several tools support the process of mining data
from different repositories, for example:
R2RML
EUCLID – a Curriculum for Big Linked Data 11
12. Scalable Solutions
• MusicBrainz RDF derived via R2RML:
300M
Triples
lb:artist_member a rr:TriplesMap ;
rr:logicalTable [rr:sqlQuery
"""SELECT a1.gid, a2.gid AS band
FROM artist a1
INNER JOIN l_artist_artist ON a1.id =
l_artist_artist.entity0
INNER JOIN link ON l_artist_artist.link = link.id
INNER JOIN link_type ON link_type = link_type.id
INNER JOIN artist a2 on l_artist_artist.entity1 = a2.id
WHERE link_type.gid='5be4c609-9afa-4ea0-910b-12ffb71e3821'"""]
;
rr:subjectMap [rr:template "http://musicbrainz.org/artist/{gid}#_"]
;
rr:predicateObjectMap
[rr:predicate mo:member_of ;
rr:objectMap [rr:template
"http://musicbrainz.org/artist/{band}#_" ;
rr:termType rr:IRI]] .
EUCLID – a Curriculum for Big Linked Data 12
13. Dog Food
• EUCLID output, topics and engagement monitored
public-lod
public-vocabs
semantic-web
EUCLID – a Curriculum for Big Linked Data 13
14. Dog Food
• EUCLID output, topics and engagement monitored
public-lod • Offered as public SPARQL endpoint
public-vocabs
semantic-web
EUCLID – a Curriculum for Big Linked Data 14
15. Dog Food
• EUCLID output, topics and engagement monitored
public-lod • Offered as public SPARQL endpoint
public-vocabs
semantic-web • Will be used as basis
of analysis examples
EUCLID – a Curriculum for Big Linked Data 15
16. Results
• Achieving ~100
live viewers:
• Set to exceed
1000 post hoc
views /channel
(Webinar
platform, Vime
o, Slideshare):
EUCLID – a Curriculum for Big Linked Data 16
17. For exercises, quiz and further material visit our website:
http://www.euclid-project.eu
eBook Course
Other channels:
@euclid_project EUCLID project EUCLIDproject
EUCLID – a Curriculum for Big Linked Data 17