SlideShare una empresa de Scribd logo
1 de 24
1 Knowledge Representation & Reasoning, Computer Science Department
List.MID: A MIDI-Based
Benchmark for RDF Lists
Albert Meroño-Peñuela, Vrije Universiteit Amsterdam
@albertmeronyo
Enrico Daga, The Open University
ISWC 2019, 28 October, Auckland
2
 Modelling choices are often left to subjective choice
 These practices and their reuse are key in query
performance
 Lists are everywhere! Co-authors, timelines, media,
recipes, etc.
Knowledge Representation & Reasoning, Computer Science Department
MOTIVATION
3
 And in MIDI applications
Knowledge Representation & Reasoning, Computer Science Department
MOTIVATION
 So what do we know about performance of RDF List solutions?
…
[ 144, 60, 100]
[ 128, 60, 64 ]
…
[Pic of music editing software]
4
 Modelling of RDF lists
> RDF(S) container classes (rdf:Bag, rdf:Alt, rdf:Seq)
> Closed collections (rdf:List : rdf:first, rdf:rest, rdf:nil)
> JSON-LD/Turtle syntaxes: "@list": [ "joe", "bob", "jaybee" ],
:a :b ( "bob" "alice" "carol")
> Ontology Design Patterns: Sequence OP, Collections Ontology
 Benchmark datasets and queries
> BSBM, LUBM, SP2Bench, DBPedia SPARQL, WatDiv
> LSQ
> IGUANA, LDBC
Knowledge Representation & Reasoning, Computer Science Department
RELATED WORK
5
What RDF list models are common in LOD? How can we generate data
and queries to test their impact at retrieval in a systematic and
principled manner?
C1: Survey of common list modelling practices in RDF and LOD
C2: Benchmarking data and queries enabling their comparison
C3: Evidence of present and future use
Knowledge Representation & Reasoning, Computer Science Department
RESEARCH QUESTIONS & CONTRIBUTIONS
6
Surveyed from:
 W3C standards
 The Ontology Design Patterns portal
 List choices in RDF datasets from ISWC resource track papers
 Linked Open Vocabularies (LOV)
 LOD Laundromat/LOD-a-lot [Fernández et al. ISWC 2017]
Findings: various practices that generalize to 6 list modelling patterns
Knowledge Representation & Reasoning, Computer Science Department
LIST PATTERNS
7 Knowledge Representation & Reasoning, Computer Science Department
RDF SEQUENCE AND RDF LIST
[SEQ]
[LIST]
8 Knowledge Representation & Reasoning, Computer Science Department
URI, NUMBER, TIMESTAMP IMPLICIT ORDERING
[URI]
[NUM] [TIME]
9 Knowledge Representation & Reasoning, Computer Science Department
SEQUENCE ONTOLOGY PATTERN
[SOP]
10
List.MID is an RDF list benchmark based on:
 Collaborative GitHub awesome MIDI list
 More than 300K MIDI files from the Web
 MIDI Linked Data Cloud [Meroño et al. ISWC 2017]
Consists of:
 RDF List data generator on top of real-world list data
 Collection of extensible SPARQL list-retrieval queries
Knowledge Representation & Reasoning, Computer Science Department
LIST.MID: A BENCHMARK FOR RDF LISTS
11
Custom version of midi2rdf algorithm [Meroño et al. ESWC 2016]
midi2rdf [-h]
[--format [{xml,n3,turtle,nt,pretty-xml,trix,trig,nquads,json-ld}]]
[--gz] [--order [{uri,prop_number,prop_time,seq,list,sop}]]
[--version]
filename [outfile]
Knowledge Representation & Reasoning, Computer Science Department
LIST.MID BENCHMARK DATA GENERATOR
Input
MIDI file
List pattern
to use
RDF
serialization
Compress?
12
CQ1. Full list lookup: What is the ordered content of the list?
CQ2. N-th Lookup: Which is the n-th item in the list?
CQ3. Ordered Range: What are the n…m items in the list?
Aimed at supporting use-case LOD publishing
Focus on minimal and atomic operations related to list ordered access
Do not deal (yet) with list management (edit, merge, split, etc.)
Knowledge Representation & Reasoning, Computer Science Department
LIST.MID QUERIES
13
[SEQ] WHERE {:list a midi:Track ; midi:hasEvents [ ?seq ?event ] .
BIND (xsd:integer(SUBSTR(str(?seq), 45)) AS ?index)
} ORDER BY ?index  OFFSET <N> LIMIT <M-N+1>
[LIST] SELECT ?event (COUNT(?step) as ?index) WHERE {
:list a midi:Track ; midi:hasEvents ?events . ?events rdf:rest∗ ?step .
?step rdf:rest∗ ?elt . ?elt rdf:first ?event .
} GROUP BY ?event ORDER BY ?index  rdf:rest{N}, /…{N}…/
[URI] WHERE { [] a midi:Track ; midi:hasEvent ?event .
BIND (xsd:integer(SUBSTR(str(?event), 77)) AS ?id) } ORDER BY ?id  OFFSET…
[NUM/TIME] WHERE { [] a midi:Track ; midi:hasEvent ?event .
?event midi:absoluteTick ?tick . } ORDER BY ?tick
[SOP] WHERE { [] a midi:Track ; midi:hasEvent ?event . ?event sequence:precedes?
?next_event . ?next_event sequence:follows? ?event .
BIND (xsd:integer(SUBSTR(str(?event), 77)) AS ?id)
} ORDER BY ?id
Knowledge Representation & Reasoning, Computer Science Department
LIST.MID QUERIES
14
[Daga et al. QuWeDa 2019]
Knowledge Representation & Reasoning, Computer Science Department
EVALUATION: PERFORMANCE EXPERIMENTS
Coherent results among triplestores
Critical impact of different models
rdf:List has poor performance
Property-based
better than link-based
rdf:Seq trade-off
15
N=24 in W3C semantic-web, public-lod, VU, OU
Knowledge Representation & Reasoning, Computer Science Department
EVALUATION: PERFORMANCE EXPERIMENTS
16
Lists are important! But how to assess the impact of their models?
 6 common list patterns in RDF
 List.MID: Real-world, MIDI-based RDF list benchmark data generator
 List.MID: SPARQL queries for minimal and atomic list retrieval operations
 First experiments and community interest
Limitations/future work:
 Limited set of list operations (e.g. rdf:List could win in e.g. addition)
 Extensions to other list models
 Recommendations for triplestore optimization implementations
Knowledge Representation & Reasoning, Computer Science Department
CONCLUSIONS
17
Questions, comments, suggestions
most welcome
@albertmeronyo
@enridaga
https://github.com/MIDI-LD/List.MID
Knowledge Representation & Reasoning, Computer Science
Department
THANK YOU
18
• Motivation
• Related Work
• List Pattern Survey
• List.MID Benchmark Data Generator
• List.MID SPARQL Queries
• Evaluation: Performance Experiments
• Evaluation: Survey of Interest
• Conclusions
Knowledge Representation & Reasoning, Computer Science Department
OUTLINE
19
Knowledge Representation & Reasoning, Computer Science
Department
20
Use this slide to place an image to
the left and text to the right.
 With bullet
> Secundary list
To replace the image, right-click the
image (click on it with your other
mouse button), select Change
picture… and choose the new
image.
Knowledge Representation & Reasoning, Computer Science
Department
21 Knowledge Representation & Reasoning, Computer Science Department
22 Knowledge Representation & Reasoning, Computer Science Department
23
24

Más contenido relacionado

La actualidad más candente

Das Semantische Daten Web für Unternehmen
Das Semantische Daten Web für UnternehmenDas Semantische Daten Web für Unternehmen
Das Semantische Daten Web für Unternehmen
Sören Auer
 

La actualidad más candente (20)

R programming groundup-basic-section-i
R programming groundup-basic-section-iR programming groundup-basic-section-i
R programming groundup-basic-section-i
 
Towards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsTowards efficient processing of RDF data streams
Towards efficient processing of RDF data streams
 
Intro to R statistic programming
Intro to R statistic programming Intro to R statistic programming
Intro to R statistic programming
 
Web Data Management in RDF Age
Web Data Management in RDF AgeWeb Data Management in RDF Age
Web Data Management in RDF Age
 
Search engine
Search engineSearch engine
Search engine
 
Timbuctoo 2 EASY
Timbuctoo 2 EASYTimbuctoo 2 EASY
Timbuctoo 2 EASY
 
R tutorial
R tutorialR tutorial
R tutorial
 
Graph databases & data integration v2
Graph databases & data integration v2Graph databases & data integration v2
Graph databases & data integration v2
 
Data quality in Real Estate
Data quality in Real EstateData quality in Real Estate
Data quality in Real Estate
 
Table Retrieval and Generation
Table Retrieval and GenerationTable Retrieval and Generation
Table Retrieval and Generation
 
Congressional PageRank: Graph Analytics of US Congress With Neo4j
Congressional PageRank: Graph Analytics of US Congress With Neo4jCongressional PageRank: Graph Analytics of US Congress With Neo4j
Congressional PageRank: Graph Analytics of US Congress With Neo4j
 
Managing RDF data with graph databases
Managing RDF data with graph databasesManaging RDF data with graph databases
Managing RDF data with graph databases
 
Deriving an Emergent Relational Schema from RDF Data
Deriving an Emergent Relational Schema from RDF DataDeriving an Emergent Relational Schema from RDF Data
Deriving an Emergent Relational Schema from RDF Data
 
Personal Knowledge Mapping with Semantic Web Technologies
Personal Knowledge Mapping with Semantic Web TechnologiesPersonal Knowledge Mapping with Semantic Web Technologies
Personal Knowledge Mapping with Semantic Web Technologies
 
Property graph vs. RDF Triplestore comparison in 2020
Property graph vs. RDF Triplestore comparison in 2020Property graph vs. RDF Triplestore comparison in 2020
Property graph vs. RDF Triplestore comparison in 2020
 
Knowledge Graph Introduction
Knowledge Graph IntroductionKnowledge Graph Introduction
Knowledge Graph Introduction
 
Das Semantische Daten Web für Unternehmen
Das Semantische Daten Web für UnternehmenDas Semantische Daten Web für Unternehmen
Das Semantische Daten Web für Unternehmen
 
A Map of the PyData Stack
A Map of the PyData StackA Map of the PyData Stack
A Map of the PyData Stack
 
SWT Lecture Session 8 - Rules
SWT Lecture Session 8 - RulesSWT Lecture Session 8 - Rules
SWT Lecture Session 8 - Rules
 
Introduction into R for historians (part 1: introduction)
Introduction into R for historians (part 1: introduction)Introduction into R for historians (part 1: introduction)
Introduction into R for historians (part 1: introduction)
 

Similar a List.MID: A MIDI-Based Benchmark for RDF Lists

DistLODStats: Distributed Computation of RDF Dataset Statistics - ISWC 2018 talk
DistLODStats: Distributed Computation of RDF Dataset Statistics - ISWC 2018 talkDistLODStats: Distributed Computation of RDF Dataset Statistics - ISWC 2018 talk
DistLODStats: Distributed Computation of RDF Dataset Statistics - ISWC 2018 talk
Gezim Sejdiu
 
Efficient source selection for sparql endpoint federation
Efficient source selection for sparql endpoint federationEfficient source selection for sparql endpoint federation
Efficient source selection for sparql endpoint federation
Muhammad Saleem
 

Similar a List.MID: A MIDI-Based Benchmark for RDF Lists (20)

Towards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsTowards efficient processing of RDF data streams
Towards efficient processing of RDF data streams
 
Optimized index structures for querying rdf from the web
Optimized index structures for querying rdf from the webOptimized index structures for querying rdf from the web
Optimized index structures for querying rdf from the web
 
Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...Transient and persistent RDF views over relational databases in the context o...
Transient and persistent RDF views over relational databases in the context o...
 
ParlBench: a SPARQL-benchmark for electronic publishing applications.
ParlBench: a SPARQL-benchmark for electronic publishing applications.ParlBench: a SPARQL-benchmark for electronic publishing applications.
ParlBench: a SPARQL-benchmark for electronic publishing applications.
 
Validating and Describing Linked Data Portals using RDF Shape Expressions
Validating and Describing Linked Data Portals using RDF Shape ExpressionsValidating and Describing Linked Data Portals using RDF Shape Expressions
Validating and Describing Linked Data Portals using RDF Shape Expressions
 
bridging formal semantics and social semantics on the web
bridging formal semantics and social semantics on the webbridging formal semantics and social semantics on the web
bridging formal semantics and social semantics on the web
 
Stanford CS347 Guest Lecture: Apache Spark
Stanford CS347 Guest Lecture: Apache SparkStanford CS347 Guest Lecture: Apache Spark
Stanford CS347 Guest Lecture: Apache Spark
 
How Apache Spark fits into the Big Data landscape
How Apache Spark fits into the Big Data landscapeHow Apache Spark fits into the Big Data landscape
How Apache Spark fits into the Big Data landscape
 
DistLODStats: Distributed Computation of RDF Dataset Statistics - ISWC 2018 talk
DistLODStats: Distributed Computation of RDF Dataset Statistics - ISWC 2018 talkDistLODStats: Distributed Computation of RDF Dataset Statistics - ISWC 2018 talk
DistLODStats: Distributed Computation of RDF Dataset Statistics - ISWC 2018 talk
 
Democratizing Big Semantic Data management
Democratizing Big Semantic Data managementDemocratizing Big Semantic Data management
Democratizing Big Semantic Data management
 
Efficient source selection for sparql endpoint federation
Efficient source selection for sparql endpoint federationEfficient source selection for sparql endpoint federation
Efficient source selection for sparql endpoint federation
 
Semantic Web in Action
Semantic Web in ActionSemantic Web in Action
Semantic Web in Action
 
Mapping, Interlinking and Exposing MusicBrainz as Linked Data
Mapping, Interlinking and Exposing MusicBrainz as Linked DataMapping, Interlinking and Exposing MusicBrainz as Linked Data
Mapping, Interlinking and Exposing MusicBrainz as Linked Data
 
2009 0807 Lod Gmod
2009 0807 Lod Gmod2009 0807 Lod Gmod
2009 0807 Lod Gmod
 
Streaming Day - an overview of Stream Reasoning
Streaming Day - an overview of Stream ReasoningStreaming Day - an overview of Stream Reasoning
Streaming Day - an overview of Stream Reasoning
 
The Best of Both Worlds: Unlocking the Power of (big) Knowledge Graphs with S...
The Best of Both Worlds: Unlocking the Power of (big) Knowledge Graphs with S...The Best of Both Worlds: Unlocking the Power of (big) Knowledge Graphs with S...
The Best of Both Worlds: Unlocking the Power of (big) Knowledge Graphs with S...
 
State of the Semantic Web
State of the Semantic WebState of the Semantic Web
State of the Semantic Web
 
AnzoGraph DB - SPARQL 101
AnzoGraph DB - SPARQL 101AnzoGraph DB - SPARQL 101
AnzoGraph DB - SPARQL 101
 
RDF Stream Processing: Let's React
RDF Stream Processing: Let's ReactRDF Stream Processing: Let's React
RDF Stream Processing: Let's React
 
RDA data, linked data, and benefits for users / Gordon Dunsire
RDA data, linked data, and benefits for users / Gordon DunsireRDA data, linked data, and benefits for users / Gordon Dunsire
RDA data, linked data, and benefits for users / Gordon Dunsire
 

Más de Albert Meroño-Peñuela

Automatic Query-Centric API for Routine Access to Linked Data
Automatic Query-Centric API for Routine Access to Linked DataAutomatic Query-Centric API for Routine Access to Linked Data
Automatic Query-Centric API for Routine Access to Linked Data
Albert Meroño-Peñuela
 
Non-Temporal Orderings for Extensional Concept Drift
Non-Temporal Orderings for Extensional Concept DriftNon-Temporal Orderings for Extensional Concept Drift
Non-Temporal Orderings for Extensional Concept Drift
Albert Meroño-Peñuela
 
Detecting and Reporting Extensional Concept Drift in Statistical Linked Data
Detecting and Reporting Extensional Concept Drift in Statistical Linked DataDetecting and Reporting Extensional Concept Drift in Statistical Linked Data
Detecting and Reporting Extensional Concept Drift in Statistical Linked Data
Albert Meroño-Peñuela
 

Más de Albert Meroño-Peñuela (19)

Making social science more reproducible by encapsulating access to linked data
Making social science more reproducible by encapsulating access to linked dataMaking social science more reproducible by encapsulating access to linked data
Making social science more reproducible by encapsulating access to linked data
 
What can I expect from an academic career? Valuable skills
What can I expect from an academic career? Valuable skillsWhat can I expect from an academic career? Valuable skills
What can I expect from an academic career? Valuable skills
 
The MIDI Linked Data Cloud
The MIDI Linked Data CloudThe MIDI Linked Data Cloud
The MIDI Linked Data Cloud
 
Automatic Query-Centric API for Routine Access to Linked Data
Automatic Query-Centric API for Routine Access to Linked DataAutomatic Query-Centric API for Routine Access to Linked Data
Automatic Query-Centric API for Routine Access to Linked Data
 
One Score To Rule Them All: Semantics in Music Notation
One Score To Rule Them All: Semantics in Music NotationOne Score To Rule Them All: Semantics in Music Notation
One Score To Rule Them All: Semantics in Music Notation
 
Repeatable Semantic Queries for the Linked Data Agnostic
Repeatable Semantic Queries for the Linked Data AgnosticRepeatable Semantic Queries for the Linked Data Agnostic
Repeatable Semantic Queries for the Linked Data Agnostic
 
The Statistics of Stairway to Heaven: A Semantic Story About Digital Humanities
The Statistics of Stairway to Heaven: A Semantic Story About Digital HumanitiesThe Statistics of Stairway to Heaven: A Semantic Story About Digital Humanities
The Statistics of Stairway to Heaven: A Semantic Story About Digital Humanities
 
grlc: Bridging the Gap Between RESTful APIs and Linked Data
grlc: Bridging the Gap Between RESTful APIs and Linked Datagrlc: Bridging the Gap Between RESTful APIs and Linked Data
grlc: Bridging the Gap Between RESTful APIs and Linked Data
 
grlc Makes GitHub Taste Like Linked Data APIs
grlc Makes GitHub Taste Like Linked Data APIsgrlc Makes GitHub Taste Like Linked Data APIs
grlc Makes GitHub Taste Like Linked Data APIs
 
Historical Reasoning on the Web
Historical Reasoning on the WebHistorical Reasoning on the Web
Historical Reasoning on the Web
 
How does a knowledge graph sound like? (or: music is a graph)
How does a knowledge graph sound like? (or: music is a graph)How does a knowledge graph sound like? (or: music is a graph)
How does a knowledge graph sound like? (or: music is a graph)
 
What Is Linked Historical Data?
What Is Linked Historical Data?What Is Linked Historical Data?
What Is Linked Historical Data?
 
CBS CEDAR Presentation
CBS CEDAR PresentationCBS CEDAR Presentation
CBS CEDAR Presentation
 
LSD Dimensions: Use and Reuse of Linked Statistical Data as RDF Data Cube
LSD Dimensions: Use and Reuse of Linked Statistical Data as RDF Data CubeLSD Dimensions: Use and Reuse of Linked Statistical Data as RDF Data Cube
LSD Dimensions: Use and Reuse of Linked Statistical Data as RDF Data Cube
 
Non-Temporal Orderings for Extensional Concept Drift
Non-Temporal Orderings for Extensional Concept DriftNon-Temporal Orderings for Extensional Concept Drift
Non-Temporal Orderings for Extensional Concept Drift
 
Detecting and Reporting Extensional Concept Drift in Statistical Linked Data
Detecting and Reporting Extensional Concept Drift in Statistical Linked DataDetecting and Reporting Extensional Concept Drift in Statistical Linked Data
Detecting and Reporting Extensional Concept Drift in Statistical Linked Data
 
Semantic Web for the Humanities
Semantic Web for the HumanitiesSemantic Web for the Humanities
Semantic Web for the Humanities
 
Linked Census Data
Linked Census DataLinked Census Data
Linked Census Data
 
Linked Humanities data
Linked Humanities dataLinked Humanities data
Linked Humanities data
 

Último

UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
rknatarajan
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Christo Ananth
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
dharasingh5698
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Christo Ananth
 

Último (20)

College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
Glass Ceramics: Processing and Properties
Glass Ceramics: Processing and PropertiesGlass Ceramics: Processing and Properties
Glass Ceramics: Processing and Properties
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdf
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 

List.MID: A MIDI-Based Benchmark for RDF Lists

  • 1. 1 Knowledge Representation & Reasoning, Computer Science Department List.MID: A MIDI-Based Benchmark for RDF Lists Albert Meroño-Peñuela, Vrije Universiteit Amsterdam @albertmeronyo Enrico Daga, The Open University ISWC 2019, 28 October, Auckland
  • 2. 2  Modelling choices are often left to subjective choice  These practices and their reuse are key in query performance  Lists are everywhere! Co-authors, timelines, media, recipes, etc. Knowledge Representation & Reasoning, Computer Science Department MOTIVATION
  • 3. 3  And in MIDI applications Knowledge Representation & Reasoning, Computer Science Department MOTIVATION  So what do we know about performance of RDF List solutions? … [ 144, 60, 100] [ 128, 60, 64 ] … [Pic of music editing software]
  • 4. 4  Modelling of RDF lists > RDF(S) container classes (rdf:Bag, rdf:Alt, rdf:Seq) > Closed collections (rdf:List : rdf:first, rdf:rest, rdf:nil) > JSON-LD/Turtle syntaxes: "@list": [ "joe", "bob", "jaybee" ], :a :b ( "bob" "alice" "carol") > Ontology Design Patterns: Sequence OP, Collections Ontology  Benchmark datasets and queries > BSBM, LUBM, SP2Bench, DBPedia SPARQL, WatDiv > LSQ > IGUANA, LDBC Knowledge Representation & Reasoning, Computer Science Department RELATED WORK
  • 5. 5 What RDF list models are common in LOD? How can we generate data and queries to test their impact at retrieval in a systematic and principled manner? C1: Survey of common list modelling practices in RDF and LOD C2: Benchmarking data and queries enabling their comparison C3: Evidence of present and future use Knowledge Representation & Reasoning, Computer Science Department RESEARCH QUESTIONS & CONTRIBUTIONS
  • 6. 6 Surveyed from:  W3C standards  The Ontology Design Patterns portal  List choices in RDF datasets from ISWC resource track papers  Linked Open Vocabularies (LOV)  LOD Laundromat/LOD-a-lot [Fernández et al. ISWC 2017] Findings: various practices that generalize to 6 list modelling patterns Knowledge Representation & Reasoning, Computer Science Department LIST PATTERNS
  • 7. 7 Knowledge Representation & Reasoning, Computer Science Department RDF SEQUENCE AND RDF LIST [SEQ] [LIST]
  • 8. 8 Knowledge Representation & Reasoning, Computer Science Department URI, NUMBER, TIMESTAMP IMPLICIT ORDERING [URI] [NUM] [TIME]
  • 9. 9 Knowledge Representation & Reasoning, Computer Science Department SEQUENCE ONTOLOGY PATTERN [SOP]
  • 10. 10 List.MID is an RDF list benchmark based on:  Collaborative GitHub awesome MIDI list  More than 300K MIDI files from the Web  MIDI Linked Data Cloud [Meroño et al. ISWC 2017] Consists of:  RDF List data generator on top of real-world list data  Collection of extensible SPARQL list-retrieval queries Knowledge Representation & Reasoning, Computer Science Department LIST.MID: A BENCHMARK FOR RDF LISTS
  • 11. 11 Custom version of midi2rdf algorithm [Meroño et al. ESWC 2016] midi2rdf [-h] [--format [{xml,n3,turtle,nt,pretty-xml,trix,trig,nquads,json-ld}]] [--gz] [--order [{uri,prop_number,prop_time,seq,list,sop}]] [--version] filename [outfile] Knowledge Representation & Reasoning, Computer Science Department LIST.MID BENCHMARK DATA GENERATOR Input MIDI file List pattern to use RDF serialization Compress?
  • 12. 12 CQ1. Full list lookup: What is the ordered content of the list? CQ2. N-th Lookup: Which is the n-th item in the list? CQ3. Ordered Range: What are the n…m items in the list? Aimed at supporting use-case LOD publishing Focus on minimal and atomic operations related to list ordered access Do not deal (yet) with list management (edit, merge, split, etc.) Knowledge Representation & Reasoning, Computer Science Department LIST.MID QUERIES
  • 13. 13 [SEQ] WHERE {:list a midi:Track ; midi:hasEvents [ ?seq ?event ] . BIND (xsd:integer(SUBSTR(str(?seq), 45)) AS ?index) } ORDER BY ?index  OFFSET <N> LIMIT <M-N+1> [LIST] SELECT ?event (COUNT(?step) as ?index) WHERE { :list a midi:Track ; midi:hasEvents ?events . ?events rdf:rest∗ ?step . ?step rdf:rest∗ ?elt . ?elt rdf:first ?event . } GROUP BY ?event ORDER BY ?index  rdf:rest{N}, /…{N}…/ [URI] WHERE { [] a midi:Track ; midi:hasEvent ?event . BIND (xsd:integer(SUBSTR(str(?event), 77)) AS ?id) } ORDER BY ?id  OFFSET… [NUM/TIME] WHERE { [] a midi:Track ; midi:hasEvent ?event . ?event midi:absoluteTick ?tick . } ORDER BY ?tick [SOP] WHERE { [] a midi:Track ; midi:hasEvent ?event . ?event sequence:precedes? ?next_event . ?next_event sequence:follows? ?event . BIND (xsd:integer(SUBSTR(str(?event), 77)) AS ?id) } ORDER BY ?id Knowledge Representation & Reasoning, Computer Science Department LIST.MID QUERIES
  • 14. 14 [Daga et al. QuWeDa 2019] Knowledge Representation & Reasoning, Computer Science Department EVALUATION: PERFORMANCE EXPERIMENTS Coherent results among triplestores Critical impact of different models rdf:List has poor performance Property-based better than link-based rdf:Seq trade-off
  • 15. 15 N=24 in W3C semantic-web, public-lod, VU, OU Knowledge Representation & Reasoning, Computer Science Department EVALUATION: PERFORMANCE EXPERIMENTS
  • 16. 16 Lists are important! But how to assess the impact of their models?  6 common list patterns in RDF  List.MID: Real-world, MIDI-based RDF list benchmark data generator  List.MID: SPARQL queries for minimal and atomic list retrieval operations  First experiments and community interest Limitations/future work:  Limited set of list operations (e.g. rdf:List could win in e.g. addition)  Extensions to other list models  Recommendations for triplestore optimization implementations Knowledge Representation & Reasoning, Computer Science Department CONCLUSIONS
  • 17. 17 Questions, comments, suggestions most welcome @albertmeronyo @enridaga https://github.com/MIDI-LD/List.MID Knowledge Representation & Reasoning, Computer Science Department THANK YOU
  • 18. 18 • Motivation • Related Work • List Pattern Survey • List.MID Benchmark Data Generator • List.MID SPARQL Queries • Evaluation: Performance Experiments • Evaluation: Survey of Interest • Conclusions Knowledge Representation & Reasoning, Computer Science Department OUTLINE
  • 19. 19 Knowledge Representation & Reasoning, Computer Science Department
  • 20. 20 Use this slide to place an image to the left and text to the right.  With bullet > Secundary list To replace the image, right-click the image (click on it with your other mouse button), select Change picture… and choose the new image. Knowledge Representation & Reasoning, Computer Science Department
  • 21. 21 Knowledge Representation & Reasoning, Computer Science Department
  • 22. 22 Knowledge Representation & Reasoning, Computer Science Department
  • 23. 23
  • 24. 24