Fine-grained Evaluation of SPARQL Endpoint Federation Systems
1. Muhammad Saleem, Yasar Khan, Ali Hasnain, Ivan
Ermilov, Axel-Cyrille Ngonga Ngomo
ISWC 2016, Kobe, Japan, 20/10/2016
10/20/2016
1
Agile Knowledge Engineering and Semantic Web (AKSW), University of
Leipzig,
Germany
2. SPARQL query federation
Survey of federated SPARQL query processing systems
Performance variables
Evaluation of SPARQL endpoint federation systems
10/20/2016 2
3. SPARQL Endpoint Federation (SEF)
Linked Data Federation (LDF)
Hybrid of SEF+LDF
10/20/2016 3
5. Index-free
Using SPARQL ASK queries
Potentially ensures result set completeness
SPARQL ASK queries can be expensive
SPARQL ASK cache is beneficial
E.g., FedX
Index-only
Only make use of Index/data summaries
Less efficient but fast source selection
Result set completeness is not ensured
E.g., DARQ, LHD
10/20/2016 5
6. Hybrid
Make use of index+SPARQL ASK
Most efficient
Result set completeness is not ensured
SPARQL ASK cache is beneficial
E.g., HiBISCuS, ANAPSID, SPLENDID
10/20/2016 6
7. System Information
Title and/or URL of federation engine
Code available?
Implementation and license
Type of source selection
Type of join(s) used for data integration
Use of cache?
Support for catalog/index update
10/20/2016 7
9. 10/20/2016 9
Yes
59%
No
12%
Not yet
29%
Code available?
Index-free
6%
Index-
only
29%
Hybrid
65%
Type of source selection
Yes
47%No
53%
Use of cache?
Yes
29%
No
59%
NA
12%
Index update support?
10. Requirements
Result completeness
Policy-based query planning
Support for partial results retrieval
Support for no-blocking operator/ adaptive query processing
Support for provenance information
Query runtime estimation
Duplicate Detection
Top-K query processing
Supported SPARQL types/ clauses
10/20/2016 10