SlideShare una empresa de Scribd logo
1 de 24
Muhammad Saleem, Yasar Khan, Ali Hasnain, Ivan
Ermilov, Axel-Cyrille Ngonga Ngomo
ISWC 2016, Kobe, Japan, 20/10/2016
10/20/2016
1
Agile Knowledge Engineering and Semantic Web (AKSW), University of
Leipzig,
Germany
 SPARQL query federation
 Survey of federated SPARQL query processing systems
 Performance variables
 Evaluation of SPARQL endpoint federation systems
10/20/2016 2
 SPARQL Endpoint Federation (SEF)
 Linked Data Federation (LDF)
 Hybrid of SEF+LDF
10/20/2016 3
10/20/2016 4
SPARQL Endpoint Federation
S1 S2 S3 S4
RDF RDF RDF RDF
Parsing/Rewriting
Source Selection
Federator Optimzer
Integrator
Rewrite query
and get Individual
Triple Patterns
Identify capable
source against
Individual Triple
Patterns
Generate
optimized sub-
query Exe. Plan
Integrate sub-
queries results
Execute sub-
queries
 Index-free
 Using SPARQL ASK queries
 Potentially ensures result set completeness
 SPARQL ASK queries can be expensive
 SPARQL ASK cache is beneficial
 E.g., FedX
 Index-only
 Only make use of Index/data summaries
 Less efficient but fast source selection
 Result set completeness is not ensured
 E.g., DARQ, LHD
10/20/2016 5
 Hybrid
 Make use of index+SPARQL ASK
 Most efficient
 Result set completeness is not ensured
 SPARQL ASK cache is beneficial
 E.g., HiBISCuS, ANAPSID, SPLENDID
10/20/2016 6
 System Information
 Title and/or URL of federation engine
 Code available?
 Implementation and license
 Type of source selection
 Type of join(s) used for data integration
 Use of cache?
 Support for catalog/index update
10/20/2016 7
10/20/2016 8
10/20/2016 9
Yes
59%
No
12%
Not yet
29%
Code available?
Index-free
6%
Index-
only
29%
Hybrid
65%
Type of source selection
Yes
47%No
53%
Use of cache?
Yes
29%
No
59%
NA
12%
Index update support?
 Requirements
 Result completeness
 Policy-based query planning
 Support for partial results retrieval
 Support for no-blocking operator/ adaptive query processing
 Support for provenance information
 Query runtime estimation
 Duplicate Detection
 Top-K query processing
 Supported SPARQL types/ clauses
10/20/2016 10
10/20/2016 11
10/20/2016 12
Yes
18%
No
82%
Result completeness?
Yes
6%
No
88%
Partial
6%
Policy-based planning?
Yes
24%
No
76%
Partial results retrieval?
Yes
59%
No
41%
Adaptive processing?
10/20/2016 13
Yes
0%
No
94%
Partial
6%
Provenance info?
Yes
0%
No
100%
Runtime estimation?
Yes
12%
No
76%
Partial
12%
Duplicate detection?
No
100%
Top-k querying?
10/20/2016 14
10/20/2016 15
 Benchmarks
 FedBench
 Sliced FedBench
 SP2Bench
 SPARQL endpoint federation engines
 FedX
 SPLENDID
 LHD
 DAW
 ADERIS
 ANAPSID
10/20/2016 16
10/20/2016 17
FedBench
Query Category FedX SPLENDID LHD DARQ ANAPSID ADERIS
CD 252 44 0 0 43 0
LS 297 33 0 0 63 0
LD 369 19 0 0 37 0
Net 918 96 0 0 143 0
Query Category Sliced FedBench
CD 280 110 0 0 228 0
LS 330 100 0 0 143 0
LD 410 140 0 0 270 0
SP2Bench 660 180 0 0 265 0
Net 1680 530 0 0 906 0
10/20/2016 18
FedBench
Query Category FedX SPLENDID LHD DARQ ANAPSID ADERIS
CD 78 78 112 84 36 84
LS 56 56 105 77 44 70
LD 108 108 119 112 54 119
Net 242 242 336 283 134 273
Query Category Sliced FedBench
CD 182 182 225 195 163 195
LS 125 125 163 133 102 117
LD 243 243 324 324 183 324
SP2Bench 521 521 593 420 244 173
Net 1071 1071 1305 1072 692 809
10/20/2016 19
FedBench
Query Category FedX(cold) FedX(warm) SPLENDID LHD DARQ ANAPSID ADERIS
CD 151 7 256 6 7 186 6
LS 147 8 236 6 12 477 5
LD 139 8 221 6 7 804 5
Average 146 8 237 6 9 489 5
Query Category Sliced FedBench
CD 284 9 470 7 8 3183 7
LS 207 7 308 7 8 2897 7
LD 323 9 557 8 7 2908 8
SP2Bench 212 9 738 8 9RE 6
Average 256 8 518 7 8 2996 7
10/20/2016 20
21
FedBench: FedX(warm)  FedX(cold)  LHD  SPLENDID  ANAPSID  DARQ
25/25 17/22 13/22 15/24 16/22
Sliced FedBench: FedX(warm)  FedX(cold)  LHD  SPLENDID  ANAPSID  DARQ
29/36 17/24 17/24 17/26 12/20
22
0
50
100
150
200
250
300
350
400
450
500
CD LS LD Average
Averageexecutiontime(msec)
FedX (first run) FedBench
FedX (first run) SlicedBench
0
50
100
150
200
250
300
CD LS LD Average
Averageexecutiontime(msec)
FedX (cached) FedBench
FedX (cached) SlicedBench
0
200
400
600
800
1000
1200
1400
1600
CD LS LD Average
Averageexecutiontime(msec)
SPLENDID FedBench
SPLENDID SlicedBench
0
10000
20000
30000
40000
50000
60000
70000
80000
CD LS LD Average
Averageexecutiontime(msec)
ANAPSID FedBench
ANAPSID SlicedBench
0
50000
100000
150000
200000
250000
300000
350000
CD LS LD Average
Averageexecutiontime(msec)
DARQ FedBench
DARQ SlicedBench
0
2000
4000
6000
8000
10000
12000
14000
CD LS LD Average
Averageexecutiontime(msec)
LHD FedBench
LHD SlicedBench
23
0
50
100
150
200
250
300
350
400
450
500
CD LS LD Average
Averageexecutiontime(msec)
FedX (first run) FedBench
FedX (first run) SlicedBench
Decreased
by 214%
0
50
100
150
200
250
300
CD LS LD Average
Averageexecutiontime(msec)
FedX (cached) FedBench
FedX (cached) SlicedBench
Decreased
by 199%
0
200
400
600
800
1000
1200
1400
1600
CD LS LD Average
Averageexecutiontime(msec)
SPLENDID FedBench
SPLENDID SlicedBench
Decreased
by 227%
0
10000
20000
30000
40000
50000
60000
70000
80000
CD LS LD Average
Averageexecutiontime(msec)
ANAPSID FedBench
ANAPSID SlicedBench
Decreased
by 392%
0
50000
100000
150000
200000
250000
300000
350000
CD LS LD Average
Averageexecutiontime(msec)
DARQ FedBench
DARQ SlicedBench
Increased
by 36%
0
2000
4000
6000
8000
10000
12000
14000
CD LS LD Average
Averageexecutiontime(msec)
LHD FedBench
LHD SlicedBench
Decreased
by 293%
10/20/2016 24
{lastname}@informatik.uni-leipzig.de
AKSW, University of Leipzig, Germany

Más contenido relacionado

La actualidad más candente

SPARQL 1.1 Update (2013-03-05)
SPARQL 1.1 Update (2013-03-05)SPARQL 1.1 Update (2013-03-05)
SPARQL 1.1 Update (2013-03-05)
andyseaborne
 
Mon norton tut_queryinglinkeddata02
Mon norton tut_queryinglinkeddata02Mon norton tut_queryinglinkeddata02
Mon norton tut_queryinglinkeddata02
eswcsummerschool
 
Ks2008 Semanticweb In Action
Ks2008 Semanticweb In ActionKs2008 Semanticweb In Action
Ks2008 Semanticweb In Action
Rinke Hoekstra
 

La actualidad más candente (20)

Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
 
Querying Linked Data with SPARQL
Querying Linked Data with SPARQLQuerying Linked Data with SPARQL
Querying Linked Data with SPARQL
 
SPARQL 1.1 Update (2013-03-05)
SPARQL 1.1 Update (2013-03-05)SPARQL 1.1 Update (2013-03-05)
SPARQL 1.1 Update (2013-03-05)
 
SPARQL-DL - Theory & Practice
SPARQL-DL - Theory & PracticeSPARQL-DL - Theory & Practice
SPARQL-DL - Theory & Practice
 
Sparql
SparqlSparql
Sparql
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
 
Linking the world with Python and Semantics
Linking the world with Python and SemanticsLinking the world with Python and Semantics
Linking the world with Python and Semantics
 
NoSQL and Triple Stores
NoSQL and Triple StoresNoSQL and Triple Stores
NoSQL and Triple Stores
 
An Introduction to SPARQL
An Introduction to SPARQLAn Introduction to SPARQL
An Introduction to SPARQL
 
SPARQL Cheat Sheet
SPARQL Cheat SheetSPARQL Cheat Sheet
SPARQL Cheat Sheet
 
Mon norton tut_queryinglinkeddata02
Mon norton tut_queryinglinkeddata02Mon norton tut_queryinglinkeddata02
Mon norton tut_queryinglinkeddata02
 
Semantic web meetup – sparql tutorial
Semantic web meetup – sparql tutorialSemantic web meetup – sparql tutorial
Semantic web meetup – sparql tutorial
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
 
Data Integration And Visualization
Data Integration And VisualizationData Integration And Visualization
Data Integration And Visualization
 
Relational Database to RDF (RDB2RDF)
Relational Database to RDF (RDB2RDF)Relational Database to RDF (RDB2RDF)
Relational Database to RDF (RDB2RDF)
 
Mapping Relational Databases to Linked Data
Mapping Relational Databases to Linked DataMapping Relational Databases to Linked Data
Mapping Relational Databases to Linked Data
 
Ks2008 Semanticweb In Action
Ks2008 Semanticweb In ActionKs2008 Semanticweb In Action
Ks2008 Semanticweb In Action
 
Triple Stores
Triple StoresTriple Stores
Triple Stores
 
Linked Data, Ontologies and Inference
Linked Data, Ontologies and InferenceLinked Data, Ontologies and Inference
Linked Data, Ontologies and Inference
 
HadoopFileFormats_2016
HadoopFileFormats_2016HadoopFileFormats_2016
HadoopFileFormats_2016
 

Similar a Fine-grained Evaluation of SPARQL Endpoint Federation Systems

Creating APIs over RDF
Creating APIs over RDFCreating APIs over RDF
Creating APIs over RDF
Leigh Dodds
 
Creating APIs over RDF
Creating APIs over RDFCreating APIs over RDF
Creating APIs over RDF
Leigh Dodds
 
Review of some successes
Review of some successesReview of some successes
Review of some successes
Andrea Zaliani
 

Similar a Fine-grained Evaluation of SPARQL Endpoint Federation Systems (20)

Is IPv6 Really Faster?
Is IPv6 Really Faster?Is IPv6 Really Faster?
Is IPv6 Really Faster?
 
Net app
Net appNet app
Net app
 
More Complete Resultset Retrieval from Large Heterogeneous RDF Sources
More Complete Resultset Retrieval from Large Heterogeneous RDF SourcesMore Complete Resultset Retrieval from Large Heterogeneous RDF Sources
More Complete Resultset Retrieval from Large Heterogeneous RDF Sources
 
IPv6 Performance
IPv6 PerformanceIPv6 Performance
IPv6 Performance
 
Heiner Oberkampf: Semantics for Integrated Analytical Laboratory Processes – ...
Heiner Oberkampf: Semantics for Integrated Analytical Laboratory Processes – ...Heiner Oberkampf: Semantics for Integrated Analytical Laboratory Processes – ...
Heiner Oberkampf: Semantics for Integrated Analytical Laboratory Processes – ...
 
Semantics for Integrated Analytical Laboratory Processes – the Allotrope Pers...
Semantics for Integrated Analytical Laboratory Processes – the Allotrope Pers...Semantics for Integrated Analytical Laboratory Processes – the Allotrope Pers...
Semantics for Integrated Analytical Laboratory Processes – the Allotrope Pers...
 
Eswc2018 wimu slides
Eswc2018 wimu slidesEswc2018 wimu slides
Eswc2018 wimu slides
 
Open source programming
Open source programmingOpen source programming
Open source programming
 
Creating APIs over RDF
Creating APIs over RDFCreating APIs over RDF
Creating APIs over RDF
 
Creating APIs over RDF
Creating APIs over RDFCreating APIs over RDF
Creating APIs over RDF
 
Compile ahead of time. It's fine?
Compile ahead of time. It's fine?Compile ahead of time. It's fine?
Compile ahead of time. It's fine?
 
balloon Fusion: SPARQL Rewriting Based on Unified Co-Reference Information
balloon Fusion: SPARQL Rewriting Based on  Unified Co-Reference Informationballoon Fusion: SPARQL Rewriting Based on  Unified Co-Reference Information
balloon Fusion: SPARQL Rewriting Based on Unified Co-Reference Information
 
IGUANA: A Generic Framework for Benchmarking the Read-Write Performance of Tr...
IGUANA: A Generic Framework for Benchmarking the Read-Write Performance of Tr...IGUANA: A Generic Framework for Benchmarking the Read-Write Performance of Tr...
IGUANA: A Generic Framework for Benchmarking the Read-Write Performance of Tr...
 
Side by Side with Elasticsearch & Solr, Part 2
Side by Side with Elasticsearch & Solr, Part 2Side by Side with Elasticsearch & Solr, Part 2
Side by Side with Elasticsearch & Solr, Part 2
 
2009 0807 Lod Gmod
2009 0807 Lod Gmod2009 0807 Lod Gmod
2009 0807 Lod Gmod
 
Retour d'expérience d'un environnement base de données multitenant
Retour d'expérience d'un environnement base de données multitenantRetour d'expérience d'un environnement base de données multitenant
Retour d'expérience d'un environnement base de données multitenant
 
Review of some successes
Review of some successesReview of some successes
Review of some successes
 
Introduction to metadata cleansing using SPARQL update queries
Introduction to metadata cleansing using SPARQL update queriesIntroduction to metadata cleansing using SPARQL update queries
Introduction to metadata cleansing using SPARQL update queries
 
How Representative Is a SPARQL Benchmark? An Analysis of RDF Triplestore Benc...
How Representative Is a SPARQL Benchmark? An Analysis of RDF Triplestore Benc...How Representative Is a SPARQL Benchmark? An Analysis of RDF Triplestore Benc...
How Representative Is a SPARQL Benchmark? An Analysis of RDF Triplestore Benc...
 
RDFUnit - Test-Driven Linked Data quality Assessment (WWW2014)
RDFUnit - Test-Driven Linked Data quality Assessment (WWW2014)RDFUnit - Test-Driven Linked Data quality Assessment (WWW2014)
RDFUnit - Test-Driven Linked Data quality Assessment (WWW2014)
 

Más de Muhammad Saleem

QaldGen: Towards Microbenchmarking of Question Answering Systems Over Knowled...
QaldGen: Towards Microbenchmarking of Question Answering Systems Over Knowled...QaldGen: Towards Microbenchmarking of Question Answering Systems Over Knowled...
QaldGen: Towards Microbenchmarking of Question Answering Systems Over Knowled...
Muhammad Saleem
 
SQCFramework: SPARQL Query containment Benchmark Generation Framework
SQCFramework: SPARQL Query containment  Benchmark Generation Framework SQCFramework: SPARQL Query containment  Benchmark Generation Framework
SQCFramework: SPARQL Query containment Benchmark Generation Framework
Muhammad Saleem
 
Question Answering Over Linked Data: What is Difficult to Answer? What Affect...
Question Answering Over Linked Data: What is Difficult to Answer? What Affect...Question Answering Over Linked Data: What is Difficult to Answer? What Affect...
Question Answering Over Linked Data: What is Difficult to Answer? What Affect...
Muhammad Saleem
 

Más de Muhammad Saleem (11)

QaldGen: Towards Microbenchmarking of Question Answering Systems Over Knowled...
QaldGen: Towards Microbenchmarking of Question Answering Systems Over Knowled...QaldGen: Towards Microbenchmarking of Question Answering Systems Over Knowled...
QaldGen: Towards Microbenchmarking of Question Answering Systems Over Knowled...
 
LargeRDFBench
LargeRDFBenchLargeRDFBench
LargeRDFBench
 
Extended LargeRDFBench
Extended LargeRDFBenchExtended LargeRDFBench
Extended LargeRDFBench
 
CostFed: Cost-Based Query Optimization for SPARQL Endpoint Federation
CostFed: Cost-Based Query Optimization for SPARQL Endpoint FederationCostFed: Cost-Based Query Optimization for SPARQL Endpoint Federation
CostFed: Cost-Based Query Optimization for SPARQL Endpoint Federation
 
SQCFramework: SPARQL Query containment Benchmark Generation Framework
SQCFramework: SPARQL Query containment  Benchmark Generation Framework SQCFramework: SPARQL Query containment  Benchmark Generation Framework
SQCFramework: SPARQL Query containment Benchmark Generation Framework
 
Question Answering Over Linked Data: What is Difficult to Answer? What Affect...
Question Answering Over Linked Data: What is Difficult to Answer? What Affect...Question Answering Over Linked Data: What is Difficult to Answer? What Affect...
Question Answering Over Linked Data: What is Difficult to Answer? What Affect...
 
LSQ: The Linked SPARQL Queries Dataset
LSQ: The Linked SPARQL Queries DatasetLSQ: The Linked SPARQL Queries Dataset
LSQ: The Linked SPARQL Queries Dataset
 
SAFE: Policy Aware SPARQL Query Federation Over RDF Data Cubes
SAFE: Policy Aware SPARQL Query Federation Over RDF Data CubesSAFE: Policy Aware SPARQL Query Federation Over RDF Data Cubes
SAFE: Policy Aware SPARQL Query Federation Over RDF Data Cubes
 
DAW: Duplicate-AWare Federated Query Processing over the Web of Data
DAW: Duplicate-AWare Federated Query Processing over the Web of DataDAW: Duplicate-AWare Federated Query Processing over the Web of Data
DAW: Duplicate-AWare Federated Query Processing over the Web of Data
 
Fostering Serendipity through Big Linked Data
Fostering Serendipity through Big Linked DataFostering Serendipity through Big Linked Data
Fostering Serendipity through Big Linked Data
 
Linked Cancer Genome Atlas Database
Linked Cancer Genome Atlas DatabaseLinked Cancer Genome Atlas Database
Linked Cancer Genome Atlas Database
 

Último

Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills KuwaitKuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
jaanualu31
 
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
HenryBriggs2
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
ssuser89054b
 
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
Health
 

Último (20)

Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the start
 
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptBlock diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.ppt
 
kiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal loadkiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal load
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdf
 
Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills KuwaitKuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
Hazard Identification (HAZID) vs. Hazard and Operability (HAZOP): A Comparati...
 
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
scipt v1.pptxcxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx...
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdf
 
Learn the concepts of Thermodynamics on Magic Marks
Learn the concepts of Thermodynamics on Magic MarksLearn the concepts of Thermodynamics on Magic Marks
Learn the concepts of Thermodynamics on Magic Marks
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 
2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projects2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projects
 
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxHOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
 
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
+97470301568>> buy weed in qatar,buy thc oil qatar,buy weed and vape oil in d...
 
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
 
A Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna MunicipalityA Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna Municipality
 

Fine-grained Evaluation of SPARQL Endpoint Federation Systems

  • 1. Muhammad Saleem, Yasar Khan, Ali Hasnain, Ivan Ermilov, Axel-Cyrille Ngonga Ngomo ISWC 2016, Kobe, Japan, 20/10/2016 10/20/2016 1 Agile Knowledge Engineering and Semantic Web (AKSW), University of Leipzig, Germany
  • 2.  SPARQL query federation  Survey of federated SPARQL query processing systems  Performance variables  Evaluation of SPARQL endpoint federation systems 10/20/2016 2
  • 3.  SPARQL Endpoint Federation (SEF)  Linked Data Federation (LDF)  Hybrid of SEF+LDF 10/20/2016 3
  • 4. 10/20/2016 4 SPARQL Endpoint Federation S1 S2 S3 S4 RDF RDF RDF RDF Parsing/Rewriting Source Selection Federator Optimzer Integrator Rewrite query and get Individual Triple Patterns Identify capable source against Individual Triple Patterns Generate optimized sub- query Exe. Plan Integrate sub- queries results Execute sub- queries
  • 5.  Index-free  Using SPARQL ASK queries  Potentially ensures result set completeness  SPARQL ASK queries can be expensive  SPARQL ASK cache is beneficial  E.g., FedX  Index-only  Only make use of Index/data summaries  Less efficient but fast source selection  Result set completeness is not ensured  E.g., DARQ, LHD 10/20/2016 5
  • 6.  Hybrid  Make use of index+SPARQL ASK  Most efficient  Result set completeness is not ensured  SPARQL ASK cache is beneficial  E.g., HiBISCuS, ANAPSID, SPLENDID 10/20/2016 6
  • 7.  System Information  Title and/or URL of federation engine  Code available?  Implementation and license  Type of source selection  Type of join(s) used for data integration  Use of cache?  Support for catalog/index update 10/20/2016 7
  • 9. 10/20/2016 9 Yes 59% No 12% Not yet 29% Code available? Index-free 6% Index- only 29% Hybrid 65% Type of source selection Yes 47%No 53% Use of cache? Yes 29% No 59% NA 12% Index update support?
  • 10.  Requirements  Result completeness  Policy-based query planning  Support for partial results retrieval  Support for no-blocking operator/ adaptive query processing  Support for provenance information  Query runtime estimation  Duplicate Detection  Top-K query processing  Supported SPARQL types/ clauses 10/20/2016 10
  • 12. 10/20/2016 12 Yes 18% No 82% Result completeness? Yes 6% No 88% Partial 6% Policy-based planning? Yes 24% No 76% Partial results retrieval? Yes 59% No 41% Adaptive processing?
  • 13. 10/20/2016 13 Yes 0% No 94% Partial 6% Provenance info? Yes 0% No 100% Runtime estimation? Yes 12% No 76% Partial 12% Duplicate detection? No 100% Top-k querying?
  • 16.  Benchmarks  FedBench  Sliced FedBench  SP2Bench  SPARQL endpoint federation engines  FedX  SPLENDID  LHD  DAW  ADERIS  ANAPSID 10/20/2016 16
  • 17. 10/20/2016 17 FedBench Query Category FedX SPLENDID LHD DARQ ANAPSID ADERIS CD 252 44 0 0 43 0 LS 297 33 0 0 63 0 LD 369 19 0 0 37 0 Net 918 96 0 0 143 0 Query Category Sliced FedBench CD 280 110 0 0 228 0 LS 330 100 0 0 143 0 LD 410 140 0 0 270 0 SP2Bench 660 180 0 0 265 0 Net 1680 530 0 0 906 0
  • 18. 10/20/2016 18 FedBench Query Category FedX SPLENDID LHD DARQ ANAPSID ADERIS CD 78 78 112 84 36 84 LS 56 56 105 77 44 70 LD 108 108 119 112 54 119 Net 242 242 336 283 134 273 Query Category Sliced FedBench CD 182 182 225 195 163 195 LS 125 125 163 133 102 117 LD 243 243 324 324 183 324 SP2Bench 521 521 593 420 244 173 Net 1071 1071 1305 1072 692 809
  • 19. 10/20/2016 19 FedBench Query Category FedX(cold) FedX(warm) SPLENDID LHD DARQ ANAPSID ADERIS CD 151 7 256 6 7 186 6 LS 147 8 236 6 12 477 5 LD 139 8 221 6 7 804 5 Average 146 8 237 6 9 489 5 Query Category Sliced FedBench CD 284 9 470 7 8 3183 7 LS 207 7 308 7 8 2897 7 LD 323 9 557 8 7 2908 8 SP2Bench 212 9 738 8 9RE 6 Average 256 8 518 7 8 2996 7
  • 21. 21 FedBench: FedX(warm)  FedX(cold)  LHD  SPLENDID  ANAPSID  DARQ 25/25 17/22 13/22 15/24 16/22 Sliced FedBench: FedX(warm)  FedX(cold)  LHD  SPLENDID  ANAPSID  DARQ 29/36 17/24 17/24 17/26 12/20
  • 22. 22 0 50 100 150 200 250 300 350 400 450 500 CD LS LD Average Averageexecutiontime(msec) FedX (first run) FedBench FedX (first run) SlicedBench 0 50 100 150 200 250 300 CD LS LD Average Averageexecutiontime(msec) FedX (cached) FedBench FedX (cached) SlicedBench 0 200 400 600 800 1000 1200 1400 1600 CD LS LD Average Averageexecutiontime(msec) SPLENDID FedBench SPLENDID SlicedBench 0 10000 20000 30000 40000 50000 60000 70000 80000 CD LS LD Average Averageexecutiontime(msec) ANAPSID FedBench ANAPSID SlicedBench 0 50000 100000 150000 200000 250000 300000 350000 CD LS LD Average Averageexecutiontime(msec) DARQ FedBench DARQ SlicedBench 0 2000 4000 6000 8000 10000 12000 14000 CD LS LD Average Averageexecutiontime(msec) LHD FedBench LHD SlicedBench
  • 23. 23 0 50 100 150 200 250 300 350 400 450 500 CD LS LD Average Averageexecutiontime(msec) FedX (first run) FedBench FedX (first run) SlicedBench Decreased by 214% 0 50 100 150 200 250 300 CD LS LD Average Averageexecutiontime(msec) FedX (cached) FedBench FedX (cached) SlicedBench Decreased by 199% 0 200 400 600 800 1000 1200 1400 1600 CD LS LD Average Averageexecutiontime(msec) SPLENDID FedBench SPLENDID SlicedBench Decreased by 227% 0 10000 20000 30000 40000 50000 60000 70000 80000 CD LS LD Average Averageexecutiontime(msec) ANAPSID FedBench ANAPSID SlicedBench Decreased by 392% 0 50000 100000 150000 200000 250000 300000 350000 CD LS LD Average Averageexecutiontime(msec) DARQ FedBench DARQ SlicedBench Increased by 36% 0 2000 4000 6000 8000 10000 12000 14000 CD LS LD Average Averageexecutiontime(msec) LHD FedBench LHD SlicedBench Decreased by 293%