SlideShare una empresa de Scribd logo
1 de 10
Descargar para leer sin conexión
David Chaves-Fraga, Ontology Engineering Group
Universidad Politécnica de Madrid, Spain
Claudio Gutierrez, CIWS & Universidad de Chile

Oscar Corcho, OEG - UPM

On the Role of the GRAPH
Clause in the Performance
of Federated SPARQL
Queries
dchaves@email

@dchavesf
22/10/2017

PROFILES17
On the Role of GRAPH Clause in the Performance of Federated SPARQL queries
Federated Query and SPARQL endpoint
2
Improve federated query evaluation by systematically using the
GRAPH clause to rewrite a query into a semantically equivalent one
G0
G1 G2
G5
G3
G4
<URI>
<URI>
<URI> <URI>
<URI>
SPARQL endpoint
select * where {
service <s1> {GP1}
service <s2> {GP2}
service <s3> {GP3}
}
Federated Query
On the Role of GRAPH Clause in the Performance of Federated SPARQL queries
Motivation
3
•Many graph patterns bind results from only one of the graphs in the
SPARQL endpoints
•The server can have more information and optimize the retrieval
On the Role of GRAPH Clause in the Performance of Federated SPARQL queries
Reducing the space of a federated SPARQL query
4
•Hypothesis: This method adds efficiency in the evaluation of the query
•Assumption: The identification of the GP
select * where {
service <s1> {GP1}
service <s2> {GP2}
service <s3> {GP3}
}
select * from named <s1-g2>, <s2-g3>, <s3-g1>{
service <s1> {graph <s1-g2> {GP1}}
service <s2> {graph <s2-g3> {GP2}}
service <s3> {graph <s3-g1> {GP3}}
}
On the Role of GRAPH Clause in the Performance of Federated SPARQL queries
Queries and experiment
5
•We use the real public instances of SPARQL endpoints of:
• DBpedia
• KB3city
• Linkedmdb
•5 queries - two different approaches (with/without GRAPH clause):
• 5 executions per approach with the disabled cache.
•Queries available at: https://github.com/dachafra/federated-graph-queries
Problems:
•There is not exist any benchmarking for federated SPARQL queries
that includes SERVICE and GRAPH clauses.
•Deal with availability and real performance of the SPARQL endpoints
On the Role of GRAPH Clause in the Performance of Federated SPARQL queries
Preliminary results
6
Q1 Q2 Q3 Q4 Q5
GRAPH 13,51 3,22 1,98 1,73 8,56
SERVICE 14,63 4,36 2,88 1,69 9,12
•5%-10% improvement
•Q2-Q3-Q4: kb.3city.com SPARQL endpoint
• 73 graphs
• Almost 5 millions of triples
•Q1-Q5: dbpedia.com SPARQL endpoint:
• 19 graphs
• 285 millions of triples
On the Role of GRAPH Clause in the Performance of Federated SPARQL queries
Preliminary results
6
Q1 Q2 Q3 Q4 Q5
GRAPH 13,51 3,22 1,98 1,73 8,56
SERVICE 14,63 4,36 2,88 1,69 9,12
•5%-10% improvement
•Q2-Q3-Q4: kb.3city.com SPARQL endpoint
• 73 graphs
• Almost 5 millions of triples
•Q1-Q5: dbpedia.com SPARQL endpoint:
• 19 graphs
• 285 millions of triples
Virtuoso puts the triples ina single table
On the Role of GRAPH Clause in the Performance of Federated SPARQL queries
Conclusions: The role of the GRAPH clause
7
Conclusions about GRAPH clause:
1. is semantically ambiguous
2. offers a view of how the data
is conceptually ordered but
not physically
3. is a SPARQL clause that
seems to not allow any type of
optimization (at the moment)
DBpedia graphs and triples
On the Role of GRAPH Clause in the Performance of Federated SPARQL queries
Future work
8
•Create a Benchmarking for federated queries with SERVICE and
GRAPH
•Apply the approach to other triple stores
•Carry out semantic equivalence between open data portal and a
SPARQL endpoint
•Link unique graph patterns with their pertained graphs
•Analyze why Virtuoso and Apache Jena can execute the approach
David Chaves-Fraga, Ontology Engineering Group
Universidad Politécnica de Madrid, Spain
Claudio Gutierrez, CIWS & Universidad de Chile

Oscar Corcho, OEG - UPM

On the Role of the GRAPH
Clause in the Performance
of Federated SPARQL
Queries
dchaves@email

@dchavesf
22/10/2017

PROFILES17

Más contenido relacionado

La actualidad más candente

La actualidad más candente (13)

IMDb Data Integration
IMDb Data IntegrationIMDb Data Integration
IMDb Data Integration
 
How to build an ETL pipeline with Apache Beam on Google Cloud Dataflow
How to build an ETL pipeline with Apache Beam on Google Cloud DataflowHow to build an ETL pipeline with Apache Beam on Google Cloud Dataflow
How to build an ETL pipeline with Apache Beam on Google Cloud Dataflow
 
Towards a comprehensive call ontology for research 2.0
Towards a comprehensive call ontology for research 2.0Towards a comprehensive call ontology for research 2.0
Towards a comprehensive call ontology for research 2.0
 
The R Ecosystem
The R EcosystemThe R Ecosystem
The R Ecosystem
 
Begin at the beginning: Feature selection for Big Data by Amparo Alonso at Bi...
Begin at the beginning: Feature selection for Big Data by Amparo Alonso at Bi...Begin at the beginning: Feature selection for Big Data by Amparo Alonso at Bi...
Begin at the beginning: Feature selection for Big Data by Amparo Alonso at Bi...
 
Everything You Always Wanted to Know About Kafka’s Rebalance Protocol but Wer...
Everything You Always Wanted to Know About Kafka’s Rebalance Protocol but Wer...Everything You Always Wanted to Know About Kafka’s Rebalance Protocol but Wer...
Everything You Always Wanted to Know About Kafka’s Rebalance Protocol but Wer...
 
Processing genetic data at scale
Processing genetic data at scaleProcessing genetic data at scale
Processing genetic data at scale
 
6 Lowpan Rpl Tutorial Code
6 Lowpan Rpl Tutorial Code6 Lowpan Rpl Tutorial Code
6 Lowpan Rpl Tutorial Code
 
PGConf APAC 2018 - Lightening Talk #3: How To Contribute to PostgreSQL
PGConf APAC 2018 - Lightening Talk #3: How To Contribute to PostgreSQLPGConf APAC 2018 - Lightening Talk #3: How To Contribute to PostgreSQL
PGConf APAC 2018 - Lightening Talk #3: How To Contribute to PostgreSQL
 
MLSD18. Summary of Morning Sessions
MLSD18. Summary of Morning SessionsMLSD18. Summary of Morning Sessions
MLSD18. Summary of Morning Sessions
 
qCube: Efficient integration of range query operators over a high dimension d...
qCube: Efficient integration of range query operators over a high dimension d...qCube: Efficient integration of range query operators over a high dimension d...
qCube: Efficient integration of range query operators over a high dimension d...
 
MLSD18. Basic Transformations - QCRI
MLSD18. Basic Transformations - QCRIMLSD18. Basic Transformations - QCRI
MLSD18. Basic Transformations - QCRI
 
Real-Time Supply Chain Analytics with Machine Learning, Kafka, and Spark
Real-Time Supply Chain Analytics with Machine Learning, Kafka, and SparkReal-Time Supply Chain Analytics with Machine Learning, Kafka, and Spark
Real-Time Supply Chain Analytics with Machine Learning, Kafka, and Spark
 

Similar a On the Role of the GRAPH Clause in the Performance of Federated SPARQL Queries

final_copy_camera_ready_paper (7)
final_copy_camera_ready_paper (7)final_copy_camera_ready_paper (7)
final_copy_camera_ready_paper (7)
Ankit Rathi
 

Similar a On the Role of the GRAPH Clause in the Performance of Federated SPARQL Queries (20)

WBDB 2015 Performance Evaluation of Spark SQL using BigBench
WBDB 2015 Performance Evaluation of Spark SQL using BigBenchWBDB 2015 Performance Evaluation of Spark SQL using BigBench
WBDB 2015 Performance Evaluation of Spark SQL using BigBench
 
23003471225PPT.pptx
23003471225PPT.pptx23003471225PPT.pptx
23003471225PPT.pptx
 
Scaling your GraphQL applications with Dgraph
Scaling your GraphQL applications with DgraphScaling your GraphQL applications with Dgraph
Scaling your GraphQL applications with Dgraph
 
GraphQL
GraphQLGraphQL
GraphQL
 
Rethinking Online SPARQL Querying to Support Incremental Result Visualization
Rethinking Online SPARQL Querying to Support Incremental Result VisualizationRethinking Online SPARQL Querying to Support Incremental Result Visualization
Rethinking Online SPARQL Querying to Support Incremental Result Visualization
 
Recent Developments In SparkR For Advanced Analytics
Recent Developments In SparkR For Advanced AnalyticsRecent Developments In SparkR For Advanced Analytics
Recent Developments In SparkR For Advanced Analytics
 
How easy (or hard) it is to monitor your graph ql service performance
How easy (or hard) it is to monitor your graph ql service performanceHow easy (or hard) it is to monitor your graph ql service performance
How easy (or hard) it is to monitor your graph ql service performance
 
final_copy_camera_ready_paper (7)
final_copy_camera_ready_paper (7)final_copy_camera_ready_paper (7)
final_copy_camera_ready_paper (7)
 
GraphQL Schema Stitching with Prisma & Contentful
GraphQL Schema Stitching with Prisma & ContentfulGraphQL Schema Stitching with Prisma & Contentful
GraphQL Schema Stitching with Prisma & Contentful
 
What could go wrong with a GraphQL query and can OpenTelemetry help? KubeCon...
What could go wrong  with a GraphQL query and can OpenTelemetry help? KubeCon...What could go wrong  with a GraphQL query and can OpenTelemetry help? KubeCon...
What could go wrong with a GraphQL query and can OpenTelemetry help? KubeCon...
 
GPORCA: Query Optimization as a Service
GPORCA: Query Optimization as a ServiceGPORCA: Query Optimization as a Service
GPORCA: Query Optimization as a Service
 
GraphQL + relay
GraphQL + relayGraphQL + relay
GraphQL + relay
 
GraphQL-ify your APIs
GraphQL-ify your APIsGraphQL-ify your APIs
GraphQL-ify your APIs
 
Graphql for Frontend Developers Simplifying Data Fetching.docx
Graphql for Frontend Developers Simplifying Data Fetching.docxGraphql for Frontend Developers Simplifying Data Fetching.docx
Graphql for Frontend Developers Simplifying Data Fetching.docx
 
GraphQL Fundamentals
GraphQL FundamentalsGraphQL Fundamentals
GraphQL Fundamentals
 
Geographica: A Benchmark for Geospatial RDF Stores
Geographica: A Benchmark for Geospatial RDF StoresGeographica: A Benchmark for Geospatial RDF Stores
Geographica: A Benchmark for Geospatial RDF Stores
 
Sempala - Interactive SPARQL Query Processing on Hadoop
Sempala - Interactive SPARQL Query Processing on HadoopSempala - Interactive SPARQL Query Processing on Hadoop
Sempala - Interactive SPARQL Query Processing on Hadoop
 
Training Week: GraphQL 2022
Training Week: GraphQL 2022Training Week: GraphQL 2022
Training Week: GraphQL 2022
 
GraphQL - Piękne API w Twojej Aplikacji - KrakowGraphAcademy
GraphQL - Piękne API w Twojej Aplikacji - KrakowGraphAcademyGraphQL - Piękne API w Twojej Aplikacji - KrakowGraphAcademy
GraphQL - Piękne API w Twojej Aplikacji - KrakowGraphAcademy
 
CONDG April 23 2020 - Baskar Rao - GraphQL
CONDG April 23 2020 - Baskar Rao - GraphQLCONDG April 23 2020 - Baskar Rao - GraphQL
CONDG April 23 2020 - Baskar Rao - GraphQL
 

Más de David Chaves-Fraga

Más de David Chaves-Fraga (6)

Enlazando datos abiertos en el transporte público
Enlazando datos abiertos en el transporte públicoEnlazando datos abiertos en el transporte público
Enlazando datos abiertos en el transporte público
 
Open Summer of Code Spain 2018
Open Summer of Code Spain 2018Open Summer of Code Spain 2018
Open Summer of Code Spain 2018
 
The tripscore Linked Data client: calculating specific summaries over large t...
The tripscore Linked Data client: calculating specific summaries over large t...The tripscore Linked Data client: calculating specific summaries over large t...
The tripscore Linked Data client: calculating specific summaries over large t...
 
Tripscore - DataMad2017 (Madrid)
Tripscore - DataMad2017 (Madrid)Tripscore - DataMad2017 (Madrid)
Tripscore - DataMad2017 (Madrid)
 
TransportDCAT-AP and PhD Thesis at Civic Lab Brussels
TransportDCAT-AP and PhD Thesis at Civic Lab BrusselsTransportDCAT-AP and PhD Thesis at Civic Lab Brussels
TransportDCAT-AP and PhD Thesis at Civic Lab Brussels
 
Semantic interoperability in Transport Domain
Semantic interoperability in Transport DomainSemantic interoperability in Transport Domain
Semantic interoperability in Transport Domain
 

Último

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Último (20)

AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 

On the Role of the GRAPH Clause in the Performance of Federated SPARQL Queries

  • 1. David Chaves-Fraga, Ontology Engineering Group Universidad Politécnica de Madrid, Spain Claudio Gutierrez, CIWS & Universidad de Chile Oscar Corcho, OEG - UPM On the Role of the GRAPH Clause in the Performance of Federated SPARQL Queries dchaves@email @dchavesf 22/10/2017 PROFILES17
  • 2. On the Role of GRAPH Clause in the Performance of Federated SPARQL queries Federated Query and SPARQL endpoint 2 Improve federated query evaluation by systematically using the GRAPH clause to rewrite a query into a semantically equivalent one G0 G1 G2 G5 G3 G4 <URI> <URI> <URI> <URI> <URI> SPARQL endpoint select * where { service <s1> {GP1} service <s2> {GP2} service <s3> {GP3} } Federated Query
  • 3. On the Role of GRAPH Clause in the Performance of Federated SPARQL queries Motivation 3 •Many graph patterns bind results from only one of the graphs in the SPARQL endpoints •The server can have more information and optimize the retrieval
  • 4. On the Role of GRAPH Clause in the Performance of Federated SPARQL queries Reducing the space of a federated SPARQL query 4 •Hypothesis: This method adds efficiency in the evaluation of the query •Assumption: The identification of the GP select * where { service <s1> {GP1} service <s2> {GP2} service <s3> {GP3} } select * from named <s1-g2>, <s2-g3>, <s3-g1>{ service <s1> {graph <s1-g2> {GP1}} service <s2> {graph <s2-g3> {GP2}} service <s3> {graph <s3-g1> {GP3}} }
  • 5. On the Role of GRAPH Clause in the Performance of Federated SPARQL queries Queries and experiment 5 •We use the real public instances of SPARQL endpoints of: • DBpedia • KB3city • Linkedmdb •5 queries - two different approaches (with/without GRAPH clause): • 5 executions per approach with the disabled cache. •Queries available at: https://github.com/dachafra/federated-graph-queries Problems: •There is not exist any benchmarking for federated SPARQL queries that includes SERVICE and GRAPH clauses. •Deal with availability and real performance of the SPARQL endpoints
  • 6. On the Role of GRAPH Clause in the Performance of Federated SPARQL queries Preliminary results 6 Q1 Q2 Q3 Q4 Q5 GRAPH 13,51 3,22 1,98 1,73 8,56 SERVICE 14,63 4,36 2,88 1,69 9,12 •5%-10% improvement •Q2-Q3-Q4: kb.3city.com SPARQL endpoint • 73 graphs • Almost 5 millions of triples •Q1-Q5: dbpedia.com SPARQL endpoint: • 19 graphs • 285 millions of triples
  • 7. On the Role of GRAPH Clause in the Performance of Federated SPARQL queries Preliminary results 6 Q1 Q2 Q3 Q4 Q5 GRAPH 13,51 3,22 1,98 1,73 8,56 SERVICE 14,63 4,36 2,88 1,69 9,12 •5%-10% improvement •Q2-Q3-Q4: kb.3city.com SPARQL endpoint • 73 graphs • Almost 5 millions of triples •Q1-Q5: dbpedia.com SPARQL endpoint: • 19 graphs • 285 millions of triples Virtuoso puts the triples ina single table
  • 8. On the Role of GRAPH Clause in the Performance of Federated SPARQL queries Conclusions: The role of the GRAPH clause 7 Conclusions about GRAPH clause: 1. is semantically ambiguous 2. offers a view of how the data is conceptually ordered but not physically 3. is a SPARQL clause that seems to not allow any type of optimization (at the moment) DBpedia graphs and triples
  • 9. On the Role of GRAPH Clause in the Performance of Federated SPARQL queries Future work 8 •Create a Benchmarking for federated queries with SERVICE and GRAPH •Apply the approach to other triple stores •Carry out semantic equivalence between open data portal and a SPARQL endpoint •Link unique graph patterns with their pertained graphs •Analyze why Virtuoso and Apache Jena can execute the approach
  • 10. David Chaves-Fraga, Ontology Engineering Group Universidad Politécnica de Madrid, Spain Claudio Gutierrez, CIWS & Universidad de Chile Oscar Corcho, OEG - UPM On the Role of the GRAPH Clause in the Performance of Federated SPARQL Queries dchaves@email @dchavesf 22/10/2017 PROFILES17