SlideShare una empresa de Scribd logo
1 de 27
Descargar para leer sin conexión
Politecnico di Milano, DEIB
Portoroz - 2017 - Riccardo Tommasini (Politecnico di Milano)
Riccardo Tommasini, Marco Balduini, Emanuele Della Valle
{name.surname}@polimi.it
Towards a Benchmark for
Expressive Stream Reasoning
1
ESWCEyE
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
SR State-of-the-art Timeline (Qualitative)
2
time
(2008) (2010) (2011) (2015)
CityBench
(2016)
YABench
(2012)
LSBench
SRBench
CSRBench
EP-SPARQL
C-SPARQL
SparkWave
MorphStream
CQELS
SKB
INSTANS
(2013)
RSP Engine
Benchmark
RDFox
trOWL
Stream Reasoner
DyamiTE
ESWC
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano 3
Benchmark Ontology Streams Queries Metrics Reasoning
SR Bench IoT
RDF/
Historical
✔ QL Feature X
LS Bench
Social
Media/IoT
Generate
d
✔ Max Throughput SubClassOf
CSRBench IoT x
Parametri
c
Correctness X
CityBench IoT CSV/Real ✔
Query latency,
Memory
consumption,
completeness
X
YABench IoT
Generate
d
✔ Correctness X
ESWC
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano 4
Stream
Reasoner
Entailmen
t
Berlin
SPARQL
LUBM UOBM DBPedia Spire Galen
SKB RDFS ✔
EP-SPARQL RDFS ✔ ✔
trOWL EL+
DynamiTE RDFS ✔
SparkWave RDFS ✔
RDFox OWL 2 RL ✔ ✔ ✔
ESWC
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
ESR Benchmarking
5
ESWC
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
Design Principles
6
[P.1] TBox of moderate size yet of scalable complexity.
[P.2] Continuous reasoning tasks.
[P.3] Arbitrary scaling of static and streaming data.
[P.4] Usage of continuous queries.
ESWC
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
SR Experiment
7
Is a TUPLES <R,E,T,D,S,Q,K>, where
R is a stream reasoner;

E is an entailment regime to test;

T a a static TBox

S a streaming ABox;

Q a set of continuous reasoning tasks under E and; 

K is a set of KPIs to measure.
ESWC
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
LASS 1.0
8
LASS is a first attempt to make an benchmark for ESR. It comprises:
L1O, an OWL 2 RL ontology about the Social Media Influence.
L1C, a set of reasoning tasks to test the engine capabilities.
L1G, a data generation algorithm and its implementation.
Building Blocks
ESWC
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
L1O 1.0
9
Extends SIOC Core Vocabulary for online Community with:
Actions/Reaction models users’ interaction within a community
Influence models users’ influence roles within a community.
Content models what characterises posts and discussions.
L1O’s modules have different velocities, 

i.e. class instances change at different rates.
The Ontology
ESWC
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
L1C 1.0
10
The Reasoning Tasks
Reasoning Task
Class
Subsumption
Role
Subsumption
Transitive Inverse Realization
Tag Containment ✔
Post Popularity ✔
User Activity ✔ ✔
User Participation ✔ ✔
ESWC
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
L1G 1.0
11
L1G exploits these ordering relations to instantiates class individuals.
It generates starting from the “slowest” classes (e.g. Discussion) to the “fastest”
ones (e.g. MicroPost)
L1G does not generate classes from the Influence Module, but it ensures that they
can be deducted.
The data generation algorithm
ESWC
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
Evaluation
12
ESWC
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
What is the goal of a
benchmark?
13
ESWCEyE
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
The goal of a domain specific benchmark 

is to foster technological progress 

by guaranteeing a fair assessment.


14
”
“
- Jim Gray, The Benchmark Handbook
for Database and Transaction Systems, 1993
ESWC
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano 15
Benchmark
Approach X
1.5m
Measure
ESWC
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano 16
Benchmark
Approach Y
2m
Measure
ESWC
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano 17
-Body Level One
-Body Level Two
-Body Level Three
-Body Level Four
-Body Level Five
ESWC
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano 18
•  accelerate'progress,'make'technology'viable'
Ying'Zhang,2Peter2Boncz2–'Benchmarking'Linked'Open'Data'Technology'
©'Jim'Gray,'2005'
ESWCEyE
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
Observations
19
More a benchmark challenges an approach more is effective.
The benchmark characterises the problem space.
The benchmark provides a measurable view over the related solution
space.
ESWCEyE
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
Role of Baselines
20
A baseline defines the lower bound of the solution space.
They show the feasibility of the problem.
They avoid one-2-one competition by defining a cross reference for the
comparison.
ESWC
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
Benchmarking is a
research problem
21
How to explore the solution space?
ESWC
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
Benchmark Design
22
- What are the guiding principles / requirements?
- What does constitute a benchmark?
- What is the experimental methodology?
- What are the the baselines?
ESWC
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
Benchmark Quality
23
- Is the benchmark compliant to the requirements/principles?
- Is the benchmark used?
- What are the benchmark limitations? (KPIs, TestDriver)
- Does the benchmark distinguish the compared systems (blind)?
ESWC
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
Conclusion & Lass Evaluation
24
ESWC
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
Atomic LASS Evaluation
25
- We evaluate L1O showing it is compliant to Gruber’s ontology design principles.
- We formulate some continuous queries involving the L1C reasoning tasks.
- We implemented L1G by extending LUBM data generator.
ESWC
Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano
Holistic LASS Evaluation
26
We evaluate LASS against the following principles by Jim Gray (G) and
Karl Huppler (H).



A benchmark must be

[G.1] Simple, [G.2] Portable, [G.3] Scalable, [G.4] Relevant and 

[H.1] formally Verifiable.
Portoroz - 2017 - Riccardo Tommasini (Politecnico di Milano)
Questions?
Email: riccardo.tommasini@polimi.it

Twitter: @rictomm
Github: riccardotommasini
Web: streamreasoning.org
27
Joseph Wright of Derby, An Experiment on a Bird in the Air Pump, 1768. 

The National Gallery, London

Más contenido relacionado

Similar a Towards a Benchmark for Expressive Stream Reasoning

Estimating Packet Loss Rate in the Access Through Application-Level Measurements
Estimating Packet Loss Rate in the Access Through Application-Level MeasurementsEstimating Packet Loss Rate in the Access Through Application-Level Measurements
Estimating Packet Loss Rate in the Access Through Application-Level MeasurementsSimone Basso
 
COSMOS: DevOps for Complex Cyber-physical Systems
COSMOS: DevOps for Complex Cyber-physical SystemsCOSMOS: DevOps for Complex Cyber-physical Systems
COSMOS: DevOps for Complex Cyber-physical SystemsSebastiano Panichella
 
IoT Reference Architectures
IoT Reference ArchitecturesIoT Reference Architectures
IoT Reference ArchitecturesBob Marcus
 
Compliance driven process development with DCR graphs
Compliance driven process development with DCR graphsCompliance driven process development with DCR graphs
Compliance driven process development with DCR graphsHugo Andrés López
 
From TRL to MRL: Assessing Open Source Project Market Readiness, Cédric Thoma...
From TRL to MRL: Assessing Open Source Project Market Readiness, Cédric Thoma...From TRL to MRL: Assessing Open Source Project Market Readiness, Cédric Thoma...
From TRL to MRL: Assessing Open Source Project Market Readiness, Cédric Thoma...OW2
 
Heaven: A Framework for Systematic Comparative Research Approach for RSP Engines
Heaven: A Framework for Systematic Comparative Research Approach for RSP EnginesHeaven: A Framework for Systematic Comparative Research Approach for RSP Engines
Heaven: A Framework for Systematic Comparative Research Approach for RSP EnginesRiccardo Tommasini
 
Weekly update @ 10.05.2016
Weekly update @ 10.05.2016Weekly update @ 10.05.2016
Weekly update @ 10.05.2016HAMSproject
 
Mining Correlations of ATL Transformation and Metamodel Metrics
Mining Correlations of ATL Transformation and Metamodel MetricsMining Correlations of ATL Transformation and Metamodel Metrics
Mining Correlations of ATL Transformation and Metamodel Metrics Davide Ruscio
 
Cyber-physical systems Industrial applications in the CPSwarm Project
Cyber-physical systems Industrial applications in the CPSwarm ProjectCyber-physical systems Industrial applications in the CPSwarm Project
Cyber-physical systems Industrial applications in the CPSwarm ProjectAlessandra Bagnato
 
TCP1P.net Meetup Vision, Objectives and Roadmap
TCP1P.net Meetup Vision, Objectives and RoadmapTCP1P.net Meetup Vision, Objectives and Roadmap
TCP1P.net Meetup Vision, Objectives and RoadmapStefan Ianta
 
Managing and Testing Ensembles of IoT, Network functions, and Clouds
Managing and Testing Ensembles of IoT, Network functions, and CloudsManaging and Testing Ensembles of IoT, Network functions, and Clouds
Managing and Testing Ensembles of IoT, Network functions, and CloudsHong-Linh Truong
 
Designing Swarms of Cyber-Physical Systems: The H2020 CPSwarm Project
Designing Swarms of Cyber-Physical Systems: The H2020 CPSwarm ProjectDesigning Swarms of Cyber-Physical Systems: The H2020 CPSwarm Project
Designing Swarms of Cyber-Physical Systems: The H2020 CPSwarm ProjectAlessandra Bagnato
 
Behavioural Rules In Multi Agent Systems Max
Behavioural Rules In Multi Agent Systems MaxBehavioural Rules In Multi Agent Systems Max
Behavioural Rules In Multi Agent Systems MaxBeniamino Murgante
 
Enabling combined Software and Data engineering at Web-scale
Enabling combined Software and Data engineering at Web-scaleEnabling combined Software and Data engineering at Web-scale
Enabling combined Software and Data engineering at Web-scaleMonika Solanki
 
Monitoring and Operational Data Analytics from a User Perspective at First Eu...
Monitoring and Operational Data Analytics from a User Perspective at First Eu...Monitoring and Operational Data Analytics from a User Perspective at First Eu...
Monitoring and Operational Data Analytics from a User Perspective at First Eu...University of Maribor
 
Superračunalništvo v Mariboru (2021, CIS11, ZID)
Superračunalništvo v Mariboru (2021, CIS11, ZID)Superračunalništvo v Mariboru (2021, CIS11, ZID)
Superračunalništvo v Mariboru (2021, CIS11, ZID)University of Maribor
 
Ethics in Technology – Example of RIPE Atlas
 Ethics in Technology – Example of RIPE Atlas Ethics in Technology – Example of RIPE Atlas
Ethics in Technology – Example of RIPE AtlasRIPE NCC
 

Similar a Towards a Benchmark for Expressive Stream Reasoning (20)

Estimating Packet Loss Rate in the Access Through Application-Level Measurements
Estimating Packet Loss Rate in the Access Through Application-Level MeasurementsEstimating Packet Loss Rate in the Access Through Application-Level Measurements
Estimating Packet Loss Rate in the Access Through Application-Level Measurements
 
Estimote Beacon presentation
Estimote Beacon presentationEstimote Beacon presentation
Estimote Beacon presentation
 
COSMOS: DevOps for Complex Cyber-physical Systems
COSMOS: DevOps for Complex Cyber-physical SystemsCOSMOS: DevOps for Complex Cyber-physical Systems
COSMOS: DevOps for Complex Cyber-physical Systems
 
IoT Reference Architectures
IoT Reference ArchitecturesIoT Reference Architectures
IoT Reference Architectures
 
Compliance driven process development with DCR graphs
Compliance driven process development with DCR graphsCompliance driven process development with DCR graphs
Compliance driven process development with DCR graphs
 
Online Tv Music Channel
Online Tv Music ChannelOnline Tv Music Channel
Online Tv Music Channel
 
From TRL to MRL: Assessing Open Source Project Market Readiness, Cédric Thoma...
From TRL to MRL: Assessing Open Source Project Market Readiness, Cédric Thoma...From TRL to MRL: Assessing Open Source Project Market Readiness, Cédric Thoma...
From TRL to MRL: Assessing Open Source Project Market Readiness, Cédric Thoma...
 
Heaven: A Framework for Systematic Comparative Research Approach for RSP Engines
Heaven: A Framework for Systematic Comparative Research Approach for RSP EnginesHeaven: A Framework for Systematic Comparative Research Approach for RSP Engines
Heaven: A Framework for Systematic Comparative Research Approach for RSP Engines
 
Weekly update @ 10.05.2016
Weekly update @ 10.05.2016Weekly update @ 10.05.2016
Weekly update @ 10.05.2016
 
Mining Correlations of ATL Transformation and Metamodel Metrics
Mining Correlations of ATL Transformation and Metamodel MetricsMining Correlations of ATL Transformation and Metamodel Metrics
Mining Correlations of ATL Transformation and Metamodel Metrics
 
Cyber-physical systems Industrial applications in the CPSwarm Project
Cyber-physical systems Industrial applications in the CPSwarm ProjectCyber-physical systems Industrial applications in the CPSwarm Project
Cyber-physical systems Industrial applications in the CPSwarm Project
 
TCP1P.net Meetup Vision, Objectives and Roadmap
TCP1P.net Meetup Vision, Objectives and RoadmapTCP1P.net Meetup Vision, Objectives and Roadmap
TCP1P.net Meetup Vision, Objectives and Roadmap
 
Managing and Testing Ensembles of IoT, Network functions, and Clouds
Managing and Testing Ensembles of IoT, Network functions, and CloudsManaging and Testing Ensembles of IoT, Network functions, and Clouds
Managing and Testing Ensembles of IoT, Network functions, and Clouds
 
Designing Swarms of Cyber-Physical Systems: The H2020 CPSwarm Project
Designing Swarms of Cyber-Physical Systems: The H2020 CPSwarm ProjectDesigning Swarms of Cyber-Physical Systems: The H2020 CPSwarm Project
Designing Swarms of Cyber-Physical Systems: The H2020 CPSwarm Project
 
Behavioural Rules In Multi Agent Systems Max
Behavioural Rules In Multi Agent Systems MaxBehavioural Rules In Multi Agent Systems Max
Behavioural Rules In Multi Agent Systems Max
 
Enabling combined Software and Data engineering at Web-scale
Enabling combined Software and Data engineering at Web-scaleEnabling combined Software and Data engineering at Web-scale
Enabling combined Software and Data engineering at Web-scale
 
Monitoring and Operational Data Analytics from a User Perspective at First Eu...
Monitoring and Operational Data Analytics from a User Perspective at First Eu...Monitoring and Operational Data Analytics from a User Perspective at First Eu...
Monitoring and Operational Data Analytics from a User Perspective at First Eu...
 
Superračunalništvo v Mariboru (2021, CIS11, ZID)
Superračunalništvo v Mariboru (2021, CIS11, ZID)Superračunalništvo v Mariboru (2021, CIS11, ZID)
Superračunalništvo v Mariboru (2021, CIS11, ZID)
 
cv
cvcv
cv
 
Ethics in Technology – Example of RIPE Atlas
 Ethics in Technology – Example of RIPE Atlas Ethics in Technology – Example of RIPE Atlas
Ethics in Technology – Example of RIPE Atlas
 

Último

MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfOverkill Security
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 

Último (20)

MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 

Towards a Benchmark for Expressive Stream Reasoning

  • 1. Politecnico di Milano, DEIB Portoroz - 2017 - Riccardo Tommasini (Politecnico di Milano) Riccardo Tommasini, Marco Balduini, Emanuele Della Valle {name.surname}@polimi.it Towards a Benchmark for Expressive Stream Reasoning 1
  • 2. ESWCEyE Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano SR State-of-the-art Timeline (Qualitative) 2 time (2008) (2010) (2011) (2015) CityBench (2016) YABench (2012) LSBench SRBench CSRBench EP-SPARQL C-SPARQL SparkWave MorphStream CQELS SKB INSTANS (2013) RSP Engine Benchmark RDFox trOWL Stream Reasoner DyamiTE
  • 3. ESWC Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano 3 Benchmark Ontology Streams Queries Metrics Reasoning SR Bench IoT RDF/ Historical ✔ QL Feature X LS Bench Social Media/IoT Generate d ✔ Max Throughput SubClassOf CSRBench IoT x Parametri c Correctness X CityBench IoT CSV/Real ✔ Query latency, Memory consumption, completeness X YABench IoT Generate d ✔ Correctness X
  • 4. ESWC Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano 4 Stream Reasoner Entailmen t Berlin SPARQL LUBM UOBM DBPedia Spire Galen SKB RDFS ✔ EP-SPARQL RDFS ✔ ✔ trOWL EL+ DynamiTE RDFS ✔ SparkWave RDFS ✔ RDFox OWL 2 RL ✔ ✔ ✔
  • 5. ESWC Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano ESR Benchmarking 5
  • 6. ESWC Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano Design Principles 6 [P.1] TBox of moderate size yet of scalable complexity. [P.2] Continuous reasoning tasks. [P.3] Arbitrary scaling of static and streaming data. [P.4] Usage of continuous queries.
  • 7. ESWC Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano SR Experiment 7 Is a TUPLES <R,E,T,D,S,Q,K>, where R is a stream reasoner;
 E is an entailment regime to test;
 T a a static TBox
 S a streaming ABox;
 Q a set of continuous reasoning tasks under E and; 
 K is a set of KPIs to measure.
  • 8. ESWC Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano LASS 1.0 8 LASS is a first attempt to make an benchmark for ESR. It comprises: L1O, an OWL 2 RL ontology about the Social Media Influence. L1C, a set of reasoning tasks to test the engine capabilities. L1G, a data generation algorithm and its implementation. Building Blocks
  • 9. ESWC Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano L1O 1.0 9 Extends SIOC Core Vocabulary for online Community with: Actions/Reaction models users’ interaction within a community Influence models users’ influence roles within a community. Content models what characterises posts and discussions. L1O’s modules have different velocities, 
 i.e. class instances change at different rates. The Ontology
  • 10. ESWC Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano L1C 1.0 10 The Reasoning Tasks Reasoning Task Class Subsumption Role Subsumption Transitive Inverse Realization Tag Containment ✔ Post Popularity ✔ User Activity ✔ ✔ User Participation ✔ ✔
  • 11. ESWC Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano L1G 1.0 11 L1G exploits these ordering relations to instantiates class individuals. It generates starting from the “slowest” classes (e.g. Discussion) to the “fastest” ones (e.g. MicroPost) L1G does not generate classes from the Influence Module, but it ensures that they can be deducted. The data generation algorithm
  • 12. ESWC Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano Evaluation 12
  • 13. ESWC Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano What is the goal of a benchmark? 13
  • 14. ESWCEyE Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano The goal of a domain specific benchmark 
 is to foster technological progress 
 by guaranteeing a fair assessment. 
 14 ” “ - Jim Gray, The Benchmark Handbook for Database and Transaction Systems, 1993
  • 15. ESWC Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano 15 Benchmark Approach X 1.5m Measure
  • 16. ESWC Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano 16 Benchmark Approach Y 2m Measure
  • 17. ESWC Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano 17 -Body Level One -Body Level Two -Body Level Three -Body Level Four -Body Level Five
  • 18. ESWC Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano 18 •  accelerate'progress,'make'technology'viable' Ying'Zhang,2Peter2Boncz2–'Benchmarking'Linked'Open'Data'Technology' ©'Jim'Gray,'2005'
  • 19. ESWCEyE Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano Observations 19 More a benchmark challenges an approach more is effective. The benchmark characterises the problem space. The benchmark provides a measurable view over the related solution space.
  • 20. ESWCEyE Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano Role of Baselines 20 A baseline defines the lower bound of the solution space. They show the feasibility of the problem. They avoid one-2-one competition by defining a cross reference for the comparison.
  • 21. ESWC Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano Benchmarking is a research problem 21 How to explore the solution space?
  • 22. ESWC Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano Benchmark Design 22 - What are the guiding principles / requirements? - What does constitute a benchmark? - What is the experimental methodology? - What are the the baselines?
  • 23. ESWC Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano Benchmark Quality 23 - Is the benchmark compliant to the requirements/principles? - Is the benchmark used? - What are the benchmark limitations? (KPIs, TestDriver) - Does the benchmark distinguish the compared systems (blind)?
  • 24. ESWC Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano Conclusion & Lass Evaluation 24
  • 25. ESWC Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano Atomic LASS Evaluation 25 - We evaluate L1O showing it is compliant to Gruber’s ontology design principles. - We formulate some continuous queries involving the L1C reasoning tasks. - We implemented L1G by extending LUBM data generator.
  • 26. ESWC Portoroz - 2017 - Riccardo Tommasini - @rictomm - Politecnico di Milano Holistic LASS Evaluation 26 We evaluate LASS against the following principles by Jim Gray (G) and Karl Huppler (H).
 
 A benchmark must be
 [G.1] Simple, [G.2] Portable, [G.3] Scalable, [G.4] Relevant and 
 [H.1] formally Verifiable.
  • 27. Portoroz - 2017 - Riccardo Tommasini (Politecnico di Milano) Questions? Email: riccardo.tommasini@polimi.it
 Twitter: @rictomm Github: riccardotommasini Web: streamreasoning.org 27 Joseph Wright of Derby, An Experiment on a Bird in the Air Pump, 1768. 
 The National Gallery, London