SlideShare a Scribd company logo
1 of 53
Stream Reasoning:
a summary of ten years of research and
a vision for the next decade
http://content.iospress.com/articles/data-science/ds006
Emanuele Della Valle
DEIB - Politecnico di Milano
@manudellavalle
emanuele.dellavalle@polimi.it
http://emanueledellavalle.org
Linköping University, Sweden - 5.10.2017
Me
• Assistant Professor at DEIB
Politecnico di Milano
• Expert in semantic technologies
and stream computing
• Brander of stream reasoning: an
approach to master the velocity and
variety dimension of Big Data
• https://scholar.google.com/scholar?hl=
en&q="stream+reasoning"
• 16 years of experience in research
and innovation projects
• Startupper:
• http://www.fluxedo.com
2
INTRODUCTION
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 3
It's a streaming world …
• Off-shore oil operations
• Smart Cities
• Power turbine
• Social networks
• Generate data streams!
E. Della Valle, S. Ceri, F. van Harmelen, D. Fensel It's a Streaming World! Reasoning upon
Rapidly Changing Information. IEEE Intelligent Systems 24(6): 83-89 (2009)
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 4
… looking for reactive answers …
• When a sensor on a drill in an oil-rig indicates that it is
about to get stuck, how long can I keep drilling?
• Where am I likely going to run into a
traffic jam during my commute tonight?
• Which electricity-producing turbine has
sensor readings similar to any turbine that
subsequently had a critical failure?
• Who is driving the discussion
about the top 10 emerging topics ?
• Require continuous processing
and reactive answer
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 5
… and many more
conflicting requirements
A system able to answer those queries must be able to
• handle volume
• handle velocity
• handle variety
• cope with incompleteness
• cope with noise
• provide reactive answers
• support fine-grained access
• integrate complex domain models
• offer high-level languages
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 6
Grand challenge
• Volume + Velocity + Variety = hard deal
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org
volume
months days hours min. sec. ms.
velocity
ZB
EB
PB
TB
GB
MB
KB
Variety
7
A good reason to embrace it!
• ++ Variety  ++ value
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org
value
ms. sec. min. hours days months years
velocity
Variety
8
From challenges to opportunities
• Formally data streams are :
– unbounded sequences of time-varying data elements
• Less formally, in many application domains, they are:
– a “continuous” flow of information
– where recent information is more relevant as it describes the
current state of a dynamic system
• Opportunities
– Forget old enough information
– Exploit the implicit ordering (by recency) in the data
time
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 9
State-of-the-art: DSMS and CEP
• A paradigmatic change!
• Continuous queries registered over streams that
are observed trough windows
window
input streams streams of answerRegistered
Continuous
Query
Dynamic
System
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 10
DSMS and CEP vs. requirements
Requirement
DSMS
CEP
volume
velocity
variety
incompleteness
noise
reactive answers
fine-grained information access
complex domain models
high-level languages
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 11
State of the art: Semantic Web
• Given ontology O and query Q, use O to rewrite Q
as Q’ so that, for any set of ground facts A contained in multiple
databases:
– answer(Q,O,A) = answer(Q’,,A)
The answer of the query Q using the ontology O for any set of ground facts A
is equal to answer of a query Q’ without considering the ontology O
• Use mapping M to map Q’ to multiple SQL queries to the various
databases
Rewrite
O
Q
Q’
Map
SQL
M
answer
A
UiO, Norway - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 12
Solutions vs. requirements
Requirement
DSMS
CEP
Sem
Web
volume
velocity
variety
incompleteness
noise
reactive answers
fine-grained information access
complex domain models
high-level languages
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 13
Stream Reasoning
• Research question
– is it possible to make sense in real time of
multiple, heterogeneous, gigantic and inevitably noisy and
incomplete data streams in order to support the decision
processes of extremely large numbers of concurrent users?
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 14
Emanuele Della Valle: On Stream Reasoning. PhD thesis, Vrije Universiteit Amsterdam, 2015.
Available online at http://dare.ubvu.vu.nl/handle/1871/53293 .
Is this feasible?
• Proposed approach: cascading Stream Reasoning
Complexity
Raw Stream Processing
Semantic Streams
DL-Lite
DLAbstraction
Selection
Interpretation
Reasoning
Querying
Re-writing
Change Frequency
PTIME
NEXPTIME
104 Hz
1 Hz
Complexity vs. Dynamics
AC0
H. Stuckenschmidt, S. Ceri, E. Della Valle, F. van Harmelen: Towards Expressive Stream Reasoning. Proceedings of
the Dagstuhl Seminar on Semantic Aspects of Sensor Networks, 2010.
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 15
A model to describe stream reasoning
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 16
D. Dell’Aglio, On Unified Stream Reasoning, PhD thesis, Politecnico di Milano, 2016.
Stream Processing (DSMS)
Event Processing (CEP)Window merge
Window operator
Streams
Graph-level entailment
Window-level entailment
Stream-level entailment
Application
StreamReasoning
GRAPH LEVEL
UNDER SIMPLE ENTAILMENT REGIME
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 17
State-of-the-art: RDF model
• RDF: Resource Description Framework
– It allows to make statements about resources
in the form of subject-predicate-object expressions
• In RDF terminology triples
• E.g.
@TimBernersLee posts "This is for everyone"
– A collection of RDF statements represents a labelled,
directed graph
• In RDF terminology a graph
• E.g., the tweet above by Tim Berners Lee is connected to
– Thousands of twitter user profiles via retweets
– Thousands of twitter user profiles via favorite
– …
subject predicate object
UiO, Norway - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 18
RDF stream Models
• RDF Stream (the C-SPARQL way)
– Unbound sequence of time-varying triples
– each represented by a pair made of an RDF triple and its
timestamp
– Timestamp are non-decreasing (allowing for simultaneity)
…
TimBernersLee posts "This is for everyone", 10:16.55 PM 22 Aug 2012
@Alice posts "RT: This is for everyone", 10:17.03 PM 22 Aug 2012
…
D.F. Barbieri, D. Braga, S. Ceri, E. Della Valle, M. Grossniklaus: Querying RDF streams with
C-SPARQL. SIGMOD Record 39(1): 20-26 (2010)
subject predicate object timestamp
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 19
RDF stream Models
• RDF Stream (the Streaming Linked Data way)
– Unbound sequence of time-varying graphs
– each represented by a pair made of an RDF graph and its
timestamp (all triples in a graph are simultaneous)
– Timestamps (if present) are monotonically increasing
– Graphs act as a form of punctuation
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org
A. Mauri, J-P Calbimonte, D.Dell'Aglio, M.Balduini, M.Brambilla, E.Della Valle, K. Aberer:
TripleWave: Spreading RDF Streams on the Web. ISWC (2) 2016: 140-149
20
D.F. Barbieri, E. Della Valle: A Proposal for Publishing Data Streams as Linked Data –
A Position Paper. LDOW (2010)
Continuous-SPARQL
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 21
Continuous-SPARQL
Who are the opinion makers? i.e., the users who are
likely to influence the behavior their followers
REGISTER STREAM OpinionMakers COMPUTED EVERY 5m AS
CONSTRUCT { ?opinionMaker sd:about ?resource }
FROM STREAM <http://…> [RANGE 30m STEP 5m]
WHERE {
?opinionMaker ?opinion ?res .
?follower sioc:follows ?opinionMaker.
?follower ?opinion ?res.
FILTER (cs:timestamp(?follower ?opinion ?res) >
cs:timestamp(?opinionMaker ?opinion ?res) )
}
HAVING ( COUNT(DISTINCT ?follower) > 3 )
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 22D.F. Barbieri, D. Braga, S. Ceri, E. Della Valle, M. Grossniklaus: Querying RDF streams with
C-SPARQL. SIGMOD Record 39(1): 20-26 (2010)
Continuous-SPARQL
Who are the opinion makers? i.e., the users who are
likely to influence the behavior their followers
REGISTER STREAM OpinionMakers COMPUTED EVERY 5m AS
CONSTRUCT { ?opinionMaker sd:about ?resource }
FROM STREAM <http://…> [RANGE 30m STEP 5m]
WHERE {
?opinionMaker ?opinion ?res .
?follower sioc:follows ?opinionMaker.
?follower ?opinion ?res.
FILTER (cs:timestamp(?follower ?opinion ?res) >
cs:timestamp(?opinionMaker ?opinion ?res) )
}
HAVING ( COUNT(DISTINCT ?follower) > 3 )
Query registration
(for continuous execution)
FROM STREAM clause
WINDOW
RDF Stream added as
new ouput format
Builtin to access
timestamps
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 23D.F. Barbieri, D. Braga, S. Ceri, E. Della Valle, M. Grossniklaus: Querying RDF streams with
C-SPARQL. SIGMOD Record 39(1): 20-26 (2010)
More than modeling,
it's for reactive answers!
• C-SPARQL engine time window-based selection outperforms
SPARQL filter-based selection (Jena-ARQ)
D. Barbieri, D. Braga, S. Ceri, E. Della Valle, Y. Huang, V. Tresp, A.Rettinger, H. Wermser:
Deductive and Inductive Stream Reasoning for Semantic Social Media Analytics
IEEE Intelligent Systems, 30 Aug. 2010.
Our In-memory
RDF stream
processing
engine
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 24
Alternatives to C-SPARQL
• SPARQLStream
– What: window in the past, focus on RDF to Stream operators
– Ref: Calbimonte, J.-P., Corcho, O., & Gray, A. J. G. Enabling ontology-based
access to streaming data sources. In ISWC, 2010, pages 96–111.
• CQELS
– What: STREAM clause, focus on new answer
– Ref: Le-Phuoc, D., Dao-Tran, M., Xavier Parreira, J., & Hauswirth, M.
A native and adaptive approach for unified processing of linked streams
and linked data. In ISWC 2011, pages 370–388.
• EP-SPARQL
– What: focus on event specific operators
– Ref: Anicic, D., Fodor, P., Rudolph, S., & Stojanovic, N. EP-SPARQL: a unified
language for event processing and stream reasoning. In WWW 2011, pages
635–644.
• TEF-SPARQL
– What: adds "facts" as first class elements
– Ref: https://www.merlin.uzh.ch/publication/show/8467
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 25
Work in progress
• RSP-QL
– Syntax
• https://github.com/streamreasoning/RSP-QL/blob/master/RSP-
QL%20Sample%20Queries.md
– Proposed semantics
• D.Dell'Aglio, E.Della Valle, J.-P.Calbimonte, Ó. Corcho: RSP-QL
Semantics: A Unifying Query Model to Explain Heterogeneity of
RDF Stream Processing Systems. Int. J. Semantic Web Inf. Syst.
10(4): 17-44 (2014)
– Semantics (work in progress)
• https://github.com/streamreasoning/RSP-
QL/blob/master/Semantics.md
– Quick ref.
• D. Dell'Aglio, J.-P. Calbimonte, E. Della Valle, Ó. Corcho: Towards a
Unified Language for RDF Stream Query Processing. ESWC
(Satellite Events) 2015: 353-363
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 26
GRAPH LEVEL
UNDER OTHER ENTAILMENT REGIME
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 27
continuous deductive reasoning
• DL Ontology Stream ST
– A ontology stream with respect to a static Tbox T is a
sequence of Abox axioms ST(i)
• A Windowed Ontology Stream ST(o,c]
– A windowed ontology stream with respect to a static
Tbox T is the union of the Abox axioms ST(i) where
o<i≤c
• Reasoning on a Windowed Ontology Stream
ST(o,c] is as reasoning on a static DL KB
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 28
Emanuele Della Valle, Stefano Ceri, Davide Francesco Barbieri, Daniele Braga, Alessandro
Campi: A First Step Towards Stream Reasoning. FIS 2008: 72-81
discusses discusses discusses
discusses discusses
discusses
discusses
Example of
continuous deductive reasoning
What impact has been my micropost p1 creating in the last hour?
Let’s count the number of microposts that discuss it …
REGISTER STREAM ImpactMeter AS
SELECT (count(?p) AS ?impact)
FROM STREAM <http://…/fb> [RANGE 60m STEP 10m]
WHERE {
:Alice posts [ sr:discusses ?p ]
}
p1 p3 p5 p8
p2 p4 p7
p6
Transitive
property
Alice posts p1 .
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 29
Not so naïve approach to
stream reasoning
• The problem is that materialization (the result of data-driven
processing) are very difficult to decrement efficiently.
– State-of-the-art: DRed algorithm
• Over delete
• Re-derive
• Insert
Reasoner
Inferred
data
ontology
window
insertions
deletions
Incremental !!!
SPARQL
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 30
Is DRed needed?
• DRed works with random insertions and deletions
• In a streaming setting, when a triple enters the window,
given the size of the window, the reasoner knows already
when it will be deleted!
• E.g.,
– if the window is 40 minutes
long, and,
– it is 10:00, the triple(s)
entering now
– will exit on 10:40.
• Conclusion
– deletions are predictable
Time
Enter
window
Exit
window
Explicitly in
window
Infer
win
10:00 AßB
10:10 BßC
10:20 AßE
10:30 EßC
10:40 AßB
10:50 BßC
11:00 AßE
A B
A B C A
A B C
E
A
A B C
E
A
A C
E
A
A B C
E
A
C
E
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 31
IMaRS algorithm
• Idea:
– add an expiration time to each triple and
– use an hash table to index triples by their expiration time
• The algorithm
1. deletes expired triples
2. Adds the new derivations that are consequences of
insertions annotating each inferred triple with an
expiration time (the min of those of the triple it is
derived from), and
3. when multiple derivations occur, for each multiple
derivation, it keeps the max expiration time.
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 32
IMaRS algorithm
• Incremental Reasoning on RDF streams (IMaRS): new reasoning
algorithm optimized for reactive query answering
D.F. Barbieri, D. Braga, S.Ceri, E. Della Valle, M. Grossniklaus: Incremental Reasoning on
Streams and Rich Background Knowledge. ESWC (1) 2010: 1-15
D. Dell'Aglio, E. Della Valle: Incremental Reasoning on RDF Streams. In A.Harth, K.Hose,
R.Schenkel (Eds.) Linked Data Management, CRC Press 2014, ISBN 9781466582408
§ Re-materialize after each window slide
§ Use DRed
§ IMaRS
% of deletions w.r.t. the content of the window
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 33
IMaRS algorithm
• comparison of the average time needed to answer
a C-SPARQL query, when 2% of the content exits the window each
time it slides, using
– A backward reasoner on the window content
– DRed + standard SPARQL on the materialization
– IMaRS + standard SPARQL on the materialization
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 34
Some alternatives to IMaRS
• TROWL
– How: DRed in the context of approximate reasoning
– Ref: Y. Ren, J. Z. Pan. Optimising ontology stream reasoning
with truth maintenance system. In CIKM (2011)
• Sparkwave
– How: extended RETE algorithm for windows and RDFS
– Ref: Sparkwave: Continuous Schema-Enhanced Pattern
Matching over RDF Data Streams. Komazec S, Cerri D. DEBS
2012
• The Backward/Forward Algorithm
– How: optimizing DRed
– B. Motik, Y. Nenov, R.E.F. Piro, I. Horrocks: Incremental Update
of Datalog Materialisation: the Backward/Forward Algorithm.
AAAI 2015: 1560-1568
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 35
WINDOW LEVEL
ENTAILMENT
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 36
window level entailment
• Graph-level entailment considers data item contents, but it
does not use the temporal annotations
• Window-level entailment applies the inference process on
the non-merged stream items.
• E.g.,
– A door cannot be open and close at the same time
– A window contains: door A is open @1, door A is close @2
– At graph-level the reasoner tells that there is an inconsistency in
the window because it ignore the parts in italics
– At window-level the reasoner does not
• Best approaches I saw so far
– ÖL Özçep, R Möller. Ontology Based Data Access on Temporal and
Streaming Data. Reasoning Web, 2014
– D. Dell'Aglio et al. :A Query Model to Capture Event Pattern Matching
in RDF Stream Processing Query Languages EKAW 2016: 145-162
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 37
STREAM LEVEL
ENTAILMENT
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 38
Stream-level entailment
• Window-level entailment only considers a recent portion of
the stream
• Stream-level entailment aims at considering the entire
stream
• This is not just a theoretical dream, CEP does so
– E.g., rise C for every A that follows a B without a C in the middle
• Best approaches I saw so far
– ETALIS
• Anicic, D., Rudolph, S., Fodor, P., & Stojanovic, N. Stream reasoning and
complex event processing in ETALIS Semantic Web,3(4), 2012, 397–407.
– LARS
• H. Beck, M. Dao-Tran, T. Eiter, M. Fink: LARS: A Logic-Based Framework
for Analyzing Reasoning over Streams. AAAI 2015: 1431-1438H.
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 39
OPEN PROBLEMS AND CHALLENGES
FOR THE NEXT YEARS
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 40
Requirements vs. state-of-the-art
Requirement SotA@2017Q2
volume
velocity
variety
incompleteness
noise
reactive answers
fine-grained information access
complex domain models
high-level languages
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 41
Towards queries that capture user needs
• Capturing a wider set of tasks in queries
– Descriptive analytics
• Which electricity-producing turbine has sensor readings similar
(i.e., Pearson correlated by at least 0.75) to any turbine that
subsequently had a critical failure in the past year?
– Predictive analytics (Machine Learning)
• Where am I likely going to run into a traffic jam during my
commute tonight and how long will it take, given current weather
and traffic conditions?
– Geo-spatial reasoning
– Including preferences
– Exploiting the graph nature of the data
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 42
Towards queries that capture user needs
• Tightening CEP and ontological reasoning
– ETALIS/EP-SPARQL is the corner stone with high-
expressivity but very low efficiency
– Work in progress:
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 43
CEP
DL
RDF STREAM PROCESSING
RAW STREAM PROCESSING
Abstraction
Selection
Annotation
Reasoning
Querying
Rewriting
R. Tommasini, P. Bonte, E. Della Valle, E. Mannens, F. De Turck, F. Ongenae: Towards
Ontology-Based Event Processing. OWLED 2016: 115-127
Towards queries that capture user needs
• Forgetting knowledge
– Is IMaRS semantics what users want?
– Consumption ≠ expiration
• Consumed facts are not useful for processing
• Expired facts are not true anymore
– Should we introduce
• a notion of semantic importance?
• a notion of time-annotated facts?
• …
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 44
Towards sophisticated Stream Reasoning
• Extending the range of logical formalisms
– Pioneers:
• Metric Temporal logic
– F. Heintz and P. Doherty, DyKnow: An approach to middleware for
knowledge processing, Journal of Intelligent and 11 Fuzzy Systems 15(1)
(2004), 3–13.
• Answer Set Programming (ASP)
– A. Mileo, A. Abdelrahman, S. Policarpio and M. Hauswirth, StreamRule:
A nonmonotonic stream reasoning system for the semantic web, in:
RR, LNCS, Vol. 7994, Springer, 2013, pp. 247–252.
– More alternatives:
• action logics
• step logics
• active logics
• event calculus
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 45
Towards sophisticated Stream Reasoning
• Integrating other types of reasoning/processing
– My contribution
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 46
D.F. Barbieri, D. Braga, S. Ceri, E. Della Valle, Y. Huang, V. Tresp, A. Rettinger, H. Wermser:
Deductive and Inductive Stream Reasoning for Semantic Social Media Analytics.
IEEE Intelligent Systems 25(6): 32-41 (2010)
Towards sophisticated Stream Reasoning
• Integrating other types of reasoning/processing
– Other options
• More on inductive stream reasoning
• Probabilistic reasoning
• Planning
• Natural Language Processing
• Sentiment analysis
• …
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 47
Towards sophisticated Stream Reasoning
• Towards semantic streams
– Describe data stream to make them discoverable,
composable, etc.
– My work in progress
• Y. A. Sedira, R.Tommasini, E. Della Valle: Towards VoIS: A
Vocabulary of Interlinked Streams. DeSemWeb@ISWC 2017
– There is much more to describe, e.g.
• Stream of state (the door is close) vs.
Stream of actions (the door was closed)
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 48
Towards sophisticated Stream Reasoning
• Towards scalable stream reasoners
– Rewriting
• e.g., MorphStream
– How: rewriting in DSMS languages (one at a time)
– Ref: Calbimonte, J.-P., Corcho, O., & Gray, A. J. G. Enabling ontology-
based access to streaming data sources. In ISWC, 2010, pages 96–111.
– Using parallelization and distribution
• e.g., DynamiTE
– How: Truth maintenance for DF (a fragment of RDFS)
– J. Urbani, A. Margara, C. J. H. Jacobs, F. van Harmelen, H.E. Bal:
DynamiTE: Parallel Materialization of Dynamic RDF Data. ISWC (1)
2013: 657-672
– Using approximation
• e.g., TR-OWL
– How: Truth maintenance for EL++ with syntactic approximations
– Ref: Y. Ren, J. Z. Pan. Optimising ontology stream reasoning with truth
maintenance system. In CIKM (2011)
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 49
Towards robustness to imperfect data
• Overcoming types of heterogeneity that do not
exist in the static reasoning settings
– It is not just extending OBDA to continuous queries
– There are more sources of heterogeneity than in the
static database settings
• Different execution semantics
• No possibility for a priori homogenous
– Synchronisation*
– discretization
• …
* Pioneered in F.Heintz, DyKnow: A stream Based Knowledge Processing Middleware
Framework, PhD thesis,Linköping University, Sweden, 2009.
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 50
Towards robustness to imperfect data
• Copying with noise
– Data as signals instead of data as truths!
– Easy to state, hard to achieve
– Still of paramount importance!
– Notable approaches I run into
• Matthias Nickles, Alessandra Mileo: Web Stream Reasoning Using
Probabilistic Answer Set Programming. RR 2014: 197-205
• Anastasios Skarlatidis, Georgios Paliouras, Alexander Artikis, George A.
Vouros: Probabilistic Event Calculus for Event Recognition. ACM Trans.
Comput. Log. 16(2): 11:1-11:37 (2015)
• Anni-Yasmin Turhan, Erik Zenker: Towards Temporal Fuzzy Query
Answering on Stream-based Data. HiDeSt@KI 2015: 56-69
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 51
Conclusions
SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 52
Stream Processing
Event ProcessingWindow merge
Window operator
Streams
Graph-level entailment
Window-level entailment
Stream-level entailment
Application
StreamReasoning
Very well only some initial studies still very open
New abstractions to capture user needs
Application
Stream Reasoning:
a summary of ten years of research and a
vision for the next decade
http://content.iospress.com/articles/data-science/ds006
Q/A
Emanuele Della Valle
DEIB - Politecnico di Milano
@manudellavalle
emanuele.dellavalle@polimi.it
http://emanueledellavalle.org
Linköping University, Sweden - 5.10.2017

More Related Content

Similar to Stream Reasoning: a summary of ten years of research and a vision for the next decade

Tackling variety in event based systems
Tackling variety in event based systemsTackling variety in event based systems
Tackling variety in event based systemsSouleiman Hasan
 
Think Big - How to Design a Big Data Information Architecture
Think Big - How to Design a Big Data Information ArchitectureThink Big - How to Design a Big Data Information Architecture
Think Big - How to Design a Big Data Information ArchitectureInside Analysis
 
Linked Open Data for Cultural Heritage
Linked Open Data for Cultural HeritageLinked Open Data for Cultural Heritage
Linked Open Data for Cultural HeritageNoreen Whysel
 
Seminaire bigdata23102014
Seminaire bigdata23102014Seminaire bigdata23102014
Seminaire bigdata23102014Raja Chiky
 
Maximising (Re)Usability of Resources using Linked Data
Maximising (Re)Usability of Resources using Linked DataMaximising (Re)Usability of Resources using Linked Data
Maximising (Re)Usability of Resources using Linked DataAsuncion Gomez-Perez
 
On the need to include functional testing in RDF stream engine benchmarks
On the need to include functional testing in RDF stream engine benchmarks On the need to include functional testing in RDF stream engine benchmarks
On the need to include functional testing in RDF stream engine benchmarks Emanuele Della Valle
 
Big Data Architecture
Big Data ArchitectureBig Data Architecture
Big Data ArchitectureGuido Schmutz
 
Data Science at Trainline for Smarter Journeys
Data Science at Trainline for Smarter JourneysData Science at Trainline for Smarter Journeys
Data Science at Trainline for Smarter JourneysMarco Rossetti
 
Stream Reasoning: State of the Art and Beyond
Stream Reasoning: State of the Art and BeyondStream Reasoning: State of the Art and Beyond
Stream Reasoning: State of the Art and BeyondEmanuele Della Valle
 
Listening to the pulse of our cities with Stream Reasoning (and few more tech...
Listening to the pulse of our cities with Stream Reasoning (and few more tech...Listening to the pulse of our cities with Stream Reasoning (and few more tech...
Listening to the pulse of our cities with Stream Reasoning (and few more tech...Emanuele Della Valle
 
Linked Data for Architecture, Engineering and Construction (AEC)
Linked Data for Architecture, Engineering and Construction (AEC)Linked Data for Architecture, Engineering and Construction (AEC)
Linked Data for Architecture, Engineering and Construction (AEC)Stefan Dietze
 
RDF Stream Processing Models (SR4LD2013)
RDF Stream Processing Models (SR4LD2013)RDF Stream Processing Models (SR4LD2013)
RDF Stream Processing Models (SR4LD2013)Daniele Dell'Aglio
 
Toward Semantic Data Stream - Technologies and Applications
Toward Semantic Data Stream - Technologies and ApplicationsToward Semantic Data Stream - Technologies and Applications
Toward Semantic Data Stream - Technologies and ApplicationsRaja Chiky
 
It's a Streaming World! Reasoning upon Rapidly Changing Information (Milano, ...
It's a Streaming World! Reasoning upon Rapidly Changing Information (Milano, ...It's a Streaming World! Reasoning upon Rapidly Changing Information (Milano, ...
It's a Streaming World! Reasoning upon Rapidly Changing Information (Milano, ...Emanuele Della Valle
 
Network Traffic Search using Apache HBase
Network Traffic Search using Apache HBaseNetwork Traffic Search using Apache HBase
Network Traffic Search using Apache HBaseEvans Ye
 
Linked Statistical Data: does it actually pay off?
Linked Statistical Data: does it actually pay off?Linked Statistical Data: does it actually pay off?
Linked Statistical Data: does it actually pay off?Oscar Corcho
 
Matapihi 'The National Digital Project'. The University of Auckland Library P...
Matapihi 'The National Digital Project'. The University of Auckland Library P...Matapihi 'The National Digital Project'. The University of Auckland Library P...
Matapihi 'The National Digital Project'. The University of Auckland Library P...Rose Holley
 

Similar to Stream Reasoning: a summary of ten years of research and a vision for the next decade (20)

Tackling variety in event based systems
Tackling variety in event based systemsTackling variety in event based systems
Tackling variety in event based systems
 
Think Big - How to Design a Big Data Information Architecture
Think Big - How to Design a Big Data Information ArchitectureThink Big - How to Design a Big Data Information Architecture
Think Big - How to Design a Big Data Information Architecture
 
Linked Open Data for Cultural Heritage
Linked Open Data for Cultural HeritageLinked Open Data for Cultural Heritage
Linked Open Data for Cultural Heritage
 
Seminaire bigdata23102014
Seminaire bigdata23102014Seminaire bigdata23102014
Seminaire bigdata23102014
 
Maximising (Re)Usability of Resources using Linked Data
Maximising (Re)Usability of Resources using Linked DataMaximising (Re)Usability of Resources using Linked Data
Maximising (Re)Usability of Resources using Linked Data
 
On the need to include functional testing in RDF stream engine benchmarks
On the need to include functional testing in RDF stream engine benchmarks On the need to include functional testing in RDF stream engine benchmarks
On the need to include functional testing in RDF stream engine benchmarks
 
Big Data Architecture
Big Data ArchitectureBig Data Architecture
Big Data Architecture
 
Data Science at Trainline for Smarter Journeys
Data Science at Trainline for Smarter JourneysData Science at Trainline for Smarter Journeys
Data Science at Trainline for Smarter Journeys
 
Observlets
Observlets Observlets
Observlets
 
Stream Reasoning: State of the Art and Beyond
Stream Reasoning: State of the Art and BeyondStream Reasoning: State of the Art and Beyond
Stream Reasoning: State of the Art and Beyond
 
Listening to the pulse of our cities with Stream Reasoning (and few more tech...
Listening to the pulse of our cities with Stream Reasoning (and few more tech...Listening to the pulse of our cities with Stream Reasoning (and few more tech...
Listening to the pulse of our cities with Stream Reasoning (and few more tech...
 
Linked Data for Architecture, Engineering and Construction (AEC)
Linked Data for Architecture, Engineering and Construction (AEC)Linked Data for Architecture, Engineering and Construction (AEC)
Linked Data for Architecture, Engineering and Construction (AEC)
 
RDF Stream Processing Models (SR4LD2013)
RDF Stream Processing Models (SR4LD2013)RDF Stream Processing Models (SR4LD2013)
RDF Stream Processing Models (SR4LD2013)
 
Toward Semantic Data Stream - Technologies and Applications
Toward Semantic Data Stream - Technologies and ApplicationsToward Semantic Data Stream - Technologies and Applications
Toward Semantic Data Stream - Technologies and Applications
 
It's a Streaming World! Reasoning upon Rapidly Changing Information (Milano, ...
It's a Streaming World! Reasoning upon Rapidly Changing Information (Milano, ...It's a Streaming World! Reasoning upon Rapidly Changing Information (Milano, ...
It's a Streaming World! Reasoning upon Rapidly Changing Information (Milano, ...
 
Network Traffic Search using Apache HBase
Network Traffic Search using Apache HBaseNetwork Traffic Search using Apache HBase
Network Traffic Search using Apache HBase
 
Linked Statistical Data: does it actually pay off?
Linked Statistical Data: does it actually pay off?Linked Statistical Data: does it actually pay off?
Linked Statistical Data: does it actually pay off?
 
On a web of data streams
On a web of data streamsOn a web of data streams
On a web of data streams
 
Matapihi 'The National Digital Project'. The University of Auckland Library P...
Matapihi 'The National Digital Project'. The University of Auckland Library P...Matapihi 'The National Digital Project'. The University of Auckland Library P...
Matapihi 'The National Digital Project'. The University of Auckland Library P...
 
Francesco Serafin
Francesco Serafin Francesco Serafin
Francesco Serafin
 

More from Emanuele Della Valle

Taming velocity - a tale of four streams
Taming velocity - a tale of four streamsTaming velocity - a tale of four streams
Taming velocity - a tale of four streamsEmanuele Della Valle
 
Work in progress on Inductive Stream Reasoning
Work in progress on Inductive Stream ReasoningWork in progress on Inductive Stream Reasoning
Work in progress on Inductive Stream ReasoningEmanuele Della Valle
 
La città dei balocchi 2017 in numeri - Fluxedo
La città dei balocchi 2017 in numeri - FluxedoLa città dei balocchi 2017 in numeri - Fluxedo
La città dei balocchi 2017 in numeri - FluxedoEmanuele Della Valle
 
ACQUA: Approximate Continuous Query Answering over Streams and Dynamic Linked...
ACQUA: Approximate Continuous Query Answering over Streams and Dynamic Linked...ACQUA: Approximate Continuous Query Answering over Streams and Dynamic Linked...
ACQUA: Approximate Continuous Query Answering over Streams and Dynamic Linked...Emanuele Della Valle
 
Stream reasoning: an approach to tame the velocity and variety dimensions of ...
Stream reasoning: an approach to tame the velocity and variety dimensions of ...Stream reasoning: an approach to tame the velocity and variety dimensions of ...
Stream reasoning: an approach to tame the velocity and variety dimensions of ...Emanuele Della Valle
 
Big Data: how to use it to create value
Big Data: how to use it to create valueBig Data: how to use it to create value
Big Data: how to use it to create valueEmanuele Della Valle
 
Ist16-03 An Introduction to the Semantic Web
Ist16-03 An Introduction to the Semantic Web Ist16-03 An Introduction to the Semantic Web
Ist16-03 An Introduction to the Semantic Web Emanuele Della Valle
 
Ist16-02 HL7 from v2 (syntax) to v3 (semantics)
Ist16-02 HL7 from v2 (syntax) to v3 (semantics)Ist16-02 HL7 from v2 (syntax) to v3 (semantics)
Ist16-02 HL7 from v2 (syntax) to v3 (semantics)Emanuele Della Valle
 
IST16-01 - Introduction to Interoperability and Semantic Technologies
IST16-01 - Introduction to Interoperability and Semantic TechnologiesIST16-01 - Introduction to Interoperability and Semantic Technologies
IST16-01 - Introduction to Interoperability and Semantic TechnologiesEmanuele Della Valle
 
Listening to the pulse of our cities fusing Social Media Streams and Call Dat...
Listening to the pulse of our cities fusing Social Media Streams and Call Dat...Listening to the pulse of our cities fusing Social Media Streams and Call Dat...
Listening to the pulse of our cities fusing Social Media Streams and Call Dat...Emanuele Della Valle
 
Social listener-brera-design-district-2015-03
Social listener-brera-design-district-2015-03Social listener-brera-design-district-2015-03
Social listener-brera-design-district-2015-03Emanuele Della Valle
 
City Data Fusion for Event Management (in Italiano)
City Data Fusion for Event Management (in Italiano)City Data Fusion for Event Management (in Italiano)
City Data Fusion for Event Management (in Italiano)Emanuele Della Valle
 
Semantic technologies and Interoperability
Semantic technologies and InteroperabilitySemantic technologies and Interoperability
Semantic technologies and InteroperabilityEmanuele Della Valle
 
Big data: why, what, paradigm shifts enabled , tools and market landscape
Big data: why, what, paradigm shifts enabled , tools and market landscapeBig data: why, what, paradigm shifts enabled , tools and market landscape
Big data: why, what, paradigm shifts enabled , tools and market landscapeEmanuele Della Valle
 
City Data Fusion and City Sensing presented at EIT ICT Labs for EXPO 2015
City Data Fusion and City Sensing presented at EIT ICT Labs for EXPO 2015City Data Fusion and City Sensing presented at EIT ICT Labs for EXPO 2015
City Data Fusion and City Sensing presented at EIT ICT Labs for EXPO 2015Emanuele Della Valle
 
On the effectiveness of a Mobile Puzzle Game UI to Crowdsource Linked Data Ma...
On the effectiveness of a Mobile Puzzle Game UI to Crowdsource Linked Data Ma...On the effectiveness of a Mobile Puzzle Game UI to Crowdsource Linked Data Ma...
On the effectiveness of a Mobile Puzzle Game UI to Crowdsource Linked Data Ma...Emanuele Della Valle
 

More from Emanuele Della Valle (20)

Taming velocity - a tale of four streams
Taming velocity - a tale of four streamsTaming velocity - a tale of four streams
Taming velocity - a tale of four streams
 
Stream reasoning
Stream reasoningStream reasoning
Stream reasoning
 
Work in progress on Inductive Stream Reasoning
Work in progress on Inductive Stream ReasoningWork in progress on Inductive Stream Reasoning
Work in progress on Inductive Stream Reasoning
 
Big Data and Data Science W's
Big Data and Data Science W'sBig Data and Data Science W's
Big Data and Data Science W's
 
La città dei balocchi 2017 in numeri - Fluxedo
La città dei balocchi 2017 in numeri - FluxedoLa città dei balocchi 2017 in numeri - Fluxedo
La città dei balocchi 2017 in numeri - Fluxedo
 
ACQUA: Approximate Continuous Query Answering over Streams and Dynamic Linked...
ACQUA: Approximate Continuous Query Answering over Streams and Dynamic Linked...ACQUA: Approximate Continuous Query Answering over Streams and Dynamic Linked...
ACQUA: Approximate Continuous Query Answering over Streams and Dynamic Linked...
 
Stream reasoning: an approach to tame the velocity and variety dimensions of ...
Stream reasoning: an approach to tame the velocity and variety dimensions of ...Stream reasoning: an approach to tame the velocity and variety dimensions of ...
Stream reasoning: an approach to tame the velocity and variety dimensions of ...
 
Big Data: how to use it to create value
Big Data: how to use it to create valueBig Data: how to use it to create value
Big Data: how to use it to create value
 
Ist16-04 An introduction to RDF
Ist16-04 An introduction to RDF Ist16-04 An introduction to RDF
Ist16-04 An introduction to RDF
 
Ist16-03 An Introduction to the Semantic Web
Ist16-03 An Introduction to the Semantic Web Ist16-03 An Introduction to the Semantic Web
Ist16-03 An Introduction to the Semantic Web
 
Ist16-02 HL7 from v2 (syntax) to v3 (semantics)
Ist16-02 HL7 from v2 (syntax) to v3 (semantics)Ist16-02 HL7 from v2 (syntax) to v3 (semantics)
Ist16-02 HL7 from v2 (syntax) to v3 (semantics)
 
IST16-01 - Introduction to Interoperability and Semantic Technologies
IST16-01 - Introduction to Interoperability and Semantic TechnologiesIST16-01 - Introduction to Interoperability and Semantic Technologies
IST16-01 - Introduction to Interoperability and Semantic Technologies
 
On Stream Reasoning
On Stream ReasoningOn Stream Reasoning
On Stream Reasoning
 
Listening to the pulse of our cities fusing Social Media Streams and Call Dat...
Listening to the pulse of our cities fusing Social Media Streams and Call Dat...Listening to the pulse of our cities fusing Social Media Streams and Call Dat...
Listening to the pulse of our cities fusing Social Media Streams and Call Dat...
 
Social listener-brera-design-district-2015-03
Social listener-brera-design-district-2015-03Social listener-brera-design-district-2015-03
Social listener-brera-design-district-2015-03
 
City Data Fusion for Event Management (in Italiano)
City Data Fusion for Event Management (in Italiano)City Data Fusion for Event Management (in Italiano)
City Data Fusion for Event Management (in Italiano)
 
Semantic technologies and Interoperability
Semantic technologies and InteroperabilitySemantic technologies and Interoperability
Semantic technologies and Interoperability
 
Big data: why, what, paradigm shifts enabled , tools and market landscape
Big data: why, what, paradigm shifts enabled , tools and market landscapeBig data: why, what, paradigm shifts enabled , tools and market landscape
Big data: why, what, paradigm shifts enabled , tools and market landscape
 
City Data Fusion and City Sensing presented at EIT ICT Labs for EXPO 2015
City Data Fusion and City Sensing presented at EIT ICT Labs for EXPO 2015City Data Fusion and City Sensing presented at EIT ICT Labs for EXPO 2015
City Data Fusion and City Sensing presented at EIT ICT Labs for EXPO 2015
 
On the effectiveness of a Mobile Puzzle Game UI to Crowdsource Linked Data Ma...
On the effectiveness of a Mobile Puzzle Game UI to Crowdsource Linked Data Ma...On the effectiveness of a Mobile Puzzle Game UI to Crowdsource Linked Data Ma...
On the effectiveness of a Mobile Puzzle Game UI to Crowdsource Linked Data Ma...
 

Recently uploaded

Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home ServiceSapana Sha
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 

Recently uploaded (20)

E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 

Stream Reasoning: a summary of ten years of research and a vision for the next decade

  • 1. Stream Reasoning: a summary of ten years of research and a vision for the next decade http://content.iospress.com/articles/data-science/ds006 Emanuele Della Valle DEIB - Politecnico di Milano @manudellavalle emanuele.dellavalle@polimi.it http://emanueledellavalle.org Linköping University, Sweden - 5.10.2017
  • 2. Me • Assistant Professor at DEIB Politecnico di Milano • Expert in semantic technologies and stream computing • Brander of stream reasoning: an approach to master the velocity and variety dimension of Big Data • https://scholar.google.com/scholar?hl= en&q="stream+reasoning" • 16 years of experience in research and innovation projects • Startupper: • http://www.fluxedo.com 2
  • 3. INTRODUCTION SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 3
  • 4. It's a streaming world … • Off-shore oil operations • Smart Cities • Power turbine • Social networks • Generate data streams! E. Della Valle, S. Ceri, F. van Harmelen, D. Fensel It's a Streaming World! Reasoning upon Rapidly Changing Information. IEEE Intelligent Systems 24(6): 83-89 (2009) SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 4
  • 5. … looking for reactive answers … • When a sensor on a drill in an oil-rig indicates that it is about to get stuck, how long can I keep drilling? • Where am I likely going to run into a traffic jam during my commute tonight? • Which electricity-producing turbine has sensor readings similar to any turbine that subsequently had a critical failure? • Who is driving the discussion about the top 10 emerging topics ? • Require continuous processing and reactive answer SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 5
  • 6. … and many more conflicting requirements A system able to answer those queries must be able to • handle volume • handle velocity • handle variety • cope with incompleteness • cope with noise • provide reactive answers • support fine-grained access • integrate complex domain models • offer high-level languages SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 6
  • 7. Grand challenge • Volume + Velocity + Variety = hard deal SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org volume months days hours min. sec. ms. velocity ZB EB PB TB GB MB KB Variety 7
  • 8. A good reason to embrace it! • ++ Variety  ++ value SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org value ms. sec. min. hours days months years velocity Variety 8
  • 9. From challenges to opportunities • Formally data streams are : – unbounded sequences of time-varying data elements • Less formally, in many application domains, they are: – a “continuous” flow of information – where recent information is more relevant as it describes the current state of a dynamic system • Opportunities – Forget old enough information – Exploit the implicit ordering (by recency) in the data time SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 9
  • 10. State-of-the-art: DSMS and CEP • A paradigmatic change! • Continuous queries registered over streams that are observed trough windows window input streams streams of answerRegistered Continuous Query Dynamic System SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 10
  • 11. DSMS and CEP vs. requirements Requirement DSMS CEP volume velocity variety incompleteness noise reactive answers fine-grained information access complex domain models high-level languages SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 11
  • 12. State of the art: Semantic Web • Given ontology O and query Q, use O to rewrite Q as Q’ so that, for any set of ground facts A contained in multiple databases: – answer(Q,O,A) = answer(Q’,,A) The answer of the query Q using the ontology O for any set of ground facts A is equal to answer of a query Q’ without considering the ontology O • Use mapping M to map Q’ to multiple SQL queries to the various databases Rewrite O Q Q’ Map SQL M answer A UiO, Norway - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 12
  • 13. Solutions vs. requirements Requirement DSMS CEP Sem Web volume velocity variety incompleteness noise reactive answers fine-grained information access complex domain models high-level languages SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 13
  • 14. Stream Reasoning • Research question – is it possible to make sense in real time of multiple, heterogeneous, gigantic and inevitably noisy and incomplete data streams in order to support the decision processes of extremely large numbers of concurrent users? SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 14 Emanuele Della Valle: On Stream Reasoning. PhD thesis, Vrije Universiteit Amsterdam, 2015. Available online at http://dare.ubvu.vu.nl/handle/1871/53293 .
  • 15. Is this feasible? • Proposed approach: cascading Stream Reasoning Complexity Raw Stream Processing Semantic Streams DL-Lite DLAbstraction Selection Interpretation Reasoning Querying Re-writing Change Frequency PTIME NEXPTIME 104 Hz 1 Hz Complexity vs. Dynamics AC0 H. Stuckenschmidt, S. Ceri, E. Della Valle, F. van Harmelen: Towards Expressive Stream Reasoning. Proceedings of the Dagstuhl Seminar on Semantic Aspects of Sensor Networks, 2010. SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 15
  • 16. A model to describe stream reasoning SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 16 D. Dell’Aglio, On Unified Stream Reasoning, PhD thesis, Politecnico di Milano, 2016. Stream Processing (DSMS) Event Processing (CEP)Window merge Window operator Streams Graph-level entailment Window-level entailment Stream-level entailment Application StreamReasoning
  • 17. GRAPH LEVEL UNDER SIMPLE ENTAILMENT REGIME SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 17
  • 18. State-of-the-art: RDF model • RDF: Resource Description Framework – It allows to make statements about resources in the form of subject-predicate-object expressions • In RDF terminology triples • E.g. @TimBernersLee posts "This is for everyone" – A collection of RDF statements represents a labelled, directed graph • In RDF terminology a graph • E.g., the tweet above by Tim Berners Lee is connected to – Thousands of twitter user profiles via retweets – Thousands of twitter user profiles via favorite – … subject predicate object UiO, Norway - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 18
  • 19. RDF stream Models • RDF Stream (the C-SPARQL way) – Unbound sequence of time-varying triples – each represented by a pair made of an RDF triple and its timestamp – Timestamp are non-decreasing (allowing for simultaneity) … TimBernersLee posts "This is for everyone", 10:16.55 PM 22 Aug 2012 @Alice posts "RT: This is for everyone", 10:17.03 PM 22 Aug 2012 … D.F. Barbieri, D. Braga, S. Ceri, E. Della Valle, M. Grossniklaus: Querying RDF streams with C-SPARQL. SIGMOD Record 39(1): 20-26 (2010) subject predicate object timestamp SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 19
  • 20. RDF stream Models • RDF Stream (the Streaming Linked Data way) – Unbound sequence of time-varying graphs – each represented by a pair made of an RDF graph and its timestamp (all triples in a graph are simultaneous) – Timestamps (if present) are monotonically increasing – Graphs act as a form of punctuation SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org A. Mauri, J-P Calbimonte, D.Dell'Aglio, M.Balduini, M.Brambilla, E.Della Valle, K. Aberer: TripleWave: Spreading RDF Streams on the Web. ISWC (2) 2016: 140-149 20 D.F. Barbieri, E. Della Valle: A Proposal for Publishing Data Streams as Linked Data – A Position Paper. LDOW (2010)
  • 21. Continuous-SPARQL SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 21
  • 22. Continuous-SPARQL Who are the opinion makers? i.e., the users who are likely to influence the behavior their followers REGISTER STREAM OpinionMakers COMPUTED EVERY 5m AS CONSTRUCT { ?opinionMaker sd:about ?resource } FROM STREAM <http://…> [RANGE 30m STEP 5m] WHERE { ?opinionMaker ?opinion ?res . ?follower sioc:follows ?opinionMaker. ?follower ?opinion ?res. FILTER (cs:timestamp(?follower ?opinion ?res) > cs:timestamp(?opinionMaker ?opinion ?res) ) } HAVING ( COUNT(DISTINCT ?follower) > 3 ) SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 22D.F. Barbieri, D. Braga, S. Ceri, E. Della Valle, M. Grossniklaus: Querying RDF streams with C-SPARQL. SIGMOD Record 39(1): 20-26 (2010)
  • 23. Continuous-SPARQL Who are the opinion makers? i.e., the users who are likely to influence the behavior their followers REGISTER STREAM OpinionMakers COMPUTED EVERY 5m AS CONSTRUCT { ?opinionMaker sd:about ?resource } FROM STREAM <http://…> [RANGE 30m STEP 5m] WHERE { ?opinionMaker ?opinion ?res . ?follower sioc:follows ?opinionMaker. ?follower ?opinion ?res. FILTER (cs:timestamp(?follower ?opinion ?res) > cs:timestamp(?opinionMaker ?opinion ?res) ) } HAVING ( COUNT(DISTINCT ?follower) > 3 ) Query registration (for continuous execution) FROM STREAM clause WINDOW RDF Stream added as new ouput format Builtin to access timestamps SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 23D.F. Barbieri, D. Braga, S. Ceri, E. Della Valle, M. Grossniklaus: Querying RDF streams with C-SPARQL. SIGMOD Record 39(1): 20-26 (2010)
  • 24. More than modeling, it's for reactive answers! • C-SPARQL engine time window-based selection outperforms SPARQL filter-based selection (Jena-ARQ) D. Barbieri, D. Braga, S. Ceri, E. Della Valle, Y. Huang, V. Tresp, A.Rettinger, H. Wermser: Deductive and Inductive Stream Reasoning for Semantic Social Media Analytics IEEE Intelligent Systems, 30 Aug. 2010. Our In-memory RDF stream processing engine SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 24
  • 25. Alternatives to C-SPARQL • SPARQLStream – What: window in the past, focus on RDF to Stream operators – Ref: Calbimonte, J.-P., Corcho, O., & Gray, A. J. G. Enabling ontology-based access to streaming data sources. In ISWC, 2010, pages 96–111. • CQELS – What: STREAM clause, focus on new answer – Ref: Le-Phuoc, D., Dao-Tran, M., Xavier Parreira, J., & Hauswirth, M. A native and adaptive approach for unified processing of linked streams and linked data. In ISWC 2011, pages 370–388. • EP-SPARQL – What: focus on event specific operators – Ref: Anicic, D., Fodor, P., Rudolph, S., & Stojanovic, N. EP-SPARQL: a unified language for event processing and stream reasoning. In WWW 2011, pages 635–644. • TEF-SPARQL – What: adds "facts" as first class elements – Ref: https://www.merlin.uzh.ch/publication/show/8467 SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 25
  • 26. Work in progress • RSP-QL – Syntax • https://github.com/streamreasoning/RSP-QL/blob/master/RSP- QL%20Sample%20Queries.md – Proposed semantics • D.Dell'Aglio, E.Della Valle, J.-P.Calbimonte, Ó. Corcho: RSP-QL Semantics: A Unifying Query Model to Explain Heterogeneity of RDF Stream Processing Systems. Int. J. Semantic Web Inf. Syst. 10(4): 17-44 (2014) – Semantics (work in progress) • https://github.com/streamreasoning/RSP- QL/blob/master/Semantics.md – Quick ref. • D. Dell'Aglio, J.-P. Calbimonte, E. Della Valle, Ó. Corcho: Towards a Unified Language for RDF Stream Query Processing. ESWC (Satellite Events) 2015: 353-363 SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 26
  • 27. GRAPH LEVEL UNDER OTHER ENTAILMENT REGIME SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 27
  • 28. continuous deductive reasoning • DL Ontology Stream ST – A ontology stream with respect to a static Tbox T is a sequence of Abox axioms ST(i) • A Windowed Ontology Stream ST(o,c] – A windowed ontology stream with respect to a static Tbox T is the union of the Abox axioms ST(i) where o<i≤c • Reasoning on a Windowed Ontology Stream ST(o,c] is as reasoning on a static DL KB SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 28 Emanuele Della Valle, Stefano Ceri, Davide Francesco Barbieri, Daniele Braga, Alessandro Campi: A First Step Towards Stream Reasoning. FIS 2008: 72-81
  • 29. discusses discusses discusses discusses discusses discusses discusses Example of continuous deductive reasoning What impact has been my micropost p1 creating in the last hour? Let’s count the number of microposts that discuss it … REGISTER STREAM ImpactMeter AS SELECT (count(?p) AS ?impact) FROM STREAM <http://…/fb> [RANGE 60m STEP 10m] WHERE { :Alice posts [ sr:discusses ?p ] } p1 p3 p5 p8 p2 p4 p7 p6 Transitive property Alice posts p1 . SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 29
  • 30. Not so naïve approach to stream reasoning • The problem is that materialization (the result of data-driven processing) are very difficult to decrement efficiently. – State-of-the-art: DRed algorithm • Over delete • Re-derive • Insert Reasoner Inferred data ontology window insertions deletions Incremental !!! SPARQL SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 30
  • 31. Is DRed needed? • DRed works with random insertions and deletions • In a streaming setting, when a triple enters the window, given the size of the window, the reasoner knows already when it will be deleted! • E.g., – if the window is 40 minutes long, and, – it is 10:00, the triple(s) entering now – will exit on 10:40. • Conclusion – deletions are predictable Time Enter window Exit window Explicitly in window Infer win 10:00 AßB 10:10 BßC 10:20 AßE 10:30 EßC 10:40 AßB 10:50 BßC 11:00 AßE A B A B C A A B C E A A B C E A A C E A A B C E A C E SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 31
  • 32. IMaRS algorithm • Idea: – add an expiration time to each triple and – use an hash table to index triples by their expiration time • The algorithm 1. deletes expired triples 2. Adds the new derivations that are consequences of insertions annotating each inferred triple with an expiration time (the min of those of the triple it is derived from), and 3. when multiple derivations occur, for each multiple derivation, it keeps the max expiration time. SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 32
  • 33. IMaRS algorithm • Incremental Reasoning on RDF streams (IMaRS): new reasoning algorithm optimized for reactive query answering D.F. Barbieri, D. Braga, S.Ceri, E. Della Valle, M. Grossniklaus: Incremental Reasoning on Streams and Rich Background Knowledge. ESWC (1) 2010: 1-15 D. Dell'Aglio, E. Della Valle: Incremental Reasoning on RDF Streams. In A.Harth, K.Hose, R.Schenkel (Eds.) Linked Data Management, CRC Press 2014, ISBN 9781466582408 § Re-materialize after each window slide § Use DRed § IMaRS % of deletions w.r.t. the content of the window SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 33
  • 34. IMaRS algorithm • comparison of the average time needed to answer a C-SPARQL query, when 2% of the content exits the window each time it slides, using – A backward reasoner on the window content – DRed + standard SPARQL on the materialization – IMaRS + standard SPARQL on the materialization SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 34
  • 35. Some alternatives to IMaRS • TROWL – How: DRed in the context of approximate reasoning – Ref: Y. Ren, J. Z. Pan. Optimising ontology stream reasoning with truth maintenance system. In CIKM (2011) • Sparkwave – How: extended RETE algorithm for windows and RDFS – Ref: Sparkwave: Continuous Schema-Enhanced Pattern Matching over RDF Data Streams. Komazec S, Cerri D. DEBS 2012 • The Backward/Forward Algorithm – How: optimizing DRed – B. Motik, Y. Nenov, R.E.F. Piro, I. Horrocks: Incremental Update of Datalog Materialisation: the Backward/Forward Algorithm. AAAI 2015: 1560-1568 SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 35
  • 36. WINDOW LEVEL ENTAILMENT SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 36
  • 37. window level entailment • Graph-level entailment considers data item contents, but it does not use the temporal annotations • Window-level entailment applies the inference process on the non-merged stream items. • E.g., – A door cannot be open and close at the same time – A window contains: door A is open @1, door A is close @2 – At graph-level the reasoner tells that there is an inconsistency in the window because it ignore the parts in italics – At window-level the reasoner does not • Best approaches I saw so far – ÖL Özçep, R Möller. Ontology Based Data Access on Temporal and Streaming Data. Reasoning Web, 2014 – D. Dell'Aglio et al. :A Query Model to Capture Event Pattern Matching in RDF Stream Processing Query Languages EKAW 2016: 145-162 SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 37
  • 38. STREAM LEVEL ENTAILMENT SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 38
  • 39. Stream-level entailment • Window-level entailment only considers a recent portion of the stream • Stream-level entailment aims at considering the entire stream • This is not just a theoretical dream, CEP does so – E.g., rise C for every A that follows a B without a C in the middle • Best approaches I saw so far – ETALIS • Anicic, D., Rudolph, S., Fodor, P., & Stojanovic, N. Stream reasoning and complex event processing in ETALIS Semantic Web,3(4), 2012, 397–407. – LARS • H. Beck, M. Dao-Tran, T. Eiter, M. Fink: LARS: A Logic-Based Framework for Analyzing Reasoning over Streams. AAAI 2015: 1431-1438H. SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 39
  • 40. OPEN PROBLEMS AND CHALLENGES FOR THE NEXT YEARS SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 40
  • 41. Requirements vs. state-of-the-art Requirement SotA@2017Q2 volume velocity variety incompleteness noise reactive answers fine-grained information access complex domain models high-level languages SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 41
  • 42. Towards queries that capture user needs • Capturing a wider set of tasks in queries – Descriptive analytics • Which electricity-producing turbine has sensor readings similar (i.e., Pearson correlated by at least 0.75) to any turbine that subsequently had a critical failure in the past year? – Predictive analytics (Machine Learning) • Where am I likely going to run into a traffic jam during my commute tonight and how long will it take, given current weather and traffic conditions? – Geo-spatial reasoning – Including preferences – Exploiting the graph nature of the data SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 42
  • 43. Towards queries that capture user needs • Tightening CEP and ontological reasoning – ETALIS/EP-SPARQL is the corner stone with high- expressivity but very low efficiency – Work in progress: SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 43 CEP DL RDF STREAM PROCESSING RAW STREAM PROCESSING Abstraction Selection Annotation Reasoning Querying Rewriting R. Tommasini, P. Bonte, E. Della Valle, E. Mannens, F. De Turck, F. Ongenae: Towards Ontology-Based Event Processing. OWLED 2016: 115-127
  • 44. Towards queries that capture user needs • Forgetting knowledge – Is IMaRS semantics what users want? – Consumption ≠ expiration • Consumed facts are not useful for processing • Expired facts are not true anymore – Should we introduce • a notion of semantic importance? • a notion of time-annotated facts? • … SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 44
  • 45. Towards sophisticated Stream Reasoning • Extending the range of logical formalisms – Pioneers: • Metric Temporal logic – F. Heintz and P. Doherty, DyKnow: An approach to middleware for knowledge processing, Journal of Intelligent and 11 Fuzzy Systems 15(1) (2004), 3–13. • Answer Set Programming (ASP) – A. Mileo, A. Abdelrahman, S. Policarpio and M. Hauswirth, StreamRule: A nonmonotonic stream reasoning system for the semantic web, in: RR, LNCS, Vol. 7994, Springer, 2013, pp. 247–252. – More alternatives: • action logics • step logics • active logics • event calculus SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 45
  • 46. Towards sophisticated Stream Reasoning • Integrating other types of reasoning/processing – My contribution SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 46 D.F. Barbieri, D. Braga, S. Ceri, E. Della Valle, Y. Huang, V. Tresp, A. Rettinger, H. Wermser: Deductive and Inductive Stream Reasoning for Semantic Social Media Analytics. IEEE Intelligent Systems 25(6): 32-41 (2010)
  • 47. Towards sophisticated Stream Reasoning • Integrating other types of reasoning/processing – Other options • More on inductive stream reasoning • Probabilistic reasoning • Planning • Natural Language Processing • Sentiment analysis • … SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 47
  • 48. Towards sophisticated Stream Reasoning • Towards semantic streams – Describe data stream to make them discoverable, composable, etc. – My work in progress • Y. A. Sedira, R.Tommasini, E. Della Valle: Towards VoIS: A Vocabulary of Interlinked Streams. DeSemWeb@ISWC 2017 – There is much more to describe, e.g. • Stream of state (the door is close) vs. Stream of actions (the door was closed) SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 48
  • 49. Towards sophisticated Stream Reasoning • Towards scalable stream reasoners – Rewriting • e.g., MorphStream – How: rewriting in DSMS languages (one at a time) – Ref: Calbimonte, J.-P., Corcho, O., & Gray, A. J. G. Enabling ontology- based access to streaming data sources. In ISWC, 2010, pages 96–111. – Using parallelization and distribution • e.g., DynamiTE – How: Truth maintenance for DF (a fragment of RDFS) – J. Urbani, A. Margara, C. J. H. Jacobs, F. van Harmelen, H.E. Bal: DynamiTE: Parallel Materialization of Dynamic RDF Data. ISWC (1) 2013: 657-672 – Using approximation • e.g., TR-OWL – How: Truth maintenance for EL++ with syntactic approximations – Ref: Y. Ren, J. Z. Pan. Optimising ontology stream reasoning with truth maintenance system. In CIKM (2011) SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 49
  • 50. Towards robustness to imperfect data • Overcoming types of heterogeneity that do not exist in the static reasoning settings – It is not just extending OBDA to continuous queries – There are more sources of heterogeneity than in the static database settings • Different execution semantics • No possibility for a priori homogenous – Synchronisation* – discretization • … * Pioneered in F.Heintz, DyKnow: A stream Based Knowledge Processing Middleware Framework, PhD thesis,Linköping University, Sweden, 2009. SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 50
  • 51. Towards robustness to imperfect data • Copying with noise – Data as signals instead of data as truths! – Easy to state, hard to achieve – Still of paramount importance! – Notable approaches I run into • Matthias Nickles, Alessandra Mileo: Web Stream Reasoning Using Probabilistic Answer Set Programming. RR 2014: 197-205 • Anastasios Skarlatidis, Georgios Paliouras, Alexander Artikis, George A. Vouros: Probabilistic Event Calculus for Event Recognition. ACM Trans. Comput. Log. 16(2): 11:1-11:37 (2015) • Anni-Yasmin Turhan, Erik Zenker: Towards Temporal Fuzzy Query Answering on Stream-based Data. HiDeSt@KI 2015: 56-69 SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 51
  • 52. Conclusions SR 2015, Austria - 3.11.2015 @manudellavalle - http://emanueledellavalle.org 52 Stream Processing Event ProcessingWindow merge Window operator Streams Graph-level entailment Window-level entailment Stream-level entailment Application StreamReasoning Very well only some initial studies still very open New abstractions to capture user needs Application
  • 53. Stream Reasoning: a summary of ten years of research and a vision for the next decade http://content.iospress.com/articles/data-science/ds006 Q/A Emanuele Della Valle DEIB - Politecnico di Milano @manudellavalle emanuele.dellavalle@polimi.it http://emanueledellavalle.org Linköping University, Sweden - 5.10.2017

Editor's Notes

  1. È possibile dare un senso in tempo reale a multipli stream di dati eterogenei, enormi ed inevitabilmente rumorosi e incompleti per supportare I processi decisionali di un gran numero di utenti?
  2. È possibile dare un senso in tempo reale a multipli stream di dati eterogenei, enormi ed inevitabilmente rumorosi e incompleti per supportare I processi decisionali di un gran numero di utenti?
  3. https://twitter.com/BarackObama/status/266031293945503744
  4. RICORDARE CAMBIO SEMANTICA!!!! Csparql language extends sparql in every 3 parts of query forms Query form -> STREAM CLAUSE to create a RDF stream as query results Datasert clause -> FROM STREAM clause added to let engine get data from RDF streams specified by URI Where Clause -> built in timestamp function to retrieve the timestamp of every single triple in the engine