SlideShare una empresa de Scribd logo
1 de 64
Descargar para leer sin conexión
Process Mining
Past, Present, and Open Challenges
Dirk Fahland (d.fahland@tue.nl)
@dfahland
0
Design vs Actual Use
1
Process Design vs Actual Use
1
2
Process Design vs Actual Use
2
3
Process Design vs Actual Use
“1 returned”
“refund 1”
4
Process Design vs Actual Use
“refund 2”
“refund 1”
5
Actual use… unknown
Hey… what’s
your return
order process?
Just use our app,
send item,
receive money
6
What is Process Mining?
7
process
miningstochastics
operations
manage-
ment &
research
business
process
management
process
automation
&
optimi-
zation
formal methods
& concurrency
theory
business
process
improve-
ment
workflow
manage-
ment
process
science
+
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
… the link between Process Science and Data Science
8©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Discover actual use of a system: read the traces
9
OrderID Activity Time Source Product …
302 Receive Order 09.02 22:15 Web 1, 2, 3 …
412 Receive Order 14.02 22:21 … … …
302 Create Return # 15.02 11:25 App 1 …
302 Create Return # 15.02 11:27 App 2
… … … … … …
302 Receive Package 17.02 9:24 Cam1 1
412 … … … … …
302 Customer Call 18.02 20:13 Anna 2 …
… … … … … …
Traces left by (Information) Systems
event
log
What?
When? Who?
Which case?
10
Traces left by (Information) Systems
event
log
…
Create Return #
Create Return #
Receive Package
Receive Order
Customer Call
Receive Order
…
…
11
OrderID Activity Time
302 Receive Order 09.02 22:15
412 Receive Order 14.02 22:21
302 Create Return # 15.02 11:25
302 Create Return # 15.02 11:27
… … …
302 Receive Package 17.02 9:24
412 … …
302 Customer Call 18.02 20:13
… … …
Process Discovery
event
log
Create Order
Receive Return
Create Return
discover
process
modeldescribes
simple…
…
Customer Call
12
…
Create Return #
Create Return #
Receive Package
Receive Order
Customer Call
Create Order
…
…
Process Mining…
13
A
B
C
DE
p2
end
p4
p3p1
start
Past Present
Open
Challenges
Learning Automata
14
Directly-Follows-Graph
Learning Automata
15
K-TailsDirectly-Follows-Graph
state = “sequences of next k activities”
Mining =
find structure
in these relations
[Cook, Wolf 1995-1998], [Cohen, Maoz 2014]
Learning Concurrency
16
Inductive Miner: B and C concurrent
Learning Concurrency
17
Inductive Miner: B and C concurrent
reveals true frequencies, local repetitions, …
zoom in
Learning Models with Concurrency: ILP Miner
[Werf, Dongen, Hurkens, Serebrenik 2009]
18
A
B
C
DE
ABCD
ACBD
AED
D must happen before B
 prevents traces #1 and #2
 don’t add placeA must happen before B or E
 allows all traces
 add place
 encode as ILP problem
Learning Models with Concurrency: ILP Miner
[Werf, Dongen, Hurkens, Serebrenik 2009]
19
A
B
C
DE
p2
end
p4
p3p1
start
ABCD
ACBD
AED
Alpha Algorithm: construct places based on binary relations (derived from directly-follows graph)
[Aalst, Weijters, Maruster 2004]
Precise Semantics and “Messy” Data
20
Road
Traffic
Fines Log
ILP Miner: fitting, but complexAlpha Miner: “unsound” (no proper behavior)
Less precise: the Visual Approach
21
Directly & Eventually Follows Relation:
thresholds for filtering edges + structural simplification
Heuristics Miner
[Agrawal, Gunopulos, Leymann 1998]
[Weijters, Aalst 2001]
Road
Traffic
Fines Log
Many Process Discovery Algorithms…
alpha ILP
Heuristics
Transition
System
Fuzzy Disco
22
… and the Challenges of Real-Life Data
ILP
Transition
Systemalpha
Heuristics
Fuzzy Disco
show/hide
details
23
…but concurrency matters for…
frequencies, performance analysis, simplicity
24
2hrs
7hrs
1.6hrs
4.5hrs
How to get correct models on real data?
25
A
B
C
DE
p2
end
p4
p3p1
start
Past Present
Open
Challenges
Quality and Forces in Process Discovery
log
process
model
positive
examples
only
26
Quality and Forces in Process Discovery
[Buijs, van Dongen, Aalst 2014]
log
process
model
ensure fitness
generalize
increase
precision
simple models
27
The Process Discovery Problem
event
log
discover process
model
fitting and precise
can rediscover (generalizes)
Simple,
Sound,
Semantics
Analysis
29
Basic Process Discovery Principle
extract
behavioral
specification
synthesize
process
model
process
model
30
event
log
Bottom-Up Discovery: Directly-Follows Relation
ACDE
ADCE
ADECFDE
BDEC
BCDEFDE
BDEFCDE
A
B
C
D E
F
33
Dominant Behavioral Relation: Sequence Cut
A CDE
A DCE
A DECFDE
B DEC
B CDEFDE
B DEFCDE
A
B
C
D E
F

A B C
D E
F
34
Split Along Cut & Recurse
A CDE
A DCE
A DECFDE
B DEC
B CDEFDE
B DEFCDE
A
B
C
D E
F

A B C
D E
F
35
Choice Cut & Base Case
A CDE
A DCE
A DECFDE
B DEC
B CDEFDE
B DEFCDE
A
B
C
D E
F


A B C
D E
F
36
C
D E
F
Parallel Cut
A
B


A B
C
D E
F
A CDE
A DCE
A DECFDE
B DEC
B CDEFDE
B DEFCDE
37

D E
F
C
Loop Cut
A CDE
A DCE
A DECFDE
B DEC
B CDEFDE
B DEFCDE
A
B
C
D E
F


A B

38
D E
F
C
Loop Cut
A CDE
A DCE
A DECFDE
B DEC
B CDEFDE
B DEFCDE
A
B
C
D E
F


A B

39

D E
F
C
… until All Bases Reached
A CDE
A DCE
A DECFDE
B DEC
B CDEFDE
B DEFCDE
A
B
C
D E
F


A B


40

D E
F
C
Sequence, Choice, Parallel, Loop (or “Flower”)
A CDE
A DCE
A DECFDE
B DEC
B CDEFDE
B DEFCDE
A
B
C
D E
F


A B


41

D E
F
C
Process Tree = Block-Structured Model


A B



A B
F

A B


C

C
D
E
D
E


F
42
Inductive Miner
sound, fitting models (+/- filtering)
 allows for reliable analysis of behavior
[Leemans, Fahland, Aalst 2013-2015]
44
Inductive Miner
sound, fitting models (+/- filtering)
 allows for reliable analysis of behavior
[Leemans, Fahland, Aalst 2013-2015]
adding details
59000x
credit collection
4000x
appeal
45
Inductive Miner
sound, fitting models (+/- filtering)
 allows for reliable analysis of behavior
[Leemans, Fahland, Aalst 2013-2015]
Animate flow of cases
46
Highlight deviations
47
Inductive Miner
sound, fitting models (+/- filtering)
 allows for reliable analysis of behavior
[Leemans, Fahland, Aalst 2013-2015]
Analyze performance
87 days until
fine is sent
Combining Process Mining and Data Mining
[Leoni et al 2013]
48
conditions for choices:
“Appeal to Judge” if amount  36 EUR
Where are we now? Process Mining Software
 www.promtools.org
49
1500+ plug-ins available covering the whole
process mining spectrum
>150k downloads
©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
Commercial Uptake
50©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
IEEE CIS Taskforce on Process Mining
https://www.win.tue.nl/ieeetfpm/
51
So, has the problem been cracked?
52
A
B
C
DE
p2
end
p4
p3p1
start
Past
Open
Challenges
So, has the problem been cracked?
53
A
B
C
DE
p2
end
p4
p3p1
start
Past Present
Processes may follow many
different variants
54
Purchasing
Process
All variants in one model  very imprecise
55
Cluster traces based
on similarity of event context
[Lu et al. 2015-2017]
56
Cluster traces based
on similarity of event context
[Lu et al. 2015-2017]
Put events into data context: decompose
[van Eck, Sidorova, Aalst 2016]
57
Create Sales Order Position
Creating Invoice
Put events into data context: decompose
[Aalst, Kalenkova, Rubin, Verbeek 2014]
59
Register, Select Flight, Select Hotel, Book Flight, Book Hotel, Pay
Register, Select Flight, Select Hotel, Book Hotel, Book Flight, Pay
Register, Select Flight, Book Flight, Select Hotel, Book Hotel, Pay
Register, Select Flight, Select Hotel, Cancel
Register, Select Flight, Book Flight, Pay
Register, Select Flight, Book Flight, Pay
Register, Select Flight, Book Flight, Pay
Register, Select Flight, Cancel
Register, Select Hotel, Book Hotel, Pay
Register, Select Hotel, Cancel
Register, Select Hotel, Book Hotel, Pay
Register, Select Hotel, Book Hotel, Pay
Put events into data context: decompose & recompose
[Aalst, Kalenkova, Rubin, Verbeek 2014]
60
Register, Select Flight, Book Flight, Pay
Register, Select Flight, Book Flight, Pay
Register, Select Flight, Book Flight, Pay
Register, Select Flight, Cancel
Register, Select Hotel, Book Hotel, Pay
Register, Select Hotel, Cancel
Register, Select Hotel, Book Hotel, Pay
Register, Select Hotel, Book Hotel, Pay
Put events into data context: decompose & recompose
[Aalst, Kalenkova, Rubin, Verbeek 2014]
61
Register, Select Flight, Book Flight, Pay
Register, Select Flight, Book Flight, Pay
Register, Select Flight, Book Flight, Pay
Register, Select Flight, Cancel
Register, Select Hotel, Book Hotel, Pay
Register, Select Hotel, Cancel
Register, Select Hotel, Book Hotel, Pay
Register, Select Hotel, Book Hotel, Pay
Allows discovering non-block structured models!
Post-Process
62
restructure output of Heuristics Miner
[Augusto et al. 2016]
Process Mining = Discovery + Conformance + Extension +
Log Preprocessing + …
event
log
discover
model of
actual
process
model of
intended
process
check
conformance Deviations between
actual and intended
process
model of
actual
process
model of
intended
process
enriched
model
extend
• Filtering
• Clustering
• Activity identification
• Deviation detection
• Partially ordered
event data
• Event log
visualization
• Database tables
• Database logs
• Event streams
• IoT devices
63
 www.promtools.org
 Find patterns and contexts
• identify variants
• identify independence  concurrency
• aggregate sets of low-level events to high-level activities
 Learn prediction models
• outcomes of a process based on case features
• detect deviations/risks early on
 Mine and integrate domain-knowledge
• Identify patterns/variants/views that fit domain expectations
• Enrich models with domain concepts
Opportunities for Data Mining in Process Mining
64
 Get ProM
• www.promtools.org
 Get event logs
• Real-life event logs
https://data.4tu.nl/repository/collection:event_logs_real
• Synthetic event logs
https://data.4tu.nl/repository/collection:event_logs_synthetic
 Read up on analyses
• Case studies
https://www.win.tue.nl/ieeetfpm/doku.php?id=shared:process_mining_case_studies
• BPI Challenge 2017 (and all previous editions)
https://www.win.tue.nl/bpi/doku.php?id=2017:challenge
 Take a free online course on Process Mining
• https://www.coursera.org/learn/process-mining/
• https://www.futurelearn.com/courses/process-mining
• https://www.futurelearn.com/courses/process-mining-healthcare
 Check the literature list on the next page
How to get started?
65
1. Cook, Jonathan E. and Alexander L. Wolf. “Automating Process Discovery through Event-Data Analysis.” 1995 17th International Conference on Software Engineering (1995): 73-73.
2. Cook, Jonathan E. and Alexander L. Wolf. “Discovering Models of Software Processes from Event-Based Data.” ACM Trans. Softw. Eng. Methodol. 7 (1998): 215-249.
3. Cook, Jonathan E. and Alexander L. Wolf. “Event-Based Detection of Concurrency.” (1998). ACM SIGSOFT’98
4. Agrawal, Rakesh, Dimitrios Gunopulos and Frank Leymann. “Mining Process Models from Workflow Logs.” EDBT (1998).
5. Weijters, A J M M and W M P Van Der Aalst. “Process Mining Discovering Workflow Models from Event-Based Data.” (2001).
6. Maruster, Laura, A. J. M. M. Weijters, Wil M. P. van der Aalst and Antal van den Bosch. “Process Mining: Discovering Direct Successors in Process Logs.” Discovery Science (2002).
7. Aalst, Wil M. P. van der, A. J. M. M. Weijters and Laura Maruster. “Workflow mining: discovering process models from event logs.” IEEE Transactions on Knowledge and Data
Engineering 16 (2004): 1128-1142.
8. Jan Martijn E. M. van der Werf, Boudewijn F. van Dongen, Cor A. J. Hurkens, Alexander Serebrenik: Process Discovery using Integer Linear Programming. Fundam. Inform. 94(3-4):
387-412 (2009)
9. Sander J. J. Leemans, Dirk Fahland, Wil M. P. van der Aalst: Discovering Block-Structured Process Models from Event Logs - A Constructive Approach. Petri Nets 2013: 311-329
10. Adriano Augusto, Raffaele Conforti, Marlon Dumas, Marcello La Rosa, Giorgio Bruno: Automated Discovery of Structured Process Models: Discover Structured vs. Discover and
Structure. ER 2016: 313-329
11. Wil M. P. van der Aalst, Anna A. Kalenkova, Vladimir A. Rubin, Eric Verbeek: Process Discovery Using Localized Events. Petri Nets 2015: 287-308
12. Cohen, Hila and Shahar Maoz. “The confidence in our k-tails.” ASE (2014).
13. Xixi Lu, Dirk Fahland, Wil M. P. van der Aalst: Interactively Exploring Logs and Mining Models with Clustering, Filtering, and Relabeling. BPM (Demos) 2016: 44-49
14. Xixi Lu, Dirk Fahland: A Conceptual Framework for Understanding Event Data Quality for Behavior Analysis. ZEUS 2017: 11-14
15. Xixi Lu, Dirk Fahland, Frank J. H. M. van den Biggelaar, Wil M. P. van der Aalst: Detecting Deviating Behaviors Without Models. Business Process Management Workshops 2015:
126-139
16. Maikel L. van Eck, Natalia Sidorova, Wil M. P. van der Aalst: Discovering and Exploring State-Based Models for Multi-perspective Processes. BPM 2016: 142-157
17. Massimiliano de Leoni, Marlon Dumas, Luciano García-Bañuelos: Discovering Branching Conditions from Business Process Execution Logs. FASE 2013: 114-129
18. Massimiliano de Leoni, Wil M. P. van der Aalst: Data-aware process mining: discovering decisions in processes using alignments. SAC 2013: 1454-1461
19. Joos C. A. M. Buijs, Boudewijn F. van Dongen, Wil M. P. van der Aalst: Quality Dimensions in Process Discovery: The Importance of Fitness, Precision, Generalization and Simplicity.
Int. J. Cooperative Inf. Syst. 23(1) (2014)
20. Arya Adriansyah, Boudewijn F. van Dongen, Wil M. P. van der Aalst: Conformance Checking Using Cost-Based Fitness Analysis. EDOC 2011: 55-64
21. Jorge Munoz-Gama, Josep Carmona: A Fresh Look at Precision in Process Conformance. BPM 2010: 211-226
22. Arya Adriansyah, Jorge Munoz-Gama, Josep Carmona, Boudewijn F. van Dongen, Wil M. P. van der Aalst: Measuring precision of modeled behavior. Inf. Syst. E-Business
Management 13(1): 37-67 (2015)
Literature
66
Process Mining
Past, Present, and Open Challenges
Dirk Fahland (d.fahland@tue.nl)
@dfahland
67

Más contenido relacionado

Similar a Process Mining Past, Present and Open Challenges

Charting New Waters: Data Integration Excellence for Port & Marine Operations
Charting New Waters: Data Integration Excellence for Port & Marine OperationsCharting New Waters: Data Integration Excellence for Port & Marine Operations
Charting New Waters: Data Integration Excellence for Port & Marine Operationsmarketing932765
 
Keeping Your DevOps Transformation From Crushing Your Ops Capacity
Keeping Your DevOps Transformation From Crushing Your Ops Capacity Keeping Your DevOps Transformation From Crushing Your Ops Capacity
Keeping Your DevOps Transformation From Crushing Your Ops Capacity Rundeck
 
Michael Hall [InfluxData] | Become an InfluxDB Pro in 20 Minutes | InfluxDays...
Michael Hall [InfluxData] | Become an InfluxDB Pro in 20 Minutes | InfluxDays...Michael Hall [InfluxData] | Become an InfluxDB Pro in 20 Minutes | InfluxDays...
Michael Hall [InfluxData] | Become an InfluxDB Pro in 20 Minutes | InfluxDays...InfluxData
 
Status Quo is Death: nib health funds’ Innovative Journey to the Cloud: AWS S...
Status Quo is Death: nib health funds’ Innovative Journey to the Cloud: AWS S...Status Quo is Death: nib health funds’ Innovative Journey to the Cloud: AWS S...
Status Quo is Death: nib health funds’ Innovative Journey to the Cloud: AWS S...Amazon Web Services
 
How to run a user-centered, requirements gathering workshop
How to run a user-centered, requirements gathering workshopHow to run a user-centered, requirements gathering workshop
How to run a user-centered, requirements gathering workshopFergus Roche
 
Iwsm2014 transforming dust into pots of gold (alain abran)
Iwsm2014   transforming dust into pots of gold (alain abran)Iwsm2014   transforming dust into pots of gold (alain abran)
Iwsm2014 transforming dust into pots of gold (alain abran)Nesma
 
Life Cycle of Metrics, Alerting, and Performance Monitoring in Microservices
Life Cycle of Metrics, Alerting, and Performance Monitoring in MicroservicesLife Cycle of Metrics, Alerting, and Performance Monitoring in Microservices
Life Cycle of Metrics, Alerting, and Performance Monitoring in MicroservicesSean Chittenden
 
Topics in Verification: Reuse, Coverage, Regression Engineering, Planning, Qu...
Topics in Verification: Reuse, Coverage, Regression Engineering, Planning, Qu...Topics in Verification: Reuse, Coverage, Regression Engineering, Planning, Qu...
Topics in Verification: Reuse, Coverage, Regression Engineering, Planning, Qu...DVClub
 
Critical software developement
Critical software developementCritical software developement
Critical software developementnedseb
 
Reactive Stream Processing Using DDS and Rx
Reactive Stream Processing Using DDS and RxReactive Stream Processing Using DDS and Rx
Reactive Stream Processing Using DDS and RxSumant Tambe
 
The art of the event streaming application: streams, stream processors and sc...
The art of the event streaming application: streams, stream processors and sc...The art of the event streaming application: streams, stream processors and sc...
The art of the event streaming application: streams, stream processors and sc...confluent
 
Kafka summit SF 2019 - the art of the event-streaming app
Kafka summit SF 2019 - the art of the event-streaming appKafka summit SF 2019 - the art of the event-streaming app
Kafka summit SF 2019 - the art of the event-streaming appNeil Avery
 
Soa12c launch 5 event processing shmakov eng cr
Soa12c launch 5 event processing shmakov eng crSoa12c launch 5 event processing shmakov eng cr
Soa12c launch 5 event processing shmakov eng crVasily Demin
 
Kansas Elsas Klint 2011
Kansas Elsas Klint 2011Kansas Elsas Klint 2011
Kansas Elsas Klint 2011Philip Elsas
 
DCEU 18: From Legacy Mainframe to the Cloud: The Finnish Railways Evolution w...
DCEU 18: From Legacy Mainframe to the Cloud: The Finnish Railways Evolution w...DCEU 18: From Legacy Mainframe to the Cloud: The Finnish Railways Evolution w...
DCEU 18: From Legacy Mainframe to the Cloud: The Finnish Railways Evolution w...Docker, Inc.
 
Rhf2019 how totackle barriersofapplicationmodernization_ap16_en
Rhf2019 how totackle barriersofapplicationmodernization_ap16_enRhf2019 how totackle barriersofapplicationmodernization_ap16_en
Rhf2019 how totackle barriersofapplicationmodernization_ap16_enMasahiko Umeno
 
Python business intelligence (PyData 2012 talk)
Python business intelligence (PyData 2012 talk)Python business intelligence (PyData 2012 talk)
Python business intelligence (PyData 2012 talk)Stefan Urbanek
 
Value stream mapping for DevOps
Value stream mapping for DevOpsValue stream mapping for DevOps
Value stream mapping for DevOpsMarc Hornbeek
 
Borders of Decidability in Verification of Data-Centric Dynamic Systems
Borders of Decidability in Verification of Data-Centric Dynamic SystemsBorders of Decidability in Verification of Data-Centric Dynamic Systems
Borders of Decidability in Verification of Data-Centric Dynamic Systemsnet2-project
 
Use dependency injection to get Hadoop *out* of your application code
Use dependency injection to get Hadoop *out* of your application codeUse dependency injection to get Hadoop *out* of your application code
Use dependency injection to get Hadoop *out* of your application codeDataWorks Summit
 

Similar a Process Mining Past, Present and Open Challenges (20)

Charting New Waters: Data Integration Excellence for Port & Marine Operations
Charting New Waters: Data Integration Excellence for Port & Marine OperationsCharting New Waters: Data Integration Excellence for Port & Marine Operations
Charting New Waters: Data Integration Excellence for Port & Marine Operations
 
Keeping Your DevOps Transformation From Crushing Your Ops Capacity
Keeping Your DevOps Transformation From Crushing Your Ops Capacity Keeping Your DevOps Transformation From Crushing Your Ops Capacity
Keeping Your DevOps Transformation From Crushing Your Ops Capacity
 
Michael Hall [InfluxData] | Become an InfluxDB Pro in 20 Minutes | InfluxDays...
Michael Hall [InfluxData] | Become an InfluxDB Pro in 20 Minutes | InfluxDays...Michael Hall [InfluxData] | Become an InfluxDB Pro in 20 Minutes | InfluxDays...
Michael Hall [InfluxData] | Become an InfluxDB Pro in 20 Minutes | InfluxDays...
 
Status Quo is Death: nib health funds’ Innovative Journey to the Cloud: AWS S...
Status Quo is Death: nib health funds’ Innovative Journey to the Cloud: AWS S...Status Quo is Death: nib health funds’ Innovative Journey to the Cloud: AWS S...
Status Quo is Death: nib health funds’ Innovative Journey to the Cloud: AWS S...
 
How to run a user-centered, requirements gathering workshop
How to run a user-centered, requirements gathering workshopHow to run a user-centered, requirements gathering workshop
How to run a user-centered, requirements gathering workshop
 
Iwsm2014 transforming dust into pots of gold (alain abran)
Iwsm2014   transforming dust into pots of gold (alain abran)Iwsm2014   transforming dust into pots of gold (alain abran)
Iwsm2014 transforming dust into pots of gold (alain abran)
 
Life Cycle of Metrics, Alerting, and Performance Monitoring in Microservices
Life Cycle of Metrics, Alerting, and Performance Monitoring in MicroservicesLife Cycle of Metrics, Alerting, and Performance Monitoring in Microservices
Life Cycle of Metrics, Alerting, and Performance Monitoring in Microservices
 
Topics in Verification: Reuse, Coverage, Regression Engineering, Planning, Qu...
Topics in Verification: Reuse, Coverage, Regression Engineering, Planning, Qu...Topics in Verification: Reuse, Coverage, Regression Engineering, Planning, Qu...
Topics in Verification: Reuse, Coverage, Regression Engineering, Planning, Qu...
 
Critical software developement
Critical software developementCritical software developement
Critical software developement
 
Reactive Stream Processing Using DDS and Rx
Reactive Stream Processing Using DDS and RxReactive Stream Processing Using DDS and Rx
Reactive Stream Processing Using DDS and Rx
 
The art of the event streaming application: streams, stream processors and sc...
The art of the event streaming application: streams, stream processors and sc...The art of the event streaming application: streams, stream processors and sc...
The art of the event streaming application: streams, stream processors and sc...
 
Kafka summit SF 2019 - the art of the event-streaming app
Kafka summit SF 2019 - the art of the event-streaming appKafka summit SF 2019 - the art of the event-streaming app
Kafka summit SF 2019 - the art of the event-streaming app
 
Soa12c launch 5 event processing shmakov eng cr
Soa12c launch 5 event processing shmakov eng crSoa12c launch 5 event processing shmakov eng cr
Soa12c launch 5 event processing shmakov eng cr
 
Kansas Elsas Klint 2011
Kansas Elsas Klint 2011Kansas Elsas Klint 2011
Kansas Elsas Klint 2011
 
DCEU 18: From Legacy Mainframe to the Cloud: The Finnish Railways Evolution w...
DCEU 18: From Legacy Mainframe to the Cloud: The Finnish Railways Evolution w...DCEU 18: From Legacy Mainframe to the Cloud: The Finnish Railways Evolution w...
DCEU 18: From Legacy Mainframe to the Cloud: The Finnish Railways Evolution w...
 
Rhf2019 how totackle barriersofapplicationmodernization_ap16_en
Rhf2019 how totackle barriersofapplicationmodernization_ap16_enRhf2019 how totackle barriersofapplicationmodernization_ap16_en
Rhf2019 how totackle barriersofapplicationmodernization_ap16_en
 
Python business intelligence (PyData 2012 talk)
Python business intelligence (PyData 2012 talk)Python business intelligence (PyData 2012 talk)
Python business intelligence (PyData 2012 talk)
 
Value stream mapping for DevOps
Value stream mapping for DevOpsValue stream mapping for DevOps
Value stream mapping for DevOps
 
Borders of Decidability in Verification of Data-Centric Dynamic Systems
Borders of Decidability in Verification of Data-Centric Dynamic SystemsBorders of Decidability in Verification of Data-Centric Dynamic Systems
Borders of Decidability in Verification of Data-Centric Dynamic Systems
 
Use dependency injection to get Hadoop *out* of your application code
Use dependency injection to get Hadoop *out* of your application codeUse dependency injection to get Hadoop *out* of your application code
Use dependency injection to get Hadoop *out* of your application code
 

Más de Dirk Fahland

Object-Centric Processes - from cases to objects and relations… and beyond
Object-Centric Processes - from cases to objects and relations… and beyondObject-Centric Processes - from cases to objects and relations… and beyond
Object-Centric Processes - from cases to objects and relations… and beyondDirk Fahland
 
Multi-Dimensional Process Analysis
Multi-Dimensional Process Analysis Multi-Dimensional Process Analysis
Multi-Dimensional Process Analysis Dirk Fahland
 
Artifacts and Databases - the Need for Event Relation Graphs and Synchronous ...
Artifacts and Databases - the Need for Event Relation Graphs and Synchronous ...Artifacts and Databases - the Need for Event Relation Graphs and Synchronous ...
Artifacts and Databases - the Need for Event Relation Graphs and Synchronous ...Dirk Fahland
 
Describing, Discovering, and Understanding Multi-Dimensional Processes
Describing, Discovering, and Understanding Multi-Dimensional ProcessesDescribing, Discovering, and Understanding Multi-Dimensional Processes
Describing, Discovering, and Understanding Multi-Dimensional ProcessesDirk Fahland
 
Where did I go wrong? Explaining errors in process models
Where did I go wrong? Explaining errors in process modelsWhere did I go wrong? Explaining errors in process models
Where did I go wrong? Explaining errors in process modelsDirk Fahland
 
Mining Branch-Time Scenarios From Execution Logs
Mining Branch-Time Scenarios From Execution LogsMining Branch-Time Scenarios From Execution Logs
Mining Branch-Time Scenarios From Execution LogsDirk Fahland
 
From Live Sequence Chart Specifications to Distributed Components
From Live Sequence Chart Specifications to Distributed ComponentsFrom Live Sequence Chart Specifications to Distributed Components
From Live Sequence Chart Specifications to Distributed ComponentsDirk Fahland
 
LSC Revisited - From Scenarios to Distributed Components
LSC Revisited - From Scenarios to Distributed ComponentsLSC Revisited - From Scenarios to Distributed Components
LSC Revisited - From Scenarios to Distributed ComponentsDirk Fahland
 
Repairing Process Models to Match Reality
Repairing Process Models to Match RealityRepairing Process Models to Match Reality
Repairing Process Models to Match RealityDirk Fahland
 
Process Mining for ERP Systems
Process Mining for ERP SystemsProcess Mining for ERP Systems
Process Mining for ERP SystemsDirk Fahland
 
Simplifying Mined Process Models
Simplifying Mined Process ModelsSimplifying Mined Process Models
Simplifying Mined Process ModelsDirk Fahland
 
The Process of Process Modeling
The Process of Process ModelingThe Process of Process Modeling
The Process of Process ModelingDirk Fahland
 
Behavioral Conformance of Artifact-Centric Process Models
Behavioral Conformance of Artifact-Centric Process ModelsBehavioral Conformance of Artifact-Centric Process Models
Behavioral Conformance of Artifact-Centric Process ModelsDirk Fahland
 
Many-to-Many: Interactions in Artifact-Centric Choreographies
Many-to-Many: Interactions in Artifact-Centric ChoreographiesMany-to-Many: Interactions in Artifact-Centric Choreographies
Many-to-Many: Interactions in Artifact-Centric ChoreographiesDirk Fahland
 
Artifacts - Processes with Multiple Instances
Artifacts - Processes with Multiple InstancesArtifacts - Processes with Multiple Instances
Artifacts - Processes with Multiple InstancesDirk Fahland
 

Más de Dirk Fahland (15)

Object-Centric Processes - from cases to objects and relations… and beyond
Object-Centric Processes - from cases to objects and relations… and beyondObject-Centric Processes - from cases to objects and relations… and beyond
Object-Centric Processes - from cases to objects and relations… and beyond
 
Multi-Dimensional Process Analysis
Multi-Dimensional Process Analysis Multi-Dimensional Process Analysis
Multi-Dimensional Process Analysis
 
Artifacts and Databases - the Need for Event Relation Graphs and Synchronous ...
Artifacts and Databases - the Need for Event Relation Graphs and Synchronous ...Artifacts and Databases - the Need for Event Relation Graphs and Synchronous ...
Artifacts and Databases - the Need for Event Relation Graphs and Synchronous ...
 
Describing, Discovering, and Understanding Multi-Dimensional Processes
Describing, Discovering, and Understanding Multi-Dimensional ProcessesDescribing, Discovering, and Understanding Multi-Dimensional Processes
Describing, Discovering, and Understanding Multi-Dimensional Processes
 
Where did I go wrong? Explaining errors in process models
Where did I go wrong? Explaining errors in process modelsWhere did I go wrong? Explaining errors in process models
Where did I go wrong? Explaining errors in process models
 
Mining Branch-Time Scenarios From Execution Logs
Mining Branch-Time Scenarios From Execution LogsMining Branch-Time Scenarios From Execution Logs
Mining Branch-Time Scenarios From Execution Logs
 
From Live Sequence Chart Specifications to Distributed Components
From Live Sequence Chart Specifications to Distributed ComponentsFrom Live Sequence Chart Specifications to Distributed Components
From Live Sequence Chart Specifications to Distributed Components
 
LSC Revisited - From Scenarios to Distributed Components
LSC Revisited - From Scenarios to Distributed ComponentsLSC Revisited - From Scenarios to Distributed Components
LSC Revisited - From Scenarios to Distributed Components
 
Repairing Process Models to Match Reality
Repairing Process Models to Match RealityRepairing Process Models to Match Reality
Repairing Process Models to Match Reality
 
Process Mining for ERP Systems
Process Mining for ERP SystemsProcess Mining for ERP Systems
Process Mining for ERP Systems
 
Simplifying Mined Process Models
Simplifying Mined Process ModelsSimplifying Mined Process Models
Simplifying Mined Process Models
 
The Process of Process Modeling
The Process of Process ModelingThe Process of Process Modeling
The Process of Process Modeling
 
Behavioral Conformance of Artifact-Centric Process Models
Behavioral Conformance of Artifact-Centric Process ModelsBehavioral Conformance of Artifact-Centric Process Models
Behavioral Conformance of Artifact-Centric Process Models
 
Many-to-Many: Interactions in Artifact-Centric Choreographies
Many-to-Many: Interactions in Artifact-Centric ChoreographiesMany-to-Many: Interactions in Artifact-Centric Choreographies
Many-to-Many: Interactions in Artifact-Centric Choreographies
 
Artifacts - Processes with Multiple Instances
Artifacts - Processes with Multiple InstancesArtifacts - Processes with Multiple Instances
Artifacts - Processes with Multiple Instances
 

Último

Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxolyaivanovalion
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...shambhavirathore45
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 

Último (20)

Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptx
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 

Process Mining Past, Present and Open Challenges

  • 1. Process Mining Past, Present, and Open Challenges Dirk Fahland (d.fahland@tue.nl) @dfahland 0
  • 3. Process Design vs Actual Use 1 2
  • 4. Process Design vs Actual Use 2 3
  • 5. Process Design vs Actual Use “1 returned” “refund 1” 4
  • 6. Process Design vs Actual Use “refund 2” “refund 1” 5
  • 7. Actual use… unknown Hey… what’s your return order process? Just use our app, send item, receive money 6
  • 8. What is Process Mining? 7 process miningstochastics operations manage- ment & research business process management process automation & optimi- zation formal methods & concurrency theory business process improve- ment workflow manage- ment process science + ©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
  • 9. … the link between Process Science and Data Science 8©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
  • 10. Discover actual use of a system: read the traces 9
  • 11. OrderID Activity Time Source Product … 302 Receive Order 09.02 22:15 Web 1, 2, 3 … 412 Receive Order 14.02 22:21 … … … 302 Create Return # 15.02 11:25 App 1 … 302 Create Return # 15.02 11:27 App 2 … … … … … … 302 Receive Package 17.02 9:24 Cam1 1 412 … … … … … 302 Customer Call 18.02 20:13 Anna 2 … … … … … … … Traces left by (Information) Systems event log What? When? Who? Which case? 10
  • 12. Traces left by (Information) Systems event log … Create Return # Create Return # Receive Package Receive Order Customer Call Receive Order … … 11 OrderID Activity Time 302 Receive Order 09.02 22:15 412 Receive Order 14.02 22:21 302 Create Return # 15.02 11:25 302 Create Return # 15.02 11:27 … … … 302 Receive Package 17.02 9:24 412 … … 302 Customer Call 18.02 20:13 … … …
  • 13. Process Discovery event log Create Order Receive Return Create Return discover process modeldescribes simple… … Customer Call 12 … Create Return # Create Return # Receive Package Receive Order Customer Call Create Order … …
  • 16. Learning Automata 15 K-TailsDirectly-Follows-Graph state = “sequences of next k activities” Mining = find structure in these relations [Cook, Wolf 1995-1998], [Cohen, Maoz 2014]
  • 18. Learning Concurrency 17 Inductive Miner: B and C concurrent reveals true frequencies, local repetitions, … zoom in
  • 19. Learning Models with Concurrency: ILP Miner [Werf, Dongen, Hurkens, Serebrenik 2009] 18 A B C DE ABCD ACBD AED D must happen before B  prevents traces #1 and #2  don’t add placeA must happen before B or E  allows all traces  add place  encode as ILP problem
  • 20. Learning Models with Concurrency: ILP Miner [Werf, Dongen, Hurkens, Serebrenik 2009] 19 A B C DE p2 end p4 p3p1 start ABCD ACBD AED Alpha Algorithm: construct places based on binary relations (derived from directly-follows graph) [Aalst, Weijters, Maruster 2004]
  • 21. Precise Semantics and “Messy” Data 20 Road Traffic Fines Log ILP Miner: fitting, but complexAlpha Miner: “unsound” (no proper behavior)
  • 22. Less precise: the Visual Approach 21 Directly & Eventually Follows Relation: thresholds for filtering edges + structural simplification Heuristics Miner [Agrawal, Gunopulos, Leymann 1998] [Weijters, Aalst 2001] Road Traffic Fines Log
  • 23. Many Process Discovery Algorithms… alpha ILP Heuristics Transition System Fuzzy Disco 22
  • 24. … and the Challenges of Real-Life Data ILP Transition Systemalpha Heuristics Fuzzy Disco show/hide details 23
  • 25. …but concurrency matters for… frequencies, performance analysis, simplicity 24 2hrs 7hrs 1.6hrs 4.5hrs
  • 26. How to get correct models on real data? 25 A B C DE p2 end p4 p3p1 start Past Present Open Challenges
  • 27. Quality and Forces in Process Discovery log process model positive examples only 26
  • 28. Quality and Forces in Process Discovery [Buijs, van Dongen, Aalst 2014] log process model ensure fitness generalize increase precision simple models 27
  • 29. The Process Discovery Problem event log discover process model fitting and precise can rediscover (generalizes) Simple, Sound, Semantics Analysis 29
  • 30. Basic Process Discovery Principle extract behavioral specification synthesize process model process model 30 event log
  • 31. Bottom-Up Discovery: Directly-Follows Relation ACDE ADCE ADECFDE BDEC BCDEFDE BDEFCDE A B C D E F 33
  • 32. Dominant Behavioral Relation: Sequence Cut A CDE A DCE A DECFDE B DEC B CDEFDE B DEFCDE A B C D E F  A B C D E F 34
  • 33. Split Along Cut & Recurse A CDE A DCE A DECFDE B DEC B CDEFDE B DEFCDE A B C D E F  A B C D E F 35
  • 34. Choice Cut & Base Case A CDE A DCE A DECFDE B DEC B CDEFDE B DEFCDE A B C D E F   A B C D E F 36
  • 35. C D E F Parallel Cut A B   A B C D E F A CDE A DCE A DECFDE B DEC B CDEFDE B DEFCDE 37 
  • 36. D E F C Loop Cut A CDE A DCE A DECFDE B DEC B CDEFDE B DEFCDE A B C D E F   A B  38
  • 37. D E F C Loop Cut A CDE A DCE A DECFDE B DEC B CDEFDE B DEFCDE A B C D E F   A B  39 
  • 38. D E F C … until All Bases Reached A CDE A DCE A DECFDE B DEC B CDEFDE B DEFCDE A B C D E F   A B   40 
  • 39. D E F C Sequence, Choice, Parallel, Loop (or “Flower”) A CDE A DCE A DECFDE B DEC B CDEFDE B DEFCDE A B C D E F   A B   41 
  • 40. D E F C Process Tree = Block-Structured Model   A B    A B F  A B   C  C D E D E   F 42
  • 41. Inductive Miner sound, fitting models (+/- filtering)  allows for reliable analysis of behavior [Leemans, Fahland, Aalst 2013-2015]
  • 42. 44 Inductive Miner sound, fitting models (+/- filtering)  allows for reliable analysis of behavior [Leemans, Fahland, Aalst 2013-2015] adding details 59000x credit collection 4000x appeal
  • 43. 45 Inductive Miner sound, fitting models (+/- filtering)  allows for reliable analysis of behavior [Leemans, Fahland, Aalst 2013-2015] Animate flow of cases
  • 45. 47 Inductive Miner sound, fitting models (+/- filtering)  allows for reliable analysis of behavior [Leemans, Fahland, Aalst 2013-2015] Analyze performance 87 days until fine is sent
  • 46. Combining Process Mining and Data Mining [Leoni et al 2013] 48 conditions for choices: “Appeal to Judge” if amount  36 EUR
  • 47. Where are we now? Process Mining Software  www.promtools.org 49 1500+ plug-ins available covering the whole process mining spectrum >150k downloads ©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
  • 48. Commercial Uptake 50©Wil van der Aalst & TU/e (use only with permission & acknowledgements)
  • 49. IEEE CIS Taskforce on Process Mining https://www.win.tue.nl/ieeetfpm/ 51
  • 50. So, has the problem been cracked? 52 A B C DE p2 end p4 p3p1 start Past Open Challenges
  • 51. So, has the problem been cracked? 53 A B C DE p2 end p4 p3p1 start Past Present
  • 52. Processes may follow many different variants 54 Purchasing Process All variants in one model  very imprecise
  • 53. 55 Cluster traces based on similarity of event context [Lu et al. 2015-2017]
  • 54. 56 Cluster traces based on similarity of event context [Lu et al. 2015-2017]
  • 55. Put events into data context: decompose [van Eck, Sidorova, Aalst 2016] 57 Create Sales Order Position Creating Invoice
  • 56. Put events into data context: decompose [Aalst, Kalenkova, Rubin, Verbeek 2014] 59 Register, Select Flight, Select Hotel, Book Flight, Book Hotel, Pay Register, Select Flight, Select Hotel, Book Hotel, Book Flight, Pay Register, Select Flight, Book Flight, Select Hotel, Book Hotel, Pay Register, Select Flight, Select Hotel, Cancel Register, Select Flight, Book Flight, Pay Register, Select Flight, Book Flight, Pay Register, Select Flight, Book Flight, Pay Register, Select Flight, Cancel Register, Select Hotel, Book Hotel, Pay Register, Select Hotel, Cancel Register, Select Hotel, Book Hotel, Pay Register, Select Hotel, Book Hotel, Pay
  • 57. Put events into data context: decompose & recompose [Aalst, Kalenkova, Rubin, Verbeek 2014] 60 Register, Select Flight, Book Flight, Pay Register, Select Flight, Book Flight, Pay Register, Select Flight, Book Flight, Pay Register, Select Flight, Cancel Register, Select Hotel, Book Hotel, Pay Register, Select Hotel, Cancel Register, Select Hotel, Book Hotel, Pay Register, Select Hotel, Book Hotel, Pay
  • 58. Put events into data context: decompose & recompose [Aalst, Kalenkova, Rubin, Verbeek 2014] 61 Register, Select Flight, Book Flight, Pay Register, Select Flight, Book Flight, Pay Register, Select Flight, Book Flight, Pay Register, Select Flight, Cancel Register, Select Hotel, Book Hotel, Pay Register, Select Hotel, Cancel Register, Select Hotel, Book Hotel, Pay Register, Select Hotel, Book Hotel, Pay Allows discovering non-block structured models!
  • 59. Post-Process 62 restructure output of Heuristics Miner [Augusto et al. 2016]
  • 60. Process Mining = Discovery + Conformance + Extension + Log Preprocessing + … event log discover model of actual process model of intended process check conformance Deviations between actual and intended process model of actual process model of intended process enriched model extend • Filtering • Clustering • Activity identification • Deviation detection • Partially ordered event data • Event log visualization • Database tables • Database logs • Event streams • IoT devices 63  www.promtools.org
  • 61.  Find patterns and contexts • identify variants • identify independence  concurrency • aggregate sets of low-level events to high-level activities  Learn prediction models • outcomes of a process based on case features • detect deviations/risks early on  Mine and integrate domain-knowledge • Identify patterns/variants/views that fit domain expectations • Enrich models with domain concepts Opportunities for Data Mining in Process Mining 64
  • 62.  Get ProM • www.promtools.org  Get event logs • Real-life event logs https://data.4tu.nl/repository/collection:event_logs_real • Synthetic event logs https://data.4tu.nl/repository/collection:event_logs_synthetic  Read up on analyses • Case studies https://www.win.tue.nl/ieeetfpm/doku.php?id=shared:process_mining_case_studies • BPI Challenge 2017 (and all previous editions) https://www.win.tue.nl/bpi/doku.php?id=2017:challenge  Take a free online course on Process Mining • https://www.coursera.org/learn/process-mining/ • https://www.futurelearn.com/courses/process-mining • https://www.futurelearn.com/courses/process-mining-healthcare  Check the literature list on the next page How to get started? 65
  • 63. 1. Cook, Jonathan E. and Alexander L. Wolf. “Automating Process Discovery through Event-Data Analysis.” 1995 17th International Conference on Software Engineering (1995): 73-73. 2. Cook, Jonathan E. and Alexander L. Wolf. “Discovering Models of Software Processes from Event-Based Data.” ACM Trans. Softw. Eng. Methodol. 7 (1998): 215-249. 3. Cook, Jonathan E. and Alexander L. Wolf. “Event-Based Detection of Concurrency.” (1998). ACM SIGSOFT’98 4. Agrawal, Rakesh, Dimitrios Gunopulos and Frank Leymann. “Mining Process Models from Workflow Logs.” EDBT (1998). 5. Weijters, A J M M and W M P Van Der Aalst. “Process Mining Discovering Workflow Models from Event-Based Data.” (2001). 6. Maruster, Laura, A. J. M. M. Weijters, Wil M. P. van der Aalst and Antal van den Bosch. “Process Mining: Discovering Direct Successors in Process Logs.” Discovery Science (2002). 7. Aalst, Wil M. P. van der, A. J. M. M. Weijters and Laura Maruster. “Workflow mining: discovering process models from event logs.” IEEE Transactions on Knowledge and Data Engineering 16 (2004): 1128-1142. 8. Jan Martijn E. M. van der Werf, Boudewijn F. van Dongen, Cor A. J. Hurkens, Alexander Serebrenik: Process Discovery using Integer Linear Programming. Fundam. Inform. 94(3-4): 387-412 (2009) 9. Sander J. J. Leemans, Dirk Fahland, Wil M. P. van der Aalst: Discovering Block-Structured Process Models from Event Logs - A Constructive Approach. Petri Nets 2013: 311-329 10. Adriano Augusto, Raffaele Conforti, Marlon Dumas, Marcello La Rosa, Giorgio Bruno: Automated Discovery of Structured Process Models: Discover Structured vs. Discover and Structure. ER 2016: 313-329 11. Wil M. P. van der Aalst, Anna A. Kalenkova, Vladimir A. Rubin, Eric Verbeek: Process Discovery Using Localized Events. Petri Nets 2015: 287-308 12. Cohen, Hila and Shahar Maoz. “The confidence in our k-tails.” ASE (2014). 13. Xixi Lu, Dirk Fahland, Wil M. P. van der Aalst: Interactively Exploring Logs and Mining Models with Clustering, Filtering, and Relabeling. BPM (Demos) 2016: 44-49 14. Xixi Lu, Dirk Fahland: A Conceptual Framework for Understanding Event Data Quality for Behavior Analysis. ZEUS 2017: 11-14 15. Xixi Lu, Dirk Fahland, Frank J. H. M. van den Biggelaar, Wil M. P. van der Aalst: Detecting Deviating Behaviors Without Models. Business Process Management Workshops 2015: 126-139 16. Maikel L. van Eck, Natalia Sidorova, Wil M. P. van der Aalst: Discovering and Exploring State-Based Models for Multi-perspective Processes. BPM 2016: 142-157 17. Massimiliano de Leoni, Marlon Dumas, Luciano García-Bañuelos: Discovering Branching Conditions from Business Process Execution Logs. FASE 2013: 114-129 18. Massimiliano de Leoni, Wil M. P. van der Aalst: Data-aware process mining: discovering decisions in processes using alignments. SAC 2013: 1454-1461 19. Joos C. A. M. Buijs, Boudewijn F. van Dongen, Wil M. P. van der Aalst: Quality Dimensions in Process Discovery: The Importance of Fitness, Precision, Generalization and Simplicity. Int. J. Cooperative Inf. Syst. 23(1) (2014) 20. Arya Adriansyah, Boudewijn F. van Dongen, Wil M. P. van der Aalst: Conformance Checking Using Cost-Based Fitness Analysis. EDOC 2011: 55-64 21. Jorge Munoz-Gama, Josep Carmona: A Fresh Look at Precision in Process Conformance. BPM 2010: 211-226 22. Arya Adriansyah, Jorge Munoz-Gama, Josep Carmona, Boudewijn F. van Dongen, Wil M. P. van der Aalst: Measuring precision of modeled behavior. Inf. Syst. E-Business Management 13(1): 37-67 (2015) Literature 66
  • 64. Process Mining Past, Present, and Open Challenges Dirk Fahland (d.fahland@tue.nl) @dfahland 67