Predictive modelling is a complex process that requires a number of steps to transform raw data into predictions. Preprocessing of the input data is a key step in such process, and the selection of proper preprocessing methods is often a labour intensive task. Such methods are usually trained offline and their parameters remain fixed during the whole model deployment lifetime. However, preprocessing of non-stationary data streams is more challenging since the lack of adaptation of such preprocessing methods may degrade system performance. In addition, dependencies between different predictive system components make the adaptation process more challenging. In this paper we discuss the effects of change propagation resulting from using adaptive preprocessing in a Multicomponent Predictive System (MCPS). To highlight various issues we present four scenarios with different levels of adaptation. A number of experiments have been performed with a range of datasets to compare the prediction error in all four scenarios. Results show that well managed adaptation considerably improves the prediction performance. However, the model can become inconsistent if adaptation in one component is not correctly propagated throughout the rest of system components. Sometimes, such inconsistency may not cause an obvious deterioration in the system performance, therefore being difficult to detect. In some other cases it may even lead to a system failure as was observed in our experiments.
Effects of change propagation resulting from adaptive preprocessing in multicomponent predictive systems
1. Effects of change propagation resulting from adaptive
preprocessing in multicomponent predictive systems
Manuel Martín Salvador, Marcin Budka, Bogdan Gabrys
{msalvador,mbudka,bgabrys}@bournemouth.ac.uk
Data Science Institute. Bournemouth University
KES-2016, York, UK
September 7th, 2016
6. Data streams
“Infinite” number of records
Continuously arriving to the system
at different or same rates
Can be stationary or evolving
7. Data streams
Examples:
● Sensors in manufacturing industry
● Traffic monitoring sensors
● Event logs in websites
● Transactions in the financial sector
“Infinite” number of records
Continuously arriving to the system
at different or same rates
Can be stationary or evolving
A single engine of Airbus A320
has more than 1000 sensors
generating 10GB/s!!
9. Data Stream
Data stream learning for online prediction
Predictive
Model
Online Supervised Learning Algorithm
Predictions
True labels
t+k
t
10. Data Stream
Data stream learning for online prediction
Predictive
Model
PredictionsPreprocessing Postprocessing
Multicomponent Predictive System (MCPS)
11. MCPS composition
Manual
● WEKA
● RapidMiner
● Knime
● IBM SPSS
Automatic
● Auto-WEKA (Bayesian optimisation)
● Auto-sklearn (Bayesian optimisation + Meta-learning)
● TPOT (Genetic programming)
● e-Lico IDA (Ontologies + Planning)
Example of WEKA workflow
13. Formalising MCPS
o prediction
i
place
transition
Well-handled and Acyclic Workflow Petri net (WA-WF-net)
MCPS = (P, T, F)
“Automatic composition and optimisation of multicomponent predictive systems”
@ IEEE TNNLS (under review) http://bit.ly/automatic-mcps-tnnls
17. Need of model adaptation
Streaming error (mean over last 10 samples)
SYN dataset with GFMM classifier
GFMMZ-Score PCA Min-Max
Wrongly classified
18. Need of preprocessing adaptation
Streaming error (mean over last 10 samples)
SYN dataset with GFMM classifier
GFMMZ-Score PCA Min-Max
Wrongly classified
(out of [0,1])
New hyperboxes
19. Main strategies for MCPS adaptation
Adaptation strategies GLOBAL LOCAL
Re-composition Full Partial
Hyperparameter optimisation (keep components) Full Partial
Parameterisation (keep components and hyperparameters) Full Partial
20. Main strategies for MCPS adaptation
Adaptation strategies GLOBAL LOCAL
Re-composition Full Partial
Hyperparameter optimisation (keep components) Full Partial
Parameterisation (keep components and hyperparameters) Full Partial
“Adapting Multicomponent Predictive Systems using Hybrid Adaptation
Strategies with Auto-WEKA in Process Industry” @ AutoML / ICML 2016
http://bit.ly/adapting-mcps-paper
This work!
21. Need of change propagation
Streaming error (mean over last 10 samples)
SYN dataset with GFMM classifier
GFMMZ-Score PCA Min-Max
Inconsistent hyperboxes
due to a different input space
28. Experiments
Name # Attr # Class Type
SYN 2 2 Synthetic
ELEC 7 2 Real
COVERTYPE 54 7 Real
GAS 128 6 Real
Datasets Scenarios
Id
Adap.
Model
Adap.
Prepro.
Change
Propagation
#1 No No No
#2 Yes No No
#3 Yes Yes No
#4 Yes Yes Yes
First 200 samples for initial training,
rest 400 for testing and online learning
GFMMZ-Score PCA Min-Max
31. Conclusion
Only model adaptation may not be enough to cope with evolving data streams,
adaptive preprocessing should be considered.
However, “blind” adaptation of components can result in inconsistent models or
even in a system crash.
Local adaptation of a component may require adapting further components.
Therefore, a system must be reactive and propagate changes.
The definition of MCPS has been extended to support change propagation using
a new token for meta-data in a coloured Petri net (cMCPS).
32. Future work
Large study to measure the actual cost of adaptation.
Open questions:
● How to handle propagation requiring changes of the Petri net structure?
● How to handle transformations in systems with nonlinear components?
● How to order components to reduce the cost of adaptation?
● Can a meta-data token be removed at an early stage instead of being fully
propagated?