A Mathematical Model for Evaluation of Data Analytics Implementation Alternatives
1. 1
A Mathematical Model for Evaluation of
Data Analytics Implementation
Alternatives
Jānis Grabis, Rūta Pirta
Department of Management Information Technology
Riga Technical University
Kalku 1, Riga, Latvia
grabis@rtu.lv, ruta.pirta@rtu.lv
TEAR: Trends in Enterprise
Architecture Research
2. 22
Problem area and related work
Objective and evaluation process
Optimization model
Evaluation example
Conclusion
Outline
2
3. 33
Importance of data analytics greatly
increases in enterprise information
systems
Proliferation of technological platforms for
data analytics
Problem Area
3
5. 55
Data analytics planning challenges
– Mohammad et al. (2014)
Enterprise architecture and data analytics
– Koehler and Alter (2016)
– Dokhanchi and Nazemi (2015)
Analysis of enterprise architecture models
– Goikoetxea (2004)
– Plataniotis et al. (2013)
Related Work
5
6. 66
To elaborate a model for selecting an
optimal alternative for implementation of
data analytics functionality requested by
users in accordance with EA management
principles
Objective
6
7. 77
Enterprise architecture represents
applications, data entities and
analytical components (e.g., reports),
and EA management principles
– Data entities are maintained by the
applications
– Reports are implemented on the basis
of applications
– Reports require access to data entities.
A number of requests for new reports
has been received
To select on the most suitable
applications for implementation of the
reports
Problem Statement
7
8. 88
Architectural principles
Identify new reports required
Estimate development, maintenance, integration
costs and other parameters
Optimize reports to applications assignments
Implement the reports
Evaluation Process
8
9. 99
Application Description
Enterprise applications Reporting is an internal component, which uses proprietary
technologies. Easy often end-user driven development of
simple reports with limited integration of external sources
and sharing.
Web-based reporting Reports are a part of various web pages and are developed
using open technologies such as HTML5 and JavaScript.
Often used for stand-alone and ad-hoc reports.
Data warehouse An information system specifically built for data analysis and
integrates data from various sources. The technology is
mainly intended for batch processing and requires significant
ramp-up time for new data sources.
Big data A set of technologies for developing advanced reports using
large data volumes and large variety of data formats. It often
requires customer development and specialized knowledge.
Spreadsheets Quick development of simple reports often for individual
users with limited scalability though sharing is possible using
cloud-based spreadsheet development tools.
Technological Platforms for Data
Analytics
9
10. 1010
Decision
– Assign reports (i) to applications (j) - Xij
Objective
– Minimize costs, decentralization penalty and
maximize user preferences
Constraints
– All reports should be implemented
– Applications should have access to data
entities (k) represented in the reports
Model Formulation: Informal
10
11. 1111
Objective function
Constraints
Model Formulation: Formal
11
TC DC MC IC DP UP
11 1
1
I J D
ij j iji j
DC c R X
21 1
1
I J M
ij j iji j
MC c R X
1 1
K J I
kj kjk j
IC c Y
1 1
I J D
j iji j
DP p X
1 1
I J U
ij iji j
UP p X
, , ,ik ij kjX Y i j k
1 1
1
I J
iji j
X
12. 1212
• Relative measures, e.g., planning poker
Development
cost
• EA network complexity measuresIntegration cost
• Is platform accessed by many users?
• Does platform maintain many data entities?
Decentralization
penalty
• Stated preferences
• Related systemsUser preferences
Estimation of Parameters’ Values
12
13. 13
Objective is to demonstrate
application of the proposed model
and to highlight dependence of
decision-making results on
architectural and implementation
considerations
Four new reports and five
implementation platforms
(applications)
Focus on a fleet management
solution
– Customers
– Service request
– Delivery routes
– Invoices
– Payments
– Drivers
– Customer feedback
– Delivery confirmations
– Insurance claims
– Accident claims
– Absence
– Traffic data
– Road safety data
13
Experimental Analysis
15. 1515
# New report Entities
1 Service quality
by driver
Customers, Service request, Delivery routes,
Invoices, Payments, Drivers, Customer feedback,
Delivery confirmations
2 Road safety
impact on
anxiety
Delivery routes, Invoices, Payments, Drivers,
Customer feedback, Delivery confirmations,
Insurance claims, Accident claims, Absence, Traffic
data, Road safety data
3 Route execution
delay causes
Customers, Service request, Delivery routes,
Invoices, Payments, Drivers, Customer feedback,
Delivery confirmations, Insurance claims, Accident
claims, Absence, Traffic data, Road safety data
4 Service on-time
fulfillment
Customers, Service request, Delivery routes,
Invoices, Payments, Drivers, Customer feedback,
Delivery confirmations
Requested Reports
15
16. 1616
Baseline scenario
Emphasis on meeting user preferences
– UP is multiplied by three
Emphasis on satisfying the centralization
principle
– DP is multiplied by two
Low cost of integration
– IC is divided by two
Increased economies of scale
– R1 and R2 are increased two times
Scenarios
16