SlideShare una empresa de Scribd logo
1 de 11
Big Data Technology Insights
Andreas Metzger
(paluno, TT Technical Coordinator)
TT Methodology
Rationale
• “No free lunch”
– Each data set, domain, use case is different
– Using a single data analytics solution will
most probably not work
 For each of the 13 Pilots
– Data analytics solutions best suited for requirement and datasets
– Dedicated infrastructures best linked to data sources
Reuse of best practices, common requirements,
lessons learned, …
– Within pilots, across pilots, beyond project
TT, A. Metzger, Riga 2019 2
TT Methodology
3-Stage validation and scale-up
Stage Embedding Scale of Data
Technology
Validation
Problem understanding and
validation of key solution ideas
(Historic) data pinpointing
problems and opportunities
Large-scale
Experiments
Controlled environment (not
productive environment)
Large historic and real-time data,
possibly anonymized / simulated
In-situ (on site)
trials
Trials in the field, involving actual
end-users
Real-time, live production data
complementing historic data
TT, A. Metzger, Riga 2019 3
TT Technical Results
TT, A. Metzger, Riga 2019 4
5 main technical priorities
with individual sub-topics
Coverage of BDVA SRIA
[S. Zillner, E. Curry, A. Metzger, R. Seidl (Eds.), “European big
data value strategic research and innovation agenda (SRIA),”
Version 4.0, October, 2017]
TT, A. Metzger, Riga 2019 5
D10.4
Data ManagementSemantic Annotation of unstructured and
semi- structured data 2 2 3 3 3 3 3 3 3 3 4 4 3
Semantic interoperability 3 3 3 4 3 3 3 3 3 3 4 4 3
Data quality 3 3 4 4 2 2 4 4 4 4 4 4 4Data lifecycle management and data
governance 4 4 4 4 4 4 4 4 3 3 3 3 3Integration of data and business
processes 3 2 3 4 4 4 4 4 4 4 4 4 4
Data-as-a service 4 4 4 4 4 4 4 4 4 4 4 4 3Distributed trust infrastructures for data
management 4 4 4 4 4 4 4 4 4 4 4 4 4
Other (specify)
Data Processing Architectures
Heterogeneity 4 4 4 4 4 4 4 4 4 4 3 3 4
Scalability 3 3 3 3 3 3 3 3 3 3 3 3 3Processing of data-in-motion and data-at-
rest 4 4 4 4 4 4 4 4 4 4 4 4 4
Decentralizatrion 4 4 4 4 4 4 4 4 4 4 4 4 4
Performance 4 4 4 4 4 4 4 4 4 4 4 4 4Novel architectures for enabling new
types of big data workloads 3 3 4 4 4 4 4 4 4 4 4 4 4
Introduction of new hardware capabilities 4 4 4 3 4 4 4 4 4 4 4 4 3
Other (specify)
Data Analytics
Semantic and knowledge-based analysis 3 2 3 3 2 2 3 3 2 2 2 2 2
Content validation 4 4 4 4 3 3 4 4 3 3 4 4 4
Analytics frameworks & processing 2 3 3 3 3 3 3 3 3 3 3 3 3Advanced business analytics and
intelligence 3 2 2 1 1 1 2 2 3 3 2 2 2
Predictive and prescriptive analytics 1 1 1 2 1 1 1 1 2 2 1 1 1High Performance Data Analytics
(HPDA) 2 2 2 2 1 1 2 2 2 2 3 3 2
Data analytics and Artificial Intelligence 4 4 4 3 4 4 4 4 4 4 4 4 3
Other (specify)
Data ProtectionGeneric and easy to use data protection
approaches 4 4 4 4 4 4 4 4 4 4 4 4 4Robust Data privacy (incl. multi-party
computation) 4 4 4 4 4 4 4 4 4 4 4 4 4
Risk based approaches 4 4 4 4 4 4 4 4 4 4 4 4 4
Other (specify)
Data Visualisation and User Interaction
Visual data discovery 3 3 3 2 2 2 3 3 3 3 3 3 3Interactive visual analytics of multiple
scale data 2 2 3 2 2 2 2 2 3 3 2 2 2Collaborative, intuitive and interactive
visual interfaces 2 2 2 2 2 2 2 2 3 3 2 2 2Interactive visual data exploration and
querying in a multi-device context 2 2 2 2 2 2 2 2 3 3 2 2 2
Other (specify)
D9.4D5.4 D6.4 D7.4D4.4 D8.4
1 = Main focus
2 = Topic addressed
(but not main focus)
3 = Topic marginally
addressed
4 = Topic not addressed
Risk based approaches 4 4 4 4 4 4 4 4 4 4 4 4 4
Other (specify)
Data Visualisation and User Interaction
Visual data discovery 3 3 3 2 2 2 3 3 3 3 3 3 3Interactive visual analytics of multiple
scale data 2 2 3 2 2 2 2 2 3 3 2 2 2Collaborative, intuitive and interactive
visual interfaces 2 2 2 2 2 2 2 2 3 3 2 2 2Interactive visual data exploration and
querying in a multi-device context 2 2 2 2 2 2 2 2 3 3 2 2 2
Other (specify)
Heterogeneity 4 4 4 4 4 4 4 4 4 4 3 3 4
Scalability 3 3 3 3 3 3 3 3 3 3 3 3 3Processing of data-in-motion and data-at-
rest 4 4 4 4 4 4 4 4 4 4 4 4 4
Decentralizatrion 4 4 4 4 4 4 4 4 4 4 4 4 4
Performance 4 4 4 4 4 4 4 4 4 4 4 4 4Novel architectures for enabling new
types of big data workloads 3 3 4 4 4 4 4 4 4 4 4 4 4
Introduction of new hardware capabilities 4 4 4 3 4 4 4 4 4 4 4 4 3
Other (specify)
Data Analytics
Semantic and knowledge-based analysis 3 2 3 3 2 2 3 3 2 2 2 2 2
Content validation 4 4 4 4 3 3 4 4 3 3 4 4 4
Analytics frameworks & processing 2 3 3 3 3 3 3 3 3 3 3 3 3Advanced business analytics and
intelligence 3 2 2 1 1 1 2 2 3 3 2 2 2
Predictive and prescriptive analytics 1 1 1 2 1 1 1 1 2 2 1 1 1High Performance Data Analytics
(HPDA) 2 2 2 2 1 1 2 2 2 2 3 3 2
Data analytics and Artificial Intelligence 4 4 4 3 4 4 4 4 4 4 4 4 3
Other (specify)
Data ProtectionGeneric and easy to use data protection
approaches 4 4 4 4 4 4 4 4 4 4 4 4 4Robust Data privacy (incl. multi-party
computation) 4 4 4 4 4 4 4 4 4 4 4 4 4
Risk based approaches 4 4 4 4 4 4 4 4 4 4 4 4 4
Other (specify)
Data Visualisation and User Interaction
Visual data discovery 3 3 3 2 2 2 3 3 3 3 3 3 3Interactive visual analytics of multiple
scale data 2 2 3 2 2 2 2 2 3 3 2 2 2Collaborative, intuitive and interactive
visual interfaces 2 2 2 2 2 2 2 2 3 3 2 2 2Interactive visual data exploration and
querying in a multi-device context 2 2 2 2 2 2 2 2 3 3 2 2 2
Other (specify)
D10.4
Data ManagementSemantic Annotation of unstructured and
semi- structured data 2 2 3 3 3 3 3 3 3 3 4 4 3
Semantic interoperability 3 3 3 4 3 3 3 3 3 3 4 4 3
Data quality 3 3 4 4 2 2 4 4 4 4 4 4 4Data lifecycle management and data
governance 4 4 4 4 4 4 4 4 3 3 3 3 3Integration of data and business
processes 3 2 3 4 4 4 4 4 4 4 4 4 4
Data-as-a service 4 4 4 4 4 4 4 4 4 4 4 4 3Distributed trust infrastructures for data
management 4 4 4 4 4 4 4 4 4 4 4 4 4
Other (specify)
Data Processing Architectures
Heterogeneity 4 4 4 4 4 4 4 4 4 4 3 3 4
Scalability 3 3 3 3 3 3 3 3 3 3 3 3 3Processing of data-in-motion and data-at-
rest 4 4 4 4 4 4 4 4 4 4 4 4 4
Decentralizatrion 4 4 4 4 4 4 4 4 4 4 4 4 4
Performance 4 4 4 4 4 4 4 4 4 4 4 4 4Novel architectures for enabling new
types of big data workloads 3 3 4 4 4 4 4 4 4 4 4 4 4
Introduction of new hardware capabilities 4 4 4 3 4 4 4 4 4 4 4 4 3
Other (specify)
Data Analytics
Semantic and knowledge-based analysis 3 2 3 3 2 2 3 3 2 2 2 2 2
Content validation 4 4 4 4 3 3 4 4 3 3 4 4 4
Analytics frameworks & processing 2 3 3 3 3 3 3 3 3 3 3 3 3Advanced business analytics and
intelligence 3 2 2 1 1 1 2 2 3 3 2 2 2
Predictive and prescriptive analytics 1 1 1 2 1 1 1 1 2 2 1 1 1High Performance Data Analytics
(HPDA) 2 2 2 2 1 1 2 2 2 2 3 3 2
Data analytics and Artificial Intelligence 4 4 4 3 4 4 4 4 4 4 4 4 3
Other (specify)
Data ProtectionGeneric and easy to use data protection
approaches 4 4 4 4 4 4 4 4 4 4 4 4 4Robust Data privacy (incl. multi-party
computation) 4 4 4 4 4 4 4 4 4 4 4 4 4
Risk based approaches 4 4 4 4 4 4 4 4 4 4 4 4 4
Other (specify)
Data Visualisation and User Interaction
Visual data discovery 3 3 3 2 2 2 3 3 3 3 3 3 3Interactive visual analytics of multiple
scale data 2 2 3 2 2 2 2 2 3 3 2 2 2Collaborative, intuitive and interactive
D9.4D5.4 D6.4 D7.4D4.4 D8.4
semi- structured data 2 2 3 3 3 3 3 3 3 3 4 4 3
Semantic interoperability 3 3 3 4 3 3 3 3 3 3 4 4 3
Data quality 3 3 4 4 2 2 4 4 4 4 4 4 4Data lifecycle management and data
governance 4 4 4 4 4 4 4 4 3 3 3 3 3Integration of data and business
processes 3 2 3 4 4 4 4 4 4 4 4 4 4
Data-as-a service 4 4 4 4 4 4 4 4 4 4 4 4 3Distributed trust infrastructures for data
management 4 4 4 4 4 4 4 4 4 4 4 4 4
Other (specify)
Data Processing Architectures
Heterogeneity 4 4 4 4 4 4 4 4 4 4 3 3 4
Scalability 3 3 3 3 3 3 3 3 3 3 3 3 3Processing of data-in-motion and data-at-
rest 4 4 4 4 4 4 4 4 4 4 4 4 4
Decentralizatrion 4 4 4 4 4 4 4 4 4 4 4 4 4
Performance 4 4 4 4 4 4 4 4 4 4 4 4 4Novel architectures for enabling new
types of big data workloads 3 3 4 4 4 4 4 4 4 4 4 4 4
Introduction of new hardware capabilities 4 4 4 3 4 4 4 4 4 4 4 4 3
Other (specify)
Data Analytics
Semantic and knowledge-based analysis 3 2 3 3 2 2 3 3 2 2 2 2 2
Content validation 4 4 4 4 3 3 4 4 3 3 4 4 4
Analytics frameworks & processing 2 3 3 3 3 3 3 3 3 3 3 3 3Advanced business analytics and
intelligence 3 2 2 1 1 1 2 2 3 3 2 2 2
Predictive and prescriptive analytics 1 1 1 2 1 1 1 1 2 2 1 1 1High Performance Data Analytics
(HPDA) 2 2 2 2 1 1 2 2 2 2 3 3 2
Data analytics and Artificial Intelligence 4 4 4 3 4 4 4 4 4 4 4 4 3
Other (specify)
Data ProtectionGeneric and easy to use data protection
approaches 4 4 4 4 4 4 4 4 4 4 4 4 4Robust Data privacy (incl. multi-party
computation) 4 4 4 4 4 4 4 4 4 4 4 4 4
Risk based approaches 4 4 4 4 4 4 4 4 4 4 4 4 4
Other (specify)
Data Visualisation and User Interaction
Visual data discovery 3 3 3 2 2 2 3 3 3 3 3 3 3Interactive visual analytics of multiple
scale data 2 2 3 2 2 2 2 2 3 3 2 2 2Collaborative, intuitive and interactive
visual interfaces 2 2 2 2 2 2 2 2 3 3 2 2 2Interactive visual data exploration and
querying in a multi-device context 2 2 2 2 2 2 2 2 3 3 2 2 2
Other (specify)
D10.4
Data ManagementSemantic Annotation of unstructured and
semi- structured data 2 2 3 3 3 3 3 3 3 3 4 4 3
Semantic interoperability 3 3 3 4 3 3 3 3 3 3 4 4 3
Data quality 3 3 4 4 2 2 4 4 4 4 4 4 4Data lifecycle management and data
governance 4 4 4 4 4 4 4 4 3 3 3 3 3Integration of data and business
processes 3 2 3 4 4 4 4 4 4 4 4 4 4
Data-as-a service 4 4 4 4 4 4 4 4 4 4 4 4 3Distributed trust infrastructures for data
management 4 4 4 4 4 4 4 4 4 4 4 4 4
Other (specify)
Data Processing Architectures
Heterogeneity 4 4 4 4 4 4 4 4 4 4 3 3 4
D9.4D5.4 D6.4 D7.4D4.4 D8.4
2
1
3
Technical Lessons Learned
Analytics
“Garbage-in – garbage-out”
• Check and cope with missing data,
data accuracy, data timeliness,
different time-zones (clocks), …
Deep Learning works very well
“out of the box”
• Use Deep learning to make more
efficient development and engineering
of big data applications (no need for
extensive hyper-parametrization)
TT, A. Metzger, Riga 2019 6
[A. Metzger & A. Neubauer, “Considering non-sequential
control flows for process prediction with recurrent
neural networks,” in SEAA 2018, Prague, Czech Republic,
IEEE Computer Society]
0,00000
0,10000
0,20000
0,30000
0,40000
0,50000
0,60000
0,70000
1 2 3 4 5 6 7 8 9 10
Diagrammtitel
Datenreihen1 Datenreihen2
Datenreihen3 Datenreihen4
MLP
RNN
Checkpoint
Accuracy [MCC]
Technical Lessons Learned
Analytics
Operators benefit from
knowing data accuracy
• Augment data (actual or
predicted) with
confidence intervals,
error ranges, reliability
estimates, …
TT, A. Metzger, Riga 2019 7
[A. Metzger et al., “Proactive process adaptation using deep
learning ensembles,” in CAiSE 2019, Rome, Italy, Springer;
Open Access: https://doi.org/10.1007/978-3-030-21290-2_34]
Alarm
Reliability Estimate
Terminal Productivity Cockpit
Technical Lessons Learned
Visualization
Do not show too much information
• Show information hierarchically
(top-down: summary  details)
• Use quickly to grasp and intuitive widgets
• Only show critical and validated events
Static UIs may be limiting
• Easy and ad-hoc customization of
visualization
TT, A. Metzger, Riga 2019 8
Technical Lessons Learned
Data Management
Data availability does not mean fit for purpose
• Define data analytics / visualization goal and then
determine which data and how to access
(or combination: bottom-up + top-down)
Data quality and integration takes around 80% of
effort/time
• Plan sufficient time at project start for data refinement
and fine-tuning of data collection
TT, A. Metzger, Riga 2019 9
Finally…
What’s next?
AI in Transport
TT, A. Metzger, Riga 2019
Autonomic
Enactment
• Reinforcement
learning for solving
complex planning and
decision problems
• Actuation driven by AI
decisions
• Safety and
trustworthiness as key
requirements for
adoption
Improved
Decision Making
• Deep learning for high
accuracy descriptive and
predictive analytics
10
[S. Zillner, J.A. Gomez, A. Garcia, E. Curry (Eds.), “Data for
Artificial Intelligence for European economic competitiveness
and societal progress – BDVA position statement,” 2018]
Thanks!
TT, A. Metzger, Riga 2019 11
Research leading to these results has received
funding from the EU’s Horizon 2020 research and
innovation programme under grant agreements no.
731932 – http://www.transformingtransport.eu
732630 – http://www.big-data-value.eu

Más contenido relacionado

La actualidad más candente

A statistical data fusion technique in virtual data integration environment
A statistical data fusion technique in virtual data integration environmentA statistical data fusion technique in virtual data integration environment
A statistical data fusion technique in virtual data integration environmentIJDKP
 
Unstructured Data Fact Sheet
Unstructured Data Fact SheetUnstructured Data Fact Sheet
Unstructured Data Fact SheetConnexica
 
IRJET- Improved Vault based Tokenization to Boost Vault Lookup Performance
IRJET-  	  Improved Vault based Tokenization to Boost Vault Lookup PerformanceIRJET-  	  Improved Vault based Tokenization to Boost Vault Lookup Performance
IRJET- Improved Vault based Tokenization to Boost Vault Lookup PerformanceIRJET Journal
 
Application of data mining tools for
Application of data mining tools forApplication of data mining tools for
Application of data mining tools forIJDKP
 
Privacy Preserving Data Mining
Privacy Preserving Data MiningPrivacy Preserving Data Mining
Privacy Preserving Data MiningVrushali Malvadkar
 

La actualidad más candente (10)

Bi assignment
Bi assignmentBi assignment
Bi assignment
 
A Study On Red Box Data Mining Approach
A Study On Red Box Data Mining ApproachA Study On Red Box Data Mining Approach
A Study On Red Box Data Mining Approach
 
A statistical data fusion technique in virtual data integration environment
A statistical data fusion technique in virtual data integration environmentA statistical data fusion technique in virtual data integration environment
A statistical data fusion technique in virtual data integration environment
 
Testing
TestingTesting
Testing
 
Unstructured Data Fact Sheet
Unstructured Data Fact SheetUnstructured Data Fact Sheet
Unstructured Data Fact Sheet
 
Intelligent Data Extraction, Turning Content into Data, A Look at Advanced Ca...
Intelligent Data Extraction, Turning Content into Data, A Look at Advanced Ca...Intelligent Data Extraction, Turning Content into Data, A Look at Advanced Ca...
Intelligent Data Extraction, Turning Content into Data, A Look at Advanced Ca...
 
IRJET- Improved Vault based Tokenization to Boost Vault Lookup Performance
IRJET-  	  Improved Vault based Tokenization to Boost Vault Lookup PerformanceIRJET-  	  Improved Vault based Tokenization to Boost Vault Lookup Performance
IRJET- Improved Vault based Tokenization to Boost Vault Lookup Performance
 
Application of data mining tools for
Application of data mining tools forApplication of data mining tools for
Application of data mining tools for
 
Data Mining
Data MiningData Mining
Data Mining
 
Privacy Preserving Data Mining
Privacy Preserving Data MiningPrivacy Preserving Data Mining
Privacy Preserving Data Mining
 

Similar a Big Data Technology Insights

What's new in SQL Server 2016
What's new in SQL Server 2016What's new in SQL Server 2016
What's new in SQL Server 2016Onomi
 
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...Denodo
 
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...DATAVERSITY
 
Fast Data Strategy Houston Roadshow Presentation
Fast Data Strategy Houston Roadshow PresentationFast Data Strategy Houston Roadshow Presentation
Fast Data Strategy Houston Roadshow PresentationDenodo
 
Data Virtualization: From Zero to Hero
Data Virtualization: From Zero to HeroData Virtualization: From Zero to Hero
Data Virtualization: From Zero to HeroDenodo
 
Data lake benefits
Data lake benefitsData lake benefits
Data lake benefitsRicky Barron
 
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...Denodo
 
GDPR Compliance Made Easy with Data Virtualization
GDPR Compliance Made Easy with Data VirtualizationGDPR Compliance Made Easy with Data Virtualization
GDPR Compliance Made Easy with Data VirtualizationDenodo
 
Unit 3 3 architectural design
Unit 3 3 architectural designUnit 3 3 architectural design
Unit 3 3 architectural designHiren Selani
 
Forensic drive correlation
Forensic drive correlationForensic drive correlation
Forensic drive correlationRamesh Gubba
 
Data science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxData science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxNagarajanG35
 
ESG Lab Report - Catalogic Software DPX
ESG Lab Report - Catalogic Software DPXESG Lab Report - Catalogic Software DPX
ESG Lab Report - Catalogic Software DPXCatalogic Software
 
brocade-five-pillars-federal-data-centers-wp (1)
brocade-five-pillars-federal-data-centers-wp (1)brocade-five-pillars-federal-data-centers-wp (1)
brocade-five-pillars-federal-data-centers-wp (1)Chet Fincke
 
brocade-five-pillars-federal-data-centers-wp (1)
brocade-five-pillars-federal-data-centers-wp (1)brocade-five-pillars-federal-data-centers-wp (1)
brocade-five-pillars-federal-data-centers-wp (1)Vince Garr
 
What's New in Pentaho 7.0?
What's New in Pentaho 7.0?What's New in Pentaho 7.0?
What's New in Pentaho 7.0?Xpand IT
 
Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)
Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)
Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)Denodo
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationDenodo
 
A Machine learning based Data Quality Analysis Approach
A Machine learning based Data Quality Analysis ApproachA Machine learning based Data Quality Analysis Approach
A Machine learning based Data Quality Analysis ApproachNandeep Nagarkar
 
A Data Warehouse Design and Usage.pdf
A Data Warehouse Design and Usage.pdfA Data Warehouse Design and Usage.pdf
A Data Warehouse Design and Usage.pdfCassie Romero
 
From Foundation to Mastery – Building a Mature Analytics Roadmap - Manav Misra
From Foundation to Mastery – Building a Mature Analytics Roadmap - Manav MisraFrom Foundation to Mastery – Building a Mature Analytics Roadmap - Manav Misra
From Foundation to Mastery – Building a Mature Analytics Roadmap - Manav MisraMolly Alexander
 

Similar a Big Data Technology Insights (20)

What's new in SQL Server 2016
What's new in SQL Server 2016What's new in SQL Server 2016
What's new in SQL Server 2016
 
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
 
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
 
Fast Data Strategy Houston Roadshow Presentation
Fast Data Strategy Houston Roadshow PresentationFast Data Strategy Houston Roadshow Presentation
Fast Data Strategy Houston Roadshow Presentation
 
Data Virtualization: From Zero to Hero
Data Virtualization: From Zero to HeroData Virtualization: From Zero to Hero
Data Virtualization: From Zero to Hero
 
Data lake benefits
Data lake benefitsData lake benefits
Data lake benefits
 
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
 
GDPR Compliance Made Easy with Data Virtualization
GDPR Compliance Made Easy with Data VirtualizationGDPR Compliance Made Easy with Data Virtualization
GDPR Compliance Made Easy with Data Virtualization
 
Unit 3 3 architectural design
Unit 3 3 architectural designUnit 3 3 architectural design
Unit 3 3 architectural design
 
Forensic drive correlation
Forensic drive correlationForensic drive correlation
Forensic drive correlation
 
Data science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxData science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptx
 
ESG Lab Report - Catalogic Software DPX
ESG Lab Report - Catalogic Software DPXESG Lab Report - Catalogic Software DPX
ESG Lab Report - Catalogic Software DPX
 
brocade-five-pillars-federal-data-centers-wp (1)
brocade-five-pillars-federal-data-centers-wp (1)brocade-five-pillars-federal-data-centers-wp (1)
brocade-five-pillars-federal-data-centers-wp (1)
 
brocade-five-pillars-federal-data-centers-wp (1)
brocade-five-pillars-federal-data-centers-wp (1)brocade-five-pillars-federal-data-centers-wp (1)
brocade-five-pillars-federal-data-centers-wp (1)
 
What's New in Pentaho 7.0?
What's New in Pentaho 7.0?What's New in Pentaho 7.0?
What's New in Pentaho 7.0?
 
Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)
Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)
Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
A Machine learning based Data Quality Analysis Approach
A Machine learning based Data Quality Analysis ApproachA Machine learning based Data Quality Analysis Approach
A Machine learning based Data Quality Analysis Approach
 
A Data Warehouse Design and Usage.pdf
A Data Warehouse Design and Usage.pdfA Data Warehouse Design and Usage.pdf
A Data Warehouse Design and Usage.pdf
 
From Foundation to Mastery – Building a Mature Analytics Roadmap - Manav Misra
From Foundation to Mastery – Building a Mature Analytics Roadmap - Manav MisraFrom Foundation to Mastery – Building a Mature Analytics Roadmap - Manav Misra
From Foundation to Mastery – Building a Mature Analytics Roadmap - Manav Misra
 

Más de Andreas Metzger

Explainable Online Reinforcement Learning for Adaptive Systems
Explainable Online Reinforcement Learning for Adaptive SystemsExplainable Online Reinforcement Learning for Adaptive Systems
Explainable Online Reinforcement Learning for Adaptive SystemsAndreas Metzger
 
Data Quality Issues in Online Reinforcement Learning for Self-Adaptive System...
Data Quality Issues in Online Reinforcement Learning for Self-Adaptive System...Data Quality Issues in Online Reinforcement Learning for Self-Adaptive System...
Data Quality Issues in Online Reinforcement Learning for Self-Adaptive System...Andreas Metzger
 
Explaining Online Reinforcement Learning Decisions of Self-Adaptive Systems
Explaining Online Reinforcement Learning Decisions of Self-Adaptive SystemsExplaining Online Reinforcement Learning Decisions of Self-Adaptive Systems
Explaining Online Reinforcement Learning Decisions of Self-Adaptive SystemsAndreas Metzger
 
Antrittsvorlesung - APL.pptx
Antrittsvorlesung - APL.pptxAntrittsvorlesung - APL.pptx
Antrittsvorlesung - APL.pptxAndreas Metzger
 
Feature Model-Guided Online Reinforcement Learning for Self-Adaptive Services
Feature Model-Guided Online Reinforcement Learning for Self-Adaptive ServicesFeature Model-Guided Online Reinforcement Learning for Self-Adaptive Services
Feature Model-Guided Online Reinforcement Learning for Self-Adaptive ServicesAndreas Metzger
 
Triggering Proactive Business Process Adaptations via Online Reinforcement Le...
Triggering Proactive Business Process Adaptations via Online Reinforcement Le...Triggering Proactive Business Process Adaptations via Online Reinforcement Le...
Triggering Proactive Business Process Adaptations via Online Reinforcement Le...Andreas Metzger
 
Data-driven AI for Self-Adaptive Software Systems
Data-driven AI for Self-Adaptive Software SystemsData-driven AI for Self-Adaptive Software Systems
Data-driven AI for Self-Adaptive Software SystemsAndreas Metzger
 
Data-driven Deep Learning for Proactive Terminal Process Management
Data-driven Deep Learning for Proactive Terminal Process ManagementData-driven Deep Learning for Proactive Terminal Process Management
Data-driven Deep Learning for Proactive Terminal Process Management Andreas Metzger
 
Proactive Process Adaptation using Deep Learning Ensembles
Proactive Process Adaptation using Deep Learning Ensembles Proactive Process Adaptation using Deep Learning Ensembles
Proactive Process Adaptation using Deep Learning Ensembles Andreas Metzger
 
Data-driven AI for Self-adaptive Information Systems
Data-driven AI for Self-adaptive Information SystemsData-driven AI for Self-adaptive Information Systems
Data-driven AI for Self-adaptive Information SystemsAndreas Metzger
 
Towards an End-to-End Architecture for Run-time Data Protection in the Cloud
Towards an End-to-End Architecture for Run-time Data Protection in the Cloud Towards an End-to-End Architecture for Run-time Data Protection in the Cloud
Towards an End-to-End Architecture for Run-time Data Protection in the Cloud Andreas Metzger
 
Considering Non-sequential Control Flows for Process Prediction with Recurren...
Considering Non-sequential Control Flows for Process Prediction with Recurren...Considering Non-sequential Control Flows for Process Prediction with Recurren...
Considering Non-sequential Control Flows for Process Prediction with Recurren...Andreas Metzger
 
Big Data Value in Mobility and Logistics
Big Data Value in Mobility and Logistics Big Data Value in Mobility and Logistics
Big Data Value in Mobility and Logistics Andreas Metzger
 
Predictive Business Process Monitoring considering Reliability and Risk
Predictive Business Process Monitoring considering Reliability and RiskPredictive Business Process Monitoring considering Reliability and Risk
Predictive Business Process Monitoring considering Reliability and RiskAndreas Metzger
 
Risk-based Proactive Process Adaptation
Risk-based Proactive Process AdaptationRisk-based Proactive Process Adaptation
Risk-based Proactive Process AdaptationAndreas Metzger
 
Predictive Process Monitoring Considering Reliability Estimates
Predictive Process Monitoring Considering Reliability EstimatesPredictive Process Monitoring Considering Reliability Estimates
Predictive Process Monitoring Considering Reliability EstimatesAndreas Metzger
 

Más de Andreas Metzger (16)

Explainable Online Reinforcement Learning for Adaptive Systems
Explainable Online Reinforcement Learning for Adaptive SystemsExplainable Online Reinforcement Learning for Adaptive Systems
Explainable Online Reinforcement Learning for Adaptive Systems
 
Data Quality Issues in Online Reinforcement Learning for Self-Adaptive System...
Data Quality Issues in Online Reinforcement Learning for Self-Adaptive System...Data Quality Issues in Online Reinforcement Learning for Self-Adaptive System...
Data Quality Issues in Online Reinforcement Learning for Self-Adaptive System...
 
Explaining Online Reinforcement Learning Decisions of Self-Adaptive Systems
Explaining Online Reinforcement Learning Decisions of Self-Adaptive SystemsExplaining Online Reinforcement Learning Decisions of Self-Adaptive Systems
Explaining Online Reinforcement Learning Decisions of Self-Adaptive Systems
 
Antrittsvorlesung - APL.pptx
Antrittsvorlesung - APL.pptxAntrittsvorlesung - APL.pptx
Antrittsvorlesung - APL.pptx
 
Feature Model-Guided Online Reinforcement Learning for Self-Adaptive Services
Feature Model-Guided Online Reinforcement Learning for Self-Adaptive ServicesFeature Model-Guided Online Reinforcement Learning for Self-Adaptive Services
Feature Model-Guided Online Reinforcement Learning for Self-Adaptive Services
 
Triggering Proactive Business Process Adaptations via Online Reinforcement Le...
Triggering Proactive Business Process Adaptations via Online Reinforcement Le...Triggering Proactive Business Process Adaptations via Online Reinforcement Le...
Triggering Proactive Business Process Adaptations via Online Reinforcement Le...
 
Data-driven AI for Self-Adaptive Software Systems
Data-driven AI for Self-Adaptive Software SystemsData-driven AI for Self-Adaptive Software Systems
Data-driven AI for Self-Adaptive Software Systems
 
Data-driven Deep Learning for Proactive Terminal Process Management
Data-driven Deep Learning for Proactive Terminal Process ManagementData-driven Deep Learning for Proactive Terminal Process Management
Data-driven Deep Learning for Proactive Terminal Process Management
 
Proactive Process Adaptation using Deep Learning Ensembles
Proactive Process Adaptation using Deep Learning Ensembles Proactive Process Adaptation using Deep Learning Ensembles
Proactive Process Adaptation using Deep Learning Ensembles
 
Data-driven AI for Self-adaptive Information Systems
Data-driven AI for Self-adaptive Information SystemsData-driven AI for Self-adaptive Information Systems
Data-driven AI for Self-adaptive Information Systems
 
Towards an End-to-End Architecture for Run-time Data Protection in the Cloud
Towards an End-to-End Architecture for Run-time Data Protection in the Cloud Towards an End-to-End Architecture for Run-time Data Protection in the Cloud
Towards an End-to-End Architecture for Run-time Data Protection in the Cloud
 
Considering Non-sequential Control Flows for Process Prediction with Recurren...
Considering Non-sequential Control Flows for Process Prediction with Recurren...Considering Non-sequential Control Flows for Process Prediction with Recurren...
Considering Non-sequential Control Flows for Process Prediction with Recurren...
 
Big Data Value in Mobility and Logistics
Big Data Value in Mobility and Logistics Big Data Value in Mobility and Logistics
Big Data Value in Mobility and Logistics
 
Predictive Business Process Monitoring considering Reliability and Risk
Predictive Business Process Monitoring considering Reliability and RiskPredictive Business Process Monitoring considering Reliability and Risk
Predictive Business Process Monitoring considering Reliability and Risk
 
Risk-based Proactive Process Adaptation
Risk-based Proactive Process AdaptationRisk-based Proactive Process Adaptation
Risk-based Proactive Process Adaptation
 
Predictive Process Monitoring Considering Reliability Estimates
Predictive Process Monitoring Considering Reliability EstimatesPredictive Process Monitoring Considering Reliability Estimates
Predictive Process Monitoring Considering Reliability Estimates
 

Último

Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhThiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhYasamin16
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
detection and classification of knee osteoarthritis.pptx
detection and classification of knee osteoarthritis.pptxdetection and classification of knee osteoarthritis.pptx
detection and classification of knee osteoarthritis.pptxAleenaJamil4
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
Vision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptxVision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptxellehsormae
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...GQ Research
 

Último (20)

Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhThiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
detection and classification of knee osteoarthritis.pptx
detection and classification of knee osteoarthritis.pptxdetection and classification of knee osteoarthritis.pptx
detection and classification of knee osteoarthritis.pptx
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
Vision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptxVision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptx
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
 

Big Data Technology Insights

  • 1. Big Data Technology Insights Andreas Metzger (paluno, TT Technical Coordinator)
  • 2. TT Methodology Rationale • “No free lunch” – Each data set, domain, use case is different – Using a single data analytics solution will most probably not work  For each of the 13 Pilots – Data analytics solutions best suited for requirement and datasets – Dedicated infrastructures best linked to data sources Reuse of best practices, common requirements, lessons learned, … – Within pilots, across pilots, beyond project TT, A. Metzger, Riga 2019 2
  • 3. TT Methodology 3-Stage validation and scale-up Stage Embedding Scale of Data Technology Validation Problem understanding and validation of key solution ideas (Historic) data pinpointing problems and opportunities Large-scale Experiments Controlled environment (not productive environment) Large historic and real-time data, possibly anonymized / simulated In-situ (on site) trials Trials in the field, involving actual end-users Real-time, live production data complementing historic data TT, A. Metzger, Riga 2019 3
  • 4. TT Technical Results TT, A. Metzger, Riga 2019 4 5 main technical priorities with individual sub-topics Coverage of BDVA SRIA [S. Zillner, E. Curry, A. Metzger, R. Seidl (Eds.), “European big data value strategic research and innovation agenda (SRIA),” Version 4.0, October, 2017]
  • 5. TT, A. Metzger, Riga 2019 5 D10.4 Data ManagementSemantic Annotation of unstructured and semi- structured data 2 2 3 3 3 3 3 3 3 3 4 4 3 Semantic interoperability 3 3 3 4 3 3 3 3 3 3 4 4 3 Data quality 3 3 4 4 2 2 4 4 4 4 4 4 4Data lifecycle management and data governance 4 4 4 4 4 4 4 4 3 3 3 3 3Integration of data and business processes 3 2 3 4 4 4 4 4 4 4 4 4 4 Data-as-a service 4 4 4 4 4 4 4 4 4 4 4 4 3Distributed trust infrastructures for data management 4 4 4 4 4 4 4 4 4 4 4 4 4 Other (specify) Data Processing Architectures Heterogeneity 4 4 4 4 4 4 4 4 4 4 3 3 4 Scalability 3 3 3 3 3 3 3 3 3 3 3 3 3Processing of data-in-motion and data-at- rest 4 4 4 4 4 4 4 4 4 4 4 4 4 Decentralizatrion 4 4 4 4 4 4 4 4 4 4 4 4 4 Performance 4 4 4 4 4 4 4 4 4 4 4 4 4Novel architectures for enabling new types of big data workloads 3 3 4 4 4 4 4 4 4 4 4 4 4 Introduction of new hardware capabilities 4 4 4 3 4 4 4 4 4 4 4 4 3 Other (specify) Data Analytics Semantic and knowledge-based analysis 3 2 3 3 2 2 3 3 2 2 2 2 2 Content validation 4 4 4 4 3 3 4 4 3 3 4 4 4 Analytics frameworks & processing 2 3 3 3 3 3 3 3 3 3 3 3 3Advanced business analytics and intelligence 3 2 2 1 1 1 2 2 3 3 2 2 2 Predictive and prescriptive analytics 1 1 1 2 1 1 1 1 2 2 1 1 1High Performance Data Analytics (HPDA) 2 2 2 2 1 1 2 2 2 2 3 3 2 Data analytics and Artificial Intelligence 4 4 4 3 4 4 4 4 4 4 4 4 3 Other (specify) Data ProtectionGeneric and easy to use data protection approaches 4 4 4 4 4 4 4 4 4 4 4 4 4Robust Data privacy (incl. multi-party computation) 4 4 4 4 4 4 4 4 4 4 4 4 4 Risk based approaches 4 4 4 4 4 4 4 4 4 4 4 4 4 Other (specify) Data Visualisation and User Interaction Visual data discovery 3 3 3 2 2 2 3 3 3 3 3 3 3Interactive visual analytics of multiple scale data 2 2 3 2 2 2 2 2 3 3 2 2 2Collaborative, intuitive and interactive visual interfaces 2 2 2 2 2 2 2 2 3 3 2 2 2Interactive visual data exploration and querying in a multi-device context 2 2 2 2 2 2 2 2 3 3 2 2 2 Other (specify) D9.4D5.4 D6.4 D7.4D4.4 D8.4 1 = Main focus 2 = Topic addressed (but not main focus) 3 = Topic marginally addressed 4 = Topic not addressed Risk based approaches 4 4 4 4 4 4 4 4 4 4 4 4 4 Other (specify) Data Visualisation and User Interaction Visual data discovery 3 3 3 2 2 2 3 3 3 3 3 3 3Interactive visual analytics of multiple scale data 2 2 3 2 2 2 2 2 3 3 2 2 2Collaborative, intuitive and interactive visual interfaces 2 2 2 2 2 2 2 2 3 3 2 2 2Interactive visual data exploration and querying in a multi-device context 2 2 2 2 2 2 2 2 3 3 2 2 2 Other (specify) Heterogeneity 4 4 4 4 4 4 4 4 4 4 3 3 4 Scalability 3 3 3 3 3 3 3 3 3 3 3 3 3Processing of data-in-motion and data-at- rest 4 4 4 4 4 4 4 4 4 4 4 4 4 Decentralizatrion 4 4 4 4 4 4 4 4 4 4 4 4 4 Performance 4 4 4 4 4 4 4 4 4 4 4 4 4Novel architectures for enabling new types of big data workloads 3 3 4 4 4 4 4 4 4 4 4 4 4 Introduction of new hardware capabilities 4 4 4 3 4 4 4 4 4 4 4 4 3 Other (specify) Data Analytics Semantic and knowledge-based analysis 3 2 3 3 2 2 3 3 2 2 2 2 2 Content validation 4 4 4 4 3 3 4 4 3 3 4 4 4 Analytics frameworks & processing 2 3 3 3 3 3 3 3 3 3 3 3 3Advanced business analytics and intelligence 3 2 2 1 1 1 2 2 3 3 2 2 2 Predictive and prescriptive analytics 1 1 1 2 1 1 1 1 2 2 1 1 1High Performance Data Analytics (HPDA) 2 2 2 2 1 1 2 2 2 2 3 3 2 Data analytics and Artificial Intelligence 4 4 4 3 4 4 4 4 4 4 4 4 3 Other (specify) Data ProtectionGeneric and easy to use data protection approaches 4 4 4 4 4 4 4 4 4 4 4 4 4Robust Data privacy (incl. multi-party computation) 4 4 4 4 4 4 4 4 4 4 4 4 4 Risk based approaches 4 4 4 4 4 4 4 4 4 4 4 4 4 Other (specify) Data Visualisation and User Interaction Visual data discovery 3 3 3 2 2 2 3 3 3 3 3 3 3Interactive visual analytics of multiple scale data 2 2 3 2 2 2 2 2 3 3 2 2 2Collaborative, intuitive and interactive visual interfaces 2 2 2 2 2 2 2 2 3 3 2 2 2Interactive visual data exploration and querying in a multi-device context 2 2 2 2 2 2 2 2 3 3 2 2 2 Other (specify) D10.4 Data ManagementSemantic Annotation of unstructured and semi- structured data 2 2 3 3 3 3 3 3 3 3 4 4 3 Semantic interoperability 3 3 3 4 3 3 3 3 3 3 4 4 3 Data quality 3 3 4 4 2 2 4 4 4 4 4 4 4Data lifecycle management and data governance 4 4 4 4 4 4 4 4 3 3 3 3 3Integration of data and business processes 3 2 3 4 4 4 4 4 4 4 4 4 4 Data-as-a service 4 4 4 4 4 4 4 4 4 4 4 4 3Distributed trust infrastructures for data management 4 4 4 4 4 4 4 4 4 4 4 4 4 Other (specify) Data Processing Architectures Heterogeneity 4 4 4 4 4 4 4 4 4 4 3 3 4 Scalability 3 3 3 3 3 3 3 3 3 3 3 3 3Processing of data-in-motion and data-at- rest 4 4 4 4 4 4 4 4 4 4 4 4 4 Decentralizatrion 4 4 4 4 4 4 4 4 4 4 4 4 4 Performance 4 4 4 4 4 4 4 4 4 4 4 4 4Novel architectures for enabling new types of big data workloads 3 3 4 4 4 4 4 4 4 4 4 4 4 Introduction of new hardware capabilities 4 4 4 3 4 4 4 4 4 4 4 4 3 Other (specify) Data Analytics Semantic and knowledge-based analysis 3 2 3 3 2 2 3 3 2 2 2 2 2 Content validation 4 4 4 4 3 3 4 4 3 3 4 4 4 Analytics frameworks & processing 2 3 3 3 3 3 3 3 3 3 3 3 3Advanced business analytics and intelligence 3 2 2 1 1 1 2 2 3 3 2 2 2 Predictive and prescriptive analytics 1 1 1 2 1 1 1 1 2 2 1 1 1High Performance Data Analytics (HPDA) 2 2 2 2 1 1 2 2 2 2 3 3 2 Data analytics and Artificial Intelligence 4 4 4 3 4 4 4 4 4 4 4 4 3 Other (specify) Data ProtectionGeneric and easy to use data protection approaches 4 4 4 4 4 4 4 4 4 4 4 4 4Robust Data privacy (incl. multi-party computation) 4 4 4 4 4 4 4 4 4 4 4 4 4 Risk based approaches 4 4 4 4 4 4 4 4 4 4 4 4 4 Other (specify) Data Visualisation and User Interaction Visual data discovery 3 3 3 2 2 2 3 3 3 3 3 3 3Interactive visual analytics of multiple scale data 2 2 3 2 2 2 2 2 3 3 2 2 2Collaborative, intuitive and interactive D9.4D5.4 D6.4 D7.4D4.4 D8.4 semi- structured data 2 2 3 3 3 3 3 3 3 3 4 4 3 Semantic interoperability 3 3 3 4 3 3 3 3 3 3 4 4 3 Data quality 3 3 4 4 2 2 4 4 4 4 4 4 4Data lifecycle management and data governance 4 4 4 4 4 4 4 4 3 3 3 3 3Integration of data and business processes 3 2 3 4 4 4 4 4 4 4 4 4 4 Data-as-a service 4 4 4 4 4 4 4 4 4 4 4 4 3Distributed trust infrastructures for data management 4 4 4 4 4 4 4 4 4 4 4 4 4 Other (specify) Data Processing Architectures Heterogeneity 4 4 4 4 4 4 4 4 4 4 3 3 4 Scalability 3 3 3 3 3 3 3 3 3 3 3 3 3Processing of data-in-motion and data-at- rest 4 4 4 4 4 4 4 4 4 4 4 4 4 Decentralizatrion 4 4 4 4 4 4 4 4 4 4 4 4 4 Performance 4 4 4 4 4 4 4 4 4 4 4 4 4Novel architectures for enabling new types of big data workloads 3 3 4 4 4 4 4 4 4 4 4 4 4 Introduction of new hardware capabilities 4 4 4 3 4 4 4 4 4 4 4 4 3 Other (specify) Data Analytics Semantic and knowledge-based analysis 3 2 3 3 2 2 3 3 2 2 2 2 2 Content validation 4 4 4 4 3 3 4 4 3 3 4 4 4 Analytics frameworks & processing 2 3 3 3 3 3 3 3 3 3 3 3 3Advanced business analytics and intelligence 3 2 2 1 1 1 2 2 3 3 2 2 2 Predictive and prescriptive analytics 1 1 1 2 1 1 1 1 2 2 1 1 1High Performance Data Analytics (HPDA) 2 2 2 2 1 1 2 2 2 2 3 3 2 Data analytics and Artificial Intelligence 4 4 4 3 4 4 4 4 4 4 4 4 3 Other (specify) Data ProtectionGeneric and easy to use data protection approaches 4 4 4 4 4 4 4 4 4 4 4 4 4Robust Data privacy (incl. multi-party computation) 4 4 4 4 4 4 4 4 4 4 4 4 4 Risk based approaches 4 4 4 4 4 4 4 4 4 4 4 4 4 Other (specify) Data Visualisation and User Interaction Visual data discovery 3 3 3 2 2 2 3 3 3 3 3 3 3Interactive visual analytics of multiple scale data 2 2 3 2 2 2 2 2 3 3 2 2 2Collaborative, intuitive and interactive visual interfaces 2 2 2 2 2 2 2 2 3 3 2 2 2Interactive visual data exploration and querying in a multi-device context 2 2 2 2 2 2 2 2 3 3 2 2 2 Other (specify) D10.4 Data ManagementSemantic Annotation of unstructured and semi- structured data 2 2 3 3 3 3 3 3 3 3 4 4 3 Semantic interoperability 3 3 3 4 3 3 3 3 3 3 4 4 3 Data quality 3 3 4 4 2 2 4 4 4 4 4 4 4Data lifecycle management and data governance 4 4 4 4 4 4 4 4 3 3 3 3 3Integration of data and business processes 3 2 3 4 4 4 4 4 4 4 4 4 4 Data-as-a service 4 4 4 4 4 4 4 4 4 4 4 4 3Distributed trust infrastructures for data management 4 4 4 4 4 4 4 4 4 4 4 4 4 Other (specify) Data Processing Architectures Heterogeneity 4 4 4 4 4 4 4 4 4 4 3 3 4 D9.4D5.4 D6.4 D7.4D4.4 D8.4 2 1 3
  • 6. Technical Lessons Learned Analytics “Garbage-in – garbage-out” • Check and cope with missing data, data accuracy, data timeliness, different time-zones (clocks), … Deep Learning works very well “out of the box” • Use Deep learning to make more efficient development and engineering of big data applications (no need for extensive hyper-parametrization) TT, A. Metzger, Riga 2019 6 [A. Metzger & A. Neubauer, “Considering non-sequential control flows for process prediction with recurrent neural networks,” in SEAA 2018, Prague, Czech Republic, IEEE Computer Society] 0,00000 0,10000 0,20000 0,30000 0,40000 0,50000 0,60000 0,70000 1 2 3 4 5 6 7 8 9 10 Diagrammtitel Datenreihen1 Datenreihen2 Datenreihen3 Datenreihen4 MLP RNN Checkpoint Accuracy [MCC]
  • 7. Technical Lessons Learned Analytics Operators benefit from knowing data accuracy • Augment data (actual or predicted) with confidence intervals, error ranges, reliability estimates, … TT, A. Metzger, Riga 2019 7 [A. Metzger et al., “Proactive process adaptation using deep learning ensembles,” in CAiSE 2019, Rome, Italy, Springer; Open Access: https://doi.org/10.1007/978-3-030-21290-2_34] Alarm Reliability Estimate Terminal Productivity Cockpit
  • 8. Technical Lessons Learned Visualization Do not show too much information • Show information hierarchically (top-down: summary  details) • Use quickly to grasp and intuitive widgets • Only show critical and validated events Static UIs may be limiting • Easy and ad-hoc customization of visualization TT, A. Metzger, Riga 2019 8
  • 9. Technical Lessons Learned Data Management Data availability does not mean fit for purpose • Define data analytics / visualization goal and then determine which data and how to access (or combination: bottom-up + top-down) Data quality and integration takes around 80% of effort/time • Plan sufficient time at project start for data refinement and fine-tuning of data collection TT, A. Metzger, Riga 2019 9
  • 10. Finally… What’s next? AI in Transport TT, A. Metzger, Riga 2019 Autonomic Enactment • Reinforcement learning for solving complex planning and decision problems • Actuation driven by AI decisions • Safety and trustworthiness as key requirements for adoption Improved Decision Making • Deep learning for high accuracy descriptive and predictive analytics 10 [S. Zillner, J.A. Gomez, A. Garcia, E. Curry (Eds.), “Data for Artificial Intelligence for European economic competitiveness and societal progress – BDVA position statement,” 2018]
  • 11. Thanks! TT, A. Metzger, Riga 2019 11 Research leading to these results has received funding from the EU’s Horizon 2020 research and innovation programme under grant agreements no. 731932 – http://www.transformingtransport.eu 732630 – http://www.big-data-value.eu