SlideShare una empresa de Scribd logo
1 de 19
Descargar para leer sin conexión
Semantic Data Management
2
Semantic Data Management
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Semantic Data Management
Actors and their Requirements
Data Provider
Data Consumer
▪ Knows his data and use cases
▪ Provides data so others can work with them
▪ Needs a way to share his knowledge
▪ Wants to work with data
▪ Needs to find the right data for his use case
▪ Needs to access the data
▪ Needs to understand the data
Different actors have different assumptions, goals and challenges when working with
data
3
Semantic Data Management
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Semantic Data Management
Current Situation
▪ Data scientists spend almost 80% of their
time with collecting, cleaning and
organizing data
▪ Common challenges:
▪ Quality, comprehensiveness, availability,
retrievability, and expandability of data
▪ To make the most of your data, they need
to be available in a way that not only allows
but supports further processing and future
use cases
▪ Semantic data management effectively
reduces the time-to-application and
prepares your data for future use cases
Source: Forbes, 2016
Building training setsCleaning and organizing data
Collecting data sets
Mining data for patterns
Refining algorithms
other
What data scientists spend the most time doing
4
Semantic Data Management
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Semantic Data Management
Challenges
XML
Heterogeneous
Provision
Relational HierachyHeterogeneous
Models
Heterogeneous
Formats
°C °F K
Heterogeneous
Semantics
Heterogeneous
Quality
Heterogeneous
Use Cases
1
2
3
5
Semantic Data Management
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Semantic Data Management
Schematic Representation of Initial Situation
Data IntegrationData Provision Data Extraction
DataProvider
Weather
Data
Customer Data
Supplier
Data
Social
Media
Data
Product
Data
DataProvider
Data Consumer
6
Semantic Data Management
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Semantic Data Management
Schematic Representation of Semantic Data Integration & Management
DataProvider
Weather
Data
Customer Data
Supplier
Data
Social
Media
Data
Product
Data
DataProvider
Semantic Query
& Access
Semantic
Processing
Semantic
Integration
Semantic Data Integration & Management
Data Provision
Data Consumer
7
Semantic Data Management
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Semantic Data Management
Our Mission
▪ Google finds and "understands" websites without the need for people to describe
them explicitly
▪ How can enterprise data be provided and found without complex and possibly
incomprehensible descriptions?
▪ Our aim is to reduce the Time-to-Application of enterprise data by supporting data
scientist to answer pressing questions like:
▪ Where to find the data needed for an application, analysis etc.?
▪ What information is contained in the available data?
▪ Are there other data sources that might be relevant to solve a challenge?
8
Semantic Data Management
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Semantic Data Management
Research Goals
Bridge the gap between physical & virtual world by
developing intuitive concepts for Big Data collection
Improve the usability & accessibility of data with the help of
semantic querying
Enable semantic processing for data based on evolving
Knowledge Graphs
Develop new approaches for constructing and evolving
Knowledge Graphs
Develop semantic mapping approaches that leverage
machine learning & external knowledge
9
Semantic Data Management
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Semantic Data Management
Big Data Collection
Bridge the gap between physical & virtual world by
developing intuitive concepts for Big Data collection
HTTP Push
Agent
OPC-UA
AgentProvider
Provider MQTT
Agent
Data Ingestion
Industrial Data Lake
▪ Plug & Ingest: Autonomous agents monitor data sources and collect the data in
heterogeneous company-wide environments
▪ Agents proactively react to changing data sources and interfaces
▪ Data is ingested into a central location, e.g., into a data lake
10
Semantic Data Management
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Semantic Data Management
Continuous Ontology & Knowledge Graph Creation
Develop new approaches for constructing and evolving
Knowledge Graphs
Knowledge graph learns continuously from interacting with its environment
Defines Semantic Models
Data Provider System
Autonomous Evolution
Data Consumer
Queries Data Sets
▪ Knowledge Graph grows and strengthens continuously
▪ Knowledge Graph allows statements about relationships of data
sets
▪ Advanced AI algorithms monitor and support the evolution of the
knowledge graph
11
Semantic Data Management
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Semantic Data Management
Continuous Ontology & Knowledge Graph Creation
Develop new approaches for constructing and evolving
Knowledge Graphs
Data Description
Local Knowledge Representation
Data collection and access
Construction Industrial Data Lake
Learning from Data
Global Knowledge Graph
12
Semantic Data Management
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Semantic Data Management
Semantic Mapping: External Knowledge Recommendation
Develop semantic mapping approaches that leverage
machine learning & external knowledge
Timestamp (92,2%)
Identifier (4,2%)
…
Knowledge Graph
External Knowledge 1
External Knowledge 2
13
Semantic Data Management
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Semantic Data Management
Semantic Mapping: Machine Learning Recommendation
Develop semantic mapping approaches that leverage
machine learning & external knowledge
Knowledge Graph
Machine Learning Layer
Recommendation
…
Timestamp (87,2%)
Number (9,1%)
…
14
Semantic Data Management
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Semantic Data Management
Semantic Processing
Enable semantic processing for data based on evolving
Knowledge Graphs
▪ Consider the continuously changing knowledge graph and enable the processing of
data based on the provided semantics
▪ Prepare the data so that they are directly useable for analytics or applications
15
Semantic Data Management
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Semantic Data Management
Developing Intuitive Concepts for Big Data Collection
Improve the usability & accessibility of data with the help of
semantic querying
▪ Enable data set queries based on natural language and visual interfaces
▪ Leverage external knowledge databases to expand queries
▪ Learn from users queries in order to improve the personalized search experience
Natural Language &
Visual Queries
Personalize the search
experience
Expand queries
Selected Research Projects
and Applications
17
Semantic Data Management
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Semantic Data Management
Cross-site Enterprise Data Lake in Automotive Industry
Cross-site Enterprise Data Lake
▪ Functional cross-site data acquisition and management
infrastructure for five plants on two continents
▪ Over 1000 different data sources integrated
▪ Conceptual design of cross-site, scalable, non-invasive
data acquisition and -management
▪ Integration of heterogeneous data sources in a common
long-term data storage for each site through configurable
connector modules
▪ Automatic metadata enrichment
▪ Cross-site transparent consolidation of local data lakes
based on information models
▪ Provision of consolidated data through common methods
and familiar tools (Tableau, Power BI, RapidMiner,…)
Results
▪ Construction of a cross-site enterprise data lake that
continuously collects and integrates data from multiple
factories
Approach
Goal
Configurable, intelligent software connectors
Live operation for more than 2 years
currently more than 6 TB/year
Connection of five
plants on two continents
▪ Logistics
▪ Construction
▪ Paint Shop
▪ Assembly
▪ Rework
Optimization of throughput times on
the conveyor belt
18
Semantic Data Management
Chair of Technologies and Management of Digital Transformation, University of Wuppertal
Semantic Data Management
Knowledge Graph-based Data Management Platform (KGBDM Platform)
KGBDM Platform
▪ Available state of the art semantic data management
platform that is used in various research and
development projects
▪ Provide different connector modules for adding data
sources
▪ Data providers create semantic models for their data sets
▪ Semantic model creation is supported with the help of
recommendation systems and artificial intelligence
▪ Semantic Data Integration based on the provided semantic
models
▪ Data access via the semantic search
▪ No need to define ontologies. The KGBDM platform learns
a knowledge graph based on the user provided semantic
models and queries
Results
▪ Reduce the time-to-application by handling heterogeneous
data sources on an information level
Approach
Goal
Your Contact Person:
André Pomp, M.Sc.
Tel: +49 (0)202 439 1153
pomp@uni-wuppertal.de
Chair for Technologies and Management of Digital Transformation
Univ. Prof. Dr. Ing. Tobias Meisen
https://www.tmdt.uni-wuppertal.de/
Campus Freudenberg
Rainer-Gruenter-Str. 21
D-42119 Wuppertal
Germany
University of Wuppertal
School of Electrical, Information and Media Engineering

Más contenido relacionado

La actualidad más candente

Overview - IBM Big Data Platform
Overview - IBM Big Data PlatformOverview - IBM Big Data Platform
Overview - IBM Big Data PlatformVikas Manoria
 
Why Infrastructure Matters for Big Data & Analytics
Why Infrastructure Matters for Big Data & AnalyticsWhy Infrastructure Matters for Big Data & Analytics
Why Infrastructure Matters for Big Data & AnalyticsRick Perret
 
Apache hadoop bigdata-in-banking
Apache hadoop bigdata-in-bankingApache hadoop bigdata-in-banking
Apache hadoop bigdata-in-bankingm_hepburn
 
Traditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonTraditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonCapgemini
 
Gov & Private Sector Regulatory Compliance: Using Hadoop to Address Requirements
Gov & Private Sector Regulatory Compliance: Using Hadoop to Address RequirementsGov & Private Sector Regulatory Compliance: Using Hadoop to Address Requirements
Gov & Private Sector Regulatory Compliance: Using Hadoop to Address RequirementsDataWorks Summit
 
Big Data Infrastructure and Analytics Solution on FITAT2013
Big Data Infrastructure and Analytics Solution on FITAT2013Big Data Infrastructure and Analytics Solution on FITAT2013
Big Data Infrastructure and Analytics Solution on FITAT2013Erdenebayar Erdenebileg
 
Big Data & Analytics Architecture
Big Data & Analytics ArchitectureBig Data & Analytics Architecture
Big Data & Analytics ArchitectureArvind Sathi
 
Case Study - Spotad: Rebuilding And Optimizing Real-Time Mobile Adverting Bid...
Case Study - Spotad: Rebuilding And Optimizing Real-Time Mobile Adverting Bid...Case Study - Spotad: Rebuilding And Optimizing Real-Time Mobile Adverting Bid...
Case Study - Spotad: Rebuilding And Optimizing Real-Time Mobile Adverting Bid...Vasu S
 
Flash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lonFlash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lonJeffrey T. Pollock
 
Deutsche Telekom on Big Data
Deutsche Telekom on Big DataDeutsche Telekom on Big Data
Deutsche Telekom on Big DataDataWorks Summit
 
"Empower Developers with HPE Machine Learning and Augmented Intelligence", Dr...
"Empower Developers with HPE Machine Learning and Augmented Intelligence", Dr..."Empower Developers with HPE Machine Learning and Augmented Intelligence", Dr...
"Empower Developers with HPE Machine Learning and Augmented Intelligence", Dr...Dataconomy Media
 
Monitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service ProvidersMonitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service ProvidersDataWorks Summit
 
Use dependency injection to get Hadoop *out* of your application code
Use dependency injection to get Hadoop *out* of your application codeUse dependency injection to get Hadoop *out* of your application code
Use dependency injection to get Hadoop *out* of your application codeDataWorks Summit
 
Making Bank Predictive and Real-Time
Making Bank Predictive and Real-TimeMaking Bank Predictive and Real-Time
Making Bank Predictive and Real-TimeDataWorks Summit
 
Protecting data privacy in analytics and machine learning ISACA London UK
Protecting data privacy in analytics and machine learning ISACA London UKProtecting data privacy in analytics and machine learning ISACA London UK
Protecting data privacy in analytics and machine learning ISACA London UKUlf Mattsson
 
Sudhir hadoop and Data warehousing resume
Sudhir hadoop and Data warehousing resume Sudhir hadoop and Data warehousing resume
Sudhir hadoop and Data warehousing resume Sudhir Saxena
 
10 things you need to know about Spark
10 things you need to know about Spark10 things you need to know about Spark
10 things you need to know about SparkIBM Analytics
 
Meet the experts dwo bde vds v7
Meet the experts dwo bde vds v7Meet the experts dwo bde vds v7
Meet the experts dwo bde vds v7mmathipra
 
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, ClouderaMongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, ClouderaMongoDB
 

La actualidad más candente (20)

Overview - IBM Big Data Platform
Overview - IBM Big Data PlatformOverview - IBM Big Data Platform
Overview - IBM Big Data Platform
 
Why Infrastructure Matters for Big Data & Analytics
Why Infrastructure Matters for Big Data & AnalyticsWhy Infrastructure Matters for Big Data & Analytics
Why Infrastructure Matters for Big Data & Analytics
 
Apache hadoop bigdata-in-banking
Apache hadoop bigdata-in-bankingApache hadoop bigdata-in-banking
Apache hadoop bigdata-in-banking
 
Traditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonTraditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A Comparison
 
Gov & Private Sector Regulatory Compliance: Using Hadoop to Address Requirements
Gov & Private Sector Regulatory Compliance: Using Hadoop to Address RequirementsGov & Private Sector Regulatory Compliance: Using Hadoop to Address Requirements
Gov & Private Sector Regulatory Compliance: Using Hadoop to Address Requirements
 
Big Data Infrastructure and Analytics Solution on FITAT2013
Big Data Infrastructure and Analytics Solution on FITAT2013Big Data Infrastructure and Analytics Solution on FITAT2013
Big Data Infrastructure and Analytics Solution on FITAT2013
 
Big Data & Analytics Architecture
Big Data & Analytics ArchitectureBig Data & Analytics Architecture
Big Data & Analytics Architecture
 
Case Study - Spotad: Rebuilding And Optimizing Real-Time Mobile Adverting Bid...
Case Study - Spotad: Rebuilding And Optimizing Real-Time Mobile Adverting Bid...Case Study - Spotad: Rebuilding And Optimizing Real-Time Mobile Adverting Bid...
Case Study - Spotad: Rebuilding And Optimizing Real-Time Mobile Adverting Bid...
 
Flash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lonFlash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lon
 
Deutsche Telekom on Big Data
Deutsche Telekom on Big DataDeutsche Telekom on Big Data
Deutsche Telekom on Big Data
 
"Empower Developers with HPE Machine Learning and Augmented Intelligence", Dr...
"Empower Developers with HPE Machine Learning and Augmented Intelligence", Dr..."Empower Developers with HPE Machine Learning and Augmented Intelligence", Dr...
"Empower Developers with HPE Machine Learning and Augmented Intelligence", Dr...
 
Monitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service ProvidersMonitizing Big Data at Telecom Service Providers
Monitizing Big Data at Telecom Service Providers
 
Use dependency injection to get Hadoop *out* of your application code
Use dependency injection to get Hadoop *out* of your application codeUse dependency injection to get Hadoop *out* of your application code
Use dependency injection to get Hadoop *out* of your application code
 
Making Bank Predictive and Real-Time
Making Bank Predictive and Real-TimeMaking Bank Predictive and Real-Time
Making Bank Predictive and Real-Time
 
Protecting data privacy in analytics and machine learning ISACA London UK
Protecting data privacy in analytics and machine learning ISACA London UKProtecting data privacy in analytics and machine learning ISACA London UK
Protecting data privacy in analytics and machine learning ISACA London UK
 
Sudhir hadoop and Data warehousing resume
Sudhir hadoop and Data warehousing resume Sudhir hadoop and Data warehousing resume
Sudhir hadoop and Data warehousing resume
 
Destroying Data Silos
Destroying Data SilosDestroying Data Silos
Destroying Data Silos
 
10 things you need to know about Spark
10 things you need to know about Spark10 things you need to know about Spark
10 things you need to know about Spark
 
Meet the experts dwo bde vds v7
Meet the experts dwo bde vds v7Meet the experts dwo bde vds v7
Meet the experts dwo bde vds v7
 
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, ClouderaMongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
 

Similar a Semantic Data Management

How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...Denodo
 
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...Denodo
 
Data Science Operationalization: The Journey of Enterprise AI
Data Science Operationalization: The Journey of Enterprise AIData Science Operationalization: The Journey of Enterprise AI
Data Science Operationalization: The Journey of Enterprise AIDenodo
 
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...Denodo
 
Unlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data VirtualizationUnlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data VirtualizationDenodo
 
Data Analytics in Digital Transformation
Data Analytics in Digital TransformationData Analytics in Digital Transformation
Data Analytics in Digital TransformationMukund Babbar
 
How a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 ViewHow a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 ViewDenodo
 
Pratik Patel Python/ Big Data Analyst
Pratik Patel Python/ Big Data AnalystPratik Patel Python/ Big Data Analyst
Pratik Patel Python/ Big Data AnalystPratik Patel
 
Advanced Analytics and Machine Learning with Data Virtualization (India)
Advanced Analytics and Machine Learning with Data Virtualization (India)Advanced Analytics and Machine Learning with Data Virtualization (India)
Advanced Analytics and Machine Learning with Data Virtualization (India)Denodo
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationDenodo
 
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...DATAVERSITY
 
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIAugmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIDenodo
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationDenodo
 
Big Data Case study - caixa bank
Big Data Case study - caixa bankBig Data Case study - caixa bank
Big Data Case study - caixa bankChungsik Yun
 
3 Reasons Data Virtualization Matters in Your Portfolio
3 Reasons Data Virtualization Matters in Your Portfolio3 Reasons Data Virtualization Matters in Your Portfolio
3 Reasons Data Virtualization Matters in Your PortfolioDenodo
 
Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)Denodo
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaCloudera, Inc.
 
How Data Virtualization Puts Machine Learning into Production (APAC)
How Data Virtualization Puts Machine Learning into Production (APAC)How Data Virtualization Puts Machine Learning into Production (APAC)
How Data Virtualization Puts Machine Learning into Production (APAC)Denodo
 
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?Denodo
 
Apache Hadoop Summit 2016: The Future of Apache Hadoop an Enterprise Architec...
Apache Hadoop Summit 2016: The Future of Apache Hadoop an Enterprise Architec...Apache Hadoop Summit 2016: The Future of Apache Hadoop an Enterprise Architec...
Apache Hadoop Summit 2016: The Future of Apache Hadoop an Enterprise Architec...PwC
 

Similar a Semantic Data Management (20)

How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
 
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
 
Data Science Operationalization: The Journey of Enterprise AI
Data Science Operationalization: The Journey of Enterprise AIData Science Operationalization: The Journey of Enterprise AI
Data Science Operationalization: The Journey of Enterprise AI
 
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
 
Unlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data VirtualizationUnlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data Virtualization
 
Data Analytics in Digital Transformation
Data Analytics in Digital TransformationData Analytics in Digital Transformation
Data Analytics in Digital Transformation
 
How a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 ViewHow a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 View
 
Pratik Patel Python/ Big Data Analyst
Pratik Patel Python/ Big Data AnalystPratik Patel Python/ Big Data Analyst
Pratik Patel Python/ Big Data Analyst
 
Advanced Analytics and Machine Learning with Data Virtualization (India)
Advanced Analytics and Machine Learning with Data Virtualization (India)Advanced Analytics and Machine Learning with Data Virtualization (India)
Advanced Analytics and Machine Learning with Data Virtualization (India)
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
 
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIAugmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
Big Data Case study - caixa bank
Big Data Case study - caixa bankBig Data Case study - caixa bank
Big Data Case study - caixa bank
 
3 Reasons Data Virtualization Matters in Your Portfolio
3 Reasons Data Virtualization Matters in Your Portfolio3 Reasons Data Virtualization Matters in Your Portfolio
3 Reasons Data Virtualization Matters in Your Portfolio
 
Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
 
How Data Virtualization Puts Machine Learning into Production (APAC)
How Data Virtualization Puts Machine Learning into Production (APAC)How Data Virtualization Puts Machine Learning into Production (APAC)
How Data Virtualization Puts Machine Learning into Production (APAC)
 
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
 
Apache Hadoop Summit 2016: The Future of Apache Hadoop an Enterprise Architec...
Apache Hadoop Summit 2016: The Future of Apache Hadoop an Enterprise Architec...Apache Hadoop Summit 2016: The Future of Apache Hadoop an Enterprise Architec...
Apache Hadoop Summit 2016: The Future of Apache Hadoop an Enterprise Architec...
 

Último

怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制vexqp
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...nirzagarg
 
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制vexqp
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...nirzagarg
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubaikojalkojal131
 
Capstone in Interprofessional Informatic // IMPACT OF COVID 19 ON EDUCATION
Capstone in Interprofessional Informatic  // IMPACT OF COVID 19 ON EDUCATIONCapstone in Interprofessional Informatic  // IMPACT OF COVID 19 ON EDUCATION
Capstone in Interprofessional Informatic // IMPACT OF COVID 19 ON EDUCATIONLakpaYanziSherpa
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraGovindSinghDasila
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxchadhar227
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...gajnagarg
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...gajnagarg
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Harnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptxHarnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptxParas Gupta
 
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制vexqp
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRajesh Mondal
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1ranjankumarbehera14
 
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制vexqp
 
Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........EfruzAsilolu
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNKTimothy Spann
 

Último (20)

怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
 
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubai
 
Capstone in Interprofessional Informatic // IMPACT OF COVID 19 ON EDUCATION
Capstone in Interprofessional Informatic  // IMPACT OF COVID 19 ON EDUCATIONCapstone in Interprofessional Informatic  // IMPACT OF COVID 19 ON EDUCATION
Capstone in Interprofessional Informatic // IMPACT OF COVID 19 ON EDUCATION
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - Almora
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Harnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptxHarnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptx
 
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
 
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
怎样办理旧金山城市学院毕业证(CCSF毕业证书)成绩单学校原版复制
 
Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 

Semantic Data Management

  • 2. 2 Semantic Data Management Chair of Technologies and Management of Digital Transformation, University of Wuppertal Semantic Data Management Actors and their Requirements Data Provider Data Consumer ▪ Knows his data and use cases ▪ Provides data so others can work with them ▪ Needs a way to share his knowledge ▪ Wants to work with data ▪ Needs to find the right data for his use case ▪ Needs to access the data ▪ Needs to understand the data Different actors have different assumptions, goals and challenges when working with data
  • 3. 3 Semantic Data Management Chair of Technologies and Management of Digital Transformation, University of Wuppertal Semantic Data Management Current Situation ▪ Data scientists spend almost 80% of their time with collecting, cleaning and organizing data ▪ Common challenges: ▪ Quality, comprehensiveness, availability, retrievability, and expandability of data ▪ To make the most of your data, they need to be available in a way that not only allows but supports further processing and future use cases ▪ Semantic data management effectively reduces the time-to-application and prepares your data for future use cases Source: Forbes, 2016 Building training setsCleaning and organizing data Collecting data sets Mining data for patterns Refining algorithms other What data scientists spend the most time doing
  • 4. 4 Semantic Data Management Chair of Technologies and Management of Digital Transformation, University of Wuppertal Semantic Data Management Challenges XML Heterogeneous Provision Relational HierachyHeterogeneous Models Heterogeneous Formats °C °F K Heterogeneous Semantics Heterogeneous Quality Heterogeneous Use Cases 1 2 3
  • 5. 5 Semantic Data Management Chair of Technologies and Management of Digital Transformation, University of Wuppertal Semantic Data Management Schematic Representation of Initial Situation Data IntegrationData Provision Data Extraction DataProvider Weather Data Customer Data Supplier Data Social Media Data Product Data DataProvider Data Consumer
  • 6. 6 Semantic Data Management Chair of Technologies and Management of Digital Transformation, University of Wuppertal Semantic Data Management Schematic Representation of Semantic Data Integration & Management DataProvider Weather Data Customer Data Supplier Data Social Media Data Product Data DataProvider Semantic Query & Access Semantic Processing Semantic Integration Semantic Data Integration & Management Data Provision Data Consumer
  • 7. 7 Semantic Data Management Chair of Technologies and Management of Digital Transformation, University of Wuppertal Semantic Data Management Our Mission ▪ Google finds and "understands" websites without the need for people to describe them explicitly ▪ How can enterprise data be provided and found without complex and possibly incomprehensible descriptions? ▪ Our aim is to reduce the Time-to-Application of enterprise data by supporting data scientist to answer pressing questions like: ▪ Where to find the data needed for an application, analysis etc.? ▪ What information is contained in the available data? ▪ Are there other data sources that might be relevant to solve a challenge?
  • 8. 8 Semantic Data Management Chair of Technologies and Management of Digital Transformation, University of Wuppertal Semantic Data Management Research Goals Bridge the gap between physical & virtual world by developing intuitive concepts for Big Data collection Improve the usability & accessibility of data with the help of semantic querying Enable semantic processing for data based on evolving Knowledge Graphs Develop new approaches for constructing and evolving Knowledge Graphs Develop semantic mapping approaches that leverage machine learning & external knowledge
  • 9. 9 Semantic Data Management Chair of Technologies and Management of Digital Transformation, University of Wuppertal Semantic Data Management Big Data Collection Bridge the gap between physical & virtual world by developing intuitive concepts for Big Data collection HTTP Push Agent OPC-UA AgentProvider Provider MQTT Agent Data Ingestion Industrial Data Lake ▪ Plug & Ingest: Autonomous agents monitor data sources and collect the data in heterogeneous company-wide environments ▪ Agents proactively react to changing data sources and interfaces ▪ Data is ingested into a central location, e.g., into a data lake
  • 10. 10 Semantic Data Management Chair of Technologies and Management of Digital Transformation, University of Wuppertal Semantic Data Management Continuous Ontology & Knowledge Graph Creation Develop new approaches for constructing and evolving Knowledge Graphs Knowledge graph learns continuously from interacting with its environment Defines Semantic Models Data Provider System Autonomous Evolution Data Consumer Queries Data Sets ▪ Knowledge Graph grows and strengthens continuously ▪ Knowledge Graph allows statements about relationships of data sets ▪ Advanced AI algorithms monitor and support the evolution of the knowledge graph
  • 11. 11 Semantic Data Management Chair of Technologies and Management of Digital Transformation, University of Wuppertal Semantic Data Management Continuous Ontology & Knowledge Graph Creation Develop new approaches for constructing and evolving Knowledge Graphs Data Description Local Knowledge Representation Data collection and access Construction Industrial Data Lake Learning from Data Global Knowledge Graph
  • 12. 12 Semantic Data Management Chair of Technologies and Management of Digital Transformation, University of Wuppertal Semantic Data Management Semantic Mapping: External Knowledge Recommendation Develop semantic mapping approaches that leverage machine learning & external knowledge Timestamp (92,2%) Identifier (4,2%) … Knowledge Graph External Knowledge 1 External Knowledge 2
  • 13. 13 Semantic Data Management Chair of Technologies and Management of Digital Transformation, University of Wuppertal Semantic Data Management Semantic Mapping: Machine Learning Recommendation Develop semantic mapping approaches that leverage machine learning & external knowledge Knowledge Graph Machine Learning Layer Recommendation … Timestamp (87,2%) Number (9,1%) …
  • 14. 14 Semantic Data Management Chair of Technologies and Management of Digital Transformation, University of Wuppertal Semantic Data Management Semantic Processing Enable semantic processing for data based on evolving Knowledge Graphs ▪ Consider the continuously changing knowledge graph and enable the processing of data based on the provided semantics ▪ Prepare the data so that they are directly useable for analytics or applications
  • 15. 15 Semantic Data Management Chair of Technologies and Management of Digital Transformation, University of Wuppertal Semantic Data Management Developing Intuitive Concepts for Big Data Collection Improve the usability & accessibility of data with the help of semantic querying ▪ Enable data set queries based on natural language and visual interfaces ▪ Leverage external knowledge databases to expand queries ▪ Learn from users queries in order to improve the personalized search experience Natural Language & Visual Queries Personalize the search experience Expand queries
  • 17. 17 Semantic Data Management Chair of Technologies and Management of Digital Transformation, University of Wuppertal Semantic Data Management Cross-site Enterprise Data Lake in Automotive Industry Cross-site Enterprise Data Lake ▪ Functional cross-site data acquisition and management infrastructure for five plants on two continents ▪ Over 1000 different data sources integrated ▪ Conceptual design of cross-site, scalable, non-invasive data acquisition and -management ▪ Integration of heterogeneous data sources in a common long-term data storage for each site through configurable connector modules ▪ Automatic metadata enrichment ▪ Cross-site transparent consolidation of local data lakes based on information models ▪ Provision of consolidated data through common methods and familiar tools (Tableau, Power BI, RapidMiner,…) Results ▪ Construction of a cross-site enterprise data lake that continuously collects and integrates data from multiple factories Approach Goal Configurable, intelligent software connectors Live operation for more than 2 years currently more than 6 TB/year Connection of five plants on two continents ▪ Logistics ▪ Construction ▪ Paint Shop ▪ Assembly ▪ Rework Optimization of throughput times on the conveyor belt
  • 18. 18 Semantic Data Management Chair of Technologies and Management of Digital Transformation, University of Wuppertal Semantic Data Management Knowledge Graph-based Data Management Platform (KGBDM Platform) KGBDM Platform ▪ Available state of the art semantic data management platform that is used in various research and development projects ▪ Provide different connector modules for adding data sources ▪ Data providers create semantic models for their data sets ▪ Semantic model creation is supported with the help of recommendation systems and artificial intelligence ▪ Semantic Data Integration based on the provided semantic models ▪ Data access via the semantic search ▪ No need to define ontologies. The KGBDM platform learns a knowledge graph based on the user provided semantic models and queries Results ▪ Reduce the time-to-application by handling heterogeneous data sources on an information level Approach Goal
  • 19. Your Contact Person: André Pomp, M.Sc. Tel: +49 (0)202 439 1153 pomp@uni-wuppertal.de Chair for Technologies and Management of Digital Transformation Univ. Prof. Dr. Ing. Tobias Meisen https://www.tmdt.uni-wuppertal.de/ Campus Freudenberg Rainer-Gruenter-Str. 21 D-42119 Wuppertal Germany University of Wuppertal School of Electrical, Information and Media Engineering