SlideShare una empresa de Scribd logo
1 de 10
Enabling Data Analytics from Knowledge Graphs
Henrique Santos
Universidade de Fortaleza, Fortaleza, CE, Brazil
The 16th
International Semantic Web Conference (ISWC 2017) – Doctoral Consortium
Vienna, Austria – 22 October 2017
22 October 2017Henrique Santos - Enabling Data Analytics from Knowledge Graphs2
The 16th International Semantic Web Conference (ISWC 2017) – Vienna, Austria – Doctoral Consortium
Problem statement
●
Datasets are the most common source of scientifc data for data analysis
●
Lack of metadata, not clean, can’t be directly combined or compared
●
Knowledge Graphs for scientifc data are on the rise
●
Many approaches, multiple uses: but data scientists are still using datasets
●
Consequence: data preparation takes around 80% of the time of the
whole analytical process (PATIL, 2012)
●
How to maintain enough metadata related to scientifc data?
●
How to exploit that knowledge to foster data analytics activities?
●
How integrate data from scientifc KGs with regular data tools like R,
Python or BI softwares?
22 October 2017Henrique Santos - Enabling Data Analytics from Knowledge Graphs3
The 16th International Semantic Web Conference (ISWC 2017) – Vienna, Austria – Doctoral Consortium
Related work
●
W3C’s CSV on the Web
●
Scientifc ontologies
●
SSN – Semantic Sensor Network
●
VSTO – Virtual Solar-Terrestrial Observatory
●
HAScO – Human-Aware Science Ontology
●
Indicators
●
GCI Ontology
●
Scientifc Knowledge graphs
●
Gene Ontology, Bio2RDF, The Graph of Things
22 October 2017Henrique Santos - Enabling Data Analytics from Knowledge Graphs4
The 16th International Semantic Web Conference (ISWC 2017) – Vienna, Austria – Doctoral Consortium
Research questions & Hypothesis
Q1 Can ontologies be used to successfully bridge the knowledge gap between acquired scientifc data and
data users? If so, how?
Q2 Will data users and applications beneft from the use of knowledge behind each scientifc data point?
Q3 How to provide data access for scientifc KGs in a way that can be consumed by routine data tools
while making use of the attached data knowledge to facilitate analytics?
H1 The reuse of scientifc data ontologies with proper extensions and their alignments to domain
ontologies can mitigate the current loss of knowledge during data acquisition
H2 Providing data points together with their knowledge (e.g. provenance, contextual knowledge) to data
users and applications can facilitate data analytics compared to current dataset usage.
H3 A hybrid RDF serialization format that suits the needs of existing data tools but also is able to convey
knowledge can be used to serialize data from KGs together with its associated metadata.
H4 A query API for scientifc KGs can be used to output data together with its associated metadata in a
better way than current tools for querying RDF data for data tools.
22 October 2017Henrique Santos - Enabling Data Analytics from Knowledge Graphs5
The 16th International Semantic Web Conference (ISWC 2017) – Vienna, Austria – Doctoral Consortium
Approach
Data
annotation
KG
building
KG
browsing
KG
serialization
Intelligent
applications
C
HAScO
VSTO-I
HACitO
prov:Activityprov:Activity
hasco: Studyhasco: Study hasco:
DataAcquisition
hasco:
DataAcquisition
vstoi:
Deployment
vstoi:
Deployment
xsd:dateTime
xsd:dateTime
isData
AcquisitionOf hasDeployment
prov: startedAtTime
prov: endedAtTime
vstoi:
Instrument
vstoi:
Instrument
vstoi:
Platform
vstoi:
Platform
vstoi:
Detector
vstoi:
Detector
hasDetectorhasInstrument hasPlatform
C
●
Automatic data
visualization
●
Data cleansing
●
Infer semantic
diference between
data points
●
...
22 October 2017Henrique Santos - Enabling Data Analytics from Knowledge Graphs6
The 16th International Semantic Web Conference (ISWC 2017) – Vienna, Austria – Doctoral Consortium
Preliminary results
SANTOS, H. et al. Contextual Data Collection for Smart Cities.
In: Proceedings of the Sixth Workshop on Semantics for
Smarter Cities. Bethlehem, PA, USA. 2015.
SANTOS, H. et al. From Data to City Indicators: A Knowledge Graph for
Supporting Automatic Generation of Dashboards. In: The Semantic Web -
Proceedings of the 14th Extended Semantic Web Conference (ESWC 2017).
Portorož, Slovenia. 2017.
Data
annotation
KG
building
KG
browsing
KG
serialization
Intelligent
applications
22 October 2017Henrique Santos - Enabling Data Analytics from Knowledge Graphs7
The 16th International Semantic Web Conference (ISWC 2017) – Vienna, Austria – Doctoral Consortium
Evaluation plan
KG evaluation (H1): state-of-the-art KG evaluation approaches discussed
in (PAULHEIM, 2017).
Metadata evaluation (H2): gathering data analytics use cases and
assessing how the associated metadata facilitates the use of the data.
KG querying & serialization (H3, H4): tests with data scientists and
feld specialists acting as users of our proposed KG and processes. Using
their data (preferably from diferent studies and sources), we intend to
build a scientifc KG adding the relevant metadata and then provide them
tools for querying the data and preparing datasets for their routine data
analytics. Then, questionnaires will be applied to measure how much our
approach has eased their tasks in contrast with their regular processes.
22 October 2017Henrique Santos - Enabling Data Analytics from Knowledge Graphs8
The 16th International Semantic Web Conference (ISWC 2017) – Vienna, Austria – Doctoral Consortium
Relevancy
●
We expect this research to bring straight benefts to data
scientists and feld specialists, by providing specifcations and
tools that we claim will ease their data preparation tasks
●
KG serialization technique will promote interoperability
between scientifc data in KGs and existing non-semantic data
tools which we believe will broaden the use of KGs to even
more knowledge areas
22 October 2017Henrique Santos - Enabling Data Analytics from Knowledge Graphs9
The 16th International Semantic Web Conference (ISWC 2017) – Vienna, Austria – Doctoral Consortium
Refections
●
Promoting data analytics from scientifc data in KGs is still in its early
stages
●
Difcult to query the needed data
●
Lack of methods and tools to easily cope data tools with data from KGs
●
Knowledge exploitation to foster data analysis is minimal
●
Our contributions
●
KG specifcation aligned with data analytics requirements
●
Data fle format able to convey both data and metadata
●
Method for data access and retrieval in scientifc KGs based on user queries
●
Our resources
●
Indicator and domain ontologies for developed use-cases
●
Implementations of the proposed method for data access
22 October 2017Henrique Santos - Enabling Data Analytics from Knowledge Graphs10
The 16th International Semantic Web Conference (ISWC 2017) – Vienna, Austria – Doctoral Consortium
hos@edu.unifor.br
@hansidm
http://henriquesantos.org
Enabling Data Analytics from Knowledge Graphs
Henrique Santos
Thank you for your attention
Advisor: Prof. João José Vasco Peixoto Furtado, Docteur

Más contenido relacionado

La actualidad más candente

Geostatistics Portal - the multitool for statistics on maps
Geostatistics Portal - the multitool for statistics on mapsGeostatistics Portal - the multitool for statistics on maps
Geostatistics Portal - the multitool for statistics on maps
Mirosław Migacz
 

La actualidad más candente (17)

Resume 2017
Resume 2017Resume 2017
Resume 2017
 
OpenCube Workshop at eGov2015 & ePart2015 dual conference
OpenCube Workshop at eGov2015 & ePart2015 dual conferenceOpenCube Workshop at eGov2015 & ePart2015 dual conference
OpenCube Workshop at eGov2015 & ePart2015 dual conference
 
Creating and Utilizing Linked Open Statistical Data for the Development of Ad...
Creating and Utilizing Linked Open Statistical Data for the Development of Ad...Creating and Utilizing Linked Open Statistical Data for the Development of Ad...
Creating and Utilizing Linked Open Statistical Data for the Development of Ad...
 
Using Road Sensor Data for Official Statistics: towards a Big Data Methodology
Using Road Sensor Data for Official Statistics: towards a Big Data MethodologyUsing Road Sensor Data for Official Statistics: towards a Big Data Methodology
Using Road Sensor Data for Official Statistics: towards a Big Data Methodology
 
"Dude, where's my graph?" RDF Data Cubes for Clinical Trials Data
"Dude, where's my graph?" RDF Data Cubes for Clinical Trials Data"Dude, where's my graph?" RDF Data Cubes for Clinical Trials Data
"Dude, where's my graph?" RDF Data Cubes for Clinical Trials Data
 
Verso le trusted smart statistics - prospettive di sviluppo e risultati del e...
Verso le trusted smart statistics - prospettive di sviluppo e risultati del e...Verso le trusted smart statistics - prospettive di sviluppo e risultati del e...
Verso le trusted smart statistics - prospettive di sviluppo e risultati del e...
 
Prague Hacks 2015
Prague Hacks 2015Prague Hacks 2015
Prague Hacks 2015
 
Publishing Linked Statistical Data: Aragón, a case study
Publishing Linked Statistical Data: Aragón, a case studyPublishing Linked Statistical Data: Aragón, a case study
Publishing Linked Statistical Data: Aragón, a case study
 
Opportunities and methodological challenges of Big Data for official statist...
Opportunities and methodological challenges of  Big Data for official statist...Opportunities and methodological challenges of  Big Data for official statist...
Opportunities and methodological challenges of Big Data for official statist...
 
Adam Bartusiak and Jörg Lässig | Semantic Processing for the Conversion of Un...
Adam Bartusiak and Jörg Lässig | Semantic Processing for the Conversion of Un...Adam Bartusiak and Jörg Lässig | Semantic Processing for the Conversion of Un...
Adam Bartusiak and Jörg Lässig | Semantic Processing for the Conversion of Un...
 
Integrating Sensor and Social Data for Understanding City Events
Integrating Sensor and Social Data for Understanding City EventsIntegrating Sensor and Social Data for Understanding City Events
Integrating Sensor and Social Data for Understanding City Events
 
Nick_Farrel_Resume
Nick_Farrel_ResumeNick_Farrel_Resume
Nick_Farrel_Resume
 
Stream processing: The Matrix Revolutions
Stream processing: The Matrix RevolutionsStream processing: The Matrix Revolutions
Stream processing: The Matrix Revolutions
 
Location analytics by Marc Planaguma at Big Data Spain 2014
 Location analytics by Marc Planaguma at Big Data Spain 2014 Location analytics by Marc Planaguma at Big Data Spain 2014
Location analytics by Marc Planaguma at Big Data Spain 2014
 
Geo-statistical Exploration of Milano Datasets
Geo-statistical Exploration of Milano DatasetsGeo-statistical Exploration of Milano Datasets
Geo-statistical Exploration of Milano Datasets
 
Geostatistics Portal - the multitool for statistics on maps
Geostatistics Portal - the multitool for statistics on mapsGeostatistics Portal - the multitool for statistics on maps
Geostatistics Portal - the multitool for statistics on maps
 
Startup Bootcamp - Online Payments Trends in the GCC
Startup Bootcamp - Online Payments Trends in the GCCStartup Bootcamp - Online Payments Trends in the GCC
Startup Bootcamp - Online Payments Trends in the GCC
 

Similar a Enabling Data Analytics from Knowledge Graphs @ ISWC 2017 Doctoral Consortium

Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh Platform
Sanjay Padhi, Ph.D
 
Memory Management in BigData: A Perpective View
Memory Management in BigData: A Perpective ViewMemory Management in BigData: A Perpective View
Memory Management in BigData: A Perpective View
ijtsrd
 
Team 05 linked data generation
Team 05 linked data generationTeam 05 linked data generation
Team 05 linked data generation
plan4all
 

Similar a Enabling Data Analytics from Knowledge Graphs @ ISWC 2017 Doctoral Consortium (20)

Tracking research data footprints - slides
Tracking research data footprints - slidesTracking research data footprints - slides
Tracking research data footprints - slides
 
The web of data: how are we doing so far
The web of data: how are we doing so farThe web of data: how are we doing so far
The web of data: how are we doing so far
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
The web of data: how are we doing so far?
The web of data: how are we doing so far?The web of data: how are we doing so far?
The web of data: how are we doing so far?
 
STATVIEW: a web platform for visualisation and dissemination of statistical d...
STATVIEW: a web platform for visualisation and dissemination of statistical d...STATVIEW: a web platform for visualisation and dissemination of statistical d...
STATVIEW: a web platform for visualisation and dissemination of statistical d...
 
Intact danish workshop_20171001
Intact danish workshop_20171001Intact danish workshop_20171001
Intact danish workshop_20171001
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh Platform
 
Official resume titash_mandal_
Official resume titash_mandal_Official resume titash_mandal_
Official resume titash_mandal_
 
Toward FAIR Semantic Resources
Toward FAIR Semantic ResourcesToward FAIR Semantic Resources
Toward FAIR Semantic Resources
 
Data mining projects
Data mining projectsData mining projects
Data mining projects
 
Open government data portals: from publishing to use and impact
Open government data portals: from publishing to use and impactOpen government data portals: from publishing to use and impact
Open government data portals: from publishing to use and impact
 
Memory Management in BigData: A Perpective View
Memory Management in BigData: A Perpective ViewMemory Management in BigData: A Perpective View
Memory Management in BigData: A Perpective View
 
4th International Conference on Data Mining and NLP (DNLP 2023)
4th International Conference on Data Mining and NLP (DNLP 2023) 4th International Conference on Data Mining and NLP (DNLP 2023)
4th International Conference on Data Mining and NLP (DNLP 2023)
 
Identifying semantics characteristics of user’s interactions datasets through...
Identifying semantics characteristics of user’s interactions datasets through...Identifying semantics characteristics of user’s interactions datasets through...
Identifying semantics characteristics of user’s interactions datasets through...
 
Observlets
Observlets Observlets
Observlets
 
Inspire hack 2017-linked-data
Inspire hack 2017-linked-dataInspire hack 2017-linked-data
Inspire hack 2017-linked-data
 
Team 05 linked data generation
Team 05 linked data generationTeam 05 linked data generation
Team 05 linked data generation
 
ESSnet Big Data WP8 Methodology (+ Quality, +IT)
ESSnet Big Data WP8 Methodology (+ Quality, +IT)ESSnet Big Data WP8 Methodology (+ Quality, +IT)
ESSnet Big Data WP8 Methodology (+ Quality, +IT)
 
Building a semantic-based decision support system to optimize the energy use ...
Building a semantic-based decision support system to optimize the energy use ...Building a semantic-based decision support system to optimize the energy use ...
Building a semantic-based decision support system to optimize the energy use ...
 
Setting Up a Qualitative or Mixed Methods Research Project in NVivo 10 to Cod...
Setting Up a Qualitative or Mixed Methods Research Project in NVivo 10 to Cod...Setting Up a Qualitative or Mixed Methods Research Project in NVivo 10 to Cod...
Setting Up a Qualitative or Mixed Methods Research Project in NVivo 10 to Cod...
 

Último

Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
chadhar227
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
nirzagarg
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
HyderabadDolls
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
wsppdmt
 
Computer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfComputer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdf
SayantanBiswas37
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Bertram Ludäscher
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
gajnagarg
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
gajnagarg
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
nirzagarg
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
gajnagarg
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
gajnagarg
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
gajnagarg
 

Último (20)

Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
 
20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf20240412-SmartCityIndex-2024-Full-Report.pdf
20240412-SmartCityIndex-2024-Full-Report.pdf
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
 
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
 
Computer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfComputer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdf
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
 
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareDigital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham Ware
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
 
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
 
Statistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbersStatistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbers
 

Enabling Data Analytics from Knowledge Graphs @ ISWC 2017 Doctoral Consortium

  • 1. Enabling Data Analytics from Knowledge Graphs Henrique Santos Universidade de Fortaleza, Fortaleza, CE, Brazil The 16th International Semantic Web Conference (ISWC 2017) – Doctoral Consortium Vienna, Austria – 22 October 2017
  • 2. 22 October 2017Henrique Santos - Enabling Data Analytics from Knowledge Graphs2 The 16th International Semantic Web Conference (ISWC 2017) – Vienna, Austria – Doctoral Consortium Problem statement ● Datasets are the most common source of scientifc data for data analysis ● Lack of metadata, not clean, can’t be directly combined or compared ● Knowledge Graphs for scientifc data are on the rise ● Many approaches, multiple uses: but data scientists are still using datasets ● Consequence: data preparation takes around 80% of the time of the whole analytical process (PATIL, 2012) ● How to maintain enough metadata related to scientifc data? ● How to exploit that knowledge to foster data analytics activities? ● How integrate data from scientifc KGs with regular data tools like R, Python or BI softwares?
  • 3. 22 October 2017Henrique Santos - Enabling Data Analytics from Knowledge Graphs3 The 16th International Semantic Web Conference (ISWC 2017) – Vienna, Austria – Doctoral Consortium Related work ● W3C’s CSV on the Web ● Scientifc ontologies ● SSN – Semantic Sensor Network ● VSTO – Virtual Solar-Terrestrial Observatory ● HAScO – Human-Aware Science Ontology ● Indicators ● GCI Ontology ● Scientifc Knowledge graphs ● Gene Ontology, Bio2RDF, The Graph of Things
  • 4. 22 October 2017Henrique Santos - Enabling Data Analytics from Knowledge Graphs4 The 16th International Semantic Web Conference (ISWC 2017) – Vienna, Austria – Doctoral Consortium Research questions & Hypothesis Q1 Can ontologies be used to successfully bridge the knowledge gap between acquired scientifc data and data users? If so, how? Q2 Will data users and applications beneft from the use of knowledge behind each scientifc data point? Q3 How to provide data access for scientifc KGs in a way that can be consumed by routine data tools while making use of the attached data knowledge to facilitate analytics? H1 The reuse of scientifc data ontologies with proper extensions and their alignments to domain ontologies can mitigate the current loss of knowledge during data acquisition H2 Providing data points together with their knowledge (e.g. provenance, contextual knowledge) to data users and applications can facilitate data analytics compared to current dataset usage. H3 A hybrid RDF serialization format that suits the needs of existing data tools but also is able to convey knowledge can be used to serialize data from KGs together with its associated metadata. H4 A query API for scientifc KGs can be used to output data together with its associated metadata in a better way than current tools for querying RDF data for data tools.
  • 5. 22 October 2017Henrique Santos - Enabling Data Analytics from Knowledge Graphs5 The 16th International Semantic Web Conference (ISWC 2017) – Vienna, Austria – Doctoral Consortium Approach Data annotation KG building KG browsing KG serialization Intelligent applications C HAScO VSTO-I HACitO prov:Activityprov:Activity hasco: Studyhasco: Study hasco: DataAcquisition hasco: DataAcquisition vstoi: Deployment vstoi: Deployment xsd:dateTime xsd:dateTime isData AcquisitionOf hasDeployment prov: startedAtTime prov: endedAtTime vstoi: Instrument vstoi: Instrument vstoi: Platform vstoi: Platform vstoi: Detector vstoi: Detector hasDetectorhasInstrument hasPlatform C ● Automatic data visualization ● Data cleansing ● Infer semantic diference between data points ● ...
  • 6. 22 October 2017Henrique Santos - Enabling Data Analytics from Knowledge Graphs6 The 16th International Semantic Web Conference (ISWC 2017) – Vienna, Austria – Doctoral Consortium Preliminary results SANTOS, H. et al. Contextual Data Collection for Smart Cities. In: Proceedings of the Sixth Workshop on Semantics for Smarter Cities. Bethlehem, PA, USA. 2015. SANTOS, H. et al. From Data to City Indicators: A Knowledge Graph for Supporting Automatic Generation of Dashboards. In: The Semantic Web - Proceedings of the 14th Extended Semantic Web Conference (ESWC 2017). Portorož, Slovenia. 2017. Data annotation KG building KG browsing KG serialization Intelligent applications
  • 7. 22 October 2017Henrique Santos - Enabling Data Analytics from Knowledge Graphs7 The 16th International Semantic Web Conference (ISWC 2017) – Vienna, Austria – Doctoral Consortium Evaluation plan KG evaluation (H1): state-of-the-art KG evaluation approaches discussed in (PAULHEIM, 2017). Metadata evaluation (H2): gathering data analytics use cases and assessing how the associated metadata facilitates the use of the data. KG querying & serialization (H3, H4): tests with data scientists and feld specialists acting as users of our proposed KG and processes. Using their data (preferably from diferent studies and sources), we intend to build a scientifc KG adding the relevant metadata and then provide them tools for querying the data and preparing datasets for their routine data analytics. Then, questionnaires will be applied to measure how much our approach has eased their tasks in contrast with their regular processes.
  • 8. 22 October 2017Henrique Santos - Enabling Data Analytics from Knowledge Graphs8 The 16th International Semantic Web Conference (ISWC 2017) – Vienna, Austria – Doctoral Consortium Relevancy ● We expect this research to bring straight benefts to data scientists and feld specialists, by providing specifcations and tools that we claim will ease their data preparation tasks ● KG serialization technique will promote interoperability between scientifc data in KGs and existing non-semantic data tools which we believe will broaden the use of KGs to even more knowledge areas
  • 9. 22 October 2017Henrique Santos - Enabling Data Analytics from Knowledge Graphs9 The 16th International Semantic Web Conference (ISWC 2017) – Vienna, Austria – Doctoral Consortium Refections ● Promoting data analytics from scientifc data in KGs is still in its early stages ● Difcult to query the needed data ● Lack of methods and tools to easily cope data tools with data from KGs ● Knowledge exploitation to foster data analysis is minimal ● Our contributions ● KG specifcation aligned with data analytics requirements ● Data fle format able to convey both data and metadata ● Method for data access and retrieval in scientifc KGs based on user queries ● Our resources ● Indicator and domain ontologies for developed use-cases ● Implementations of the proposed method for data access
  • 10. 22 October 2017Henrique Santos - Enabling Data Analytics from Knowledge Graphs10 The 16th International Semantic Web Conference (ISWC 2017) – Vienna, Austria – Doctoral Consortium hos@edu.unifor.br @hansidm http://henriquesantos.org Enabling Data Analytics from Knowledge Graphs Henrique Santos Thank you for your attention Advisor: Prof. João José Vasco Peixoto Furtado, Docteur