SlideShare una empresa de Scribd logo
1 de 36
Descargar para leer sin conexión
Source: http://lod-cloud.net/versions/2011-09-19/lod-cloud_colored.png
QA systems
Quality
assessment
of the LOD
datasets
The answer lies here!
•
•
Digging into the QA system
Typical IR system performances
measures
● Overall Performance
○ F1
○ Precision
○ Recall
Digging into the QA system
Data & Component/Module
oriented measures
● Search & retrieval module
○ Indexer
○ Retriever
● Preprocessing / Linguistic
○ NLP - POS tags, NER, etc
○ Entity linking & annotation - semantics
○ Relation extraction & annotation
● Query formulation
○ SPARQL conversion
● Datasource/knowledge base
○ Data
Typical IR system performances
measures
● Overall Performance
○ F1
○ Precision
○ Recall
Digging into the QA system
Data & Component/Module
oriented measures
● Search & retrieval module
○ Indexer
■ Top K words accuracy; P@10,
P@1000, etc
○ Retriever
■ Ranking, Re-ranking, MRR, etc
● Preprocessing / Linguistic
○ NLP - POS tags, NER, etc
○ Entity linking & annotation - semantics
○ Relation extraction & annotation
■ annotation accuracy/precision
■ consistency, interlinking, etc
● Query formulation
○ SPARQL conversion
■ conversion accuracy/precision
● Datasource/knowledge base
○ Completeness
○ Data diversity
○ Trust and Provenance
○ Coverage
○ Timeliness (up to date)
○ etc
Typical IR system performances
measures
● Overall Performance
○ F1
○ Precision
○ Recall
Digging into the QA system
Data & Component/Module
oriented measures
● Search & retrieval module
○ Indexer
■ Top K words accuracy; P@10,
P@1000, etc
○ Retriever
■ Ranking, Re-ranking, MRR, etc
● Preprocessing / Linguistic
○ NLP - POS tags, NER, etc
○ Entity linking & annotation - semantics
○ Relation extraction & annotation
■ annotation accuracy/precision
■ consistency, interlinking, etc
● Query formulation
○ SPARQL conversion
■ conversion accuracy/precision
● Datasource/Knowledge base
○ Completeness
○ Data diversity
○ Trust and Provenance
○ Coverage
○ Timeliness (up to date)
○ etc
Typical IR system performances
measures
● Overall Performance
○ F1
○ Precision
○ Recall
•
•
Evaluated in this study
•
owl:DatatypeProperty
dc:creator dc:publisher
●
○
○
●
○
■
■
■
■
●
○
○
●
○
○
○
●
DBpedia data slice sizes (in MB)Wikidata data slice sizes (in MB)
Dimension Metric DB_Rest DB_Poli DB_Film DB_Soc
Availability
EstimatedDereferenceabilityMetric 0.013 0.013 0.012 0.012
EstimatedDereferenceabilityForwardLinksMetric 0.027 0.027 0.027 0.027
NoMisreportedContentTypesMetric 0 1 1 1
RDFAvailabilityMetric 0 0 0 0
EndPointAvailabilityMetric 0 0 0 0
Interlinking
EstimatedInterlinkDetectionMetric - - - -
EstimatedLinkExternalDataProviders - - - -
EstimatedDereferenceBackLinks 0.012 0.014 0.015 0.022
Semantic
accuracy
OntologyHijacking 1 1 1 1
MisusedOwlDatatypeOrObjectProperties 1 1 1 1
Data diversity
HumanReadableLabelling 0.953 0.985 0.997 1
MultipleLanguageUsageMteric 1 2 3 3
Trust and
Provenance
Basic Provenance 0 0 0 0
Extended Provenance 0 0 0 0
Provenance Richness 0 0 0 0
DBPEDIA SLICE ASSESSMENT RESULTS
WIKIDATA SLICE ASSESSMENT RESULTS
Dimension Metric Wiki_Rest Wiki_Poli Wiki_Film Wiki_Soc
Availability
EstimatedDereferenceabilityMetric 0.051 0.063 0.048 0.062
EstimatedDereferenceabilityForwardLinksMetric 0.093 0.053 0.050 0.064
NoMisreportedContentTypesMetric 0 1 0 1
RDFAvailabilityMetric 0 0 0 0
EndPointAvailabilityMetric 0 0 0 0
Interlinking
EstimatedInterlinkDetectionMetric - - - -
EstimatedLinkExternalDataProviders 5 11 9 8
EstimatedDereferenceBackLinks 0.013 0.098 0.089 0.083
Semantic
accuracy
OntologyHijacking 1 1 1 1
MisusedOwlDatatypeOrObjectProperties 1 1 1 1
Data diversity
HumanReadableLabelling 0.175 0.076 0.091 0.102
MultipleLanguageUsageMteric 2 3 2 3
Trust and
Provenance
Basic Provenance 0 0 0 0
Extended Provenance 0 0 0 0
Provenance Richness 0.055 0.083 0.010 0.025
●
○
○
○
●
○
○ …
○
QUESTIONS?
<hthakkar@uni-bonn.de>

Más contenido relacionado

La actualidad más candente

Henning agt talk-caise-semnet
Henning agt   talk-caise-semnetHenning agt   talk-caise-semnet
Henning agt talk-caise-semnet
caise2013vlc
 
Achieving time effective federated information from scalable rdf data using s...
Achieving time effective federated information from scalable rdf data using s...Achieving time effective federated information from scalable rdf data using s...
Achieving time effective federated information from scalable rdf data using s...
తేజ దండిభట్ల
 
Why is JSON-LD Important to Businesses - Franz Inc
Why is JSON-LD Important to Businesses - Franz IncWhy is JSON-LD Important to Businesses - Franz Inc
Why is JSON-LD Important to Businesses - Franz Inc
Franz Inc. - AllegroGraph
 
Semantic Pipes and Semantic Mashups
Semantic Pipes and Semantic MashupsSemantic Pipes and Semantic Mashups
Semantic Pipes and Semantic Mashups
giurca
 
RDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
RDF4U: RDF Graph Visualization by Interpreting Linked Data as KnowledgeRDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
RDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
National Institute of Informatics
 

La actualidad más candente (20)

The Power of Semantic Technologies to Explore Linked Open Data
The Power of Semantic Technologies to Explore Linked Open DataThe Power of Semantic Technologies to Explore Linked Open Data
The Power of Semantic Technologies to Explore Linked Open Data
 
Sparql querying of-property-graphs-harsh thakkar-graph day 2017 sf
Sparql querying of-property-graphs-harsh thakkar-graph day 2017 sfSparql querying of-property-graphs-harsh thakkar-graph day 2017 sf
Sparql querying of-property-graphs-harsh thakkar-graph day 2017 sf
 
ETL All The Things with Ruby
ETL All The Things with RubyETL All The Things with Ruby
ETL All The Things with Ruby
 
Henning agt talk-caise-semnet
Henning agt   talk-caise-semnetHenning agt   talk-caise-semnet
Henning agt talk-caise-semnet
 
Achieving time effective federated information from scalable rdf data using s...
Achieving time effective federated information from scalable rdf data using s...Achieving time effective federated information from scalable rdf data using s...
Achieving time effective federated information from scalable rdf data using s...
 
Proposal for open government data
Proposal for open government dataProposal for open government data
Proposal for open government data
 
Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...
Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...
Tutorial "Linked Data Query Processing" Part 4 "Execution Process" (WWW 2013 ...
 
Why is JSON-LD Important to Businesses - Franz Inc
Why is JSON-LD Important to Businesses - Franz IncWhy is JSON-LD Important to Businesses - Franz Inc
Why is JSON-LD Important to Businesses - Franz Inc
 
Pandas
PandasPandas
Pandas
 
Normalizing Data for Migrations
Normalizing Data for MigrationsNormalizing Data for Migrations
Normalizing Data for Migrations
 
Towards Flexible Indices for Distributed Graph Data: The Formal Schema-level...
Towards Flexible Indices for  Distributed Graph Data: The Formal Schema-level...Towards Flexible Indices for  Distributed Graph Data: The Formal Schema-level...
Towards Flexible Indices for Distributed Graph Data: The Formal Schema-level...
 
LinkML presentation to Yosemite Group
LinkML presentation to Yosemite GroupLinkML presentation to Yosemite Group
LinkML presentation to Yosemite Group
 
[Webinar] FactForge Debuts: Trump World Data and Instant Ranking of Industry ...
[Webinar] FactForge Debuts: Trump World Data and Instant Ranking of Industry ...[Webinar] FactForge Debuts: Trump World Data and Instant Ranking of Industry ...
[Webinar] FactForge Debuts: Trump World Data and Instant Ranking of Industry ...
 
Explicit Semantics in Graph DBs Driving Digital Transformation With Neo4j
Explicit Semantics in Graph DBs Driving Digital Transformation With Neo4jExplicit Semantics in Graph DBs Driving Digital Transformation With Neo4j
Explicit Semantics in Graph DBs Driving Digital Transformation With Neo4j
 
JSON-LD and SHACL for Knowledge Graphs
JSON-LD and SHACL for Knowledge GraphsJSON-LD and SHACL for Knowledge Graphs
JSON-LD and SHACL for Knowledge Graphs
 
Semantic Pipes and Semantic Mashups
Semantic Pipes and Semantic MashupsSemantic Pipes and Semantic Mashups
Semantic Pipes and Semantic Mashups
 
NoSql evaluation
NoSql evaluationNoSql evaluation
NoSql evaluation
 
Introduction to data analysis using R
Introduction to data analysis using RIntroduction to data analysis using R
Introduction to data analysis using R
 
RDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
RDF4U: RDF Graph Visualization by Interpreting Linked Data as KnowledgeRDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
RDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
 
Clustering output of Apache Nutch using Apache Spark
Clustering output of Apache Nutch using Apache SparkClustering output of Apache Nutch using Apache Spark
Clustering output of Apache Nutch using Apache Spark
 

Similar a Are Linked Datasets fit for Open-domain Question Answering? A Quality Assessment

print mod 2.pdf
print mod 2.pdfprint mod 2.pdf
print mod 2.pdf
lathass5
 

Similar a Are Linked Datasets fit for Open-domain Question Answering? A Quality Assessment (20)

Anatomy of Data Frame API : A deep dive into Spark Data Frame API
Anatomy of Data Frame API :  A deep dive into Spark Data Frame APIAnatomy of Data Frame API :  A deep dive into Spark Data Frame API
Anatomy of Data Frame API : A deep dive into Spark Data Frame API
 
AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...
AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...
AnzoGraph DB: Driving AI and Machine Insights with Knowledge Graphs in a Conn...
 
Data pipelines observability: OpenLineage & Marquez
Data pipelines observability:  OpenLineage & MarquezData pipelines observability:  OpenLineage & Marquez
Data pipelines observability: OpenLineage & Marquez
 
CNCF opa
CNCF opaCNCF opa
CNCF opa
 
print mod 2.pdf
print mod 2.pdfprint mod 2.pdf
print mod 2.pdf
 
Pivotal OSS meetup - MADlib and PivotalR
Pivotal OSS meetup - MADlib and PivotalRPivotal OSS meetup - MADlib and PivotalR
Pivotal OSS meetup - MADlib and PivotalR
 
The Nitty Gritty of Advanced Analytics Using Apache Spark in Python
The Nitty Gritty of Advanced Analytics Using Apache Spark in PythonThe Nitty Gritty of Advanced Analytics Using Apache Spark in Python
The Nitty Gritty of Advanced Analytics Using Apache Spark in Python
 
Machine learning pipeline with spark ml
Machine learning pipeline with spark mlMachine learning pipeline with spark ml
Machine learning pipeline with spark ml
 
Heterogenous Persistence
Heterogenous PersistenceHeterogenous Persistence
Heterogenous Persistence
 
MongoDB .local Munich 2019: A Complete Methodology to Data Modeling for MongoDB
MongoDB .local Munich 2019: A Complete Methodology to Data Modeling for MongoDBMongoDB .local Munich 2019: A Complete Methodology to Data Modeling for MongoDB
MongoDB .local Munich 2019: A Complete Methodology to Data Modeling for MongoDB
 
Introducing Datawave
Introducing DatawaveIntroducing Datawave
Introducing Datawave
 
Preparing Your Legacy Data for Automation in S1000D
Preparing Your Legacy Data for Automation in S1000DPreparing Your Legacy Data for Automation in S1000D
Preparing Your Legacy Data for Automation in S1000D
 
IoT with Azure Machine Learning and InfluxDB
IoT with Azure Machine Learning and InfluxDBIoT with Azure Machine Learning and InfluxDB
IoT with Azure Machine Learning and InfluxDB
 
Instant search - A hands-on tutorial
Instant search  - A hands-on tutorialInstant search  - A hands-on tutorial
Instant search - A hands-on tutorial
 
dipLODocus[RDF]: Short and Long-Tail RDF Analytics for Massive Webs of Data
dipLODocus[RDF]: Short and Long-Tail RDF Analytics for Massive Webs of DatadipLODocus[RDF]: Short and Long-Tail RDF Analytics for Massive Webs of Data
dipLODocus[RDF]: Short and Long-Tail RDF Analytics for Massive Webs of Data
 
Lessons learned from designing a QA Automation for analytics databases (big d...
Lessons learned from designing a QA Automation for analytics databases (big d...Lessons learned from designing a QA Automation for analytics databases (big d...
Lessons learned from designing a QA Automation for analytics databases (big d...
 
Spark Summit EU 2015: Combining the Strengths of MLlib, scikit-learn, and R
Spark Summit EU 2015: Combining the Strengths of MLlib, scikit-learn, and RSpark Summit EU 2015: Combining the Strengths of MLlib, scikit-learn, and R
Spark Summit EU 2015: Combining the Strengths of MLlib, scikit-learn, and R
 
Real-time analytics with Druid at Appsflyer
Real-time analytics with Druid at AppsflyerReal-time analytics with Druid at Appsflyer
Real-time analytics with Druid at Appsflyer
 
Time Series Databases for IoT (On-premises and Azure)
Time Series Databases for IoT (On-premises and Azure)Time Series Databases for IoT (On-premises and Azure)
Time Series Databases for IoT (On-premises and Azure)
 
Big Data processing with Apache Spark
Big Data processing with Apache SparkBig Data processing with Apache Spark
Big Data processing with Apache Spark
 

Último

Integrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - NeometrixIntegrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - Neometrix
Neometrix_Engineering_Pvt_Ltd
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
dollysharma2066
 
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
dharasingh5698
 
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoorTop Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
dharasingh5698
 
notes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptnotes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.ppt
MsecMca
 

Último (20)

Integrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - NeometrixIntegrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - Neometrix
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
 
Unit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdfUnit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdf
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdf
 
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the start
 
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPT
 
A Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna MunicipalityA Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna Municipality
 
22-prompt engineering noted slide shown.pdf
22-prompt engineering noted slide shown.pdf22-prompt engineering noted slide shown.pdf
22-prompt engineering noted slide shown.pdf
 
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
 
2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projects2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projects
 
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoorTop Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
 
notes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptnotes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.ppt
 

Are Linked Datasets fit for Open-domain Question Answering? A Quality Assessment

  • 1.
  • 2.
  • 3.
  • 4.
  • 5.
  • 6.
  • 8.
  • 9.
  • 10.
  • 11.
  • 12.
  • 13. QA systems Quality assessment of the LOD datasets The answer lies here!
  • 15. Digging into the QA system Typical IR system performances measures ● Overall Performance ○ F1 ○ Precision ○ Recall
  • 16. Digging into the QA system Data & Component/Module oriented measures ● Search & retrieval module ○ Indexer ○ Retriever ● Preprocessing / Linguistic ○ NLP - POS tags, NER, etc ○ Entity linking & annotation - semantics ○ Relation extraction & annotation ● Query formulation ○ SPARQL conversion ● Datasource/knowledge base ○ Data Typical IR system performances measures ● Overall Performance ○ F1 ○ Precision ○ Recall
  • 17. Digging into the QA system Data & Component/Module oriented measures ● Search & retrieval module ○ Indexer ■ Top K words accuracy; P@10, P@1000, etc ○ Retriever ■ Ranking, Re-ranking, MRR, etc ● Preprocessing / Linguistic ○ NLP - POS tags, NER, etc ○ Entity linking & annotation - semantics ○ Relation extraction & annotation ■ annotation accuracy/precision ■ consistency, interlinking, etc ● Query formulation ○ SPARQL conversion ■ conversion accuracy/precision ● Datasource/knowledge base ○ Completeness ○ Data diversity ○ Trust and Provenance ○ Coverage ○ Timeliness (up to date) ○ etc Typical IR system performances measures ● Overall Performance ○ F1 ○ Precision ○ Recall
  • 18. Digging into the QA system Data & Component/Module oriented measures ● Search & retrieval module ○ Indexer ■ Top K words accuracy; P@10, P@1000, etc ○ Retriever ■ Ranking, Re-ranking, MRR, etc ● Preprocessing / Linguistic ○ NLP - POS tags, NER, etc ○ Entity linking & annotation - semantics ○ Relation extraction & annotation ■ annotation accuracy/precision ■ consistency, interlinking, etc ● Query formulation ○ SPARQL conversion ■ conversion accuracy/precision ● Datasource/Knowledge base ○ Completeness ○ Data diversity ○ Trust and Provenance ○ Coverage ○ Timeliness (up to date) ○ etc Typical IR system performances measures ● Overall Performance ○ F1 ○ Precision ○ Recall
  • 21.
  • 22.
  • 23.
  • 24.
  • 25.
  • 28.
  • 31. DBpedia data slice sizes (in MB)Wikidata data slice sizes (in MB)
  • 32. Dimension Metric DB_Rest DB_Poli DB_Film DB_Soc Availability EstimatedDereferenceabilityMetric 0.013 0.013 0.012 0.012 EstimatedDereferenceabilityForwardLinksMetric 0.027 0.027 0.027 0.027 NoMisreportedContentTypesMetric 0 1 1 1 RDFAvailabilityMetric 0 0 0 0 EndPointAvailabilityMetric 0 0 0 0 Interlinking EstimatedInterlinkDetectionMetric - - - - EstimatedLinkExternalDataProviders - - - - EstimatedDereferenceBackLinks 0.012 0.014 0.015 0.022 Semantic accuracy OntologyHijacking 1 1 1 1 MisusedOwlDatatypeOrObjectProperties 1 1 1 1 Data diversity HumanReadableLabelling 0.953 0.985 0.997 1 MultipleLanguageUsageMteric 1 2 3 3 Trust and Provenance Basic Provenance 0 0 0 0 Extended Provenance 0 0 0 0 Provenance Richness 0 0 0 0 DBPEDIA SLICE ASSESSMENT RESULTS
  • 33. WIKIDATA SLICE ASSESSMENT RESULTS Dimension Metric Wiki_Rest Wiki_Poli Wiki_Film Wiki_Soc Availability EstimatedDereferenceabilityMetric 0.051 0.063 0.048 0.062 EstimatedDereferenceabilityForwardLinksMetric 0.093 0.053 0.050 0.064 NoMisreportedContentTypesMetric 0 1 0 1 RDFAvailabilityMetric 0 0 0 0 EndPointAvailabilityMetric 0 0 0 0 Interlinking EstimatedInterlinkDetectionMetric - - - - EstimatedLinkExternalDataProviders 5 11 9 8 EstimatedDereferenceBackLinks 0.013 0.098 0.089 0.083 Semantic accuracy OntologyHijacking 1 1 1 1 MisusedOwlDatatypeOrObjectProperties 1 1 1 1 Data diversity HumanReadableLabelling 0.175 0.076 0.091 0.102 MultipleLanguageUsageMteric 2 3 2 3 Trust and Provenance Basic Provenance 0 0 0 0 Extended Provenance 0 0 0 0 Provenance Richness 0.055 0.083 0.010 0.025
  • 34.