SlideShare a Scribd company logo
Creating Knowledge out of Interlinked Data
          JIST 2012 – Page 1                                                      http://lod2.eu




  Improving the Performance of the
 DL-Learner SPARQL Component for
     Semantic Web Applications
                  Didier Cherix, Sebastian Hellmann, Jens Lehmann

                         http://slideshare.net/kurzum


                                                          http://dl-learner.org
                                                             http://lod2.eu


                                                            AKSW, Universität Leipzig
LOD2 Presentation . 02.09.2010 . Page                                      http://lod2.eu
JIST 2012 – Page 2                                               http://lod2.eu




              Motivation: 2007 - 2012

DL-Learner was developed in parallel to DBpedia at University Leipzig since 2007

DL-Learner is a tool for learning concepts in Description Logics (DLs) from user-
provided examples.

Worked very well for small to medium sized data sets, e.g. Carcinogenesis an other
ML problems from the UCI ML repository

Limit is the capacity of current OWL-DL reasoners

Challenge was (and is) to do reasoning-based, supervized Machine Learning on
the DBpedia Dataset (> 200 Mio triples) or larger datasets
JIST 2012 – Page 3         http://lod2.eu




 Introduction DL-Learner
JIST 2012 – Page 4                                    http://lod2.eu




 Introduction DL-Learner




                           Very large search space

                           Reasoner instance checks
JIST 2012 – Page 5         http://lod2.eu




 Introduction DL-Learner
JIST 2012 – Page 6                                             http://lod2.eu




              Introduction DL-Learner
DL-Learner heavily relies on instance checks for machine learning, so the OWL
Reasoner is the bottle neck

Underlying idea:
Only select relevant data for the Machine Learning Problem based on user-given
examples

→ Reduces the amount of triples that have to be given to a reasoner
→ Reduces complexity and size of the OWL schema

Brute-force approach:
Load all data into the OWL Reasoner, then do instance checks
→ infeasible for Dbpedia

Iterative approach (old component):
Iterate over all instances and fetch the data recursively
→ inefficient even with caching
JIST 2012 – Page 7         http://lod2.eu




 Introduction DL-Learner
JIST 2012 – Page 8         http://lod2.eu




 Introduction DL-Learner
JIST 2012 – Page 9         http://lod2.eu




 Introduction DL-Learner
JIST 2012 – Page 10        http://lod2.eu




 Introduction DL-Learner
JIST 2012 – Page 11        http://lod2.eu




 Introduction DL-Learner
JIST 2012 – Page 12                                              http://lod2.eu




 Introduction DL-Learner




                           Challenge:
   What is the most efficient way to retrieve such a fragment?
JIST 2012 – Page 13                           http://lod2.eu




           Improvements of the New Component

•   Step 1: Indexing the T-Box:
     • Download the OWL Schema and index it in memory
     • either via SPARQL or OWL file
JIST 2012 – Page 14                                             http://lod2.eu




              Improvements of the New Component

 •   Step 2: A-Box Queries




Parameter recursion depth:
Retrieve newly discovered bindings to ?o until a certain depth is reached.
JIST 2012 – Page 15                  http://lod2.eu




           Improvements of the New Component

•   Step 3: Typing the retrieved instances
JIST 2012 – Page 16                                              http://lod2.eu




           Improvements of the New Component

•   Step 4: T-Box Index:
    All “relevant” T-Box information is added via the index to the fragment.
    For each class already in the fragment. all superclasses and their
    equivalentClass axioms are added
JIST 2012 – Page 17       http://lod2.eu




             Benchmarking - Speed

For each class in DBpedia Ontology:
- 30 instances as positives
- 30 negatives from a sister class
JIST 2012 – Page 18                                     http://lod2.eu




 Benchmarking – F-Measure on the training data




             70% of the results for each class
     had an F-measure of 90-100% on the training data
JIST 2012 – Page 19                                              http://lod2.eu




              SPARQL Retrieval Component Impact

•    DL-Learner – http://dl-learner.org
•    DBpedia Navigator
•    Tiger Corpus Navigator
•    AutoSPARQL - http://autosparql.dl-learner.org/
•    HANNE – http://hanne.aksw.org
•    ORE - http://aksw.org/Projects/ORE


    Sebastian Hellmann, Jens Lehmann und Sören Auer:
    Learning of OWL Class Descriptions on Very Large Knowledge Bases
    In: International Journal on Semantic Web and Information Systems, 2009


     Web Applications
     Active Learning → User Interaction and Feedback
JIST 2012 – Page 20                                                 http://lod2.eu




           Future Work

•   Research Paper in Session 4b (tomorrow at 15:10)
    Navigation-induced Knowledge Engineering by Example
•   Caching + more sophisticated options
•   Large scale learning problems


                          http://slideshare.net/kurzum


                                Homepage: http://dl-learner.org
                                Source code:
                                http://sourceforge.net/projects/dl-learner/
JIST 2012 – Page 21                                                               http://lod2.eu




             Example




Sebastian Hellmann, Jens Lehmann, Jörg Unbehauen, Claus Stadler, Thanh Nghia Lam und Markus
Strohmaier: Navigation-induced Knowledge Engineering by Example
In: JIST 2012
JIST 2012 – Page 22                                              http://lod2.eu




          Example




Sebastian Hellmann, Jens Lehmann und Sören Auer:
Learning of OWL Class Descriptions on Very Large Knowledge Bases
In: International Journal on Semantic Web and Information Systems, 2009

More Related Content

Viewers also liked

Linked Data in Linguistics for NLP and Web Annotation
Linked Data in Linguistics for NLP and Web AnnotationLinked Data in Linguistics for NLP and Web Annotation
Linked Data in Linguistics for NLP and Web AnnotationSebastian Hellmann
 
Lider Reference Model ld4lt session March, 3rd, 2015
Lider Reference Model ld4lt session  March, 3rd, 2015Lider Reference Model ld4lt session  March, 3rd, 2015
Lider Reference Model ld4lt session March, 3rd, 2015Sebastian Hellmann
 
NIF - Version 1.0 - 2011/10/23
NIF - Version 1.0 - 2011/10/23NIF - Version 1.0 - 2011/10/23
NIF - Version 1.0 - 2011/10/23Sebastian Hellmann
 
NLP2RDF Wortschatz and Linguistic LOD draft
NLP2RDF Wortschatz and Linguistic LOD draftNLP2RDF Wortschatz and Linguistic LOD draft
NLP2RDF Wortschatz and Linguistic LOD draftSebastian Hellmann
 
Linked Data for Abbreviations and Segmentation
Linked Data for Abbreviations and SegmentationLinked Data for Abbreviations and Segmentation
Linked Data for Abbreviations and SegmentationSebastian Hellmann
 

Viewers also liked (7)

Linked Data in Linguistics for NLP and Web Annotation
Linked Data in Linguistics for NLP and Web AnnotationLinked Data in Linguistics for NLP and Web Annotation
Linked Data in Linguistics for NLP and Web Annotation
 
Introduction to LDL 2012
Introduction to LDL 2012Introduction to LDL 2012
Introduction to LDL 2012
 
Tool collection as linkeddata
Tool collection as linkeddataTool collection as linkeddata
Tool collection as linkeddata
 
Lider Reference Model ld4lt session March, 3rd, 2015
Lider Reference Model ld4lt session  March, 3rd, 2015Lider Reference Model ld4lt session  March, 3rd, 2015
Lider Reference Model ld4lt session March, 3rd, 2015
 
NIF - Version 1.0 - 2011/10/23
NIF - Version 1.0 - 2011/10/23NIF - Version 1.0 - 2011/10/23
NIF - Version 1.0 - 2011/10/23
 
NLP2RDF Wortschatz and Linguistic LOD draft
NLP2RDF Wortschatz and Linguistic LOD draftNLP2RDF Wortschatz and Linguistic LOD draft
NLP2RDF Wortschatz and Linguistic LOD draft
 
Linked Data for Abbreviations and Segmentation
Linked Data for Abbreviations and SegmentationLinked Data for Abbreviations and Segmentation
Linked Data for Abbreviations and Segmentation
 

Similar to Improving the Performance of the DL-Learner SPARQL Component for Semantic Web Applications

Navigation-induced Knowledge Engineering by Example
 Navigation-induced Knowledge Engineering by Example Navigation-induced Knowledge Engineering by Example
Navigation-induced Knowledge Engineering by ExampleSebastian Hellmann
 
Integrating NLP using Linked Data
Integrating NLP using Linked DataIntegrating NLP using Linked Data
Integrating NLP using Linked DataSebastian Hellmann
 
NIF 2.0 Tutorial: Content Analysis and the Semantic Web
NIF 2.0 Tutorial: Content Analysis and the Semantic Web  NIF 2.0 Tutorial: Content Analysis and the Semantic Web
NIF 2.0 Tutorial: Content Analysis and the Semantic Web Sebastian Hellmann
 
Learning Outcomes & Learner Achievements Management in Higher Ed
Learning Outcomes & Learner Achievements Management in Higher EdLearning Outcomes & Learner Achievements Management in Higher Ed
Learning Outcomes & Learner Achievements Management in Higher EdJad Najjar
 
Linked Open Data Visualization
Linked Open Data VisualizationLinked Open Data Visualization
Linked Open Data VisualizationLaura Po
 
Pal gov.tutorial2.session14.lab rdf-dataintegration
Pal gov.tutorial2.session14.lab rdf-dataintegrationPal gov.tutorial2.session14.lab rdf-dataintegration
Pal gov.tutorial2.session14.lab rdf-dataintegrationMustafa Jarrar
 
Pal gov.tutorial2.session16.lab rd-fa
Pal gov.tutorial2.session16.lab rd-faPal gov.tutorial2.session16.lab rd-fa
Pal gov.tutorial2.session16.lab rd-faMustafa Jarrar
 
Pal gov.tutorial2.session8.lab owl
Pal gov.tutorial2.session8.lab owlPal gov.tutorial2.session8.lab owl
Pal gov.tutorial2.session8.lab owlMustafa Jarrar
 
Milleks meile õpitehnoloogia standardid?
Milleks meile õpitehnoloogia standardid?Milleks meile õpitehnoloogia standardid?
Milleks meile õpitehnoloogia standardid?Hans Põldoja
 

Similar to Improving the Performance of the DL-Learner SPARQL Component for Semantic Web Applications (20)

Navigation-induced Knowledge Engineering by Example
 Navigation-induced Knowledge Engineering by Example Navigation-induced Knowledge Engineering by Example
Navigation-induced Knowledge Engineering by Example
 
LOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORE
LOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORELOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORE
LOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORE
 
LOD2 Webinar Series: CubeViz
LOD2 Webinar Series: CubeViz LOD2 Webinar Series: CubeViz
LOD2 Webinar Series: CubeViz
 
Integrating NLP using Linked Data
Integrating NLP using Linked DataIntegrating NLP using Linked Data
Integrating NLP using Linked Data
 
NIF 2.0 Tutorial: Content Analysis and the Semantic Web
NIF 2.0 Tutorial: Content Analysis and the Semantic Web  NIF 2.0 Tutorial: Content Analysis and the Semantic Web
NIF 2.0 Tutorial: Content Analysis and the Semantic Web
 
NoTube: Models & Semantics
NoTube: Models & SemanticsNoTube: Models & Semantics
NoTube: Models & Semantics
 
LOD2 Webinar: UnifiedViews
LOD2 Webinar: UnifiedViewsLOD2 Webinar: UnifiedViews
LOD2 Webinar: UnifiedViews
 
LOD2 Webinar Series: 3rd relase of the Stack
LOD2 Webinar Series: 3rd relase of the StackLOD2 Webinar Series: 3rd relase of the Stack
LOD2 Webinar Series: 3rd relase of the Stack
 
LOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge Fusion
LOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge FusionLOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge Fusion
LOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge Fusion
 
LOD2 Webinar Series: LOD2 in information and publishing industry
LOD2 Webinar Series: LOD2 in information and publishing industryLOD2 Webinar Series: LOD2 in information and publishing industry
LOD2 Webinar Series: LOD2 in information and publishing industry
 
Learning Outcomes & Learner Achievements Management in Higher Ed
Learning Outcomes & Learner Achievements Management in Higher EdLearning Outcomes & Learner Achievements Management in Higher Ed
Learning Outcomes & Learner Achievements Management in Higher Ed
 
Linked Open Data Visualization
Linked Open Data VisualizationLinked Open Data Visualization
Linked Open Data Visualization
 
Free Webinar: LOD2 Stack - 1st release
Free Webinar: LOD2 Stack - 1st releaseFree Webinar: LOD2 Stack - 1st release
Free Webinar: LOD2 Stack - 1st release
 
NIF 2.0 draft for Pisa
NIF 2.0 draft for PisaNIF 2.0 draft for Pisa
NIF 2.0 draft for Pisa
 
LOD2: State of Play WP3A - Knowledge Base Creation, Enrichment and Repair
LOD2: State of Play WP3A - Knowledge Base Creation, Enrichment and RepairLOD2: State of Play WP3A - Knowledge Base Creation, Enrichment and Repair
LOD2: State of Play WP3A - Knowledge Base Creation, Enrichment and Repair
 
Pal gov.tutorial2.session14.lab rdf-dataintegration
Pal gov.tutorial2.session14.lab rdf-dataintegrationPal gov.tutorial2.session14.lab rdf-dataintegration
Pal gov.tutorial2.session14.lab rdf-dataintegration
 
Pal gov.tutorial2.session16.lab rd-fa
Pal gov.tutorial2.session16.lab rd-faPal gov.tutorial2.session16.lab rd-fa
Pal gov.tutorial2.session16.lab rd-fa
 
Pal gov.tutorial2.session8.lab owl
Pal gov.tutorial2.session8.lab owlPal gov.tutorial2.session8.lab owl
Pal gov.tutorial2.session8.lab owl
 
LOD2 Webinar Series: Zemanta / Open refine
LOD2 Webinar Series: Zemanta / Open refine LOD2 Webinar Series: Zemanta / Open refine
LOD2 Webinar Series: Zemanta / Open refine
 
Milleks meile õpitehnoloogia standardid?
Milleks meile õpitehnoloogia standardid?Milleks meile õpitehnoloogia standardid?
Milleks meile õpitehnoloogia standardid?
 

More from Sebastian Hellmann

Linguistic Linked Open Data, Challenges, Approaches, Future Work
Linguistic Linked Open Data, Challenges, Approaches, Future WorkLinguistic Linked Open Data, Challenges, Approaches, Future Work
Linguistic Linked Open Data, Challenges, Approaches, Future WorkSebastian Hellmann
 
DBpedia/association Introduction The Hague 12.2.2016
DBpedia/association Introduction The Hague 12.2.2016DBpedia/association Introduction The Hague 12.2.2016
DBpedia/association Introduction The Hague 12.2.2016Sebastian Hellmann
 
LD4LT Roadmap session 19_02_2015
LD4LT Roadmap session 19_02_2015LD4LT Roadmap session 19_02_2015
LD4LT Roadmap session 19_02_2015Sebastian Hellmann
 
DBpedia: A Public Data Infrastructure for the Web of Data
DBpedia: A Public Data Infrastructure for the Web of DataDBpedia: A Public Data Infrastructure for the Web of Data
DBpedia: A Public Data Infrastructure for the Web of DataSebastian Hellmann
 
NIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate reportNIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate reportSebastian Hellmann
 

More from Sebastian Hellmann (8)

KEDL DBpedia 2019
KEDL DBpedia  2019KEDL DBpedia  2019
KEDL DBpedia 2019
 
Linguistic Linked Open Data, Challenges, Approaches, Future Work
Linguistic Linked Open Data, Challenges, Approaches, Future WorkLinguistic Linked Open Data, Challenges, Approaches, Future Work
Linguistic Linked Open Data, Challenges, Approaches, Future Work
 
DBpedia/association Introduction The Hague 12.2.2016
DBpedia/association Introduction The Hague 12.2.2016DBpedia/association Introduction The Hague 12.2.2016
DBpedia/association Introduction The Hague 12.2.2016
 
LD4LT Roadmap session 19_02_2015
LD4LT Roadmap session 19_02_2015LD4LT Roadmap session 19_02_2015
LD4LT Roadmap session 19_02_2015
 
DBpedia: A Public Data Infrastructure for the Web of Data
DBpedia: A Public Data Infrastructure for the Web of DataDBpedia: A Public Data Infrastructure for the Web of Data
DBpedia: A Public Data Infrastructure for the Web of Data
 
NIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate reportNIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate report
 
Thesis presentation
Thesis presentationThesis presentation
Thesis presentation
 
NIF - NLP Interchange Format
NIF - NLP Interchange FormatNIF - NLP Interchange Format
NIF - NLP Interchange Format
 

Recently uploaded

What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024Stephanie Beckett
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonDianaGray10
 
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptxWSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptxJennifer Lim
 
In-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT ProfessionalsIn-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT ProfessionalsExpeed Software
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlPeter Udo Diehl
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Tobias Schneck
 
Speed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in MinutesSpeed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in Minutesconfluent
 
AI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří KarpíšekAI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří KarpíšekCzechDreamin
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaRTTS
 
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀DianaGray10
 
Strategic AI Integration in Engineering Teams
Strategic AI Integration in Engineering TeamsStrategic AI Integration in Engineering Teams
Strategic AI Integration in Engineering TeamsUXDXConf
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualityInflectra
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxDavid Michel
 
The architecture of Generative AI for enterprises.pdf
The architecture of Generative AI for enterprises.pdfThe architecture of Generative AI for enterprises.pdf
The architecture of Generative AI for enterprises.pdfalexjohnson7307
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...CzechDreamin
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupCatarinaPereira64715
 
UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2DianaGray10
 
Enterprise Security Monitoring, And Log Management.
Enterprise Security Monitoring, And Log Management.Enterprise Security Monitoring, And Log Management.
Enterprise Security Monitoring, And Log Management.Boni Yeamin
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backElena Simperl
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyJohn Staveley
 

Recently uploaded (20)

What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptxWSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
 
In-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT ProfessionalsIn-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT Professionals
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
Speed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in MinutesSpeed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in Minutes
 
AI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří KarpíšekAI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří Karpíšek
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
 
Strategic AI Integration in Engineering Teams
Strategic AI Integration in Engineering TeamsStrategic AI Integration in Engineering Teams
Strategic AI Integration in Engineering Teams
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
 
The architecture of Generative AI for enterprises.pdf
The architecture of Generative AI for enterprises.pdfThe architecture of Generative AI for enterprises.pdf
The architecture of Generative AI for enterprises.pdf
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 
UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2
 
Enterprise Security Monitoring, And Log Management.
Enterprise Security Monitoring, And Log Management.Enterprise Security Monitoring, And Log Management.
Enterprise Security Monitoring, And Log Management.
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John Staveley
 

Improving the Performance of the DL-Learner SPARQL Component for Semantic Web Applications

  • 1. Creating Knowledge out of Interlinked Data JIST 2012 – Page 1 http://lod2.eu Improving the Performance of the DL-Learner SPARQL Component for Semantic Web Applications Didier Cherix, Sebastian Hellmann, Jens Lehmann http://slideshare.net/kurzum http://dl-learner.org http://lod2.eu AKSW, Universität Leipzig LOD2 Presentation . 02.09.2010 . Page http://lod2.eu
  • 2. JIST 2012 – Page 2 http://lod2.eu Motivation: 2007 - 2012 DL-Learner was developed in parallel to DBpedia at University Leipzig since 2007 DL-Learner is a tool for learning concepts in Description Logics (DLs) from user- provided examples. Worked very well for small to medium sized data sets, e.g. Carcinogenesis an other ML problems from the UCI ML repository Limit is the capacity of current OWL-DL reasoners Challenge was (and is) to do reasoning-based, supervized Machine Learning on the DBpedia Dataset (> 200 Mio triples) or larger datasets
  • 3. JIST 2012 – Page 3 http://lod2.eu Introduction DL-Learner
  • 4. JIST 2012 – Page 4 http://lod2.eu Introduction DL-Learner Very large search space Reasoner instance checks
  • 5. JIST 2012 – Page 5 http://lod2.eu Introduction DL-Learner
  • 6. JIST 2012 – Page 6 http://lod2.eu Introduction DL-Learner DL-Learner heavily relies on instance checks for machine learning, so the OWL Reasoner is the bottle neck Underlying idea: Only select relevant data for the Machine Learning Problem based on user-given examples → Reduces the amount of triples that have to be given to a reasoner → Reduces complexity and size of the OWL schema Brute-force approach: Load all data into the OWL Reasoner, then do instance checks → infeasible for Dbpedia Iterative approach (old component): Iterate over all instances and fetch the data recursively → inefficient even with caching
  • 7. JIST 2012 – Page 7 http://lod2.eu Introduction DL-Learner
  • 8. JIST 2012 – Page 8 http://lod2.eu Introduction DL-Learner
  • 9. JIST 2012 – Page 9 http://lod2.eu Introduction DL-Learner
  • 10. JIST 2012 – Page 10 http://lod2.eu Introduction DL-Learner
  • 11. JIST 2012 – Page 11 http://lod2.eu Introduction DL-Learner
  • 12. JIST 2012 – Page 12 http://lod2.eu Introduction DL-Learner Challenge: What is the most efficient way to retrieve such a fragment?
  • 13. JIST 2012 – Page 13 http://lod2.eu Improvements of the New Component • Step 1: Indexing the T-Box: • Download the OWL Schema and index it in memory • either via SPARQL or OWL file
  • 14. JIST 2012 – Page 14 http://lod2.eu Improvements of the New Component • Step 2: A-Box Queries Parameter recursion depth: Retrieve newly discovered bindings to ?o until a certain depth is reached.
  • 15. JIST 2012 – Page 15 http://lod2.eu Improvements of the New Component • Step 3: Typing the retrieved instances
  • 16. JIST 2012 – Page 16 http://lod2.eu Improvements of the New Component • Step 4: T-Box Index: All “relevant” T-Box information is added via the index to the fragment. For each class already in the fragment. all superclasses and their equivalentClass axioms are added
  • 17. JIST 2012 – Page 17 http://lod2.eu Benchmarking - Speed For each class in DBpedia Ontology: - 30 instances as positives - 30 negatives from a sister class
  • 18. JIST 2012 – Page 18 http://lod2.eu Benchmarking – F-Measure on the training data 70% of the results for each class had an F-measure of 90-100% on the training data
  • 19. JIST 2012 – Page 19 http://lod2.eu SPARQL Retrieval Component Impact • DL-Learner – http://dl-learner.org • DBpedia Navigator • Tiger Corpus Navigator • AutoSPARQL - http://autosparql.dl-learner.org/ • HANNE – http://hanne.aksw.org • ORE - http://aksw.org/Projects/ORE Sebastian Hellmann, Jens Lehmann und Sören Auer: Learning of OWL Class Descriptions on Very Large Knowledge Bases In: International Journal on Semantic Web and Information Systems, 2009 Web Applications Active Learning → User Interaction and Feedback
  • 20. JIST 2012 – Page 20 http://lod2.eu Future Work • Research Paper in Session 4b (tomorrow at 15:10) Navigation-induced Knowledge Engineering by Example • Caching + more sophisticated options • Large scale learning problems http://slideshare.net/kurzum Homepage: http://dl-learner.org Source code: http://sourceforge.net/projects/dl-learner/
  • 21. JIST 2012 – Page 21 http://lod2.eu Example Sebastian Hellmann, Jens Lehmann, Jörg Unbehauen, Claus Stadler, Thanh Nghia Lam und Markus Strohmaier: Navigation-induced Knowledge Engineering by Example In: JIST 2012
  • 22. JIST 2012 – Page 22 http://lod2.eu Example Sebastian Hellmann, Jens Lehmann und Sören Auer: Learning of OWL Class Descriptions on Very Large Knowledge Bases In: International Journal on Semantic Web and Information Systems, 2009