SlideShare una empresa de Scribd logo
1 de 20
Descargar para leer sin conexión
Creating Knowledge out of Interlinked Data
          MultilingualWeb – 2012/06/11 Dublin – Page 1                           http://lod2.eu




        Linked Data in Linguistics
      for NLP and Web Annotation



                                                            http://nlp2rdf.org
                                                              http://lod2.eu
                                                         Sebastian Hellmann
                                                            AKSW, Universität Leipzig
LOD2 Presentation . 02.09.2010 . Page                                      http://lod2.eu
MultilingualWeb – 2012/06/11 Dublin – Page 2   http://lod2.eu




 The Semantic Gap
MultilingualWeb – 2012/06/11 Dublin – Page 3                         http://lod2.eu

          Turning Walled Gardens into Park Networks of
          Semantic Linguistic Data
How can we leverage the Data Web for natural
language processing?
                                                            50 Billion facts covering
                                                            all kinds of domains are
                                                            readily available
                                    1. Use the Data         Leverage the wisdom of
                                        Web as              the crowds
                                      background
                                     knowledge for
                                          NLP

                                                    2. Use Data
                         3. Make the
                                                        Web
                        output of NLP
                                                   technologies
                       tools available
                                                  for integrating    RDF is all about
                         on the Data
On the Web, by                                      NLP tools &      semantic
                             Web
sharing and                                         approaches
                                                                     interoperability
copying the value
of information
increases
MultilingualWeb – 2012/06/11 Dublin – Page 4                            http://lod2.eu

 1. Use the Data Web as
 background knowledge for NLP




                                               Linguistic Data currently filed
                                                   under “cross-domain”
MultilingualWeb – 2012/06/11 Dublin – Page 5        http://lod2.eu

    1. Use the Data Web as
    background knowledge for NLP


Three communities with three resources:
 • Working Group for Open Linguistics Data (OWLG)
    – > http://linguistics.okfn.org
 • DBpedia Internationalization Committee
    – > http://wiki.dbpedia.org/Internationalization
 • Wiktionary2RDF Wrappers
    – > http://dbpedia.org/Wiktionary
All communities are open, please join!
MultilingualWeb – 2012/06/11 Dublin – Page 6   http://lod2.eu




 The Linguistic Linked Open Data Cloud
MultilingualWeb – 2012/06/11 Dublin – Page 7   http://lod2.eu




 Main question
MultilingualWeb – 2012/06/11 Dublin – Page 8                        http://lod2.eu




 Wiktionary2RDF – Mediator Wrapper
                                               http://dbpedia.org/Wiktionary
MultilingualWeb – 2012/06/11 Dublin – Page 9                        http://lod2.eu




 Wiktionary2RDF – Mediator Wrapper
                                               http://dbpedia.org/Wiktionary


                                                                 Mediator
                                                                  Lemon
MultilingualWeb – 2012/06/11 Dublin – Page 10     http://lod2.eu

          2. Use Data Web Technologies for
          Integrating NLP Tools and Approaches


Golden Hammer Anti-pattern



The question is not whether to
use RDF and Linked Data, but when
to use...




  Image from http://pbmo.wordpress.com/2011/09/29/maslows-hammer/
MultilingualWeb – 2012/06/11 Dublin – Page 11   http://lod2.eu
MultilingualWeb – 2012/06/11 Dublin – Page 12   http://lod2.eu

   2. Use Data Web Technologies for
   Integrating NLP Tools and Approaches




• Ontologies provide (formal) documentation (UML, ERD)
• Structure is easy to understand
• Wide range of RDF tools can be used, e.g. LOD2 Stack
• Indexing and querying as Big Picture possible
MultilingualWeb – 2012/06/11 Dublin – Page 13         http://lod2.eu

      2. Use Data Web Technologies for
      Integrating NLP Tools and Approaches

  The NLP Interchange Format (NIF) is an RDF/OWL-based
  format that aims to achieve interoperability between Natural
  Language Processing (NLP) tools, language resources and
  annotations.
• Road map
   • Bootstrapped by LOD2, but a community project
   • First release in September 2011
   • Great resonance
      – Over 50 people joined the mailing list:
         http://lists.okfn.org/mailman/listinfo/open-linguistics
      – First third party implementations and contributions
      – Several project discuss usage
   • Currently setting up advisory board, next draft in July
MultilingualWeb – 2012/06/11 Dublin – Page 14                                         http://lod2.eu




S. Auer and S. Hellmann: The Web of Data: Decentralized, collaborative, interlinked and interoperable
LREC 2012, http://www.lrec-conf.org/proceedings/lrec2012/keynotes/LREC%202012.Keynote%20Speech%201.Soeren%20Auer.pdf
MultilingualWeb – 2012/06/11 Dublin – Page 15   http://lod2.eu

        3. Make the Output of NLP Tools
         available on the Web




Currently there is no standard mechanism to transparently
combine the WWW, GGG and NLP




GGG = Giant Global Graph (basically the Web of Data)
see: http://dig.csail.mit.edu/breadcrumbs/node/215
MultilingualWeb – 2012/06/11 Dublin – Page 16   http://lod2.eu

 3. Make the Output of NLP Tools
  available on the Web
MultilingualWeb – 2012/06/11 Dublin – Page 17            http://lod2.eu

           3. Make the Output of NLP Tools
            available on the Web




http://dbpedia.org/spotlight P. Mendes et. al. DBpedia spotlight: Shedding
           light on the web of documents. In I-Semantics, 2011
MultilingualWeb – 2012/06/11 Dublin – Page 18    http://lod2.eu

 3. Make the Output of NLP Tools
  available on the Web




http://annotateit.org
http://sourceforge.net/projects/fragmentlinks/
MultilingualWeb – 2012/06/11 Dublin – Page 19                http://lod2.eu

         3. Make the Output of NLP Tools
          available on the Web

        NLP Interchange Format (NIF) join the mailing list at:
                         http://nlp2rdf.org




Hellmann et.al.: Towards an Ontology for Representing Strings In: EKAW 2012
    http://svn.aksw.org/papers/2012/WWW_NIF/public/string_ontology.pdf
LOD2 Title . 02.09.2010 . Page 20                               http://lod2.eu




            Contact

            Address

            University of Leipzig
            Faculty of Mathematics and Computer
            Science
            Institute of Computer Science
            Department of Business Information
            Systems

            Postfach 100920
            04009 Leipzig
            Germany



     Project: http://lod2.eu
     Organisation: http://uni-leipzig.de, http://aksw.org
     Presenter: http://bis.informatik.uni-leipzig.de/SebastianHellmann
     NLP2RDF page: http://nlp2rdf.org

                                                     Acknowledgement:
       CC-BY-SA                            some slides are taken from the keynote
  Thanks for your
unless otherwise stated                         of Sören Auer at LREC 2012

Más contenido relacionado

Similar a Linked Data in Linguistics for NLP and Web Annotation

Soren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked DataSoren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked DataOpen City Foundation
 
Integrating NLP using Linked Data
Integrating NLP using Linked DataIntegrating NLP using Linked Data
Integrating NLP using Linked DataSebastian Hellmann
 
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataIntroduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataSören Auer
 
Linked data and semantic wikis
Linked data and semantic wikisLinked data and semantic wikis
Linked data and semantic wikisSören Auer
 
Improving the Performance of the DL-Learner SPARQL Component for Semantic We...
Improving the Performance of the  DL-Learner SPARQL Component for Semantic We...Improving the Performance of the  DL-Learner SPARQL Component for Semantic We...
Improving the Performance of the DL-Learner SPARQL Component for Semantic We...Sebastian Hellmann
 
NIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate reportNIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate reportSebastian Hellmann
 
Semantic personalisation in networked media: determining the background know...
Semantic personalisation in networked media: determining the  background know...Semantic personalisation in networked media: determining the  background know...
Semantic personalisation in networked media: determining the background know...LinkedTV
 
Datalift: A Catalyser for the Web of Data - Francois Scharffe
Datalift: A Catalyser for the Web of Data - Francois ScharffeDatalift: A Catalyser for the Web of Data - Francois Scharffe
Datalift: A Catalyser for the Web of Data - Francois Scharffewebscience-montpellier
 

Similar a Linked Data in Linguistics for NLP and Web Annotation (20)

Soren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked DataSoren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked Data
 
NIF 2.0 draft for Pisa
NIF 2.0 draft for PisaNIF 2.0 draft for Pisa
NIF 2.0 draft for Pisa
 
Integrating NLP using Linked Data
Integrating NLP using Linked DataIntegrating NLP using Linked Data
Integrating NLP using Linked Data
 
NIF - NLP Interchange Format
NIF - NLP Interchange FormatNIF - NLP Interchange Format
NIF - NLP Interchange Format
 
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataIntroduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
 
LOD2 Webinar: UnifiedViews
LOD2 Webinar: UnifiedViewsLOD2 Webinar: UnifiedViews
LOD2 Webinar: UnifiedViews
 
Linked data and semantic wikis
Linked data and semantic wikisLinked data and semantic wikis
Linked data and semantic wikis
 
LOD2 - Creating Knowledge out of Interlinked Data - General Presentation
LOD2 - Creating Knowledge out of Interlinked Data - General PresentationLOD2 - Creating Knowledge out of Interlinked Data - General Presentation
LOD2 - Creating Knowledge out of Interlinked Data - General Presentation
 
LOD2 Webinar Series: Zemanta / Open refine
LOD2 Webinar Series: Zemanta / Open refine LOD2 Webinar Series: Zemanta / Open refine
LOD2 Webinar Series: Zemanta / Open refine
 
LOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge Fusion
LOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge FusionLOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge Fusion
LOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge Fusion
 
Improving the Performance of the DL-Learner SPARQL Component for Semantic We...
Improving the Performance of the  DL-Learner SPARQL Component for Semantic We...Improving the Performance of the  DL-Learner SPARQL Component for Semantic We...
Improving the Performance of the DL-Learner SPARQL Component for Semantic We...
 
LOD2 Webinar Series: LOD2 in information and publishing industry
LOD2 Webinar Series: LOD2 in information and publishing industryLOD2 Webinar Series: LOD2 in information and publishing industry
LOD2 Webinar Series: LOD2 in information and publishing industry
 
LOD2 Webinar Series FOX
LOD2 Webinar Series FOXLOD2 Webinar Series FOX
LOD2 Webinar Series FOX
 
Free Webinar: LOD2 Stack - 1st release
Free Webinar: LOD2 Stack - 1st releaseFree Webinar: LOD2 Stack - 1st release
Free Webinar: LOD2 Stack - 1st release
 
NIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate reportNIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate report
 
LOD2: State of Play WP3B - Knowledge Extraction, NLP2RDF + NIF
LOD2: State of Play WP3B - Knowledge Extraction, NLP2RDF + NIFLOD2: State of Play WP3B - Knowledge Extraction, NLP2RDF + NIF
LOD2: State of Play WP3B - Knowledge Extraction, NLP2RDF + NIF
 
Semantic personalisation in networked media: determining the background know...
Semantic personalisation in networked media: determining the  background know...Semantic personalisation in networked media: determining the  background know...
Semantic personalisation in networked media: determining the background know...
 
LOD2 Webinar Series: D2R and Sparqlify
LOD2 Webinar Series: D2R and SparqlifyLOD2 Webinar Series: D2R and Sparqlify
LOD2 Webinar Series: D2R and Sparqlify
 
LOD2 Webinar Series: SILK
LOD2 Webinar Series: SILKLOD2 Webinar Series: SILK
LOD2 Webinar Series: SILK
 
Datalift: A Catalyser for the Web of Data - Francois Scharffe
Datalift: A Catalyser for the Web of Data - Francois ScharffeDatalift: A Catalyser for the Web of Data - Francois Scharffe
Datalift: A Catalyser for the Web of Data - Francois Scharffe
 

Más de Sebastian Hellmann

Linguistic Linked Open Data, Challenges, Approaches, Future Work
Linguistic Linked Open Data, Challenges, Approaches, Future WorkLinguistic Linked Open Data, Challenges, Approaches, Future Work
Linguistic Linked Open Data, Challenges, Approaches, Future WorkSebastian Hellmann
 
DBpedia/association Introduction The Hague 12.2.2016
DBpedia/association Introduction The Hague 12.2.2016DBpedia/association Introduction The Hague 12.2.2016
DBpedia/association Introduction The Hague 12.2.2016Sebastian Hellmann
 
Lider Reference Model ld4lt session March, 3rd, 2015
Lider Reference Model ld4lt session  March, 3rd, 2015Lider Reference Model ld4lt session  March, 3rd, 2015
Lider Reference Model ld4lt session March, 3rd, 2015Sebastian Hellmann
 
LD4LT Roadmap session 19_02_2015
LD4LT Roadmap session 19_02_2015LD4LT Roadmap session 19_02_2015
LD4LT Roadmap session 19_02_2015Sebastian Hellmann
 
DBpedia: A Public Data Infrastructure for the Web of Data
DBpedia: A Public Data Infrastructure for the Web of DataDBpedia: A Public Data Infrastructure for the Web of Data
DBpedia: A Public Data Infrastructure for the Web of DataSebastian Hellmann
 
NIF 2.0 Tutorial: Content Analysis and the Semantic Web
NIF 2.0 Tutorial: Content Analysis and the Semantic Web  NIF 2.0 Tutorial: Content Analysis and the Semantic Web
NIF 2.0 Tutorial: Content Analysis and the Semantic Web Sebastian Hellmann
 
Linked Data for Abbreviations and Segmentation
Linked Data for Abbreviations and SegmentationLinked Data for Abbreviations and Segmentation
Linked Data for Abbreviations and SegmentationSebastian Hellmann
 
Navigation-induced Knowledge Engineering by Example
 Navigation-induced Knowledge Engineering by Example Navigation-induced Knowledge Engineering by Example
Navigation-induced Knowledge Engineering by ExampleSebastian Hellmann
 
NIF - Version 1.0 - 2011/10/23
NIF - Version 1.0 - 2011/10/23NIF - Version 1.0 - 2011/10/23
NIF - Version 1.0 - 2011/10/23Sebastian Hellmann
 
NLP2RDF Wortschatz and Linguistic LOD draft
NLP2RDF Wortschatz and Linguistic LOD draftNLP2RDF Wortschatz and Linguistic LOD draft
NLP2RDF Wortschatz and Linguistic LOD draftSebastian Hellmann
 

Más de Sebastian Hellmann (14)

KEDL DBpedia 2019
KEDL DBpedia  2019KEDL DBpedia  2019
KEDL DBpedia 2019
 
Linguistic Linked Open Data, Challenges, Approaches, Future Work
Linguistic Linked Open Data, Challenges, Approaches, Future WorkLinguistic Linked Open Data, Challenges, Approaches, Future Work
Linguistic Linked Open Data, Challenges, Approaches, Future Work
 
DBpedia/association Introduction The Hague 12.2.2016
DBpedia/association Introduction The Hague 12.2.2016DBpedia/association Introduction The Hague 12.2.2016
DBpedia/association Introduction The Hague 12.2.2016
 
Lider Reference Model ld4lt session March, 3rd, 2015
Lider Reference Model ld4lt session  March, 3rd, 2015Lider Reference Model ld4lt session  March, 3rd, 2015
Lider Reference Model ld4lt session March, 3rd, 2015
 
LD4LT Roadmap session 19_02_2015
LD4LT Roadmap session 19_02_2015LD4LT Roadmap session 19_02_2015
LD4LT Roadmap session 19_02_2015
 
DBpedia: A Public Data Infrastructure for the Web of Data
DBpedia: A Public Data Infrastructure for the Web of DataDBpedia: A Public Data Infrastructure for the Web of Data
DBpedia: A Public Data Infrastructure for the Web of Data
 
NIF 2.0 Tutorial: Content Analysis and the Semantic Web
NIF 2.0 Tutorial: Content Analysis and the Semantic Web  NIF 2.0 Tutorial: Content Analysis and the Semantic Web
NIF 2.0 Tutorial: Content Analysis and the Semantic Web
 
Linked Data for Abbreviations and Segmentation
Linked Data for Abbreviations and SegmentationLinked Data for Abbreviations and Segmentation
Linked Data for Abbreviations and Segmentation
 
Navigation-induced Knowledge Engineering by Example
 Navigation-induced Knowledge Engineering by Example Navigation-induced Knowledge Engineering by Example
Navigation-induced Knowledge Engineering by Example
 
Introduction to LDL 2012
Introduction to LDL 2012Introduction to LDL 2012
Introduction to LDL 2012
 
Thesis presentation
Thesis presentationThesis presentation
Thesis presentation
 
NIF - Version 1.0 - 2011/10/23
NIF - Version 1.0 - 2011/10/23NIF - Version 1.0 - 2011/10/23
NIF - Version 1.0 - 2011/10/23
 
Tool collection as linkeddata
Tool collection as linkeddataTool collection as linkeddata
Tool collection as linkeddata
 
NLP2RDF Wortschatz and Linguistic LOD draft
NLP2RDF Wortschatz and Linguistic LOD draftNLP2RDF Wortschatz and Linguistic LOD draft
NLP2RDF Wortschatz and Linguistic LOD draft
 

Último

Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 

Último (20)

Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 

Linked Data in Linguistics for NLP and Web Annotation

  • 1. Creating Knowledge out of Interlinked Data MultilingualWeb – 2012/06/11 Dublin – Page 1 http://lod2.eu Linked Data in Linguistics for NLP and Web Annotation http://nlp2rdf.org http://lod2.eu Sebastian Hellmann AKSW, Universität Leipzig LOD2 Presentation . 02.09.2010 . Page http://lod2.eu
  • 2. MultilingualWeb – 2012/06/11 Dublin – Page 2 http://lod2.eu The Semantic Gap
  • 3. MultilingualWeb – 2012/06/11 Dublin – Page 3 http://lod2.eu Turning Walled Gardens into Park Networks of Semantic Linguistic Data How can we leverage the Data Web for natural language processing? 50 Billion facts covering all kinds of domains are readily available 1. Use the Data Leverage the wisdom of Web as the crowds background knowledge for NLP 2. Use Data 3. Make the Web output of NLP technologies tools available for integrating RDF is all about on the Data On the Web, by NLP tools & semantic Web sharing and approaches interoperability copying the value of information increases
  • 4. MultilingualWeb – 2012/06/11 Dublin – Page 4 http://lod2.eu 1. Use the Data Web as background knowledge for NLP Linguistic Data currently filed under “cross-domain”
  • 5. MultilingualWeb – 2012/06/11 Dublin – Page 5 http://lod2.eu 1. Use the Data Web as background knowledge for NLP Three communities with three resources: • Working Group for Open Linguistics Data (OWLG) – > http://linguistics.okfn.org • DBpedia Internationalization Committee – > http://wiki.dbpedia.org/Internationalization • Wiktionary2RDF Wrappers – > http://dbpedia.org/Wiktionary All communities are open, please join!
  • 6. MultilingualWeb – 2012/06/11 Dublin – Page 6 http://lod2.eu The Linguistic Linked Open Data Cloud
  • 7. MultilingualWeb – 2012/06/11 Dublin – Page 7 http://lod2.eu Main question
  • 8. MultilingualWeb – 2012/06/11 Dublin – Page 8 http://lod2.eu Wiktionary2RDF – Mediator Wrapper http://dbpedia.org/Wiktionary
  • 9. MultilingualWeb – 2012/06/11 Dublin – Page 9 http://lod2.eu Wiktionary2RDF – Mediator Wrapper http://dbpedia.org/Wiktionary Mediator Lemon
  • 10. MultilingualWeb – 2012/06/11 Dublin – Page 10 http://lod2.eu 2. Use Data Web Technologies for Integrating NLP Tools and Approaches Golden Hammer Anti-pattern The question is not whether to use RDF and Linked Data, but when to use... Image from http://pbmo.wordpress.com/2011/09/29/maslows-hammer/
  • 11. MultilingualWeb – 2012/06/11 Dublin – Page 11 http://lod2.eu
  • 12. MultilingualWeb – 2012/06/11 Dublin – Page 12 http://lod2.eu 2. Use Data Web Technologies for Integrating NLP Tools and Approaches • Ontologies provide (formal) documentation (UML, ERD) • Structure is easy to understand • Wide range of RDF tools can be used, e.g. LOD2 Stack • Indexing and querying as Big Picture possible
  • 13. MultilingualWeb – 2012/06/11 Dublin – Page 13 http://lod2.eu 2. Use Data Web Technologies for Integrating NLP Tools and Approaches The NLP Interchange Format (NIF) is an RDF/OWL-based format that aims to achieve interoperability between Natural Language Processing (NLP) tools, language resources and annotations. • Road map • Bootstrapped by LOD2, but a community project • First release in September 2011 • Great resonance – Over 50 people joined the mailing list: http://lists.okfn.org/mailman/listinfo/open-linguistics – First third party implementations and contributions – Several project discuss usage • Currently setting up advisory board, next draft in July
  • 14. MultilingualWeb – 2012/06/11 Dublin – Page 14 http://lod2.eu S. Auer and S. Hellmann: The Web of Data: Decentralized, collaborative, interlinked and interoperable LREC 2012, http://www.lrec-conf.org/proceedings/lrec2012/keynotes/LREC%202012.Keynote%20Speech%201.Soeren%20Auer.pdf
  • 15. MultilingualWeb – 2012/06/11 Dublin – Page 15 http://lod2.eu 3. Make the Output of NLP Tools available on the Web Currently there is no standard mechanism to transparently combine the WWW, GGG and NLP GGG = Giant Global Graph (basically the Web of Data) see: http://dig.csail.mit.edu/breadcrumbs/node/215
  • 16. MultilingualWeb – 2012/06/11 Dublin – Page 16 http://lod2.eu 3. Make the Output of NLP Tools available on the Web
  • 17. MultilingualWeb – 2012/06/11 Dublin – Page 17 http://lod2.eu 3. Make the Output of NLP Tools available on the Web http://dbpedia.org/spotlight P. Mendes et. al. DBpedia spotlight: Shedding light on the web of documents. In I-Semantics, 2011
  • 18. MultilingualWeb – 2012/06/11 Dublin – Page 18 http://lod2.eu 3. Make the Output of NLP Tools available on the Web http://annotateit.org http://sourceforge.net/projects/fragmentlinks/
  • 19. MultilingualWeb – 2012/06/11 Dublin – Page 19 http://lod2.eu 3. Make the Output of NLP Tools available on the Web NLP Interchange Format (NIF) join the mailing list at: http://nlp2rdf.org Hellmann et.al.: Towards an Ontology for Representing Strings In: EKAW 2012 http://svn.aksw.org/papers/2012/WWW_NIF/public/string_ontology.pdf
  • 20. LOD2 Title . 02.09.2010 . Page 20 http://lod2.eu Contact Address University of Leipzig Faculty of Mathematics and Computer Science Institute of Computer Science Department of Business Information Systems Postfach 100920 04009 Leipzig Germany Project: http://lod2.eu Organisation: http://uni-leipzig.de, http://aksw.org Presenter: http://bis.informatik.uni-leipzig.de/SebastianHellmann NLP2RDF page: http://nlp2rdf.org Acknowledgement: CC-BY-SA some slides are taken from the keynote Thanks for your unless otherwise stated of Sören Auer at LREC 2012