SlideShare una empresa de Scribd logo
1 de 23
www.sti-innsbruck.at© Copyright 2008 STI INNSBRUCK www.sti-innsbruck.at
NLP Interchange Format
José M. García
www.sti-innsbruck.at
Outline
• What is NIF?
• Design requirements
• URI schemes
• NIF ontologies
• Use cases
• Relationship with ELRA
• Roadmap for NIF 2.0
• Conclusions
2
www.sti-innsbruck.at
What is NIF?
• Natural Language Processing Interchange Format
• NIF is an RDF/OWL-based format that aims to achieve interoperability
between Natural Language Processing (NLP) tools, language
resources and annotations.
• Building blocks
– URI scheme for identifying elements in texts
– Ontology for describing common NLP terms
• Created and maintained by AKSW group of University of Leipzig, during
the LOD2 EU project.
• Community project: http://persistence.uni-leipzig.org/nlp2rdf/
3
www.sti-innsbruck.at
NIF design requirements
Compatibility
with RDF
Coverage
Structural
Interoperability
Conceptual
Interoperability
Granularity
Provenance and
Confidence
Simplicity Scalability
4
www.sti-innsbruck.at
URI schemes
• Text needs to be referenceable by URIs
• With URI references text can be used as resources in RDF statements
• NIF distinguishes:
– Documents
– Text of the document
– Substrings of the text.
• URI scheme is an algorithm to create IDs for text and substrings
• URI elements
– Document URI
– Separator
– Character indices
5
www.sti-innsbruck.at
RFC 5147
• Canonical URI scheme for NIF is based on RFC 5147
• It standardizes fragment identifiers for text/plain media type
6
http://www.w3.org/DesignIssues/LinkedData.html
www.sti-innsbruck.at
RFC 5147
• Canonical URI scheme for NIF is based on RFC 5147
• It standardizes fragment identifiers for text/plain media type
7
http://www.w3.org/DesignIssues/LinkedData.html
http://www.w3.org/DesignIssues/LinkedData.html#char=0,26610
www.sti-innsbruck.at
RFC 5147
• Canonical URI scheme for NIF is based on RFC 5147
• It standardizes fragment identifiers for text/plain media type
8
http://www.w3.org/DesignIssues/LinkedData.html
http://www.w3.org/DesignIssues/LinkedData.html#char=0,26610
http://www.w3.org/DesignIssues/LinkedData.html#char=1206,1218
www.sti-innsbruck.at
NIF Core Ontology
• Classes and properties to describe relation between
– Documents
– Text
– Substrings
– Corresponding URI schemes
9
www.sti-innsbruck.at
NIF Core Ontology
• Additional classes and properties (unstable/testing)
– More URI schemes
– Text structure (words, sentences, paragraphs…)
– Part of Speech (POS)
– Annotations with Stanbol
– Confidence
10
www.sti-innsbruck.at
Workflows, Modularity and Extensibility of NIF
• Workflows for NLP integration
– Normalization
– Tokenization
– Merge RDF annotations
11
www.sti-innsbruck.at
Workflows, Modularity and Extensibility of NIF
• NIF ontology logical modules
– Terminological model
– Inference model
– Validation model
• Vocabulary modules
– FISE
– ITS
– OLiA
– NERD
– …
12
www.sti-innsbruck.at
Workflows, Modularity and Extensibility of NIF
• Granularity profiles
13
www.sti-innsbruck.at
ITS Use Case
• The Internationalization Tag Set 2.0 is a W3C working draft that is
becoming a Recommendation.
• ITS standardizes HTML and XML attributes which can be used to
annotate nodes with processing information for language service
providers (i18n, l10n)
• ITS 2.0 RDF ontology was developed using NIF, including a round-trip
conversion algorithm from ITS to NIF.
• NIF is expected to receive wide adoption by translation & language
service providers
• ITS 2.0 RDF ontology provides properties which can be used to provide
best practices for NLP annotations.
14
www.sti-innsbruck.at
OLiA Use Case
• The Ontologies of Linguistic Annotation provide stable identifiers for
morpho-syntactical annotation tag sets, so that NLP tools can use these
ids for better interoperability.
• OLiA provides Annotation Models and a Reference Model, comprising
more than 110 OWL ontologies for over 34 tag sets in 69 languages
• Features
– Documentation
– Flexible Granularity
– Language Independence
• NIF provides two properties
– nif:oliaIndividual (links a nif:String to an OLiA Annotation Model)
– nif:oliaCategory (links to the Reference Model)
15
www.sti-innsbruck.at
RDFaCE Use Case
• RDFa Content Editor is a rich text editor that supports WYSIWYM
authoring including various views of the semantically enriched textual
content.
• It combines results of different NLP APIs for automatic content
annotation
– Heterogeneous APIs access, URI generation and output data structure
– Solution: server-side proxy, hard-coded input and connection of each API.
• NIF simplified the integration, adding an interoperability layer
16
www.sti-innsbruck.at
What is ELRA?
• European Language Resources Association
• http://www.elra.info
• Effort to make available Language Resources (LR) for language
engineering and to evaluate language engineering technologies.
• LR marketplace
• Related organizations
– ELDA (ELRA’s operational body)
– LREC conferences
17
www.sti-innsbruck.at
What is ELRA?
18
www.sti-innsbruck.at
Relationship with NIF
• Different objectives
• LR written resources (esp. Corpora) can be annotated with NIF for
further interoperability and integration with NLP tools
• ADVANTAGE: Large test data collection to evaluate NLP tools
• DISADVANTAGE: Cost of LR (though there are free ones)
19
www.sti-innsbruck.at
Roadmap for NIF 2.0
• Release of NIF 1.0
– DONE (Nov 2009)
• Release of NIF 2.0 Draft
– CURRENT effort on solving pending issues
– Adoption in ITS 2.0 W3C (soon-to-be) Recommendation
– NIF-Core ontology is becoming stable
– RLOG - an RDF Logging Ontology
– NIF Validator software available
• Release of NIF 2.0 Core
• Release of NIF 2.0 Extensions
– ITS ontology, PROV ontology, Lemon Ontology, NERD, UIMA, MARL opinion
ontology…
20
www.sti-innsbruck.at
Conclusions
• NIF allows to integrate NLP tools using Linked Data
• Ongoing effort
• Many adopters and supporters
– LOD2 EU project
– Several W3C working groups
– Named Entity Recognition and Disambiguation (NERD)
– Ontologies of Linguistic Annotation (OLiA)
– …
• 27 different implementations and use cases
– Some available at http://persistence.uni-leipzig.org/nlp2rdf/
21
www.sti-innsbruck.at© Copyright 2012 STI INNSBRUCK www.sti-innsbruck.at
Thanks for your attention
Questions?
22
www.sti-innsbruck.at
References
1. http://persistence.uni-leipzig.org/nlp2rdf/
2. Integrating NLP using Linked Data by Sebastian Hellmann, Jens
Lehmann, Sören Auer, and Martin Brümmer in 12th International
Semantic Web Conference, 21-25 October 2013, Sydney, Australia
23

Más contenido relacionado

Similar a Oc wg-nif-20130711

Lynx Webinar #4: Lynx Services Platform (LySP) - Part 2 - The Services
Lynx Webinar #4: Lynx Services Platform (LySP) - Part 2 - The ServicesLynx Webinar #4: Lynx Services Platform (LySP) - Part 2 - The Services
Lynx Webinar #4: Lynx Services Platform (LySP) - Part 2 - The ServicesLynx Project
 
NIF - Version 1.0 - 2011/10/23
NIF - Version 1.0 - 2011/10/23NIF - Version 1.0 - 2011/10/23
NIF - Version 1.0 - 2011/10/23Sebastian Hellmann
 
Integrating NLP using Linked Data
Integrating NLP using Linked DataIntegrating NLP using Linked Data
Integrating NLP using Linked DataSebastian Hellmann
 
OOR Architecture - Towards a Network of Linked Ontology Repositories
OOR Architecture - Towards a Network of Linked Ontology RepositoriesOOR Architecture - Towards a Network of Linked Ontology Repositories
OOR Architecture - Towards a Network of Linked Ontology RepositoriesKim Viljanen
 
NIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate reportNIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate reportSebastian Hellmann
 
Briefing on OASIS XLIFF OMOS TC 20160121
Briefing on OASIS XLIFF OMOS TC 20160121Briefing on OASIS XLIFF OMOS TC 20160121
Briefing on OASIS XLIFF OMOS TC 20160121Jamie Clark
 
Lider Reference Model ld4lt session March, 3rd, 2015
Lider Reference Model ld4lt session  March, 3rd, 2015Lider Reference Model ld4lt session  March, 3rd, 2015
Lider Reference Model ld4lt session March, 3rd, 2015Sebastian Hellmann
 
Linked Open Data Cloud
Linked Open Data CloudLinked Open Data Cloud
Linked Open Data CloudPretaLLOD
 
How community software supports language documentation and data analysis
How community software supports language documentation and data analysisHow community software supports language documentation and data analysis
How community software supports language documentation and data analysisPeter Bouda
 
Incubating Apache Linda (ApacheCon Europe 2012)
Incubating Apache Linda (ApacheCon Europe 2012)Incubating Apache Linda (ApacheCon Europe 2012)
Incubating Apache Linda (ApacheCon Europe 2012)Sergio Fernández
 
Phil Ritchie | Putting Standards into Action: Multilingual and Semantic Enric...
Phil Ritchie | Putting Standards into Action: Multilingual and Semantic Enric...Phil Ritchie | Putting Standards into Action: Multilingual and Semantic Enric...
Phil Ritchie | Putting Standards into Action: Multilingual and Semantic Enric...semanticsconference
 
Update From OCLC Research May 2008
Update From OCLC Research May 2008Update From OCLC Research May 2008
Update From OCLC Research May 2008Nancy Elkington
 
Object Reuse and Exchange (ORE) : Experience in the Open Language Archives Co...
Object Reuse and Exchange (ORE) : Experience in the Open Language Archives Co...Object Reuse and Exchange (ORE) : Experience in the Open Language Archives Co...
Object Reuse and Exchange (ORE) : Experience in the Open Language Archives Co...Baden Hughes
 
Semantic Web Methodologies, Best Practices and Ontology Engineering Applied t...
Semantic Web Methodologies, Best Practices and Ontology Engineering Applied t...Semantic Web Methodologies, Best Practices and Ontology Engineering Applied t...
Semantic Web Methodologies, Best Practices and Ontology Engineering Applied t...Ghislain ATEMEZING
 
Mission to NARs with Apache NiFi
Mission to NARs with Apache NiFiMission to NARs with Apache NiFi
Mission to NARs with Apache NiFiHortonworks
 
Eclipse RDF4J - Working with RDF in Java
Eclipse RDF4J - Working with RDF in JavaEclipse RDF4J - Working with RDF in Java
Eclipse RDF4J - Working with RDF in JavaJeen Broekstra
 

Similar a Oc wg-nif-20130711 (20)

Lemon at-mlw3
Lemon at-mlw3Lemon at-mlw3
Lemon at-mlw3
 
Lynx Webinar #4: Lynx Services Platform (LySP) - Part 2 - The Services
Lynx Webinar #4: Lynx Services Platform (LySP) - Part 2 - The ServicesLynx Webinar #4: Lynx Services Platform (LySP) - Part 2 - The Services
Lynx Webinar #4: Lynx Services Platform (LySP) - Part 2 - The Services
 
NIF - Version 1.0 - 2011/10/23
NIF - Version 1.0 - 2011/10/23NIF - Version 1.0 - 2011/10/23
NIF - Version 1.0 - 2011/10/23
 
Integrating NLP using Linked Data
Integrating NLP using Linked DataIntegrating NLP using Linked Data
Integrating NLP using Linked Data
 
OOR Architecture - Towards a Network of Linked Ontology Repositories
OOR Architecture - Towards a Network of Linked Ontology RepositoriesOOR Architecture - Towards a Network of Linked Ontology Repositories
OOR Architecture - Towards a Network of Linked Ontology Repositories
 
NIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate reportNIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate report
 
NIF - NLP Interchange Format
NIF - NLP Interchange FormatNIF - NLP Interchange Format
NIF - NLP Interchange Format
 
Briefing on OASIS XLIFF OMOS TC 20160121
Briefing on OASIS XLIFF OMOS TC 20160121Briefing on OASIS XLIFF OMOS TC 20160121
Briefing on OASIS XLIFF OMOS TC 20160121
 
Lider Reference Model ld4lt session March, 3rd, 2015
Lider Reference Model ld4lt session  March, 3rd, 2015Lider Reference Model ld4lt session  March, 3rd, 2015
Lider Reference Model ld4lt session March, 3rd, 2015
 
Linked Open Data Cloud
Linked Open Data CloudLinked Open Data Cloud
Linked Open Data Cloud
 
How community software supports language documentation and data analysis
How community software supports language documentation and data analysisHow community software supports language documentation and data analysis
How community software supports language documentation and data analysis
 
Incubating Apache Linda (ApacheCon Europe 2012)
Incubating Apache Linda (ApacheCon Europe 2012)Incubating Apache Linda (ApacheCon Europe 2012)
Incubating Apache Linda (ApacheCon Europe 2012)
 
Phil Ritchie | Putting Standards into Action: Multilingual and Semantic Enric...
Phil Ritchie | Putting Standards into Action: Multilingual and Semantic Enric...Phil Ritchie | Putting Standards into Action: Multilingual and Semantic Enric...
Phil Ritchie | Putting Standards into Action: Multilingual and Semantic Enric...
 
Oct 15 NISO Webinar: 21st Century Resource Sharing: Which Inter-Library Loan ...
Oct 15 NISO Webinar: 21st Century Resource Sharing: Which Inter-Library Loan ...Oct 15 NISO Webinar: 21st Century Resource Sharing: Which Inter-Library Loan ...
Oct 15 NISO Webinar: 21st Century Resource Sharing: Which Inter-Library Loan ...
 
Update From OCLC Research May 2008
Update From OCLC Research May 2008Update From OCLC Research May 2008
Update From OCLC Research May 2008
 
Object Reuse and Exchange (ORE) : Experience in the Open Language Archives Co...
Object Reuse and Exchange (ORE) : Experience in the Open Language Archives Co...Object Reuse and Exchange (ORE) : Experience in the Open Language Archives Co...
Object Reuse and Exchange (ORE) : Experience in the Open Language Archives Co...
 
Semantic Web Methodologies, Best Practices and Ontology Engineering Applied t...
Semantic Web Methodologies, Best Practices and Ontology Engineering Applied t...Semantic Web Methodologies, Best Practices and Ontology Engineering Applied t...
Semantic Web Methodologies, Best Practices and Ontology Engineering Applied t...
 
Mission to NARs with Apache NiFi
Mission to NARs with Apache NiFiMission to NARs with Apache NiFi
Mission to NARs with Apache NiFi
 
Linked Data Competency Index : Mapping the field for teachers and learners
 Linked Data Competency Index : Mapping the field for teachers and learners Linked Data Competency Index : Mapping the field for teachers and learners
Linked Data Competency Index : Mapping the field for teachers and learners
 
Eclipse RDF4J - Working with RDF in Java
Eclipse RDF4J - Working with RDF in JavaEclipse RDF4J - Working with RDF in Java
Eclipse RDF4J - Working with RDF in Java
 

Más de STIinnsbruck

Más de STIinnsbruck (20)

Unister
UnisterUnister
Unister
 
Twoo
TwooTwoo
Twoo
 
Twibes
TwibesTwibes
Twibes
 
Tweet deck 2012-01-02
Tweet deck 2012-01-02Tweet deck 2012-01-02
Tweet deck 2012-01-02
 
Tv handbook revised_100120141
Tv handbook revised_100120141Tv handbook revised_100120141
Tv handbook revised_100120141
 
Tv feratel 13032014
Tv feratel 13032014Tv feratel 13032014
Tv feratel 13032014
 
Tv evaluation 12032014
Tv evaluation 12032014Tv evaluation 12032014
Tv evaluation 12032014
 
T vb publication_rules_11032014
T vb publication_rules_11032014T vb publication_rules_11032014
T vb publication_rules_11032014
 
T vb mapping_implementation_25032014
T vb mapping_implementation_25032014T vb mapping_implementation_25032014
T vb mapping_implementation_25032014
 
T vb alignment_022814_0
T vb alignment_022814_0T vb alignment_022814_0
T vb alignment_022814_0
 
Ttr 20130701
Ttr 20130701Ttr 20130701
Ttr 20130701
 
Ttg mapping to_schema.org_
Ttg mapping to_schema.org_Ttg mapping to_schema.org_
Ttg mapping to_schema.org_
 
Ttb 08042014
Ttb 08042014Ttb 08042014
Ttb 08042014
 
Trust you
Trust youTrust you
Trust you
 
Tripwolf
TripwolfTripwolf
Tripwolf
 
Tripbirds
TripbirdsTripbirds
Tripbirds
 
Traveltainment
TraveltainmentTraveltainment
Traveltainment
 
Travelaudience
TravelaudienceTravelaudience
Travelaudience
 
Tourismuszukunft
TourismuszukunftTourismuszukunft
Tourismuszukunft
 
Tourismusverband innsbruck 24.09.2013
Tourismusverband innsbruck 24.09.2013Tourismusverband innsbruck 24.09.2013
Tourismusverband innsbruck 24.09.2013
 

Último

Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptxMohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptxmohammadalnahdi22
 
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxChiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxraffaeleoman
 
Uncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac FolorunsoUncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac FolorunsoKayode Fayemi
 
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls
 
If this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaIf this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaKayode Fayemi
 
Report Writing Webinar Training
Report Writing Webinar TrainingReport Writing Webinar Training
Report Writing Webinar TrainingKylaCullinane
 
Causes of poverty in France presentation.pptx
Causes of poverty in France presentation.pptxCauses of poverty in France presentation.pptx
Causes of poverty in France presentation.pptxCamilleBoulbin1
 
Dreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio IIIDreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio IIINhPhngng3
 
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdfThe workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdfSenaatti-kiinteistöt
 
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdfAWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdfSkillCertProExams
 
SaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, YardstickSaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, Yardsticksaastr
 
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night EnjoyCall Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night EnjoyPooja Nehwal
 
Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)Chameera Dedduwage
 
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxNikitaBankoti2
 
My Presentation "In Your Hands" by Halle Bailey
My Presentation "In Your Hands" by Halle BaileyMy Presentation "In Your Hands" by Halle Bailey
My Presentation "In Your Hands" by Halle Baileyhlharris
 
Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...
Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...
Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...Delhi Call girls
 
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...Sheetaleventcompany
 
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesVVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesPooja Nehwal
 
Presentation on Engagement in Book Clubs
Presentation on Engagement in Book ClubsPresentation on Engagement in Book Clubs
Presentation on Engagement in Book Clubssamaasim06
 
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...Hasting Chen
 

Último (20)

Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptxMohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx
 
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxChiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
 
Uncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac FolorunsoUncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac Folorunso
 
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
 
If this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaIf this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New Nigeria
 
Report Writing Webinar Training
Report Writing Webinar TrainingReport Writing Webinar Training
Report Writing Webinar Training
 
Causes of poverty in France presentation.pptx
Causes of poverty in France presentation.pptxCauses of poverty in France presentation.pptx
Causes of poverty in France presentation.pptx
 
Dreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio IIIDreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio III
 
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdfThe workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
 
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdfAWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
 
SaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, YardstickSaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, Yardstick
 
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night EnjoyCall Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
 
Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)
 
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
 
My Presentation "In Your Hands" by Halle Bailey
My Presentation "In Your Hands" by Halle BaileyMy Presentation "In Your Hands" by Halle Bailey
My Presentation "In Your Hands" by Halle Bailey
 
Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...
Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...
Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...
 
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
 
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesVVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
 
Presentation on Engagement in Book Clubs
Presentation on Engagement in Book ClubsPresentation on Engagement in Book Clubs
Presentation on Engagement in Book Clubs
 
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
 

Oc wg-nif-20130711

  • 1. www.sti-innsbruck.at© Copyright 2008 STI INNSBRUCK www.sti-innsbruck.at NLP Interchange Format José M. García
  • 2. www.sti-innsbruck.at Outline • What is NIF? • Design requirements • URI schemes • NIF ontologies • Use cases • Relationship with ELRA • Roadmap for NIF 2.0 • Conclusions 2
  • 3. www.sti-innsbruck.at What is NIF? • Natural Language Processing Interchange Format • NIF is an RDF/OWL-based format that aims to achieve interoperability between Natural Language Processing (NLP) tools, language resources and annotations. • Building blocks – URI scheme for identifying elements in texts – Ontology for describing common NLP terms • Created and maintained by AKSW group of University of Leipzig, during the LOD2 EU project. • Community project: http://persistence.uni-leipzig.org/nlp2rdf/ 3
  • 4. www.sti-innsbruck.at NIF design requirements Compatibility with RDF Coverage Structural Interoperability Conceptual Interoperability Granularity Provenance and Confidence Simplicity Scalability 4
  • 5. www.sti-innsbruck.at URI schemes • Text needs to be referenceable by URIs • With URI references text can be used as resources in RDF statements • NIF distinguishes: – Documents – Text of the document – Substrings of the text. • URI scheme is an algorithm to create IDs for text and substrings • URI elements – Document URI – Separator – Character indices 5
  • 6. www.sti-innsbruck.at RFC 5147 • Canonical URI scheme for NIF is based on RFC 5147 • It standardizes fragment identifiers for text/plain media type 6 http://www.w3.org/DesignIssues/LinkedData.html
  • 7. www.sti-innsbruck.at RFC 5147 • Canonical URI scheme for NIF is based on RFC 5147 • It standardizes fragment identifiers for text/plain media type 7 http://www.w3.org/DesignIssues/LinkedData.html http://www.w3.org/DesignIssues/LinkedData.html#char=0,26610
  • 8. www.sti-innsbruck.at RFC 5147 • Canonical URI scheme for NIF is based on RFC 5147 • It standardizes fragment identifiers for text/plain media type 8 http://www.w3.org/DesignIssues/LinkedData.html http://www.w3.org/DesignIssues/LinkedData.html#char=0,26610 http://www.w3.org/DesignIssues/LinkedData.html#char=1206,1218
  • 9. www.sti-innsbruck.at NIF Core Ontology • Classes and properties to describe relation between – Documents – Text – Substrings – Corresponding URI schemes 9
  • 10. www.sti-innsbruck.at NIF Core Ontology • Additional classes and properties (unstable/testing) – More URI schemes – Text structure (words, sentences, paragraphs…) – Part of Speech (POS) – Annotations with Stanbol – Confidence 10
  • 11. www.sti-innsbruck.at Workflows, Modularity and Extensibility of NIF • Workflows for NLP integration – Normalization – Tokenization – Merge RDF annotations 11
  • 12. www.sti-innsbruck.at Workflows, Modularity and Extensibility of NIF • NIF ontology logical modules – Terminological model – Inference model – Validation model • Vocabulary modules – FISE – ITS – OLiA – NERD – … 12
  • 13. www.sti-innsbruck.at Workflows, Modularity and Extensibility of NIF • Granularity profiles 13
  • 14. www.sti-innsbruck.at ITS Use Case • The Internationalization Tag Set 2.0 is a W3C working draft that is becoming a Recommendation. • ITS standardizes HTML and XML attributes which can be used to annotate nodes with processing information for language service providers (i18n, l10n) • ITS 2.0 RDF ontology was developed using NIF, including a round-trip conversion algorithm from ITS to NIF. • NIF is expected to receive wide adoption by translation & language service providers • ITS 2.0 RDF ontology provides properties which can be used to provide best practices for NLP annotations. 14
  • 15. www.sti-innsbruck.at OLiA Use Case • The Ontologies of Linguistic Annotation provide stable identifiers for morpho-syntactical annotation tag sets, so that NLP tools can use these ids for better interoperability. • OLiA provides Annotation Models and a Reference Model, comprising more than 110 OWL ontologies for over 34 tag sets in 69 languages • Features – Documentation – Flexible Granularity – Language Independence • NIF provides two properties – nif:oliaIndividual (links a nif:String to an OLiA Annotation Model) – nif:oliaCategory (links to the Reference Model) 15
  • 16. www.sti-innsbruck.at RDFaCE Use Case • RDFa Content Editor is a rich text editor that supports WYSIWYM authoring including various views of the semantically enriched textual content. • It combines results of different NLP APIs for automatic content annotation – Heterogeneous APIs access, URI generation and output data structure – Solution: server-side proxy, hard-coded input and connection of each API. • NIF simplified the integration, adding an interoperability layer 16
  • 17. www.sti-innsbruck.at What is ELRA? • European Language Resources Association • http://www.elra.info • Effort to make available Language Resources (LR) for language engineering and to evaluate language engineering technologies. • LR marketplace • Related organizations – ELDA (ELRA’s operational body) – LREC conferences 17
  • 19. www.sti-innsbruck.at Relationship with NIF • Different objectives • LR written resources (esp. Corpora) can be annotated with NIF for further interoperability and integration with NLP tools • ADVANTAGE: Large test data collection to evaluate NLP tools • DISADVANTAGE: Cost of LR (though there are free ones) 19
  • 20. www.sti-innsbruck.at Roadmap for NIF 2.0 • Release of NIF 1.0 – DONE (Nov 2009) • Release of NIF 2.0 Draft – CURRENT effort on solving pending issues – Adoption in ITS 2.0 W3C (soon-to-be) Recommendation – NIF-Core ontology is becoming stable – RLOG - an RDF Logging Ontology – NIF Validator software available • Release of NIF 2.0 Core • Release of NIF 2.0 Extensions – ITS ontology, PROV ontology, Lemon Ontology, NERD, UIMA, MARL opinion ontology… 20
  • 21. www.sti-innsbruck.at Conclusions • NIF allows to integrate NLP tools using Linked Data • Ongoing effort • Many adopters and supporters – LOD2 EU project – Several W3C working groups – Named Entity Recognition and Disambiguation (NERD) – Ontologies of Linguistic Annotation (OLiA) – … • 27 different implementations and use cases – Some available at http://persistence.uni-leipzig.org/nlp2rdf/ 21
  • 22. www.sti-innsbruck.at© Copyright 2012 STI INNSBRUCK www.sti-innsbruck.at Thanks for your attention Questions? 22
  • 23. www.sti-innsbruck.at References 1. http://persistence.uni-leipzig.org/nlp2rdf/ 2. Integrating NLP using Linked Data by Sebastian Hellmann, Jens Lehmann, Sören Auer, and Martin Brümmer in 12th International Semantic Web Conference, 21-25 October 2013, Sydney, Australia 23