SlideShare a Scribd company logo
1 of 20
Download to read offline
FRAMEWORK FOR INTELLIGENT VIRTUAL ORGANIZATIONS (FIVO)




       Natural Language based
 Processing of Multilingual Contracts
for Virtual Organizations constitution
          Mikołaj Pastuszko, Bartosz Kryza, Renata Słota, Jacek Kitowski
       Institute of Computer Science, University of Science and Technology AGH
                                                               Kraków, POLAND
Agenda

Background of the problem

Goals and requirements of NLPN system

Architecture of NLPN system

Main processing flow in NLPN system

Technologies and tools used in NLPN system

Example of contract text analysis in NLPN system

Future development proposals for NLPN system
Problem introduction

Assumption
  Organizations
               own resources that are expected to be shared within Virtual
   Organization
  Conditions of cooperation are written down in form of the contract
   document


Problem
  Contracts
           are written in natural language (e.g. Polish)
  Automatization of the Virtual Organization management (FiVO) requires a
   formal and semantic form of the contract (ontology in OWL format)


Solution
  NLP-based  Negotiations (NLPN) System:
   Translating natural language based contracts to ontologies in OWL
   format
Concept of NLPN system
Goals and requirements

Support for multiple languages
  English and Polish as a starting point
  Easily extendable with support for another languages



Output ontology in OWL format           (FiVO requirement)
  Ontology   sturucture easily adjustable


Minimalization of human (supervisor) assistance


Flexible mapping between text phrases and ontology entities
  Human-readable     and easily editable Contract Dictionary


Modularity
  Easy   orchestration for various applications
Data flow in NLPN system
Modular architecture of NLPN system
Contract text analysis


1. Tokenization

2. Sentence Splitting

3. Morphological Analysis and POS Tagging

4. Named Entities Recognition
  ●
    Gazetteer

5. Contract Statemets Recognition
  ●
    Transducer + grammars
Technologies and tools

NLP tools
    GATE – General Architecture for Text Engineering
        Tokenizer
                                 ANNIE – A Nearly-New
        Gazetteer
                                 Information Extraction System
        OntoGazetteer
        JAPE Transducer
          JAPE grammars – Java Annotations Pattern Engine

    LanguageTool
      Sentence Splitter
        Part-of-Speech Tagger
        Disambiguator (tagger part)
        Supports 20 languages including Polish (Morfologik library)
Technologies and tools

Ontologies
    Jena Semantic Web Framework library
        Supports read and write in RDF/XML, N3 and N-Triples formats
        Provides API for OWL and RDF


Configuration files
    YAML format
    SnakeYAML library
Example: Contract text analysis

QoS Statements
Costa Rica Airlines should provide number of seats of Mercedes-Benz H6 equal to 54 and expected average velocity
greater than 60 km/h.
Security Statements
Tour Manager and Client should be able to book seats on Costa Rica Service.
Penalty Clauses
In case of violation of Acela D45 trainset sharing conditions a notification should be sent to John Smith.




Stwierdzenia QoS
Costa Rica Airlines będzie świadczyć ilość miejsc siedzących dla Mercedes-Benz H6 wynoszącą dokładnie 54 i
przewidywaną prędkość średnią ponad 60 km/h.
Stwierdzenia bezpieczeństwa
Tour Manager i Klient powinni być uprawnieni do rezerwowania miejsc poprzez Usługę Costa Rica.
Klauzule kar umownych
W przypadku niedotrzymania warunków świadczenia Acela D45 trainset powinno zostać wysłane powiadomienie do
Johna Smitha.
Tokenization
Sentence Splitting
Morphological Analysis and POS Tagging
Named Entities Recognition
Contract Statements Recognition
Contract Statements Recognition
Summary

NLPN system:
  Translates   natural language based contracts to formal and
   semantic form of ontologies
  Supports English and Polish

    Easily extendable with another languages
  Is modular

    Ease of use in various applications
  Is highly configurable

    Contract Dictionary (including its structure)
    Contract Ontology structure
    Contract Statements forms
    Configuration files for all components
  Has broad perspectives for future development →
Future development

Distributed Negotiations Environment
     Negotiations
      Console
 More
  statement
  forms
 Statistic
  approach
  algorithms
 Noise
  correction
  (typo etc.)
The End



    Thank you


mikolaj.pastuszko@gmail.com

More Related Content

Similar to CGW 2010 - NLPN

Crawford ubl200212
Crawford ubl200212Crawford ubl200212
Crawford ubl200212
kcmani15
 
Crawford ubl200212
Crawford ubl200212Crawford ubl200212
Crawford ubl200212
guestc93331f
 
Crawford U B L200212
Crawford U B L200212Crawford U B L200212
Crawford U B L200212
gauravtest1
 
Crawford Ubl200212
Crawford Ubl200212Crawford Ubl200212
Crawford Ubl200212
aaa111333
 
Crawford Ubl200212
Crawford Ubl200212Crawford Ubl200212
Crawford Ubl200212
guest993a3d
 
Crawford Ubl200212
Crawford Ubl200212Crawford Ubl200212
Crawford Ubl200212
guest993a3d
 
Crawford Ubl200212
Crawford Ubl200212Crawford Ubl200212
Crawford Ubl200212
guest993a3d
 
Crawford U B L200212
Crawford U B L200212Crawford U B L200212
Crawford U B L200212
guest993a3d
 
02 c a306-phillips_langtags
02 c a306-phillips_langtags02 c a306-phillips_langtags
02 c a306-phillips_langtags
suvo1111
 

Similar to CGW 2010 - NLPN (20)

How AI can help you build better customer relationships?
How AI can help you build better customer relationships?How AI can help you build better customer relationships?
How AI can help you build better customer relationships?
 
Crawford ubl200212
Crawford ubl200212Crawford ubl200212
Crawford ubl200212
 
Google Cloud Platform Munich
Google Cloud Platform MunichGoogle Cloud Platform Munich
Google Cloud Platform Munich
 
Don Bailey eLearning sample
Don Bailey eLearning sampleDon Bailey eLearning sample
Don Bailey eLearning sample
 
Crawford ubl200212
Crawford ubl200212Crawford ubl200212
Crawford ubl200212
 
Crawford U B L200212
Crawford U B L200212Crawford U B L200212
Crawford U B L200212
 
Crawford Ubl200212
Crawford Ubl200212Crawford Ubl200212
Crawford Ubl200212
 
Crawford Ubl200212
Crawford Ubl200212Crawford Ubl200212
Crawford Ubl200212
 
Crawford Ubl200212
Crawford Ubl200212Crawford Ubl200212
Crawford Ubl200212
 
Crawford Ubl200212
Crawford Ubl200212Crawford Ubl200212
Crawford Ubl200212
 
Crawford U B L200212
Crawford U B L200212Crawford U B L200212
Crawford U B L200212
 
CLTL Software and Web Services
CLTL Software and Web Services CLTL Software and Web Services
CLTL Software and Web Services
 
DeepPavlov 2019
DeepPavlov 2019DeepPavlov 2019
DeepPavlov 2019
 
02 c a306-phillips_langtags
02 c a306-phillips_langtags02 c a306-phillips_langtags
02 c a306-phillips_langtags
 
Google Cloud Platform - Cloud-Native Roadshow Stuttgart
Google Cloud Platform - Cloud-Native Roadshow StuttgartGoogle Cloud Platform - Cloud-Native Roadshow Stuttgart
Google Cloud Platform - Cloud-Native Roadshow Stuttgart
 
voice browser
voice browservoice browser
voice browser
 
ACE Logo
ACE LogoACE Logo
ACE Logo
 
Tamil OCR using Tesseract OCR Engine
Tamil OCR using Tesseract OCR EngineTamil OCR using Tesseract OCR Engine
Tamil OCR using Tesseract OCR Engine
 
Languages, Ontologies and Automatic Grammar Generation - Prof. Pedro Rangel H...
Languages, Ontologies and Automatic Grammar Generation - Prof. Pedro Rangel H...Languages, Ontologies and Automatic Grammar Generation - Prof. Pedro Rangel H...
Languages, Ontologies and Automatic Grammar Generation - Prof. Pedro Rangel H...
 
CSCorganization of programming languages
CSCorganization of programming languagesCSCorganization of programming languages
CSCorganization of programming languages
 

Recently uploaded

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 

Recently uploaded (20)

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

CGW 2010 - NLPN

  • 1. FRAMEWORK FOR INTELLIGENT VIRTUAL ORGANIZATIONS (FIVO) Natural Language based Processing of Multilingual Contracts for Virtual Organizations constitution Mikołaj Pastuszko, Bartosz Kryza, Renata Słota, Jacek Kitowski Institute of Computer Science, University of Science and Technology AGH Kraków, POLAND
  • 2. Agenda Background of the problem Goals and requirements of NLPN system Architecture of NLPN system Main processing flow in NLPN system Technologies and tools used in NLPN system Example of contract text analysis in NLPN system Future development proposals for NLPN system
  • 3. Problem introduction Assumption  Organizations own resources that are expected to be shared within Virtual Organization  Conditions of cooperation are written down in form of the contract document Problem  Contracts are written in natural language (e.g. Polish)  Automatization of the Virtual Organization management (FiVO) requires a formal and semantic form of the contract (ontology in OWL format) Solution  NLP-based Negotiations (NLPN) System: Translating natural language based contracts to ontologies in OWL format
  • 5. Goals and requirements Support for multiple languages  English and Polish as a starting point  Easily extendable with support for another languages Output ontology in OWL format (FiVO requirement)  Ontology sturucture easily adjustable Minimalization of human (supervisor) assistance Flexible mapping between text phrases and ontology entities  Human-readable and easily editable Contract Dictionary Modularity  Easy orchestration for various applications
  • 6. Data flow in NLPN system
  • 8. Contract text analysis 1. Tokenization 2. Sentence Splitting 3. Morphological Analysis and POS Tagging 4. Named Entities Recognition ● Gazetteer 5. Contract Statemets Recognition ● Transducer + grammars
  • 9. Technologies and tools NLP tools  GATE – General Architecture for Text Engineering  Tokenizer ANNIE – A Nearly-New  Gazetteer Information Extraction System  OntoGazetteer  JAPE Transducer  JAPE grammars – Java Annotations Pattern Engine  LanguageTool  Sentence Splitter  Part-of-Speech Tagger  Disambiguator (tagger part)  Supports 20 languages including Polish (Morfologik library)
  • 10. Technologies and tools Ontologies  Jena Semantic Web Framework library  Supports read and write in RDF/XML, N3 and N-Triples formats  Provides API for OWL and RDF Configuration files  YAML format  SnakeYAML library
  • 11. Example: Contract text analysis QoS Statements Costa Rica Airlines should provide number of seats of Mercedes-Benz H6 equal to 54 and expected average velocity greater than 60 km/h. Security Statements Tour Manager and Client should be able to book seats on Costa Rica Service. Penalty Clauses In case of violation of Acela D45 trainset sharing conditions a notification should be sent to John Smith. Stwierdzenia QoS Costa Rica Airlines będzie świadczyć ilość miejsc siedzących dla Mercedes-Benz H6 wynoszącą dokładnie 54 i przewidywaną prędkość średnią ponad 60 km/h. Stwierdzenia bezpieczeństwa Tour Manager i Klient powinni być uprawnieni do rezerwowania miejsc poprzez Usługę Costa Rica. Klauzule kar umownych W przypadku niedotrzymania warunków świadczenia Acela D45 trainset powinno zostać wysłane powiadomienie do Johna Smitha.
  • 18. Summary NLPN system:  Translates natural language based contracts to formal and semantic form of ontologies  Supports English and Polish  Easily extendable with another languages  Is modular  Ease of use in various applications  Is highly configurable  Contract Dictionary (including its structure)  Contract Ontology structure  Contract Statements forms  Configuration files for all components  Has broad perspectives for future development →
  • 19. Future development Distributed Negotiations Environment  Negotiations Console  More statement forms  Statistic approach algorithms  Noise correction (typo etc.)
  • 20. The End Thank you mikolaj.pastuszko@gmail.com