SlideShare a Scribd company logo
1 of 25
Natural Intelligence -
Commonsense Question Answering
with Conceptual Graphs

  Fatih Mehmet Güler and Aysenur Birturk
  Department of Computer Engineering, METU
          06531, Ankara/TURKEY
             fmguler@gmail.com
          birturk@ceng.metu.edu.tr
Motivation

   Massive Knowledge found as Natural Language
   Text based Question Answering (no tagging)
   Open Domain Question Answering
   Address Commonsense Reasoning Problem
   Linguistically motivated KRR
       Intelligence is the accumulation of knowledge
   Integrate State of the Art Tools
   Ultimate goal: Getting closer to strong AI
Summary of the System

   Natural Language is parsed
   Utterances are represented using CGs
   Concepts and Relation types are mapped to
    Cyc equivalent counterparts
   Type hierarchies are computed
   Knowledge is accumulated
   If the input is a question
       Search for answer (projection)
Summary of the System (Cont’d)

                       NI




           NLP        KRR       Commonsense




           CCG        CGs        Open Cyc




         C&C Tools   Cogitant
Background

   Combinatory Categorial Grammar (CCG)
       C&C Tools
   Conceptual Graphs
       Cogitant
   Open Cyc
Combinatory Categorial Grammar (CCG)

   Lexicalized Theory of Grammar based on
    Categorial Grammar ( Steedman 2001).
       Functions can be applied or composed
       Arguments can be picked up or turned into
        functors (Type raising)
   Easy for Semantic Representations
       Small number of semantically transparent
        combinatory rules to combine CCG categories.
           Assign semantic representations to the lexical entries
           Interpret combinatory rules
CCG parse for “Mr. Hyde ate two
lemmons”
CCG Parse for “Susan knows that Bob
likes Fred”
DRS for “Susan knows that Bob likes
Fred”
C&C Tools
   Linguistically Motivated Large-Scale NLP with C&C
    and Boxer. (Curran, Clark, Bos, 2007)
       C&C Parser
         POS Tagging, Supertagging

         Parsing, Chunking

         Named Entity Recognition

       Boxer
         Uses CCG parser output

         Generates DRS Semantic Representations

   Freely available for research
       http://svn.ask.it.usyd.edu.au/trac/candc/wiki
C&C Tools

   Large Scale NLP is possible with C&C and
    Boxer
   C&C Parser: state of the art parser for CCG
   Boxer: Semantic representations in DRS
Open Cyc

   Open source version of Cyc system
   Cyc: greatest effort to encode Common Sense
    knowledge in machine processable way
   500.000 concepts 26.000 relations and 5.000.000
    assertions
   CycL language similar to Lisp
   We use Cyc to map parsed words to common sense
    counterparts such as person to #$Person
    (disambiguation)
Open Cyc (cont’d)

   (#$likesAsFriend #$GeorgeWBush #$AlGore)
   #$isa, #$genls
   (#$isa #$GeorgeWBush
    #$UnitedStatesPresident)
   (#$genls #$UnitedStatesPresident #$Person)
Cogitant

   Library for Conceptual Graph operations
   Supports broad CG operations (Genest &
    Salvat, 1998)
       Graph representation
       Conversion from CGIF
       Projection checking
       Rule application
Natural Intelligence – Commonsense
Question Answering with CGs
   Augment Common Sense knowledge
   Modular Approach
       Separation of Concerns
       State of the art tools
Architecture - Modules

   Natural Language Processing (C&C Tools are used
    for implementation)
       Convert natural language to CGIF
   Reasoning (Cogitant library is used for
    implementation)
       CG operations
   Common Sense (Open Cyc is used for
    implementation)
       Common sense mapping
   Storage (Conceptual Graphs are stored in a
    database)
       Persistence of CGs
System Definition
   User enters a sentence from web interface;
   This sentence is converted to CGIF using the NLP module;
   CGIF is converted to CGs using the reasoning module;
   Support is generated to CGs using the common sense module;
   Common sense rules gathered from common sense module are
    applied to CGs using reasoning module;
   CGs are merged to the previous ones using reasoning module;
   If the input sentence is a question sentence, same operations
    take place, except the resulting graph is used to query existing
    CGs using the reasoning service, and if there are projections
    from this query graph to previous CGs, results are displayed to
    the user;
   CGs are persisted using the storage module.
Common Sense Mapping

   Cyc: (prettyString TERM STRING)
   Chain up to #$Thing using #$genls relations
   Same for relations using #$genlPreds
   Relation hierarchies are converted to forward
    rules
       #$performedBy -> #$temporallyRelated
Sample Concept Hierarchy
   #$Place ->
           #$EnduringThing-Localized ->
                        #$Location-Underspecified ->
                                        #$Thing ->
                        #$SomethingExisting ->
                                        #$Individual ->
                                                        #$Thing ^^
                                                        #$Trajector-Underspecified ->
                                                                        #$Location-Underspecified ^^
                                        #$TemporallyExistingThing ->
                                                        #$TemporalThing ->
                                                                        #$Individual ^^
                        #$SpatialThing-Localized ->
                                        #$TemporallyExistingThing ^^
                                        #$SpatialThing ->
                                                        #$Individual ^^
                                        #$Boundary-Underspecified ->
                                                        #$Region-Underspecified ->
                                                                        #$Location-Underspecified ^^
                                                        #$Landmark-Underspecified ->
                                                                        #$Individual ^^
                                                                        #$Location-Underspecified ^^
                        #$SpatialThing-NonSituational ->
                                        #$SpatialThing ^^
                                        #$Individual ^^
           #$Location-Underspecified ^^
Conversion to Cogitant Support

   Convert Cyc hierarcy to Cogitant support
    format
       Concept Types
       Relation Types
       Individuals
       Rules
   Convert assertions to Cogitant graph format
   Apply forward rules
Answering Queries
Significance

   Sentences like;
       What are the intangible things in this situation?
       Was Mr. Hyde there while eating the apples?
       Does Mr. Hyde exist after eating the apples?
       Do the apples exist after Mr. Hyde ate them?
   Deep Natural Language Understanding
   State of the art tools
   Open domain question answering
Difficulties

   Open Cyc API is broken
       Does not work in Turkish locale (fixes are sent to
        maintainers)
       Still, provided API sends one IP packet per character, way
        too slow over network
       Custom socket API is developed and used over TCP
       Custom Lisp functions for generalization hierarchy and
        concept mapping
   Cogitant problematic
       Java API is very limited (compared to C++)
       Only works over XML files
Conclusion

   Central Integrated Common Sense QAS
   CCG for Natural Language Processing
   Conceptual Graphs for KRR
   Cyc for Common Sense
Future Work

   Implement Rule Induction
   Backward Chaining (Resolution)
   Improve NLP module and Common Sense mapping
   Probabilistic Reasoning
   Question Answering System (QAS) to be used in;
       Education (Learning Management Systems)
       Semantic Search (Content Management Systems)
       Intelligent Help

More Related Content

Similar to Natural Intelligence ICCS 2010

Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...
Lucidworks
 
PHP Development With MongoDB
PHP Development With MongoDBPHP Development With MongoDB
PHP Development With MongoDB
Fitz Agard
 
PHP Development with MongoDB (Fitz Agard)
PHP Development with MongoDB (Fitz Agard)PHP Development with MongoDB (Fitz Agard)
PHP Development with MongoDB (Fitz Agard)
MongoSF
 
Building DSLs On CLR and DLR (Microsoft.NET)
Building DSLs On CLR and DLR (Microsoft.NET)Building DSLs On CLR and DLR (Microsoft.NET)
Building DSLs On CLR and DLR (Microsoft.NET)
Vitaly Baum
 
Introduction into scalable graph analysis with Apache Giraph and Spark GraphX
Introduction into scalable graph analysis with Apache Giraph and Spark GraphXIntroduction into scalable graph analysis with Apache Giraph and Spark GraphX
Introduction into scalable graph analysis with Apache Giraph and Spark GraphX
rhatr
 

Similar to Natural Intelligence ICCS 2010 (20)

Embrace NoSQL and Eventual Consistency with Ripple
Embrace NoSQL and Eventual Consistency with RippleEmbrace NoSQL and Eventual Consistency with Ripple
Embrace NoSQL and Eventual Consistency with Ripple
 
Spatial LDA
Spatial LDASpatial LDA
Spatial LDA
 
Overview of running R in the Oracle Database
Overview of running R in the Oracle DatabaseOverview of running R in the Oracle Database
Overview of running R in the Oracle Database
 
Introduction to Domain-Driven Design
Introduction to Domain-Driven DesignIntroduction to Domain-Driven Design
Introduction to Domain-Driven Design
 
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...
 
PHP Development With MongoDB
PHP Development With MongoDBPHP Development With MongoDB
PHP Development With MongoDB
 
PHP Development with MongoDB (Fitz Agard)
PHP Development with MongoDB (Fitz Agard)PHP Development with MongoDB (Fitz Agard)
PHP Development with MongoDB (Fitz Agard)
 
HPTS talk on micro-sharding with Katta
HPTS talk on micro-sharding with KattaHPTS talk on micro-sharding with Katta
HPTS talk on micro-sharding with Katta
 
Mining Software Archives to Support Software Development
Mining Software Archives to Support Software DevelopmentMining Software Archives to Support Software Development
Mining Software Archives to Support Software Development
 
[SOCRS2013]Differential Context Modeling in Collaborative Filtering
[SOCRS2013]Differential Context Modeling in Collaborative Filtering[SOCRS2013]Differential Context Modeling in Collaborative Filtering
[SOCRS2013]Differential Context Modeling in Collaborative Filtering
 
Building DSLs On CLR and DLR (Microsoft.NET)
Building DSLs On CLR and DLR (Microsoft.NET)Building DSLs On CLR and DLR (Microsoft.NET)
Building DSLs On CLR and DLR (Microsoft.NET)
 
Code as Data workshop: Using source{d} Engine to extract insights from git re...
Code as Data workshop: Using source{d} Engine to extract insights from git re...Code as Data workshop: Using source{d} Engine to extract insights from git re...
Code as Data workshop: Using source{d} Engine to extract insights from git re...
 
Stuff About CQRS
Stuff About CQRSStuff About CQRS
Stuff About CQRS
 
Compass Framework
Compass FrameworkCompass Framework
Compass Framework
 
R Workshop for Beginners
R Workshop for BeginnersR Workshop for Beginners
R Workshop for Beginners
 
Concurrent and Distributed Applications with Akka, Java and Scala
Concurrent and Distributed Applications with Akka, Java and ScalaConcurrent and Distributed Applications with Akka, Java and Scala
Concurrent and Distributed Applications with Akka, Java and Scala
 
High Performance Ruby: Evented vs. Threaded
High Performance Ruby: Evented vs. ThreadedHigh Performance Ruby: Evented vs. Threaded
High Performance Ruby: Evented vs. Threaded
 
Jquery2012 defs
Jquery2012 defsJquery2012 defs
Jquery2012 defs
 
Introduction into scalable graph analysis with Apache Giraph and Spark GraphX
Introduction into scalable graph analysis with Apache Giraph and Spark GraphXIntroduction into scalable graph analysis with Apache Giraph and Spark GraphX
Introduction into scalable graph analysis with Apache Giraph and Spark GraphX
 
Tips And Tricks For Bioinformatics Software Engineering
Tips And Tricks For Bioinformatics Software EngineeringTips And Tricks For Bioinformatics Software Engineering
Tips And Tricks For Bioinformatics Software Engineering
 

Recently uploaded

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Recently uploaded (20)

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 

Natural Intelligence ICCS 2010

  • 1. Natural Intelligence - Commonsense Question Answering with Conceptual Graphs Fatih Mehmet Güler and Aysenur Birturk Department of Computer Engineering, METU 06531, Ankara/TURKEY fmguler@gmail.com birturk@ceng.metu.edu.tr
  • 2. Motivation  Massive Knowledge found as Natural Language  Text based Question Answering (no tagging)  Open Domain Question Answering  Address Commonsense Reasoning Problem  Linguistically motivated KRR  Intelligence is the accumulation of knowledge  Integrate State of the Art Tools  Ultimate goal: Getting closer to strong AI
  • 3. Summary of the System  Natural Language is parsed  Utterances are represented using CGs  Concepts and Relation types are mapped to Cyc equivalent counterparts  Type hierarchies are computed  Knowledge is accumulated  If the input is a question  Search for answer (projection)
  • 4. Summary of the System (Cont’d) NI NLP KRR Commonsense CCG CGs Open Cyc C&C Tools Cogitant
  • 5. Background  Combinatory Categorial Grammar (CCG)  C&C Tools  Conceptual Graphs  Cogitant  Open Cyc
  • 6. Combinatory Categorial Grammar (CCG)  Lexicalized Theory of Grammar based on Categorial Grammar ( Steedman 2001).  Functions can be applied or composed  Arguments can be picked up or turned into functors (Type raising)  Easy for Semantic Representations  Small number of semantically transparent combinatory rules to combine CCG categories.  Assign semantic representations to the lexical entries  Interpret combinatory rules
  • 7. CCG parse for “Mr. Hyde ate two lemmons”
  • 8. CCG Parse for “Susan knows that Bob likes Fred”
  • 9. DRS for “Susan knows that Bob likes Fred”
  • 10. C&C Tools  Linguistically Motivated Large-Scale NLP with C&C and Boxer. (Curran, Clark, Bos, 2007)  C&C Parser  POS Tagging, Supertagging  Parsing, Chunking  Named Entity Recognition  Boxer  Uses CCG parser output  Generates DRS Semantic Representations  Freely available for research  http://svn.ask.it.usyd.edu.au/trac/candc/wiki
  • 11. C&C Tools  Large Scale NLP is possible with C&C and Boxer  C&C Parser: state of the art parser for CCG  Boxer: Semantic representations in DRS
  • 12. Open Cyc  Open source version of Cyc system  Cyc: greatest effort to encode Common Sense knowledge in machine processable way  500.000 concepts 26.000 relations and 5.000.000 assertions  CycL language similar to Lisp  We use Cyc to map parsed words to common sense counterparts such as person to #$Person (disambiguation)
  • 13. Open Cyc (cont’d)  (#$likesAsFriend #$GeorgeWBush #$AlGore)  #$isa, #$genls  (#$isa #$GeorgeWBush #$UnitedStatesPresident)  (#$genls #$UnitedStatesPresident #$Person)
  • 14. Cogitant  Library for Conceptual Graph operations  Supports broad CG operations (Genest & Salvat, 1998)  Graph representation  Conversion from CGIF  Projection checking  Rule application
  • 15. Natural Intelligence – Commonsense Question Answering with CGs  Augment Common Sense knowledge  Modular Approach  Separation of Concerns  State of the art tools
  • 16. Architecture - Modules  Natural Language Processing (C&C Tools are used for implementation)  Convert natural language to CGIF  Reasoning (Cogitant library is used for implementation)  CG operations  Common Sense (Open Cyc is used for implementation)  Common sense mapping  Storage (Conceptual Graphs are stored in a database)  Persistence of CGs
  • 17. System Definition  User enters a sentence from web interface;  This sentence is converted to CGIF using the NLP module;  CGIF is converted to CGs using the reasoning module;  Support is generated to CGs using the common sense module;  Common sense rules gathered from common sense module are applied to CGs using reasoning module;  CGs are merged to the previous ones using reasoning module;  If the input sentence is a question sentence, same operations take place, except the resulting graph is used to query existing CGs using the reasoning service, and if there are projections from this query graph to previous CGs, results are displayed to the user;  CGs are persisted using the storage module.
  • 18. Common Sense Mapping  Cyc: (prettyString TERM STRING)  Chain up to #$Thing using #$genls relations  Same for relations using #$genlPreds  Relation hierarchies are converted to forward rules  #$performedBy -> #$temporallyRelated
  • 19. Sample Concept Hierarchy  #$Place ->  #$EnduringThing-Localized ->  #$Location-Underspecified ->  #$Thing ->  #$SomethingExisting ->  #$Individual ->  #$Thing ^^  #$Trajector-Underspecified ->  #$Location-Underspecified ^^  #$TemporallyExistingThing ->  #$TemporalThing ->  #$Individual ^^  #$SpatialThing-Localized ->  #$TemporallyExistingThing ^^  #$SpatialThing ->  #$Individual ^^  #$Boundary-Underspecified ->  #$Region-Underspecified ->  #$Location-Underspecified ^^  #$Landmark-Underspecified ->  #$Individual ^^  #$Location-Underspecified ^^  #$SpatialThing-NonSituational ->  #$SpatialThing ^^  #$Individual ^^  #$Location-Underspecified ^^
  • 20. Conversion to Cogitant Support  Convert Cyc hierarcy to Cogitant support format  Concept Types  Relation Types  Individuals  Rules  Convert assertions to Cogitant graph format  Apply forward rules
  • 22. Significance  Sentences like;  What are the intangible things in this situation?  Was Mr. Hyde there while eating the apples?  Does Mr. Hyde exist after eating the apples?  Do the apples exist after Mr. Hyde ate them?  Deep Natural Language Understanding  State of the art tools  Open domain question answering
  • 23. Difficulties  Open Cyc API is broken  Does not work in Turkish locale (fixes are sent to maintainers)  Still, provided API sends one IP packet per character, way too slow over network  Custom socket API is developed and used over TCP  Custom Lisp functions for generalization hierarchy and concept mapping  Cogitant problematic  Java API is very limited (compared to C++)  Only works over XML files
  • 24. Conclusion  Central Integrated Common Sense QAS  CCG for Natural Language Processing  Conceptual Graphs for KRR  Cyc for Common Sense
  • 25. Future Work  Implement Rule Induction  Backward Chaining (Resolution)  Improve NLP module and Common Sense mapping  Probabilistic Reasoning  Question Answering System (QAS) to be used in;  Education (Learning Management Systems)  Semantic Search (Content Management Systems)  Intelligent Help