SlideShare una empresa de Scribd logo
1 de 7
Descargar para leer sin conexión
Module
          13
Natural Language
      Processing
        Version 2 CSE IIT, Kharagpur
13.1 Instructional Objective
•   The students should understand the necessity of natural language processing in
    building an intelligent system
•   Students should understand the difference between natural and formal language and
    the difficulty in processing the former
•   Students should understand the ambiguities that arise in natural language processing
•   Students should understand the language information required like like
        o Phonology
        o Morphology
        o Syntax
        o Semantic
        o Discourse
        o World knowledge
•   Students should understand the steps involved in natural language understanding and
    generation
•   The student should be familiar with basic language processing operations like
        o Morphological analysis
        o Parts-of-Speech tagging
        o Lexical processing
        o Semantic processing
        o Knowledge representation

At the end of this lesson the student should be able to do the following:
    • Design the processing steps required for a NLP task
    • Implement the processing techniques.




                                                            Version 2 CSE IIT, Kharagpur
Lesson
        40
Issues in NLP
    Version 2 CSE IIT, Kharagpur
13.1 Natural Language Processing
Natural Language Processing (NLP) is the process of computer analysis of input provided
in a human language (natural language), and conversion of this input into a useful form of
representation.

The field of NLP is primarily concerned with getting computers to perform useful and
interesting tasks with human languages. The field of NLP is secondarily concerned with
helping us come to a better understanding of human language.

   •   The input/output of a NLP system can be:
           – written text
           – speech
   •   We will mostly concerned with written text (not speech).
   •   To process written text, we need:
           – lexical, syntactic, semantic knowledge about the language
           – discourse information, real world knowledge
   •   To process spoken language, we need everything required to process written text,
       plus the challenges of speech recognition and speech synthesis.

There are two components of NLP.

   •   Natural Language Understanding
          – Mapping the given input in the natural language into a useful
             representation.
          – Different level of analysis required:
             morphological analysis,
             syntactic analysis,
             semantic analysis,
             discourse analysis, …
   •   Natural Language Generation
          – Producing output in the natural language from some internal
             representation.
          – Different level of synthesis required:
             deep planning (what to say),
             syntactic generation
   •   NL Understanding is much harder than NL Generation. But, still both of them are
       hard.

The difficulty in NL understanding arises from the following facts:

   •   Natural language is extremely rich in form and structure, and very ambiguous.
          – How to represent meaning,
          – Which structures map to which meaning structures.
   •   One input can mean many different things. Ambiguity can be at different levels.

                                                           Version 2 CSE IIT, Kharagpur
–    Lexical (word level) ambiguity -- different meanings of words
          –    Syntactic ambiguity -- different ways to parse the sentence
          –    Interpreting partial information -- how to interpret pronouns
          –    Contextual information -- context of the sentence may affect the meaning
               of that sentence.
   •   Many input can mean the same thing.
   •   Interaction among components of the input is not clear.

The following language related information are useful in NLP:

   •   Phonology – concerns how words are related to the sounds that realize them.

   •   Morphology – concerns how words are constructed from more        basic meaning
       units called morphemes. A morpheme is the primitive unit of meaning in a
       language.

   •   Syntax – concerns how can be put together to form correct sentences and
       determines what structural role each word plays in the sentence and what phrases
       are subparts of other phrases.

   •   Semantics – concerns what words mean and how these meaning combine in
       sentences to form sentence meaning. The study of context-independent meaning.

   •   Pragmatics – concerns how sentences are used in different situations and how
       use affects the interpretation of the sentence.

   •   Discourse – concerns how the immediately preceding sentences affect the
       interpretation of the next sentence. For example, interpreting pronouns and
       interpreting the temporal aspects of the information.

   •   World Knowledge – includes general knowledge about the world. What each
       language user must know about the other’s beliefs and goals.


13.1.1 Ambiguity

I made her duck.

   •   How many different interpretations does this sentence have?
   •   What are the reasons for the ambiguity?
   •   The categories of knowledge of language can be thought of as ambiguity
       resolving components.
   •   How can each ambiguous piece be resolved?
   •   Does speech input make the sentence even more ambiguous?
           – Yes – deciding word boundaries
   •   Some interpretations of : I made her duck.

                                                         Version 2 CSE IIT, Kharagpur
1. I cooked duck for her.
           2. I cooked duck belonging to her.
           3. I created a toy duck which she owns.
           4. I caused her to quickly lower her head or body.
           5. I used magic and turned her into a duck.
   •   duck – morphologically and syntactically ambiguous:
               noun or verb.
   •   her – syntactically ambiguous: dative or possessive.
   •   make – semantically ambiguous: cook or create.
   •   make – syntactically ambiguous:
           – Transitive – takes a direct object. => 2
           – Di-transitive – takes two objects. => 5
           – Takes a direct object and a verb. => 4

Ambiguities are resolved using the following methods.

   •   models and algorithms are introduced to resolve ambiguities at different levels.
   •   part-of-speech tagging -- Deciding whether duck is verb or noun.
   •   word-sense disambiguation -- Deciding whether make is create or cook.
   •   lexical disambiguation -- Resolution of part-of-speech and        word-sense
       ambiguities are two important kinds of lexical disambiguation.
   •   syntactic ambiguity -- her duck is an example of syntactic ambiguity, and can be
       addressed by probabilistic parsing.

13.1.2 Models to represent Linguistic Knowledge

   •   We will use certain formalisms (models) to represent the required linguistic
       knowledge.
   •   State Machines -- FSAs, FSTs, HMMs, ATNs, RTNs
   •   Formal Rule Systems -- Context Free Grammars, Unification Grammars,
       Probabilistic CFGs.
   •   Logic-based Formalisms -- first order predicate logic, some higher order logic.
   •   Models of Uncertainty -- Bayesian probability theory.

13.1.3 Algorithms to Manipulate Linguistic Knowledge

   •   We will use algorithms to manipulate the models of linguistic knowledge to
       produce the desired behavior.
   •   Most of the algorithms we will study are transducers and parsers.
           – These algorithms construct some structure based on their input.
   •   Since the language is ambiguous at all levels,
       these algorithms are never simple processes.
   •   Categories of most algorithms that will be used can fall into following categories.
           – state space search
           – dynamic programming


                                                           Version 2 CSE IIT, Kharagpur
13.2 Natural Language Understanding
The steps in natural language understanding are as follows:

           Words

Morphological Analysis

           Morphologically analyzed words (another step: POS tagging)

Syntactic Analysis

           Syntactic Structure

Semantic Analysis

           Context-independent meaning representation

Discourse Processing

            Final meaning representation




                                                          Version 2 CSE IIT, Kharagpur

Más contenido relacionado

La actualidad más candente

Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
Yasir Khan
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
Mariana Soffer
 
Natural language processing (nlp)
Natural language processing (nlp)Natural language processing (nlp)
Natural language processing (nlp)
Kuppusamy P
 
Natural Language Processing in Alternative and Augmentative Communication
Natural Language Processing in Alternative and Augmentative CommunicationNatural Language Processing in Alternative and Augmentative Communication
Natural Language Processing in Alternative and Augmentative Communication
Divya Sugumar
 

La actualidad más candente (20)

Natural language processing
Natural language processing Natural language processing
Natural language processing
 
Nlp
NlpNlp
Nlp
 
NLP
NLPNLP
NLP
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
 
NLP_KASHK: Introduction
NLP_KASHK: Introduction NLP_KASHK: Introduction
NLP_KASHK: Introduction
 
Natural language processing (NLP)
Natural language processing (NLP) Natural language processing (NLP)
Natural language processing (NLP)
 
Natural Language Processing glossary for Coders
Natural Language Processing glossary for CodersNatural Language Processing glossary for Coders
Natural Language Processing glossary for Coders
 
Natural Language Processing
Natural Language Processing Natural Language Processing
Natural Language Processing
 
Natural language processing (nlp)
Natural language processing (nlp)Natural language processing (nlp)
Natural language processing (nlp)
 
Natural Language Processing: Parsing
Natural Language Processing: ParsingNatural Language Processing: Parsing
Natural Language Processing: Parsing
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Natural Language Processing in Alternative and Augmentative Communication
Natural Language Processing in Alternative and Augmentative CommunicationNatural Language Processing in Alternative and Augmentative Communication
Natural Language Processing in Alternative and Augmentative Communication
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Natural Language Processing for Games Research
Natural Language Processing for Games ResearchNatural Language Processing for Games Research
Natural Language Processing for Games Research
 

Similar a AI Lesson 40

Natural Language Processing (NLP).pptx
Natural Language Processing (NLP).pptxNatural Language Processing (NLP).pptx
Natural Language Processing (NLP).pptx
SHIBDASDUTTA
 
1 Introduction.ppt
1 Introduction.ppt1 Introduction.ppt
1 Introduction.ppt
tanishamahajan11
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
Abdullah al Mamun
 
Eng19 week 6 (aural comprehension instruction2)
Eng19 week 6 (aural comprehension instruction2)Eng19 week 6 (aural comprehension instruction2)
Eng19 week 6 (aural comprehension instruction2)
leolita
 

Similar a AI Lesson 40 (20)

L1 nlp intro
L1 nlp introL1 nlp intro
L1 nlp intro
 
CNN for NLP using text analysis by using deep learning
CNN for NLP using text analysis by using deep learningCNN for NLP using text analysis by using deep learning
CNN for NLP using text analysis by using deep learning
 
Natural Language Processing (NLP).pptx
Natural Language Processing (NLP).pptxNatural Language Processing (NLP).pptx
Natural Language Processing (NLP).pptx
 
AI Lesson 41
AI Lesson 41AI Lesson 41
AI Lesson 41
 
natural language processing help at myassignmenthelp.net
natural language processing  help at myassignmenthelp.netnatural language processing  help at myassignmenthelp.net
natural language processing help at myassignmenthelp.net
 
AI - natural language processing
AI - natural language processingAI - natural language processing
AI - natural language processing
 
Natural Language Processing Course in AI
Natural Language Processing Course in AINatural Language Processing Course in AI
Natural Language Processing Course in AI
 
1 Introduction.ppt
1 Introduction.ppt1 Introduction.ppt
1 Introduction.ppt
 
nlp-01.pptxvvvffffffvvvvvfeddeeddffffffffff
nlp-01.pptxvvvffffffvvvvvfeddeeddffffffffffnlp-01.pptxvvvffffffvvvvvfeddeeddffffffffff
nlp-01.pptxvvvffffffvvvvvfeddeeddffffffffff
 
Lesson 41.pdf
Lesson 41.pdfLesson 41.pdf
Lesson 41.pdf
 
English for Specific Purposes
English for Specific PurposesEnglish for Specific Purposes
English for Specific Purposes
 
Ch9
Ch9Ch9
Ch9
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
 
Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4
 
LARG-20010118-Natasha e wejkwrlkwr klwrlknrklnr k.ppt
LARG-20010118-Natasha e wejkwrlkwr klwrlknrklnr k.pptLARG-20010118-Natasha e wejkwrlkwr klwrlknrklnr k.ppt
LARG-20010118-Natasha e wejkwrlkwr klwrlknrklnr k.ppt
 
NLP introduced and in 47 slides Lecture 1.ppt
NLP introduced and in 47 slides Lecture 1.pptNLP introduced and in 47 slides Lecture 1.ppt
NLP introduced and in 47 slides Lecture 1.ppt
 
Teaching reading
Teaching readingTeaching reading
Teaching reading
 
Sla glossary
Sla glossarySla glossary
Sla glossary
 
Eng19 week 6 (aural comprehension instruction2)
Eng19 week 6 (aural comprehension instruction2)Eng19 week 6 (aural comprehension instruction2)
Eng19 week 6 (aural comprehension instruction2)
 
Transformational grammar
Transformational grammarTransformational grammar
Transformational grammar
 

Más de Assistant Professor (20)

AI Lesson 39
AI Lesson 39AI Lesson 39
AI Lesson 39
 
AI Lesson 38
AI Lesson 38AI Lesson 38
AI Lesson 38
 
AI Lesson 37
AI Lesson 37AI Lesson 37
AI Lesson 37
 
AI Lesson 36
AI Lesson 36AI Lesson 36
AI Lesson 36
 
AI Lesson 35
AI Lesson 35AI Lesson 35
AI Lesson 35
 
AI Lesson 34
AI Lesson 34AI Lesson 34
AI Lesson 34
 
AI Lesson 33
AI Lesson 33AI Lesson 33
AI Lesson 33
 
AI Lesson 32
AI Lesson 32AI Lesson 32
AI Lesson 32
 
AI Lesson 31
AI Lesson 31AI Lesson 31
AI Lesson 31
 
AI Lesson 30
AI Lesson 30AI Lesson 30
AI Lesson 30
 
AI Lesson 29
AI Lesson 29AI Lesson 29
AI Lesson 29
 
AI Lesson 28
AI Lesson 28AI Lesson 28
AI Lesson 28
 
AI Lesson 27
AI Lesson 27AI Lesson 27
AI Lesson 27
 
AI Lesson 26
AI Lesson 26AI Lesson 26
AI Lesson 26
 
AI Lesson 25
AI Lesson 25AI Lesson 25
AI Lesson 25
 
AI Lesson 24
AI Lesson 24AI Lesson 24
AI Lesson 24
 
AI Lesson 23
AI Lesson 23AI Lesson 23
AI Lesson 23
 
AI Lesson 22
AI Lesson 22AI Lesson 22
AI Lesson 22
 
AI Lesson 21
AI Lesson 21AI Lesson 21
AI Lesson 21
 
Lesson 20
Lesson 20Lesson 20
Lesson 20
 

Último

Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
AnaAcapella
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 

Último (20)

Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptx
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 

AI Lesson 40

  • 1. Module 13 Natural Language Processing Version 2 CSE IIT, Kharagpur
  • 2. 13.1 Instructional Objective • The students should understand the necessity of natural language processing in building an intelligent system • Students should understand the difference between natural and formal language and the difficulty in processing the former • Students should understand the ambiguities that arise in natural language processing • Students should understand the language information required like like o Phonology o Morphology o Syntax o Semantic o Discourse o World knowledge • Students should understand the steps involved in natural language understanding and generation • The student should be familiar with basic language processing operations like o Morphological analysis o Parts-of-Speech tagging o Lexical processing o Semantic processing o Knowledge representation At the end of this lesson the student should be able to do the following: • Design the processing steps required for a NLP task • Implement the processing techniques. Version 2 CSE IIT, Kharagpur
  • 3. Lesson 40 Issues in NLP Version 2 CSE IIT, Kharagpur
  • 4. 13.1 Natural Language Processing Natural Language Processing (NLP) is the process of computer analysis of input provided in a human language (natural language), and conversion of this input into a useful form of representation. The field of NLP is primarily concerned with getting computers to perform useful and interesting tasks with human languages. The field of NLP is secondarily concerned with helping us come to a better understanding of human language. • The input/output of a NLP system can be: – written text – speech • We will mostly concerned with written text (not speech). • To process written text, we need: – lexical, syntactic, semantic knowledge about the language – discourse information, real world knowledge • To process spoken language, we need everything required to process written text, plus the challenges of speech recognition and speech synthesis. There are two components of NLP. • Natural Language Understanding – Mapping the given input in the natural language into a useful representation. – Different level of analysis required: morphological analysis, syntactic analysis, semantic analysis, discourse analysis, … • Natural Language Generation – Producing output in the natural language from some internal representation. – Different level of synthesis required: deep planning (what to say), syntactic generation • NL Understanding is much harder than NL Generation. But, still both of them are hard. The difficulty in NL understanding arises from the following facts: • Natural language is extremely rich in form and structure, and very ambiguous. – How to represent meaning, – Which structures map to which meaning structures. • One input can mean many different things. Ambiguity can be at different levels. Version 2 CSE IIT, Kharagpur
  • 5. Lexical (word level) ambiguity -- different meanings of words – Syntactic ambiguity -- different ways to parse the sentence – Interpreting partial information -- how to interpret pronouns – Contextual information -- context of the sentence may affect the meaning of that sentence. • Many input can mean the same thing. • Interaction among components of the input is not clear. The following language related information are useful in NLP: • Phonology – concerns how words are related to the sounds that realize them. • Morphology – concerns how words are constructed from more basic meaning units called morphemes. A morpheme is the primitive unit of meaning in a language. • Syntax – concerns how can be put together to form correct sentences and determines what structural role each word plays in the sentence and what phrases are subparts of other phrases. • Semantics – concerns what words mean and how these meaning combine in sentences to form sentence meaning. The study of context-independent meaning. • Pragmatics – concerns how sentences are used in different situations and how use affects the interpretation of the sentence. • Discourse – concerns how the immediately preceding sentences affect the interpretation of the next sentence. For example, interpreting pronouns and interpreting the temporal aspects of the information. • World Knowledge – includes general knowledge about the world. What each language user must know about the other’s beliefs and goals. 13.1.1 Ambiguity I made her duck. • How many different interpretations does this sentence have? • What are the reasons for the ambiguity? • The categories of knowledge of language can be thought of as ambiguity resolving components. • How can each ambiguous piece be resolved? • Does speech input make the sentence even more ambiguous? – Yes – deciding word boundaries • Some interpretations of : I made her duck. Version 2 CSE IIT, Kharagpur
  • 6. 1. I cooked duck for her. 2. I cooked duck belonging to her. 3. I created a toy duck which she owns. 4. I caused her to quickly lower her head or body. 5. I used magic and turned her into a duck. • duck – morphologically and syntactically ambiguous: noun or verb. • her – syntactically ambiguous: dative or possessive. • make – semantically ambiguous: cook or create. • make – syntactically ambiguous: – Transitive – takes a direct object. => 2 – Di-transitive – takes two objects. => 5 – Takes a direct object and a verb. => 4 Ambiguities are resolved using the following methods. • models and algorithms are introduced to resolve ambiguities at different levels. • part-of-speech tagging -- Deciding whether duck is verb or noun. • word-sense disambiguation -- Deciding whether make is create or cook. • lexical disambiguation -- Resolution of part-of-speech and word-sense ambiguities are two important kinds of lexical disambiguation. • syntactic ambiguity -- her duck is an example of syntactic ambiguity, and can be addressed by probabilistic parsing. 13.1.2 Models to represent Linguistic Knowledge • We will use certain formalisms (models) to represent the required linguistic knowledge. • State Machines -- FSAs, FSTs, HMMs, ATNs, RTNs • Formal Rule Systems -- Context Free Grammars, Unification Grammars, Probabilistic CFGs. • Logic-based Formalisms -- first order predicate logic, some higher order logic. • Models of Uncertainty -- Bayesian probability theory. 13.1.3 Algorithms to Manipulate Linguistic Knowledge • We will use algorithms to manipulate the models of linguistic knowledge to produce the desired behavior. • Most of the algorithms we will study are transducers and parsers. – These algorithms construct some structure based on their input. • Since the language is ambiguous at all levels, these algorithms are never simple processes. • Categories of most algorithms that will be used can fall into following categories. – state space search – dynamic programming Version 2 CSE IIT, Kharagpur
  • 7. 13.2 Natural Language Understanding The steps in natural language understanding are as follows: Words Morphological Analysis Morphologically analyzed words (another step: POS tagging) Syntactic Analysis Syntactic Structure Semantic Analysis Context-independent meaning representation Discourse Processing Final meaning representation Version 2 CSE IIT, Kharagpur