SlideShare una empresa de Scribd logo
1 de 26
Language processing (HUL455)
MORPHOLOGICAL
ANALYSIS
-JINIA RAO & ASHISH KASHYAP
CONTENTS
• Morphology & its types.
• Approaches to Morphology
• Morpheme based morphology
• Morphological Analysis and its need.
• Morphological Generation and Analysis using
Paradigms
• Problems in Morphological Analysis.
• Bibliography.
MORPHOLOGY
• The study of word formation – how words are
built up from smaller pieces.
• Identification, analysis, and description of the
structure of a given language's MORPHEMES
and other linguistic units, such as root
words, affixes, parts of
speech, intonations and stresses, or
implied context.
Examples
• Washing= wash + ing
• Browser= browse + er
• Rats= rat + s
Types of Morphology
• Inflectional morphology:-modification of a
word to express different grammatical
categories. Examples- cats, men etc.
• Derivational Morphology:- creation of a new
word from existing word by changing
grammatical category. Examples- happiness,
brotherhood etc.
APPROACHES TO MORPHOLOGY
There are three principal approaches to
morphology
• Morpheme based morphology
• Lexeme based morphology
• Word based morphology
Morpheme-based morphology
• Word forms are analyzed as arrangements
of morphemes.
• Morphemes- smallest linguistic unit with a
grammatical function.
Lexeme based Morphology
• Lexeme-based morphology usually takes what
is called an "item-and-process" approach.
• Instead of analyzing a word form as a set of
morphemes arranged in sequence, a word
form is said to be the result of applying rules
that alter a word-form or stem in order to
produce a new one
Word based Morphology
• Word-based morphology is (usually) a word-
and-paradigm approach.
• Instead of stating rules to combine
morphemes into word forms, or to generate
word forms from stems, word-based
morphology states generalizations that hold
between the forms of inflectional paradigms
MORPHOLOGICAL ANALYSIS
• Analyzing words into their linguistic
components (morphemes).
• Ambiguity: More than one alternatives
flies fly VERB + PROG
fly NOUN + PLU
Expected Output
Input Morphologically analyzed output
Cats Cat+ N+ PL
Cat Cat + N + SG
Cities City + N + PL
Geese Goose + N + PL
Goose Goose + N + SG OR Goose + V
Gooses Goose + V + 3SG
Merging Merge + V + PresPart
Caught Catch + V + PastPart
Caught Catch + V + Past
NEED FOR MORPHOLOGICAL ANALYSIS
• Wastage of memory in exhaustive lexicon.
• Failure to depict linguistic generalization-
necessary to understand an unknown word.
• Morphologically rich and productive
languages might be problematic.
MORPHOLOGICAL ANALYSIS
USING PARADIGMS
• Most NLP systems use simple linguistic
theories for morphological analysis.
• Most NLP systems widely use this approach.
• Words are related to each other by analogical
rules.
• Words can be categorized based on the
pattern they fit into.
• Applicable both to existing words and to new
ones.
• Application of a pattern different from the
one that has been used - give rise to a new
word
• Examples:-older replacing elder .
Procedure and Algorithm
• A language expert provides different tables of
word forms covering the words in the entire
language.
• The roots follow the pattern( or paradigm )
implicit in the table for generating their word
forms.
• Examples
Continued..
EACH ENTRY IN THE TABLE SHOWS THE NUMBER OF
CHARACTERS TO BE DELETED FROM
CASE
Number Direct Oblique
Singular LADKAA LADAKE
Plural LADAKE LADAKON
CASE
Number Direct Oblique
Singular (0,ø) (1,e)
Plural (1,e) (1,ON)
Continued…
The table can be expressed in terms of an algorithm, which is as
follows:-
ALGORITHM 1: Forming paradigm table
PURPOSE: To form paradigm table from word forms table for a
root
INPUT: Root r, Words forms table WFT (with labels for rows and
columns)
OUTPUT: Paradigm table PT
ALGORITHM:
1. Create an empty table PT of the same dimensionality, size and
labels as the word forms table WFT
Continued…
2. For every entry w in WTF, do
If w=r
then store “(0, Ø)” in the corresponding
position in PT
else begin
let i be the position of the first
characters in w and r which are
different
store (size(r)-i+1,suffix(i,w)) at the
corresponding position in PT
3. Return PT
Generation of a Word Form
ALGORITHM 2: Generating a word form
PURPOSE: To generate a word form given a root and
desired feature values.
INPUT: Root r, Feature values FV
USES: Paradigm tables, Dictionary of roots DR,
dictionary of indeclinable words DI
OUTPUT: Word w
ALGORITHM:
1. If root r belongs to DI then return( words
stored in DI for r irrespective of FV)
Continued…
2. let p = paradigm type of r as obtained from
DR
3. let PT = paradigm table for p.
4. let (n,s) = entry in PT for feature values
FV
5. w := r minus n characters at the end
6. w := w plus suffix s
END ALGORITHM
PROBLEMS IN MORPHOLOGICAL
ANALYSIS
• False Analysis
• Productivity
• Bound base morphemes
False analysis
Words such as hospitable, sizeable.
• They don’t have the meaning “to be able”
• They can not take the suffix -ity to form a
noun
• Analyzing them as the words containing suffix
-able leads to false analysis
PRODUCTIVITY
• Property of a morphological process to give rise
to new formations on a systematic basis.
Exceptions to the above rule.
• Peaceable
• Actionable
• Companionable
Bound Base Morphemes
• Occur only in a particular complex word.
• Do not have independent existence.
• Words such as feasible, malleable
• -able has the regular meaning “be able”
• -ity form is possible
• Base words don’t exit independently
base
(nonexistent)
morpheme
(known)
Compound
REFERENCES
• “Linguistics, An Introduction to Language and
Communication” by Adrian Akmajian, Richard A.
Demers, Ann K. Farmer and Robert M. Harnish (5th
Edition)
• SPEECH and LANGUAGE PROCESSING, An Introduction
to Natural Language Processing,
Computational Linguistics, and Speech Recognition by
Daniel Jurafsky and James H. Martin (Second Edition)
• “Natural Language Processing- a Paninian perspective”
by Akshar Bharati, Vineet Chaitanya, Rajeev Sangal.
THANKYOU!!!

Más contenido relacionado

La actualidad más candente

Corpus linguistics
Corpus linguisticsCorpus linguistics
Corpus linguisticsIrum Malik
 
Syntactic analysis in NLP
Syntactic analysis in NLPSyntactic analysis in NLP
Syntactic analysis in NLPkartikaVashisht
 
Subfields of linguistics
Subfields of linguisticsSubfields of linguistics
Subfields of linguisticswajiha khan
 
Assignment of pakistani literature
Assignment  of  pakistani literatureAssignment  of  pakistani literature
Assignment of pakistani literatureRuby Rajpoot
 
Computational linguistics
Computational linguisticsComputational linguistics
Computational linguistics1101989
 
The innateness theory and theories of language acquisition
The innateness theory and theories of language acquisitionThe innateness theory and theories of language acquisition
The innateness theory and theories of language acquisitionJEZS88
 
Lecture 1 introduction to syntax
Lecture 1 introduction to syntaxLecture 1 introduction to syntax
Lecture 1 introduction to syntaxssuser1f22f9
 
Stylistics and it’s relation with linguistics and literature
Stylistics and it’s relation with linguistics and literatureStylistics and it’s relation with linguistics and literature
Stylistics and it’s relation with linguistics and literatureMuhammad Adnan Ejaz
 
Types of corpus linguistics Parallel ,aligned...
 Types of corpus linguistics Parallel ,aligned... Types of corpus linguistics Parallel ,aligned...
Types of corpus linguistics Parallel ,aligned...RajpootBhatti5
 
Discourse and the sentence
Discourse and the sentenceDiscourse and the sentence
Discourse and the sentenceStudent
 
Parts of Speect Tagging
Parts of Speect TaggingParts of Speect Tagging
Parts of Speect Taggingtheyaseen51
 

La actualidad más candente (20)

Corpus linguistics
Corpus linguisticsCorpus linguistics
Corpus linguistics
 
Syntactic analysis in NLP
Syntactic analysis in NLPSyntactic analysis in NLP
Syntactic analysis in NLP
 
Subfields of linguistics
Subfields of linguisticsSubfields of linguistics
Subfields of linguistics
 
Syntax
SyntaxSyntax
Syntax
 
Assignment of pakistani literature
Assignment  of  pakistani literatureAssignment  of  pakistani literature
Assignment of pakistani literature
 
Computational linguistics
Computational linguisticsComputational linguistics
Computational linguistics
 
The innateness theory and theories of language acquisition
The innateness theory and theories of language acquisitionThe innateness theory and theories of language acquisition
The innateness theory and theories of language acquisition
 
Semantics analysis ppt
Semantics analysis pptSemantics analysis ppt
Semantics analysis ppt
 
Machine translation
Machine translationMachine translation
Machine translation
 
Lecture 1 introduction to syntax
Lecture 1 introduction to syntaxLecture 1 introduction to syntax
Lecture 1 introduction to syntax
 
Nlp ambiguity presentation
Nlp ambiguity presentationNlp ambiguity presentation
Nlp ambiguity presentation
 
"Linguistics is a Science"
"Linguistics is a Science""Linguistics is a Science"
"Linguistics is a Science"
 
Stylistics and it’s relation with linguistics and literature
Stylistics and it’s relation with linguistics and literatureStylistics and it’s relation with linguistics and literature
Stylistics and it’s relation with linguistics and literature
 
Types of corpus linguistics Parallel ,aligned...
 Types of corpus linguistics Parallel ,aligned... Types of corpus linguistics Parallel ,aligned...
Types of corpus linguistics Parallel ,aligned...
 
Discourse and the sentence
Discourse and the sentenceDiscourse and the sentence
Discourse and the sentence
 
Parts of Speect Tagging
Parts of Speect TaggingParts of Speect Tagging
Parts of Speect Tagging
 
Phrase structure grammar
Phrase structure grammarPhrase structure grammar
Phrase structure grammar
 
An introduction to semantics
An introduction to semanticsAn introduction to semantics
An introduction to semantics
 
Transformational Generative Grammar
Transformational Generative GrammarTransformational Generative Grammar
Transformational Generative Grammar
 
Structuralism
Structuralism Structuralism
Structuralism
 

Similar a Morphological Analysis

cs626-449-lect20-morphology-2009-9-29.ppt
cs626-449-lect20-morphology-2009-9-29.pptcs626-449-lect20-morphology-2009-9-29.ppt
cs626-449-lect20-morphology-2009-9-29.pptRamandeepKaur724335
 
MorphologyAndFST.pdf
MorphologyAndFST.pdfMorphologyAndFST.pdf
MorphologyAndFST.pdfssuser97943d
 
Visual Word Recognition. The Journey from Features to Meaning
Visual Word Recognition. The Journey from Features to MeaningVisual Word Recognition. The Journey from Features to Meaning
Visual Word Recognition. The Journey from Features to Meaningfawzia
 
English Plus Lesson 1.pptx
English Plus Lesson 1.pptxEnglish Plus Lesson 1.pptx
English Plus Lesson 1.pptxRussel Carilla
 
Natural language processing
Natural language processingNatural language processing
Natural language processingBasha Chand
 
語言學概論Morphology 2
語言學概論Morphology 2語言學概論Morphology 2
語言學概論Morphology 2Ja-Jun Liao
 
Morphemes & Types of morphemes
Morphemes & Types of morphemesMorphemes & Types of morphemes
Morphemes & Types of morphemesMahrukhShehzadi1
 
Principles of parameters
Principles of parametersPrinciples of parameters
Principles of parametersVelnar
 
05 linguistic theory meets lexicography
05 linguistic theory meets lexicography05 linguistic theory meets lexicography
05 linguistic theory meets lexicographyDuygu Aşıklar
 
Morphology background and history
Morphology background and historyMorphology background and history
Morphology background and historyMUHAMMADAFZAL378
 
Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Engineering Intelligent NLP Applications Using Deep Learning – Part 1Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Engineering Intelligent NLP Applications Using Deep Learning – Part 1Saurabh Kaushik
 

Similar a Morphological Analysis (20)

cs626-449-lect20-morphology-2009-9-29.ppt
cs626-449-lect20-morphology-2009-9-29.pptcs626-449-lect20-morphology-2009-9-29.ppt
cs626-449-lect20-morphology-2009-9-29.ppt
 
morphemes.pdf
morphemes.pdfmorphemes.pdf
morphemes.pdf
 
Syntax.ppt
Syntax.pptSyntax.ppt
Syntax.ppt
 
MorphologyAndFST.pdf
MorphologyAndFST.pdfMorphologyAndFST.pdf
MorphologyAndFST.pdf
 
Visual Word Recognition. The Journey from Features to Meaning
Visual Word Recognition. The Journey from Features to MeaningVisual Word Recognition. The Journey from Features to Meaning
Visual Word Recognition. The Journey from Features to Meaning
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
English Plus Lesson 1.pptx
English Plus Lesson 1.pptxEnglish Plus Lesson 1.pptx
English Plus Lesson 1.pptx
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Morpheme
MorphemeMorpheme
Morpheme
 
語言學概論Morphology 2
語言學概論Morphology 2語言學概論Morphology 2
語言學概論Morphology 2
 
Linguistic morp
Linguistic morpLinguistic morp
Linguistic morp
 
Morphemes & Types of morphemes
Morphemes & Types of morphemesMorphemes & Types of morphemes
Morphemes & Types of morphemes
 
Morphology
Morphology Morphology
Morphology
 
Principles of parameters
Principles of parametersPrinciples of parameters
Principles of parameters
 
05 linguistic theory meets lexicography
05 linguistic theory meets lexicography05 linguistic theory meets lexicography
05 linguistic theory meets lexicography
 
COMPONENTS-OF-GRAMMAR.pptx
COMPONENTS-OF-GRAMMAR.pptxCOMPONENTS-OF-GRAMMAR.pptx
COMPONENTS-OF-GRAMMAR.pptx
 
Mental grammar
Mental grammarMental grammar
Mental grammar
 
Morphology background and history
Morphology background and historyMorphology background and history
Morphology background and history
 
Morphology
MorphologyMorphology
Morphology
 
Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Engineering Intelligent NLP Applications Using Deep Learning – Part 1Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Engineering Intelligent NLP Applications Using Deep Learning – Part 1
 

Último

22-prompt engineering noted slide shown.pdf
22-prompt engineering noted slide shown.pdf22-prompt engineering noted slide shown.pdf
22-prompt engineering noted slide shown.pdf203318pmpc
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...roncy bisnoi
 
Work-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptxWork-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptxJuliansyahHarahap1
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
Unit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdfUnit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdfRagavanV2
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaOmar Fathy
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.Kamal Acharya
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756dollysharma2066
 
Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086anil_gaur
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXssuser89054b
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdfKamal Acharya
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startQuintin Balsdon
 
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptBlock diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptNANDHAKUMARA10
 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Bookingdharasingh5698
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...SUHANI PANDEY
 

Último (20)

22-prompt engineering noted slide shown.pdf
22-prompt engineering noted slide shown.pdf22-prompt engineering noted slide shown.pdf
22-prompt engineering noted slide shown.pdf
 
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
Work-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptxWork-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptx
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
Unit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdfUnit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdf
 
Integrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - NeometrixIntegrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - Neometrix
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS Lambda
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
 
Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the start
 
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptBlock diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.ppt
 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
 
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
 

Morphological Analysis

  • 2. CONTENTS • Morphology & its types. • Approaches to Morphology • Morpheme based morphology • Morphological Analysis and its need. • Morphological Generation and Analysis using Paradigms • Problems in Morphological Analysis. • Bibliography.
  • 3. MORPHOLOGY • The study of word formation – how words are built up from smaller pieces. • Identification, analysis, and description of the structure of a given language's MORPHEMES and other linguistic units, such as root words, affixes, parts of speech, intonations and stresses, or implied context.
  • 4. Examples • Washing= wash + ing • Browser= browse + er • Rats= rat + s
  • 5. Types of Morphology • Inflectional morphology:-modification of a word to express different grammatical categories. Examples- cats, men etc. • Derivational Morphology:- creation of a new word from existing word by changing grammatical category. Examples- happiness, brotherhood etc.
  • 6. APPROACHES TO MORPHOLOGY There are three principal approaches to morphology • Morpheme based morphology • Lexeme based morphology • Word based morphology
  • 7. Morpheme-based morphology • Word forms are analyzed as arrangements of morphemes. • Morphemes- smallest linguistic unit with a grammatical function.
  • 8. Lexeme based Morphology • Lexeme-based morphology usually takes what is called an "item-and-process" approach. • Instead of analyzing a word form as a set of morphemes arranged in sequence, a word form is said to be the result of applying rules that alter a word-form or stem in order to produce a new one
  • 9. Word based Morphology • Word-based morphology is (usually) a word- and-paradigm approach. • Instead of stating rules to combine morphemes into word forms, or to generate word forms from stems, word-based morphology states generalizations that hold between the forms of inflectional paradigms
  • 10. MORPHOLOGICAL ANALYSIS • Analyzing words into their linguistic components (morphemes). • Ambiguity: More than one alternatives flies fly VERB + PROG fly NOUN + PLU
  • 11. Expected Output Input Morphologically analyzed output Cats Cat+ N+ PL Cat Cat + N + SG Cities City + N + PL Geese Goose + N + PL Goose Goose + N + SG OR Goose + V Gooses Goose + V + 3SG Merging Merge + V + PresPart Caught Catch + V + PastPart Caught Catch + V + Past
  • 12. NEED FOR MORPHOLOGICAL ANALYSIS • Wastage of memory in exhaustive lexicon. • Failure to depict linguistic generalization- necessary to understand an unknown word. • Morphologically rich and productive languages might be problematic.
  • 13. MORPHOLOGICAL ANALYSIS USING PARADIGMS • Most NLP systems use simple linguistic theories for morphological analysis. • Most NLP systems widely use this approach.
  • 14. • Words are related to each other by analogical rules. • Words can be categorized based on the pattern they fit into. • Applicable both to existing words and to new ones. • Application of a pattern different from the one that has been used - give rise to a new word • Examples:-older replacing elder .
  • 15. Procedure and Algorithm • A language expert provides different tables of word forms covering the words in the entire language. • The roots follow the pattern( or paradigm ) implicit in the table for generating their word forms. • Examples
  • 16. Continued.. EACH ENTRY IN THE TABLE SHOWS THE NUMBER OF CHARACTERS TO BE DELETED FROM CASE Number Direct Oblique Singular LADKAA LADAKE Plural LADAKE LADAKON CASE Number Direct Oblique Singular (0,ø) (1,e) Plural (1,e) (1,ON)
  • 17. Continued… The table can be expressed in terms of an algorithm, which is as follows:- ALGORITHM 1: Forming paradigm table PURPOSE: To form paradigm table from word forms table for a root INPUT: Root r, Words forms table WFT (with labels for rows and columns) OUTPUT: Paradigm table PT ALGORITHM: 1. Create an empty table PT of the same dimensionality, size and labels as the word forms table WFT
  • 18. Continued… 2. For every entry w in WTF, do If w=r then store “(0, Ø)” in the corresponding position in PT else begin let i be the position of the first characters in w and r which are different store (size(r)-i+1,suffix(i,w)) at the corresponding position in PT 3. Return PT
  • 19. Generation of a Word Form ALGORITHM 2: Generating a word form PURPOSE: To generate a word form given a root and desired feature values. INPUT: Root r, Feature values FV USES: Paradigm tables, Dictionary of roots DR, dictionary of indeclinable words DI OUTPUT: Word w ALGORITHM: 1. If root r belongs to DI then return( words stored in DI for r irrespective of FV)
  • 20. Continued… 2. let p = paradigm type of r as obtained from DR 3. let PT = paradigm table for p. 4. let (n,s) = entry in PT for feature values FV 5. w := r minus n characters at the end 6. w := w plus suffix s END ALGORITHM
  • 21. PROBLEMS IN MORPHOLOGICAL ANALYSIS • False Analysis • Productivity • Bound base morphemes
  • 22. False analysis Words such as hospitable, sizeable. • They don’t have the meaning “to be able” • They can not take the suffix -ity to form a noun • Analyzing them as the words containing suffix -able leads to false analysis
  • 23. PRODUCTIVITY • Property of a morphological process to give rise to new formations on a systematic basis. Exceptions to the above rule. • Peaceable • Actionable • Companionable
  • 24. Bound Base Morphemes • Occur only in a particular complex word. • Do not have independent existence. • Words such as feasible, malleable • -able has the regular meaning “be able” • -ity form is possible • Base words don’t exit independently base (nonexistent) morpheme (known) Compound
  • 25. REFERENCES • “Linguistics, An Introduction to Language and Communication” by Adrian Akmajian, Richard A. Demers, Ann K. Farmer and Robert M. Harnish (5th Edition) • SPEECH and LANGUAGE PROCESSING, An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition by Daniel Jurafsky and James H. Martin (Second Edition) • “Natural Language Processing- a Paninian perspective” by Akshar Bharati, Vineet Chaitanya, Rajeev Sangal.