SlideShare a Scribd company logo
1 of 39
Part of Speech Tagging
Perpectivising NLP: Areas of AI and 
their inter-dependencies 
Knowledge 
Search Logic Representation 
Machine 
Learning 
Planning 
Expert 
NLP Vision Robotics Systems
Two pictures 
Problem 
NLP 
Semantics NLP 
nity 
Parsing 
Vision Speech Morph 
Analysis 
HM 
M Statistics and Probability Hindi English 
Language 
CRF + 
Knowledge Based 
MEM 
M 
Algorithm 
N 
Tri 
Part of Speech 
Tagging 
Marathi French
What it is 
POS Tagging is a process that attaches 
each word in a sentence with a suitable 
tag from a given set of tags. 
The set of tags is called the Tag-set. 
Standard Tag-set : Penn Treebank (for 
English).
Definition 
Tagging is the assignment of a 
singlepart-of-speech tag to each word 
(and punctuation marker) in a corpus. 
“_“ The_DT guys_NNS that_WDT 
make_VBP traditional_JJ hardware_NN 
are_VBP really_RB being_VBG 
obsoleted_VBN by_IN microprocessor-based_ 
JJ machines_NNS ,_, ”_” said_VBD 
Mr._NNP Benton_NNP ._.
POS Tags 
NN – Noun; e.g. 
VM – Main Verb; 
Dog_NN 
e.g. Run_VM 
VAUX – AuxiliaryVerb; e.g. Is_VAUX 
JJ – Adjective; e.g. Red_JJ 
PRP – Pronoun; e.g. You_PRP 
NNP– Proper Noun; e.g. John_NNP 
etc.
POS Tag Ambiguity 
In English : I bank1 on the bank2 on the 
river bank3 for 
Bank1 is verb, 
my transactions. 
the other two banks are 
noun 
In Hindi : 
”Khaanaa” : can be noun (food) or 
eat) 
verb (to
For Hindi 
Rama achhaa gaata hai. (hai is VAUX : 
Auxiliary verb); Ram sings well 
Rama achha ladakaa hai. (hai is VCOP : 
Copula verb); Ram is a good boy
Process 
List all possible tag for each word in 
sentence. 
Choose best suitable tag sequence.
Example 
”People jump high”. 
People : Noun/Verb 
jump : Noun/Verb 
high : Noun/Verb/Adjective 
We can start with probabilities.
Importance of POS tagging 
Ack: presentation by Claire 
Gardent on POS tagging by NLTK
What is Part of Speech (POS) 
Words can be divided into classes 
behave similarly. 
Traditionally eight parts of speech 
that 
in 
English: noun, verb, pronoun, 
preposition, adverb, 
adjective and article 
More recently larger 
conjunction, 
sets have been 
used: e.g. Penn Treebank (45 tags), 
Susanne (353 tags).
Why POS 
POS tell us a lot about a word (and the 
words near it). 
E.g, adjectives often followed by nouns 
personal pronouns often followed by verbs 
possessive pronouns by nouns 
Pronunciations depends on POS, e.g. 
object (first syllable NN, second syllable 
VM), content, discount 
First step in many NLP applications
Categories of POS 
Open and closed classes 
Closed classes have a fixed membership of 
words: determiners, pronouns, prepositions 
Closed class words are usually function 
word: frequently occurring, 
grammatically important, often short 
(e.g. of, it, the, in) 
Open classes: nouns, verbs, adjectives 
and adverbs(allow new addition of word)
Open Class (1/2) 
Nouns: 
Proper nouns (Scotland, BBC), 
common nouns 
count nouns (goat, glass) 
mass nouns (snow, pacifism) 
Verbs: 
actions and processes (run, hope) 
also auxiliary verbs (is, are, am, will, can)
Open Class (2/2) 
Adjectives: 
properties and qualities 
value) 
Adverbs: 
(age, colour, 
modify verbs, or verb phrases, or other 
adverbs- Unfortunately John walked home 
extremely slowly yesterday 
Sentential adverb: unfortunately 
Manner adverb: extremely, slowly 
Time adverb: yesterday
Closed class 
Prepositions: on, under, over, to, with, 
by 
Determiners: the, a, an, some 
Pronouns: she, you, I, who 
Conjunctions: and, but, or, as, when, if 
Auxiliary verbs: can, may, are
Penn tagset (1/2)
Penn tagset (2/2)
Indian 
Noun 
Language Tagset:
Indian Language Tagset: 
Pronoun
Indian Language Tagset: 
Quantifier
Indian Language Tagset: 
Demonstrative 
3 Demonstrative DM DM Vaha, jo, 
yaha, 
3.1 Deictic DMD DM DMD Vaha, yaha 
3.2 Relative DMR DM DMR jo, jis 
3.3 Wh-word DMQ DM DMQ kis, kaun 
Indefinite DMI DM DMI KoI, kis
Indian Language Tagset: 
Verb, Adjective, Adverb
Indian Language Tagset: 
Postposition, conjunction
Indian Language Tagset: 
Particle
Indian Language Tagset: 
Residuals
Bigram 
Best tag sequence 
Assumption 
= 
= 
= 
T* 
argmax P(T|W) 
argmax P(T)P(W|T) (by Baye’s Theorem) 
P(T) = P(t0=^ t1t2 … tn+1=.) 
= P(t0)P(t1|t0)P(t2|t1t0)P(t3|t2t1t0) … 
P(tn|tn-1tn-2…t0)P(tn+1|tntn-1…t0) 
= P(t0)P(t1|t0)P(t2|t1) … P(tn|tn-1)P(tn+1|tn) 
N+1 
Π 
i = 0 
= P(ti|ti-1) Bigram Assumption
Lexical Probability Assumption 
P(W|T) = P(w0|t0-tn+1)P(w1|w0t0-tn+1)P(w2|w1w0t0-tn+1) … 
P(wn|w0-wn-1t0-tn+1)P(wn+1|w0-wnt0-tn+1) 
Assumption: A word is determined completely by 
inspired by speech recognition 
its tag. This is 
= P(wo|to)P(w1|t1) … P(wn+1|tn+1) 
n+1 
=Π P(wi|ti) 
i = 0 
n+1 
= Π P(wi|ti) 
i = 1 
(Lexical Probability Assumption)
Generative Model 
^_^ People_N Jump_V High_R ._ 
. 
Lexical 
Probabilit 
ies 
^ N V A . 
V N N Bigram 
Probabilit 
ies 
A A N 
This model is called Generative model. 
Here words are observed from tags as states. 
This is similar to HMM.
Bigram probabilities 
N V A 
N 0.2 0.7 0.1 
V 0.6 0.2 0.2 
A 0.5 0.2 0.3
Lexical Probability 
People jump high 
10 
10 
10 
N - 5 
10 
- 3 
0.4x10 
-7 
V - 7 10 -2 10 -7 
A 0 0 -1 
values in cell are P(col-heading/row-heading)
Calculation from 
Corpus 
actual data 
^ Ram got many NLP books. He found them 
all very interesting. 
Pos Tagged 
^N V A N N . N V N A R A .
Recording numbers 
^ N V A R . 
^ 0 2 0 0 0 0 
N 0 1 2 1 0 1 
V 0 1 0 1 0 0 
A 0 1 0 0 1 1 
R 0 0 0 1 0 0 
. 1 0 0 0 0 0
Probabilities 
^ N V A R . 
^ 0 1 0 0 0 0 
N 0 1/5 2/5 1/5 0 1/5 
V 0 1/2 0 1/2 0 0 
A 0 1/3 0 0 1/3 1/3 
R 0 0 0 1 0 0 
. 1 0 0 0 0 0
To find 
T* = argmax (P(T) P(W/T)) 
P(T).P(W/T) = Π P( ti / ti+1 ).P(wi /ti) 
i=1 n 
P( ti / ti+1 ) : Bigram probability 
P(wi /ti): Lexical probability
Bigram probabilities 
N V A R 
N 0.15 0.7 0.05 0.1 
V 0.6 0.2 0.1 0.1 
A 0.5 0.2 0.3 0 
R 0.1 0.3 0.5 0.1
Lexical Probability 
People jump high 
10 
10 
10 
N 
-5 
10 
-3 
0.4x10 -7 
V 
-7 
10 
-2 
10 -7 
A 0 0 -1 
R 0 0 0 
values in cell are P(col-heading/row-heading)

More Related Content

What's hot

Introduction to Named Entity Recognition
Introduction to Named Entity RecognitionIntroduction to Named Entity Recognition
Introduction to Named Entity RecognitionTomer Lieber
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)Yuriy Guts
 
Natural language processing (nlp)
Natural language processing (nlp)Natural language processing (nlp)
Natural language processing (nlp)Kuppusamy P
 
Natural language processing
Natural language processingNatural language processing
Natural language processingKarenVacca
 
Natural language processing and transformer models
Natural language processing and transformer modelsNatural language processing and transformer models
Natural language processing and transformer modelsDing Li
 
Natural Language Processing seminar review
Natural Language Processing seminar review Natural Language Processing seminar review
Natural Language Processing seminar review Jayneel Vora
 
Natural language processing
Natural language processingNatural language processing
Natural language processingYogendra Tamang
 
Natural Language Processing with Python
Natural Language Processing with PythonNatural Language Processing with Python
Natural Language Processing with PythonBenjamin Bengfort
 
NLP_KASHK:Finite-State Morphological Parsing
NLP_KASHK:Finite-State Morphological ParsingNLP_KASHK:Finite-State Morphological Parsing
NLP_KASHK:Finite-State Morphological ParsingHemantha Kulathilake
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processingrohitnayak
 
Introduction to natural language processing
Introduction to natural language processingIntroduction to natural language processing
Introduction to natural language processingMinh Pham
 
Natural Language Processing in AI
Natural Language Processing in AINatural Language Processing in AI
Natural Language Processing in AISaurav Shrestha
 
word level analysis
word level analysis word level analysis
word level analysis tjs1
 

What's hot (20)

Introduction to Named Entity Recognition
Introduction to Named Entity RecognitionIntroduction to Named Entity Recognition
Introduction to Named Entity Recognition
 
Text summarization
Text summarizationText summarization
Text summarization
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
 
Natural language processing (nlp)
Natural language processing (nlp)Natural language processing (nlp)
Natural language processing (nlp)
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Natural language processing and transformer models
Natural language processing and transformer modelsNatural language processing and transformer models
Natural language processing and transformer models
 
Natural Language Processing seminar review
Natural Language Processing seminar review Natural Language Processing seminar review
Natural Language Processing seminar review
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Nlp ambiguity presentation
Nlp ambiguity presentationNlp ambiguity presentation
Nlp ambiguity presentation
 
Natural Language Processing with Python
Natural Language Processing with PythonNatural Language Processing with Python
Natural Language Processing with Python
 
NLP_KASHK:Minimum Edit Distance
NLP_KASHK:Minimum Edit DistanceNLP_KASHK:Minimum Edit Distance
NLP_KASHK:Minimum Edit Distance
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
NLP_KASHK:Finite-State Morphological Parsing
NLP_KASHK:Finite-State Morphological ParsingNLP_KASHK:Finite-State Morphological Parsing
NLP_KASHK:Finite-State Morphological Parsing
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processing
 
Introduction to natural language processing
Introduction to natural language processingIntroduction to natural language processing
Introduction to natural language processing
 
Natural Language Processing in AI
Natural Language Processing in AINatural Language Processing in AI
Natural Language Processing in AI
 
8 issues in pos tagging
8 issues in pos tagging8 issues in pos tagging
8 issues in pos tagging
 
word level analysis
word level analysis word level analysis
word level analysis
 
Word2Vec
Word2VecWord2Vec
Word2Vec
 

Viewers also liked

Current state of the art pos tagging for indian languages – a study
Current state of the art pos tagging for indian languages – a studyCurrent state of the art pos tagging for indian languages – a study
Current state of the art pos tagging for indian languages – a studyiaemedu
 
A Connectionist approach to Part-Of-Speech Tagging
A Connectionist approach to Part-Of-Speech TaggingA Connectionist approach to Part-Of-Speech Tagging
A Connectionist approach to Part-Of-Speech TaggingFrancisco Zamora-Martinez
 
Part of speech tagger
Part of speech taggerPart of speech tagger
Part of speech taggerarteimi
 
Text Analytics for Security
Text Analytics for SecurityText Analytics for Security
Text Analytics for SecurityTao Xie
 
Crash-course in Natural Language Processing
Crash-course in Natural Language ProcessingCrash-course in Natural Language Processing
Crash-course in Natural Language ProcessingVsevolod Dyomkin
 
CS571:: Part of-Speech Tagging
CS571:: Part of-Speech TaggingCS571:: Part of-Speech Tagging
CS571:: Part of-Speech TaggingJinho Choi
 
Natural Language Processing: L02 words
Natural Language Processing: L02 wordsNatural Language Processing: L02 words
Natural Language Processing: L02 wordsananth
 
L05 language model_part2
L05 language model_part2L05 language model_part2
L05 language model_part2ananth
 
Natural language processing (NLP) introduction
Natural language processing (NLP) introductionNatural language processing (NLP) introduction
Natural language processing (NLP) introductionRobert Lujo
 
English : Part of speech
English : Part of speech English : Part of speech
English : Part of speech Sol Sid
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language ProcessingPranav Gupta
 

Viewers also liked (19)

Current state of the art pos tagging for indian languages – a study
Current state of the art pos tagging for indian languages – a studyCurrent state of the art pos tagging for indian languages – a study
Current state of the art pos tagging for indian languages – a study
 
A Connectionist approach to Part-Of-Speech Tagging
A Connectionist approach to Part-Of-Speech TaggingA Connectionist approach to Part-Of-Speech Tagging
A Connectionist approach to Part-Of-Speech Tagging
 
TDC 1 - Class 2
TDC 1 - Class 2TDC 1 - Class 2
TDC 1 - Class 2
 
Part of speech tagger
Part of speech taggerPart of speech tagger
Part of speech tagger
 
TDC 1 - Class 2
TDC 1 - Class 2TDC 1 - Class 2
TDC 1 - Class 2
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
L3 v2
L3 v2L3 v2
L3 v2
 
Ignat vita artur
Ignat vita arturIgnat vita artur
Ignat vita artur
 
Text Analytics for Security
Text Analytics for SecurityText Analytics for Security
Text Analytics for Security
 
Ngrams smoothing
Ngrams smoothingNgrams smoothing
Ngrams smoothing
 
Crash-course in Natural Language Processing
Crash-course in Natural Language ProcessingCrash-course in Natural Language Processing
Crash-course in Natural Language Processing
 
CS571:: Part of-Speech Tagging
CS571:: Part of-Speech TaggingCS571:: Part of-Speech Tagging
CS571:: Part of-Speech Tagging
 
Natural Language Processing: L02 words
Natural Language Processing: L02 wordsNatural Language Processing: L02 words
Natural Language Processing: L02 words
 
OpenNLP demo
OpenNLP demoOpenNLP demo
OpenNLP demo
 
L05 language model_part2
L05 language model_part2L05 language model_part2
L05 language model_part2
 
NLP
NLPNLP
NLP
 
Natural language processing (NLP) introduction
Natural language processing (NLP) introductionNatural language processing (NLP) introduction
Natural language processing (NLP) introduction
 
English : Part of speech
English : Part of speech English : Part of speech
English : Part of speech
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processing
 

Similar to Natural Language processing Parts of speech tagging, its classes, and how to process it

Natural Language parsing.pptx
Natural Language parsing.pptxNatural Language parsing.pptx
Natural Language parsing.pptxsiddhantroy13
 
Segmenting dna sequence into words
Segmenting dna sequence into wordsSegmenting dna sequence into words
Segmenting dna sequence into wordsLiang Wang
 
NLP Deep Learning with Tensorflow
NLP Deep Learning with TensorflowNLP Deep Learning with Tensorflow
NLP Deep Learning with Tensorflowseungwoo kim
 
Parts of Speect Tagging
Parts of Speect TaggingParts of Speect Tagging
Parts of Speect Taggingtheyaseen51
 
Open nlp presentationss
Open nlp presentationssOpen nlp presentationss
Open nlp presentationssChandan Deb
 
Lecture 7- Text Statistics and Document Parsing
Lecture 7- Text Statistics and Document ParsingLecture 7- Text Statistics and Document Parsing
Lecture 7- Text Statistics and Document ParsingSean Golliher
 
Natural Language Processing made easy
Natural Language Processing made easyNatural Language Processing made easy
Natural Language Processing made easyGopi Krishnan Nambiar
 
natural language processing
natural language processing natural language processing
natural language processing sunanthakrishnan
 
NLTK: Natural Language Processing made easy
NLTK: Natural Language Processing made easyNLTK: Natural Language Processing made easy
NLTK: Natural Language Processing made easyoutsider2
 
Data Analytics using R with Yelp Dataset
Data Analytics using R with Yelp DatasetData Analytics using R with Yelp Dataset
Data Analytics using R with Yelp DatasetCédric Poottaren
 
Language Technology Enhanced Learning
Language Technology Enhanced LearningLanguage Technology Enhanced Learning
Language Technology Enhanced Learningtelss09
 
Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4DigiGurukul
 
MEBI 591C/598 – Data and Text Mining in Biomedical Informatics
MEBI 591C/598 – Data and Text Mining in Biomedical InformaticsMEBI 591C/598 – Data and Text Mining in Biomedical Informatics
MEBI 591C/598 – Data and Text Mining in Biomedical Informaticsbutest
 
Recommender systems
Recommender systemsRecommender systems
Recommender systemsVenkat Raman
 
ToC_M1L3_Grammar and Derivation.pdf
ToC_M1L3_Grammar and Derivation.pdfToC_M1L3_Grammar and Derivation.pdf
ToC_M1L3_Grammar and Derivation.pdfjaishreemane73
 
Unsupervised Software-Specific Morphological Forms Inference from Informal Di...
Unsupervised Software-Specific Morphological Forms Inference from Informal Di...Unsupervised Software-Specific Morphological Forms Inference from Informal Di...
Unsupervised Software-Specific Morphological Forms Inference from Informal Di...Chunyang Chen
 
GSCL2013.Phrase Tagset Mapping for French and English Treebanks and Its Appli...
GSCL2013.Phrase Tagset Mapping for French and English Treebanks and Its Appli...GSCL2013.Phrase Tagset Mapping for French and English Treebanks and Its Appli...
GSCL2013.Phrase Tagset Mapping for French and English Treebanks and Its Appli...Lifeng (Aaron) Han
 
Moore_slides.ppt
Moore_slides.pptMoore_slides.ppt
Moore_slides.pptbutest
 

Similar to Natural Language processing Parts of speech tagging, its classes, and how to process it (20)

Text Mining Analytics 101
Text Mining Analytics 101Text Mining Analytics 101
Text Mining Analytics 101
 
Natural Language parsing.pptx
Natural Language parsing.pptxNatural Language parsing.pptx
Natural Language parsing.pptx
 
Segmenting dna sequence into words
Segmenting dna sequence into wordsSegmenting dna sequence into words
Segmenting dna sequence into words
 
NLP Deep Learning with Tensorflow
NLP Deep Learning with TensorflowNLP Deep Learning with Tensorflow
NLP Deep Learning with Tensorflow
 
Parts of Speect Tagging
Parts of Speect TaggingParts of Speect Tagging
Parts of Speect Tagging
 
Open nlp presentationss
Open nlp presentationssOpen nlp presentationss
Open nlp presentationss
 
Lecture 7- Text Statistics and Document Parsing
Lecture 7- Text Statistics and Document ParsingLecture 7- Text Statistics and Document Parsing
Lecture 7- Text Statistics and Document Parsing
 
Natural Language Processing made easy
Natural Language Processing made easyNatural Language Processing made easy
Natural Language Processing made easy
 
natural language processing
natural language processing natural language processing
natural language processing
 
NLTK: Natural Language Processing made easy
NLTK: Natural Language Processing made easyNLTK: Natural Language Processing made easy
NLTK: Natural Language Processing made easy
 
Data Analytics using R with Yelp Dataset
Data Analytics using R with Yelp DatasetData Analytics using R with Yelp Dataset
Data Analytics using R with Yelp Dataset
 
Language Technology Enhanced Learning
Language Technology Enhanced LearningLanguage Technology Enhanced Learning
Language Technology Enhanced Learning
 
Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4
 
MEBI 591C/598 – Data and Text Mining in Biomedical Informatics
MEBI 591C/598 – Data and Text Mining in Biomedical InformaticsMEBI 591C/598 – Data and Text Mining in Biomedical Informatics
MEBI 591C/598 – Data and Text Mining in Biomedical Informatics
 
Recommender systems
Recommender systemsRecommender systems
Recommender systems
 
ToC_M1L3_Grammar and Derivation.pdf
ToC_M1L3_Grammar and Derivation.pdfToC_M1L3_Grammar and Derivation.pdf
ToC_M1L3_Grammar and Derivation.pdf
 
Unsupervised Software-Specific Morphological Forms Inference from Informal Di...
Unsupervised Software-Specific Morphological Forms Inference from Informal Di...Unsupervised Software-Specific Morphological Forms Inference from Informal Di...
Unsupervised Software-Specific Morphological Forms Inference from Informal Di...
 
GSCL2013.Phrase Tagset Mapping for French and English Treebanks and Its Appli...
GSCL2013.Phrase Tagset Mapping for French and English Treebanks and Its Appli...GSCL2013.Phrase Tagset Mapping for French and English Treebanks and Its Appli...
GSCL2013.Phrase Tagset Mapping for French and English Treebanks and Its Appli...
 
Nltk
NltkNltk
Nltk
 
Moore_slides.ppt
Moore_slides.pptMoore_slides.ppt
Moore_slides.ppt
 

Recently uploaded

Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Christo Ananth
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . pptDineshKumar4165
 
Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01KreezheaRecto
 
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELLPVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELLManishPatel169454
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile
 
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank  Design by Working Stress - IS Method.pdfIntze Overhead Water Tank  Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdfSuman Jyoti
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduitsrknatarajan
 
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICS
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICSUNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICS
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICSrknatarajan
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptDineshKumar4165
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college projectTonystark477637
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Bookingdharasingh5698
 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxfenichawla
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTbhaskargani46
 

Recently uploaded (20)

Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
 
Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01
 
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELLPVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank  Design by Working Stress - IS Method.pdfIntze Overhead Water Tank  Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
 
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
 
UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduits
 
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICS
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICSUNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICS
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICS
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
 
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPT
 

Natural Language processing Parts of speech tagging, its classes, and how to process it

  • 1. Part of Speech Tagging
  • 2. Perpectivising NLP: Areas of AI and their inter-dependencies Knowledge Search Logic Representation Machine Learning Planning Expert NLP Vision Robotics Systems
  • 3. Two pictures Problem NLP Semantics NLP nity Parsing Vision Speech Morph Analysis HM M Statistics and Probability Hindi English Language CRF + Knowledge Based MEM M Algorithm N Tri Part of Speech Tagging Marathi French
  • 4. What it is POS Tagging is a process that attaches each word in a sentence with a suitable tag from a given set of tags. The set of tags is called the Tag-set. Standard Tag-set : Penn Treebank (for English).
  • 5. Definition Tagging is the assignment of a singlepart-of-speech tag to each word (and punctuation marker) in a corpus. “_“ The_DT guys_NNS that_WDT make_VBP traditional_JJ hardware_NN are_VBP really_RB being_VBG obsoleted_VBN by_IN microprocessor-based_ JJ machines_NNS ,_, ”_” said_VBD Mr._NNP Benton_NNP ._.
  • 6. POS Tags NN – Noun; e.g. VM – Main Verb; Dog_NN e.g. Run_VM VAUX – AuxiliaryVerb; e.g. Is_VAUX JJ – Adjective; e.g. Red_JJ PRP – Pronoun; e.g. You_PRP NNP– Proper Noun; e.g. John_NNP etc.
  • 7. POS Tag Ambiguity In English : I bank1 on the bank2 on the river bank3 for Bank1 is verb, my transactions. the other two banks are noun In Hindi : ”Khaanaa” : can be noun (food) or eat) verb (to
  • 8. For Hindi Rama achhaa gaata hai. (hai is VAUX : Auxiliary verb); Ram sings well Rama achha ladakaa hai. (hai is VCOP : Copula verb); Ram is a good boy
  • 9. Process List all possible tag for each word in sentence. Choose best suitable tag sequence.
  • 10. Example ”People jump high”. People : Noun/Verb jump : Noun/Verb high : Noun/Verb/Adjective We can start with probabilities.
  • 11.
  • 12. Importance of POS tagging Ack: presentation by Claire Gardent on POS tagging by NLTK
  • 13. What is Part of Speech (POS) Words can be divided into classes behave similarly. Traditionally eight parts of speech that in English: noun, verb, pronoun, preposition, adverb, adjective and article More recently larger conjunction, sets have been used: e.g. Penn Treebank (45 tags), Susanne (353 tags).
  • 14. Why POS POS tell us a lot about a word (and the words near it). E.g, adjectives often followed by nouns personal pronouns often followed by verbs possessive pronouns by nouns Pronunciations depends on POS, e.g. object (first syllable NN, second syllable VM), content, discount First step in many NLP applications
  • 15. Categories of POS Open and closed classes Closed classes have a fixed membership of words: determiners, pronouns, prepositions Closed class words are usually function word: frequently occurring, grammatically important, often short (e.g. of, it, the, in) Open classes: nouns, verbs, adjectives and adverbs(allow new addition of word)
  • 16. Open Class (1/2) Nouns: Proper nouns (Scotland, BBC), common nouns count nouns (goat, glass) mass nouns (snow, pacifism) Verbs: actions and processes (run, hope) also auxiliary verbs (is, are, am, will, can)
  • 17. Open Class (2/2) Adjectives: properties and qualities value) Adverbs: (age, colour, modify verbs, or verb phrases, or other adverbs- Unfortunately John walked home extremely slowly yesterday Sentential adverb: unfortunately Manner adverb: extremely, slowly Time adverb: yesterday
  • 18. Closed class Prepositions: on, under, over, to, with, by Determiners: the, a, an, some Pronouns: she, you, I, who Conjunctions: and, but, or, as, when, if Auxiliary verbs: can, may, are
  • 24. Indian Language Tagset: Demonstrative 3 Demonstrative DM DM Vaha, jo, yaha, 3.1 Deictic DMD DM DMD Vaha, yaha 3.2 Relative DMR DM DMR jo, jis 3.3 Wh-word DMQ DM DMQ kis, kaun Indefinite DMI DM DMI KoI, kis
  • 25. Indian Language Tagset: Verb, Adjective, Adverb
  • 26. Indian Language Tagset: Postposition, conjunction
  • 29. Bigram Best tag sequence Assumption = = = T* argmax P(T|W) argmax P(T)P(W|T) (by Baye’s Theorem) P(T) = P(t0=^ t1t2 … tn+1=.) = P(t0)P(t1|t0)P(t2|t1t0)P(t3|t2t1t0) … P(tn|tn-1tn-2…t0)P(tn+1|tntn-1…t0) = P(t0)P(t1|t0)P(t2|t1) … P(tn|tn-1)P(tn+1|tn) N+1 Π i = 0 = P(ti|ti-1) Bigram Assumption
  • 30. Lexical Probability Assumption P(W|T) = P(w0|t0-tn+1)P(w1|w0t0-tn+1)P(w2|w1w0t0-tn+1) … P(wn|w0-wn-1t0-tn+1)P(wn+1|w0-wnt0-tn+1) Assumption: A word is determined completely by inspired by speech recognition its tag. This is = P(wo|to)P(w1|t1) … P(wn+1|tn+1) n+1 =Π P(wi|ti) i = 0 n+1 = Π P(wi|ti) i = 1 (Lexical Probability Assumption)
  • 31. Generative Model ^_^ People_N Jump_V High_R ._ . Lexical Probabilit ies ^ N V A . V N N Bigram Probabilit ies A A N This model is called Generative model. Here words are observed from tags as states. This is similar to HMM.
  • 32. Bigram probabilities N V A N 0.2 0.7 0.1 V 0.6 0.2 0.2 A 0.5 0.2 0.3
  • 33. Lexical Probability People jump high 10 10 10 N - 5 10 - 3 0.4x10 -7 V - 7 10 -2 10 -7 A 0 0 -1 values in cell are P(col-heading/row-heading)
  • 34. Calculation from Corpus actual data ^ Ram got many NLP books. He found them all very interesting. Pos Tagged ^N V A N N . N V N A R A .
  • 35. Recording numbers ^ N V A R . ^ 0 2 0 0 0 0 N 0 1 2 1 0 1 V 0 1 0 1 0 0 A 0 1 0 0 1 1 R 0 0 0 1 0 0 . 1 0 0 0 0 0
  • 36. Probabilities ^ N V A R . ^ 0 1 0 0 0 0 N 0 1/5 2/5 1/5 0 1/5 V 0 1/2 0 1/2 0 0 A 0 1/3 0 0 1/3 1/3 R 0 0 0 1 0 0 . 1 0 0 0 0 0
  • 37. To find T* = argmax (P(T) P(W/T)) P(T).P(W/T) = Π P( ti / ti+1 ).P(wi /ti) i=1 n P( ti / ti+1 ) : Bigram probability P(wi /ti): Lexical probability
  • 38. Bigram probabilities N V A R N 0.15 0.7 0.05 0.1 V 0.6 0.2 0.1 0.1 A 0.5 0.2 0.3 0 R 0.1 0.3 0.5 0.1
  • 39. Lexical Probability People jump high 10 10 10 N -5 10 -3 0.4x10 -7 V -7 10 -2 10 -7 A 0 0 -1 R 0 0 0 values in cell are P(col-heading/row-heading)