SlideShare una empresa de Scribd logo
1 de 59
z
Natural Language
Processing
Overview of
z
Syllabus
 Natural Language Processing
 Natural Language Models
 Syntactic Analysis
 Augmented Grammar
 Semantic Interpretation
 Machine Translation
 Ambiguity and Disambiguation
 Discourse understanding
 Grammar Induction
z
Natural Language processing (NLP)
 NLP allows computer interaction with humans
 A field in Artificial Intelligence
Computer LinguisticsNLP
Fig 1: NLP
Fig 2: Research Area in NLP
AI
ML
z
Trending Topics in Natural Language Processing
 Natural Language Understanding
 Natural Language Generation
 Text Extraction
 Language Translation
 Parsing
 Parts of Speech Tagging
Fig 3 : Projection of NLP projects
z
Components of NLP
z
Language Translation
Text Extraction & Parsing and
tagging parts of speech
z
NLPApplications
 Sentimental / opinion analysis
 Chatbots
 Speech Recognition
 Machine Translation
 Spell checking
 Keyword searching
 Information retrieval
 Advertisement Matching / Trending
/Linking connects
z
Research Topics in Natural Language Processing
 Named Entity Recognition
 Hamming Problem
 Neural networkbased Transition & parsing
 Ontology
 Dependency parsing
 Query entity recognising & Disambiguity
 Sentiment analysis & mining
 Text Categorization and Summarization
 Online Browsing
 Text Mining
 Plagiarism Detection
 Information retrieval
 Machine translation
 Speech recognition
 Deep learning in NLP
 Opinion analysis & mining
 Text to 3D scene Generation
 Sentence completion
z
Applications of Natural Language Processing
 Biomedical
 Forensic Science
 Advertisement
 Education
 Politics
 E-governance
 Business Development
 Marketing
zTools & Software : Purpose
 Stanford NLP : provide models files for analysis of English, written in java
 Apache Opennlp: provide support for common NLP task such as tokenization, sentence Segmentation.
 Jig LDA nlp: used for parameter estimation & inference implemented in java
 Scala NLP: Umbrella project for several libraries, including Breeze ,Epic
 Apache Lucene Core: full-featured text Search engine library implemented in java
 GateNLP: Java suits of tools which include information extraction support system to support various Lang
 NLTK: build a python program to work with human language
z NLP - Stages
Editors
Jupyter NB
Google Collab
Pychram
Software Libraries
NLTK
TensorFlow
Keras
Pytorch Pragmatic Analysis
Disclosure Integration
Semantic Analysis
Syntax Analysis
Lexical Analysis
z
z
Detail Insight
z
NL Models
z
Where do we need ML
z
z
z
z
z
z
z
z
z
z
z
z
z
z
z
Search engine / Document classification / Feedback analysis
z
z
z
How to check the syntax of the sentence ?
z
Parsing & Approaches – Syntax Analysis
 Parsing - taking input text and giving structural representation to it after
checking the syntax as per formal grammar ( rule ).
 Top-down Parsing
The parser starts constructing the parse tree from the start
symbol and then tries to transform the start symbol to the
input.
 Bottom-up Parsing
The parser starts with the input symbol and tries to
construct the parser tree up to the start symbol.
z
Top Down & Bottom Up Approaches
z
How to get /Solve this structure / Grammar of POS?
Parts of Speech (POS) ?
Subject , Object , Predicate !
Grammar ?
z
Grammar – Example
 S->NP VP
 S-> VP NP
 S->NP VP NP
 NP -> Det Noun | NP->Noun |Nominal
 VP-> Verb NP | V | Verb NP PP | V PP
 V->Verb
 Det-> Det | Article |Aux
L
H
S
R
H
S
Terminal
Non
Terminal
z
Context Free Grammar / Backus Norm
Form / Phrase Structure Gramme r
 CFG has 4 components.
G={ V,T,P,S}
Set of Non-Terminals: ( It is denoted by V). The non-terminals are syntactic variables that
denote the sets of strings, such as Verb Phrase or noun phrase.( LHS of a Grammar )
Set of Terminals: ( It is denoted by T). Strings are further cannot be sub divided like Noun,
Verb, determinant, article, auxiliary ( RHS of a Grammar )
Set of Production Rule : (It is denoted by P). The rule to defines how the terminals and
non-terminals can be combined. Every production(P) consists of non-terminals, an arrow,
and terminals
Start Symbol: (( It is denoted by S)The production begins from the start symbol. Non-
terminal symbol is always designated as start symbol.
z
CFG - Construct the parsing
To solve :- The flight includes a meal
 S -> NP VP NP
 S -> NP VP
 S -> VP NP
 VP -> V NP
 NP - > Det N
 V -> Verb
 Verb -> includes
 Det -> the
 Det ->a
 N -> Flight
 N -> Meal
S
NP VP
Det
Flight
NP NP
Det N
the Meala
N
includes
V
S
NP VP
Det
Flight
NP NP
Det
the a
N
includes
V
Meal
N
N
z
Some Difficult Examples
 From the newspapers:
 Squad helps dog bite victim.
 Helicopter powered by human flies.
 Levy won’t hurt the poor.
 Once-sagging cloth diaper industry saved by full
dumps.
 Ambiguities:
 Lexical: meanings of ‘hot’, ‘back’.
 Syntactic: I heard the music in my room.
 Referential: The cat ate the mouse. It was ugly.
z
CKY uses C Norm Form
 When Sentence contain
 Ambiguity
 Recursive / Repeated Sub Structure ,
 How to resolve ?
 CNF
Allowed Rules in CNF
S -> B ( single terminal) NP -> the N (incorrect ), so we introduce dummy variable
S->B C ( 2 non terminal ) NP -> Det N
NP -> N pp Det -> the ( Dummy Variable)
z
Grammar to CNF conversion
z
CKY – checking / construct parsing Structure
 0 The 1 Flight 2 includes 3 the 4 Meal 5
 Representation chart
1 2 3 4 5
0 Det
1 N
2 V
3 Det
4 N
z
CKY with Probability
z
Augmented Grammar :
 On the fly if the given sentence
generates a new grammar , add
that rule to the rule table , this is
referred as AG.
 A -> B, B ->C, A -> C
 e.g. Students like coffee.
 Todd likes coffee.
 Todd like coffee.
Examples
S -> NP[number] VP[number]
NP[number] -> N[number]
N[number=singular] -> “Todd”
N[number=plural] -> “students”
VP[number] -> V[number] NP
V[number=singular] -> “likes”
V[number=plural] -> “like”
z
z
Word Net
Contain DB of Nouns, Verbs, Adjectives & Adverbs)
 Ambiguity : single word having multiple meaning (e.g. bank)
 Synonyms: similar words (e.g. big, large)(fare/price)- oddity
 Antonyms: (big, small , good bad, fast slow )
 Complementary pair: ( male female, alive dead, present dead)
 Relation pair: ( married- not single , single – not married )
 Hyponymy: ( vehicle(car))
 Meronymy : ( part of a whole (apple <- apple tree))
 Homonymy: ( whole to a part (apple tree ->apple))
z
Semantic Analysis- Building blocks
 Entities − It represents the individual such as a particular person, location etc.
For example, Haryana. India, Ram all are entities.
 Concepts − It represents the general category of the individuals such as a
person, city, etc.
 Relations − It represents the relationship between entities and concept. For
example, Ram is a person.
 Predicates − It represents the verb structures. For example, semantic roles and
case grammar are the examples of predicates.
z
Semantic Analysis
 Two approaches FOL / WordNet
 First Order Logic
Flight 707 serve lunch S -> Np Vp ( DCL( Np Vp ))
Server lunch S -> IMP( VP NP )
Does Flight 207 server lunch S -> Aux NP VP ( YNQ( NP VP ))
Which flight server lunch S -> (WHQ (NP VP))
Atlanta’s airport S -> N VIP (GN ( N ))
I told harry to go the queen ( infinite verb phrase) S -> NP VP NP ( S-> NP λ VP NP))
z
z
Discourse Integration
Let S0 and S1 to represent the meaning of the two related sentences i.e. Text Coherence
 Results: It infers that the state asserted by term S0 could cause the state asserted by S1.
e.g. Ram was caught in the fire. His skin burned.
 Explanation: It infers that the state asserted by S1 could cause the state asserted by S0.
e.g. Ram fought with Shyam’s friend. He was drunk.
 Parallel: It infers p(a1,a2,…) of S0 & p(b1,b2,…) from S1. Here ai and bi are similar for all i.
e.g. Ram wanted car. Shyam wanted money.
 Elaboration : It infers the same proposition P from both the assertions − S0 and S1 for
e,g, Ram was from Chandigarh. Shyam was from Kerala.
 Occasion: It happens when a change of state can be inferred from the assertion of S0, final state of
which can be inferred from S1 and vice-versa.
e.g. Ram picked up the book. He gave it to Shyam.
z
Example of Discourse Integration
S1 − Ram went to the bank to deposit money.
S2 − He then took a train to Shyam’s cloth shop.
S3 − He wanted to buy some clothes.
S4 − He do not have new clothes for party.
S5 − He also wanted to talk to Shyam regarding
his health
z
Grammar Induction :
 Unsupervised learning of a language’s syntax from a corpus of observed
sentences
 – Ability to uncover an underlying grammar
 – Ability to parse
 – Ability to judge grammaticality
 solve using parsing 1. CFG 2. CKY form and generate a parsed tree or by
Language models like Markov Chain Model
 Demonstration of sentence completion using Grammar Induction using NN
z
NLP - Applications
Pragmatic
Analysis
Disclosure
Integration
Semantic
Analysis
Syntactic
Analysis
Lexical
Analysis
Search engine,
Recommendation
system
Information retrieval,
Text summarization
and Information
extraction, sentence
completion
Grammar checking
Sentimental / Sarcasm
/ Opinion, Machine
Translations
Grammar checking
Word Dictionary,
Page Ranking
z
54
Speech Recognition
 Human languages are limited to a set of about
40 to 50 distinct sounds called phones: e.g.,
 [ey] bet
 [ah] but
 [oy] boy
 [em] bottom
 [en] button
 These phones are characterized in terms of acoustic features, e.g.,
frequency and amplitude, that can be extracted from the sound
waves
z
55
Difficulties
 Why isn't this easy?
 just develop a dictionary of pronunciation
e.g., coat = [k] + [ow] + [t] = [kowt]
 but: recognize speech  wreck a nice beach
 Problems:
 homophones: different fragments sound the same
 e.g., rec and wreck
 segmentation: determining breaks between words
 e.g., nize speech and nice beach
 signal processing problems
56
Speech Recognition Architecture
• Large vocabulary, continuous speech (words not separated), speaker-
independent
Speech
Waveform
Spectral
Feature
Vectors
Phone
Likelihoods
P(o|q)
Words
Feature Extraction
(Signal Processing)
Phone Likelihood
Estimation (Gaussians
or Neural Networks)
Decoding (Viterbi
or Stack Decoder)
Neural Net
N-gram Grammar
HMM Lexicon
z
57
Signal Processing
 Sound is an analog energy source resulting from pressure waves
striking an eardrum or microphone
 A device called an analog-to-digital converter can be used to record
the speech sounds
 sampling rate: the number of times per second that the sound level is
measured
 quantization factor: the maximum number of bits of precision for the
sound level measurements
 e.g., telephone: 3 KHz (3000 times per second)
 e.g., speech recognizer: 8 KHz with 8 bit samples
so that 1 minute takes about 500K bytes
z
References
 https://www.youtube.com/watch?v=GiyMGBuu45w&t=3s
 https://www.youtube.com/watch?v=iGmHnICXDss&t=12s
 https://www.slideshare.net/HansiThenuwara/natural-language-processing-64271235
 https://www.tutorialspoint.com/natural_language_processing/natural_language_processing_syntactic_analysis.
htm
 https://www.tutorialspoint.com/natural_language_processing/natural_language_processing_syntactic_analysis.
htm
 https://www.youtube.com/watch?v=DPi5rfTX6mw&t=1050s
 https://www.youtube.com/watch?v=SFQ-owZaU_s
 https://personal.utdallas.edu/~sanda/courses/NLP/Lecture10.pdf
 https://nlp.stanford.edu/projects/project-induction.shtml
 https://nlp.stanford.edu/IR-book/html/htmledition/types-of-language-models-1.html
z
Thank You

Más contenido relacionado

La actualidad más candente

Artificial Intelligence Searching Techniques
Artificial Intelligence Searching TechniquesArtificial Intelligence Searching Techniques
Artificial Intelligence Searching TechniquesDr. C.V. Suresh Babu
 
Knowledge representation in AI
Knowledge representation in AIKnowledge representation in AI
Knowledge representation in AIVishal Singh
 
Control Strategies in AI
Control Strategies in AIControl Strategies in AI
Control Strategies in AIAmey Kerkar
 
Relationship Among Token, Lexeme & Pattern
Relationship Among Token, Lexeme & PatternRelationship Among Token, Lexeme & Pattern
Relationship Among Token, Lexeme & PatternBharat Rathore
 
Genetic algorithms vs Traditional algorithms
Genetic algorithms vs Traditional algorithmsGenetic algorithms vs Traditional algorithms
Genetic algorithms vs Traditional algorithmsDr. C.V. Suresh Babu
 
Propositional logic
Propositional logicPropositional logic
Propositional logicRushdi Shams
 
Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4DigiGurukul
 
Agents in Artificial intelligence
Agents in Artificial intelligence Agents in Artificial intelligence
Agents in Artificial intelligence Lalit Birla
 
Natural language processing
Natural language processingNatural language processing
Natural language processingHansi Thenuwara
 
ProLog (Artificial Intelligence) Introduction
ProLog (Artificial Intelligence) IntroductionProLog (Artificial Intelligence) Introduction
ProLog (Artificial Intelligence) Introductionwahab khan
 
Informed and Uninformed search Strategies
Informed and Uninformed search StrategiesInformed and Uninformed search Strategies
Informed and Uninformed search StrategiesAmey Kerkar
 
Three Address code
Three Address code Three Address code
Three Address code Pooja Dixit
 
Forward and Backward chaining in AI
Forward and Backward chaining in AIForward and Backward chaining in AI
Forward and Backward chaining in AIMegha Sharma
 
Performance analysis(Time & Space Complexity)
Performance analysis(Time & Space Complexity)Performance analysis(Time & Space Complexity)
Performance analysis(Time & Space Complexity)swapnac12
 

La actualidad más candente (20)

Artificial Intelligence Searching Techniques
Artificial Intelligence Searching TechniquesArtificial Intelligence Searching Techniques
Artificial Intelligence Searching Techniques
 
Adversarial search
Adversarial search Adversarial search
Adversarial search
 
Recognition-of-tokens
Recognition-of-tokensRecognition-of-tokens
Recognition-of-tokens
 
Knowledge representation in AI
Knowledge representation in AIKnowledge representation in AI
Knowledge representation in AI
 
search strategies in artificial intelligence
search strategies in artificial intelligencesearch strategies in artificial intelligence
search strategies in artificial intelligence
 
Parsing
ParsingParsing
Parsing
 
Control Strategies in AI
Control Strategies in AIControl Strategies in AI
Control Strategies in AI
 
Asymptotic Notation
Asymptotic NotationAsymptotic Notation
Asymptotic Notation
 
Relationship Among Token, Lexeme & Pattern
Relationship Among Token, Lexeme & PatternRelationship Among Token, Lexeme & Pattern
Relationship Among Token, Lexeme & Pattern
 
Genetic algorithms vs Traditional algorithms
Genetic algorithms vs Traditional algorithmsGenetic algorithms vs Traditional algorithms
Genetic algorithms vs Traditional algorithms
 
Propositional logic
Propositional logicPropositional logic
Propositional logic
 
Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4
 
Agents in Artificial intelligence
Agents in Artificial intelligence Agents in Artificial intelligence
Agents in Artificial intelligence
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
ProLog (Artificial Intelligence) Introduction
ProLog (Artificial Intelligence) IntroductionProLog (Artificial Intelligence) Introduction
ProLog (Artificial Intelligence) Introduction
 
Informed and Uninformed search Strategies
Informed and Uninformed search StrategiesInformed and Uninformed search Strategies
Informed and Uninformed search Strategies
 
Three Address code
Three Address code Three Address code
Three Address code
 
Forward and Backward chaining in AI
Forward and Backward chaining in AIForward and Backward chaining in AI
Forward and Backward chaining in AI
 
5 csp
5 csp5 csp
5 csp
 
Performance analysis(Time & Space Complexity)
Performance analysis(Time & Space Complexity)Performance analysis(Time & Space Complexity)
Performance analysis(Time & Space Complexity)
 

Similar a natural language processing

Lecture 7: Definite Clause Grammars
Lecture 7: Definite Clause GrammarsLecture 7: Definite Clause Grammars
Lecture 7: Definite Clause GrammarsCS, NcState
 
Contemporary Models of Natural Language Processing
Contemporary Models of Natural Language ProcessingContemporary Models of Natural Language Processing
Contemporary Models of Natural Language ProcessingKaterina Vylomova
 
Natural Language processing Parts of speech tagging, its classes, and how to ...
Natural Language processing Parts of speech tagging, its classes, and how to ...Natural Language processing Parts of speech tagging, its classes, and how to ...
Natural Language processing Parts of speech tagging, its classes, and how to ...Rajnish Raj
 
Open nlp presentationss
Open nlp presentationssOpen nlp presentationss
Open nlp presentationssChandan Deb
 
MACHINE-DRIVEN TEXT ANALYSIS
MACHINE-DRIVEN TEXT ANALYSISMACHINE-DRIVEN TEXT ANALYSIS
MACHINE-DRIVEN TEXT ANALYSISMassimo Schenone
 
Moore_slides.ppt
Moore_slides.pptMoore_slides.ppt
Moore_slides.pptbutest
 
Solutions advanced sb
Solutions advanced sbSolutions advanced sb
Solutions advanced sbmaicanhtinh
 
INFO-2950-Languages-and-Grammars.ppt
INFO-2950-Languages-and-Grammars.pptINFO-2950-Languages-and-Grammars.ppt
INFO-2950-Languages-and-Grammars.pptLamhotNaibaho3
 
Lec 15,16,17 NLP.machine translation
Lec 15,16,17  NLP.machine translationLec 15,16,17  NLP.machine translation
Lec 15,16,17 NLP.machine translationguest873a50
 
Computational model language and grammar bnf
Computational model language and grammar bnfComputational model language and grammar bnf
Computational model language and grammar bnfTaha Shakeel
 
Natural Language parsing.pptx
Natural Language parsing.pptxNatural Language parsing.pptx
Natural Language parsing.pptxsiddhantroy13
 
Unit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmm
Unit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmmUnit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmm
Unit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmmDhruvKushwaha12
 
Statistical machine translation
Statistical machine translationStatistical machine translation
Statistical machine translationHrishikesh Nair
 

Similar a natural language processing (20)

NLP-my-lecture (3).ppt
NLP-my-lecture (3).pptNLP-my-lecture (3).ppt
NLP-my-lecture (3).ppt
 
Lecture 7: Definite Clause Grammars
Lecture 7: Definite Clause GrammarsLecture 7: Definite Clause Grammars
Lecture 7: Definite Clause Grammars
 
Contemporary Models of Natural Language Processing
Contemporary Models of Natural Language ProcessingContemporary Models of Natural Language Processing
Contemporary Models of Natural Language Processing
 
Natural Language processing Parts of speech tagging, its classes, and how to ...
Natural Language processing Parts of speech tagging, its classes, and how to ...Natural Language processing Parts of speech tagging, its classes, and how to ...
Natural Language processing Parts of speech tagging, its classes, and how to ...
 
Open nlp presentationss
Open nlp presentationssOpen nlp presentationss
Open nlp presentationss
 
NLP
NLPNLP
NLP
 
NLP
NLPNLP
NLP
 
MACHINE-DRIVEN TEXT ANALYSIS
MACHINE-DRIVEN TEXT ANALYSISMACHINE-DRIVEN TEXT ANALYSIS
MACHINE-DRIVEN TEXT ANALYSIS
 
Moore_slides.ppt
Moore_slides.pptMoore_slides.ppt
Moore_slides.ppt
 
Text Mining Analytics 101
Text Mining Analytics 101Text Mining Analytics 101
Text Mining Analytics 101
 
Solutions advanced sb
Solutions advanced sbSolutions advanced sb
Solutions advanced sb
 
INFO-2950-Languages-and-Grammars.ppt
INFO-2950-Languages-and-Grammars.pptINFO-2950-Languages-and-Grammars.ppt
INFO-2950-Languages-and-Grammars.ppt
 
Lec 15,16,17 NLP.machine translation
Lec 15,16,17  NLP.machine translationLec 15,16,17  NLP.machine translation
Lec 15,16,17 NLP.machine translation
 
Computational model language and grammar bnf
Computational model language and grammar bnfComputational model language and grammar bnf
Computational model language and grammar bnf
 
Natural Language parsing.pptx
Natural Language parsing.pptxNatural Language parsing.pptx
Natural Language parsing.pptx
 
Unit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmm
Unit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmmUnit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmm
Unit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmm
 
Statistical machine translation
Statistical machine translationStatistical machine translation
Statistical machine translation
 
Inteligencia artificial
Inteligencia artificialInteligencia artificial
Inteligencia artificial
 
Nlp
NlpNlp
Nlp
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 

Último

BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024Janet Corral
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 

Último (20)

BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 

natural language processing

  • 2. z Syllabus  Natural Language Processing  Natural Language Models  Syntactic Analysis  Augmented Grammar  Semantic Interpretation  Machine Translation  Ambiguity and Disambiguation  Discourse understanding  Grammar Induction
  • 3. z Natural Language processing (NLP)  NLP allows computer interaction with humans  A field in Artificial Intelligence Computer LinguisticsNLP Fig 1: NLP Fig 2: Research Area in NLP AI ML
  • 4. z Trending Topics in Natural Language Processing  Natural Language Understanding  Natural Language Generation  Text Extraction  Language Translation  Parsing  Parts of Speech Tagging Fig 3 : Projection of NLP projects
  • 6. z Language Translation Text Extraction & Parsing and tagging parts of speech
  • 7. z NLPApplications  Sentimental / opinion analysis  Chatbots  Speech Recognition  Machine Translation  Spell checking  Keyword searching  Information retrieval  Advertisement Matching / Trending /Linking connects
  • 8. z Research Topics in Natural Language Processing  Named Entity Recognition  Hamming Problem  Neural networkbased Transition & parsing  Ontology  Dependency parsing  Query entity recognising & Disambiguity  Sentiment analysis & mining  Text Categorization and Summarization  Online Browsing  Text Mining  Plagiarism Detection  Information retrieval  Machine translation  Speech recognition  Deep learning in NLP  Opinion analysis & mining  Text to 3D scene Generation  Sentence completion
  • 9. z Applications of Natural Language Processing  Biomedical  Forensic Science  Advertisement  Education  Politics  E-governance  Business Development  Marketing
  • 10. zTools & Software : Purpose  Stanford NLP : provide models files for analysis of English, written in java  Apache Opennlp: provide support for common NLP task such as tokenization, sentence Segmentation.  Jig LDA nlp: used for parameter estimation & inference implemented in java  Scala NLP: Umbrella project for several libraries, including Breeze ,Epic  Apache Lucene Core: full-featured text Search engine library implemented in java  GateNLP: Java suits of tools which include information extraction support system to support various Lang  NLTK: build a python program to work with human language
  • 11. z NLP - Stages Editors Jupyter NB Google Collab Pychram Software Libraries NLTK TensorFlow Keras Pytorch Pragmatic Analysis Disclosure Integration Semantic Analysis Syntax Analysis Lexical Analysis
  • 14. z Where do we need ML
  • 15. z
  • 16. z
  • 17. z
  • 18. z
  • 19. z
  • 20. z
  • 21. z
  • 22. z
  • 23. z
  • 24. z
  • 25. z
  • 26. z
  • 27. z
  • 28. z
  • 29. z Search engine / Document classification / Feedback analysis
  • 30. z
  • 31. z
  • 32. z How to check the syntax of the sentence ?
  • 33. z Parsing & Approaches – Syntax Analysis  Parsing - taking input text and giving structural representation to it after checking the syntax as per formal grammar ( rule ).  Top-down Parsing The parser starts constructing the parse tree from the start symbol and then tries to transform the start symbol to the input.  Bottom-up Parsing The parser starts with the input symbol and tries to construct the parser tree up to the start symbol.
  • 34. z Top Down & Bottom Up Approaches
  • 35. z How to get /Solve this structure / Grammar of POS? Parts of Speech (POS) ? Subject , Object , Predicate ! Grammar ?
  • 36. z Grammar – Example  S->NP VP  S-> VP NP  S->NP VP NP  NP -> Det Noun | NP->Noun |Nominal  VP-> Verb NP | V | Verb NP PP | V PP  V->Verb  Det-> Det | Article |Aux L H S R H S Terminal Non Terminal
  • 37. z Context Free Grammar / Backus Norm Form / Phrase Structure Gramme r  CFG has 4 components. G={ V,T,P,S} Set of Non-Terminals: ( It is denoted by V). The non-terminals are syntactic variables that denote the sets of strings, such as Verb Phrase or noun phrase.( LHS of a Grammar ) Set of Terminals: ( It is denoted by T). Strings are further cannot be sub divided like Noun, Verb, determinant, article, auxiliary ( RHS of a Grammar ) Set of Production Rule : (It is denoted by P). The rule to defines how the terminals and non-terminals can be combined. Every production(P) consists of non-terminals, an arrow, and terminals Start Symbol: (( It is denoted by S)The production begins from the start symbol. Non- terminal symbol is always designated as start symbol.
  • 38. z CFG - Construct the parsing To solve :- The flight includes a meal  S -> NP VP NP  S -> NP VP  S -> VP NP  VP -> V NP  NP - > Det N  V -> Verb  Verb -> includes  Det -> the  Det ->a  N -> Flight  N -> Meal S NP VP Det Flight NP NP Det N the Meala N includes V S NP VP Det Flight NP NP Det the a N includes V Meal N N
  • 39. z Some Difficult Examples  From the newspapers:  Squad helps dog bite victim.  Helicopter powered by human flies.  Levy won’t hurt the poor.  Once-sagging cloth diaper industry saved by full dumps.  Ambiguities:  Lexical: meanings of ‘hot’, ‘back’.  Syntactic: I heard the music in my room.  Referential: The cat ate the mouse. It was ugly.
  • 40. z CKY uses C Norm Form  When Sentence contain  Ambiguity  Recursive / Repeated Sub Structure ,  How to resolve ?  CNF Allowed Rules in CNF S -> B ( single terminal) NP -> the N (incorrect ), so we introduce dummy variable S->B C ( 2 non terminal ) NP -> Det N NP -> N pp Det -> the ( Dummy Variable)
  • 41. z Grammar to CNF conversion
  • 42. z CKY – checking / construct parsing Structure  0 The 1 Flight 2 includes 3 the 4 Meal 5  Representation chart 1 2 3 4 5 0 Det 1 N 2 V 3 Det 4 N
  • 44. z Augmented Grammar :  On the fly if the given sentence generates a new grammar , add that rule to the rule table , this is referred as AG.  A -> B, B ->C, A -> C  e.g. Students like coffee.  Todd likes coffee.  Todd like coffee. Examples S -> NP[number] VP[number] NP[number] -> N[number] N[number=singular] -> “Todd” N[number=plural] -> “students” VP[number] -> V[number] NP V[number=singular] -> “likes” V[number=plural] -> “like”
  • 45. z
  • 46. z Word Net Contain DB of Nouns, Verbs, Adjectives & Adverbs)  Ambiguity : single word having multiple meaning (e.g. bank)  Synonyms: similar words (e.g. big, large)(fare/price)- oddity  Antonyms: (big, small , good bad, fast slow )  Complementary pair: ( male female, alive dead, present dead)  Relation pair: ( married- not single , single – not married )  Hyponymy: ( vehicle(car))  Meronymy : ( part of a whole (apple <- apple tree))  Homonymy: ( whole to a part (apple tree ->apple))
  • 47. z Semantic Analysis- Building blocks  Entities − It represents the individual such as a particular person, location etc. For example, Haryana. India, Ram all are entities.  Concepts − It represents the general category of the individuals such as a person, city, etc.  Relations − It represents the relationship between entities and concept. For example, Ram is a person.  Predicates − It represents the verb structures. For example, semantic roles and case grammar are the examples of predicates.
  • 48. z Semantic Analysis  Two approaches FOL / WordNet  First Order Logic Flight 707 serve lunch S -> Np Vp ( DCL( Np Vp )) Server lunch S -> IMP( VP NP ) Does Flight 207 server lunch S -> Aux NP VP ( YNQ( NP VP )) Which flight server lunch S -> (WHQ (NP VP)) Atlanta’s airport S -> N VIP (GN ( N )) I told harry to go the queen ( infinite verb phrase) S -> NP VP NP ( S-> NP λ VP NP))
  • 49. z
  • 50. z Discourse Integration Let S0 and S1 to represent the meaning of the two related sentences i.e. Text Coherence  Results: It infers that the state asserted by term S0 could cause the state asserted by S1. e.g. Ram was caught in the fire. His skin burned.  Explanation: It infers that the state asserted by S1 could cause the state asserted by S0. e.g. Ram fought with Shyam’s friend. He was drunk.  Parallel: It infers p(a1,a2,…) of S0 & p(b1,b2,…) from S1. Here ai and bi are similar for all i. e.g. Ram wanted car. Shyam wanted money.  Elaboration : It infers the same proposition P from both the assertions − S0 and S1 for e,g, Ram was from Chandigarh. Shyam was from Kerala.  Occasion: It happens when a change of state can be inferred from the assertion of S0, final state of which can be inferred from S1 and vice-versa. e.g. Ram picked up the book. He gave it to Shyam.
  • 51. z Example of Discourse Integration S1 − Ram went to the bank to deposit money. S2 − He then took a train to Shyam’s cloth shop. S3 − He wanted to buy some clothes. S4 − He do not have new clothes for party. S5 − He also wanted to talk to Shyam regarding his health
  • 52. z Grammar Induction :  Unsupervised learning of a language’s syntax from a corpus of observed sentences  – Ability to uncover an underlying grammar  – Ability to parse  – Ability to judge grammaticality  solve using parsing 1. CFG 2. CKY form and generate a parsed tree or by Language models like Markov Chain Model  Demonstration of sentence completion using Grammar Induction using NN
  • 53. z NLP - Applications Pragmatic Analysis Disclosure Integration Semantic Analysis Syntactic Analysis Lexical Analysis Search engine, Recommendation system Information retrieval, Text summarization and Information extraction, sentence completion Grammar checking Sentimental / Sarcasm / Opinion, Machine Translations Grammar checking Word Dictionary, Page Ranking
  • 54. z 54 Speech Recognition  Human languages are limited to a set of about 40 to 50 distinct sounds called phones: e.g.,  [ey] bet  [ah] but  [oy] boy  [em] bottom  [en] button  These phones are characterized in terms of acoustic features, e.g., frequency and amplitude, that can be extracted from the sound waves
  • 55. z 55 Difficulties  Why isn't this easy?  just develop a dictionary of pronunciation e.g., coat = [k] + [ow] + [t] = [kowt]  but: recognize speech  wreck a nice beach  Problems:  homophones: different fragments sound the same  e.g., rec and wreck  segmentation: determining breaks between words  e.g., nize speech and nice beach  signal processing problems
  • 56. 56 Speech Recognition Architecture • Large vocabulary, continuous speech (words not separated), speaker- independent Speech Waveform Spectral Feature Vectors Phone Likelihoods P(o|q) Words Feature Extraction (Signal Processing) Phone Likelihood Estimation (Gaussians or Neural Networks) Decoding (Viterbi or Stack Decoder) Neural Net N-gram Grammar HMM Lexicon
  • 57. z 57 Signal Processing  Sound is an analog energy source resulting from pressure waves striking an eardrum or microphone  A device called an analog-to-digital converter can be used to record the speech sounds  sampling rate: the number of times per second that the sound level is measured  quantization factor: the maximum number of bits of precision for the sound level measurements  e.g., telephone: 3 KHz (3000 times per second)  e.g., speech recognizer: 8 KHz with 8 bit samples so that 1 minute takes about 500K bytes
  • 58. z References  https://www.youtube.com/watch?v=GiyMGBuu45w&t=3s  https://www.youtube.com/watch?v=iGmHnICXDss&t=12s  https://www.slideshare.net/HansiThenuwara/natural-language-processing-64271235  https://www.tutorialspoint.com/natural_language_processing/natural_language_processing_syntactic_analysis. htm  https://www.tutorialspoint.com/natural_language_processing/natural_language_processing_syntactic_analysis. htm  https://www.youtube.com/watch?v=DPi5rfTX6mw&t=1050s  https://www.youtube.com/watch?v=SFQ-owZaU_s  https://personal.utdallas.edu/~sanda/courses/NLP/Lecture10.pdf  https://nlp.stanford.edu/projects/project-induction.shtml  https://nlp.stanford.edu/IR-book/html/htmledition/types-of-language-models-1.html

Notas del editor

  1. 2020/11/11
  2. 2020/11/11
  3. 2020/11/11