SlideShare a Scribd company logo
1 of 50
Introduction to Natural Language
Processing
Markus Dickinson
Linguistics
@georgetown.edu
With some modifications
2
Languages and Intelligence
• Languages: Natural (mandarin) and
Artificial (prolog/lisp), and in between?
• Processing of Computer languages is
easy: why?
• Intelligence: Natural? and Artificial (AI).
• What is AI?: behave as humans do!
Whatever that means! Turing Test
• Language related activities: acquisition,
speech,… manifestation of intelligence
3
Languages and Intelligence
• Historically: peaks and valleys.
• Tendency to underestimate the problem
and overestimate the computing power.
• NLP Goes along with AI: Rise and Decline
• Now is a peak, 20 years ago may have
been a valley.
• The internet, small devices may be a
driving force.
4
What is NLP?
• Natural Language Processing (NLP)
– Computers use to process (analyze,
understand, generate) natural language
– A somewhat applied field
• Computational Linguistics (CL)
– Computational aspects of the human
language faculty
– More theoretical
5
Why Study NLP?
• Human language interesting & challenging
– NLP offers insights into language
• Language is the medium of the web
• Interdisciplinary: Linguistics, CS,
psychology, math, EE,
• Help in communication
– With computers (ASR, TTS)-Interfaces-HCI
– With other humans (MT)
• Ambitious yet practical
6
Goals of NLP
• Scientific Goal
–Identify the computational machinery
needed for an agent to exhibit various
forms of linguistic behavior
• Engineering Goal
–Design, implement, and test systems
that process natural languages for
practical applications
• Ups and downs-Historically: more
with the web
7
Applications
• speech processing: get flight information or book
a hotel over the phone, TTS (for the blind)
• information extraction: discover names of people
and events they participate in, from a document
• machine translation: translate a document from
one human language into another
• question answering: find answers to natural
language questions in a text collection or
database
• summarization: generate a short biography of
Noam Chomsky from one or more news articles
• OCR: both print and handwritten.
• More inportant with Web and Mobile devices!
8
General Themes
• Ambiguity of Language
• Language as a formal system
• Rule-based vs. Statistical Methods
• The need for efficiency
• Syntax/Semantics/Data mining
• Multilingual across/languages Support
9
Ambiguity of language
• Phonetic
– [raIt] = write, right, rite (Soundex) ‫سائد‬-‫صائد‬
• Lexical
– can = noun, verb, modal (‫ذهب‬)
• Structural
– I saw the man with the telescope ‫بالنظارة‬ ‫فقط‬ ‫رأيته‬
• Semantic
→dish = physical plate, menu item (‫مشروع‬)
• Referential: The son asked the father to drive him
home
– (‫شعر‬ ‫تصفيف‬ ‫البنت‬ ‫من‬ ‫األم‬ ‫طلبت‬‫ها‬)
→ All of these make NLP difficult
10
Language as a formal system
• We can treat parts of language formally
– Language = a set of acceptable strings in an
alphabet
– Define a model to recognize/generate language
(but recall: language before grammar)
• Works for different levels of language
(phonology, morphology, etc.)
• Can use finite-state automata, context-free
grammars, etc. to represent language
11
Rule-based & Statistical Methods
• Theoretical linguistics captures abstract
properties of language
• NLP can more or less follow theoretical
insights
– Rule-based: model system with linguistic rules
– Statistical: model system with probabilities of
what normally happens: trust what people
usually do/write/say
• Hybrid models combine the two
12
The need for efficiency
• Simply writing down linguistic insights isn’t
sufficient to have a working system
• Programs need to run in real-time, i.e., be
efficient
– There are thousands of grammar rules which might
be applied to a sentence
• Use insights from computer science
– To find the best parse, use chart parsing, a form of
dynamic programming
• Recall: computers are powerful but “never” as
powerful as need be.
13
Preview of Topics
1.Finding Syntactic Patterns in Human
Languages: Lg. as Formal System
2.Meaning from Patterns
3.Patterns from Language in the Large
4.Bridging the Rationalist-Empiricist Divide
5.Applications
6.Conclusion
14
The Problem of Syntactic Analysis
• Assume input sentence S in natural
language L
• Assume you have rules (grammar G) that
describe syntactic regularities (patterns or
structures) found in sentences of L
• Given S & G, find syntactic structure of S
• Such a structure is called a parse tree
15
Example 1
S  NP VP
VP  V NP
VP  V
S
VP
he
NP
V
slept
NP  I
NP  he
V  slept
V  ate
V  drinks
Grammar Parse Tree
S  NS| VS
NS  M K VS  V NP NP
M  N NP  N|NS
K  N
16
Parsing Example 1
• S  NP VP
• VP  V NP
• VP  V
• NP  I
• NP  he
• V  slept
• V  ate
• V  drinks
17
More Complex Sentences
• I can fish.
• I saw the elephant in my pajamas.
• These sentences exhibit ambiguity
• Computers will have to find the acceptable
or most likely meaning(s).
• I can fish. (‫يعلب؟‬ ،‫يستطيع‬)
18
Example 2
S
NP VP
Pronoun V NP
NI can
fish
S
NP VP
Pronoun V NP
NI can
fish
S
NP VP
Pronoun Aux V
I can fish
S
NP VP
Pronoun Aux V
I can fish
19
Example 3
• NP  D Nom
• Nom  Nom RelClause
• Nom  N
• RelClause  RelPro VP
• VP  V NP
• D  the
• D  my
• V  is
• V  hit
• N  dog
• N  boy
• N  brother
• RelPro  who
S
NP VP
NomD
Nom RelClause
RelPro VP
V
who
NP
hit the dog
is my brother
the
N
boy
S
NP VP
NomD
Nom RelClause
RelPro VP
V
who
NP
hit the dog
is my brother
the
N
boy
20
Topics
1.Finding Syntactic Patterns in Human
Languages
2. Meaning from Patterns
3.Patterns from Language in the Large
4.Bridging the Rationalist-Empiricist Divide
5.Applications
6.Conclusion
21
Meaning from a Parse Tree
• I can fish.
• We want to understand
– Who does what?
• the canner is me, the action
is canning, and the thing
canned is fish.
• e.g. Canning(ME, FishStuff)
• This is a logic
representation of meaning
S
NP VP
Pronoun V NP
NI can
fish
S
NP VP
Pronoun V NP
NI can
fish
We can do this by
• associating meanings with lexical items in the tree
• then using rules to figure out what the S as a whole means
22
Meaning from a Parse Tree (Details)
• Let’s augment the
grammar with
feature constraints
• S  NP VP
– <S subj> =<NP>
– <S>=<VP>
• VP V NP
– <VP> = <V>
– <VP obj> =<NP>
S
NP VP
Pronoun V NP
NI can
fish
S
NP VP
Pronoun V NP
NI can
fish
*2:[pred: Canning]
[subj: *1
pred: *2
obj: *3]
*1[sem: ME]
*3[sem: Fish
Stuff]
[pred: *2
obj: *3]
23
Grammar Induction
• Start with a tree bank = collection of parsed
sentences
• Extract grammar rules corresponding to parse
trees, estimating the probability of the grammar
rule based on its frequency
P(Aβ | A) = Count(Aβ) / Count(A)
• You then have a probabilistic grammar, derived
from a corpus of parse trees
• How does this grammar compare to grammars
created by human intuition?
• How do you get the corpus?
24
Finite-State Analysis
A finite-state machine for
recognizing NPs:
 initial=0; final ={2}
 0->N->2
 0->D->1
 1->N->2
 2->N->2
An equivalent regular
expression for NP’s
/D? N+/
A regular expression for recognizing simple sentences
/(Prep D? A* N+)* (D? N) (Prep D? A* N+)* (V_tns|Aux V_ing)
(Prep D? A* N+)*/
We can also “cheat” a bit in our linguistic analysis
25
Topics
1.Finding Syntactic Patterns in Human
Languages
2.Meaning from Patterns
3.Patterns from Language in the Large
4.Bridging the Rationalist-Empiricist Divide
5.Applications
6.Conclusion
26
Empirical Approaches to NLP
• Empiricism: knowledge is derived from experience
• Rationalism: knowledge is derived from reason
• NLP is, by necessity, focused on ‘performance’, in that
naturally-occurring linguistic data has to be processed
– Have to process data characterized by false starts, hesitations,
elliptical sentences, long and complex sentences, input in a
complex format, etc.
• The methodology used is corpus-based
– linguistic analysis (phonological, morphological, syntactic,
semantic, etc.) carried out on a fairly large scale
– rules are derived by humans or machines from looking at
phenomena in situ (with statistics playing an important role)
27
Which Words are the Most
Frequent?
Common Words in Tom
Sawyer (71,730 words),
from
Manning & Schutze p.21
• Will these counts hold in a different corpus
(and genre, cf. Tom)?
• What happens if you have 8-9M words?
28
Data Sparseness
• Many low-frequency
words
• Fewer high-frequency
words.
• Only a few words will
have lots of examples.
• About 50% of word types
occur only once
• Over 90% occur 10 times
or less.
Word Frequency Number of words
of that frequency
1 3993
2 1292
3 664
4 410
5 243
6 199
7 172
8 131
9 82
10 91
11-50 540
51-100 99
>100 102
Frequency of word types in
Tom Sawyer, from M&S 22.
29
Zipf’s Law: Frequency is inversely
proportional to rank
Word Freq f Rank r f.r
the 3332 1 3332
and 2972 2 5944
a 1775 3 5325
he 877 10 8770
but 410 20 8200
be 294 30 8820
there 222 40 8880
one 172 50 8600
about 158 60 9480
more 138 70 9660
never 124 80 9920
oh 116 90 10440
two 104 100 10400
turned 51 200 10200
you’ll 30 300 9000
name 21 400 8400
comes 16 500 8000
group 13 600 7800
lead 11 700 7700
friends 10 800 8000
begin 9 900 8100
family 8 1000 8000
brushed 4 2000 8000
sins 2 3000 6000
could 2 4000 8000
applausive 1 8000 8000
Empirical evaluation of Zipf’s Law
on Tom Sawyer, from M&S 23.
30
Illustration of Zipf’s Law
logarithmic
scale
(Brown Corpus, from M&S p. 30)
31
Empiricism: Part-of-Speech Tagging
• Word statistics are only so useful
• We want to be able to deduce linguistic
properties of the text
• Part-of-speech (POS) Tagging = assigning
a POS (lexical category) to every word in a
text
– Words can be ambiguous
– What is the best way to disambiguate?
32
Part-of-Speech Disambiguation
Secretariat/NNP is/VBZ
expected/VBN to/TO
race/VB tomorrow/NN
The/DT reason/NN for/IN
the/DT race/NN for/IN
outer/JJ space/NN is
…
• Given a sentence
W1…Wn and a tagset of
lexical categories, find the
most likely tag C1..Cn for
each word in the
sentence
• Tagset – e.g., Penn
Treebank (45 tags)
• Note that many of the
words may have
unambiguous tags
• The tagger also has to
deal with unknown words
Penn Tree Bank Tagset
A Statistical Method for POS Tagging
MD NN VB PRP
he 0 0 0 .3
will .8 .2 0 0
race 0 .4 .6 0
lexical generation probs
he|PRP
0.3
will|MD
0.8
race|NN
0.4
race|VB
0.6
will|NN
0.2
.4
.6.3
.7
.8
.2
<s>| 1
C|R MD NN VB PRP
MD .4 .6
NN .3 .7
PRP .8 .2
 1
POS bigram probs
Find the value of C1..Cn which maximizes:
i=1, n P(Wi| Ci) * P(Ci| Ci-1)
lexical generation
probabilities
POS bigram
probabilities
Chomsky’s Critique of Corpus-Based Methods
1. Corpora model performance, while linguistics is aimed at the
explanation of competence
If you define linguistics that way, linguistic theories will never be able
to deal with actual, messy data
2. Natural language is in principle infinite, whereas corpora are finite, so
many examples will be missed
Excellent point, which needs to be understood by anyone working with a
corpus.
But does that mean corpora are useless?
 Introspection is unreliable (prone to performance factors, cf. only
short sentences), and pretty useless with child data.
 Insights from a corpus might lead to generalization/induction
beyond the corpus– if the corpus is a good sample of the “text
population”
3. Ungrammatical examples won’t be available in a corpus
Depends on the corpus, e.g., spontaneous speech, language learners, etc.
The notion of grammaticality is not that clear
- Who did you see [pictures/?a picture/??his picture/*John’s
picture] of?
36
Topics
1.Finding Syntactic Patterns in Human
Languages
2.Meaning from Patterns
3.Patterns from Language in the Large
4.Bridging the Rationalist-Empiricist
Divide
5.Applications
6.Conclusion
37
The Annotation of Data
• If we want to learn linguistic properties
from data, we need to annotate the data
– Train on annotated data
– Test methods on other annotated data
• Through the annotation of corpora, we
encode linguistic information in a
computer-usable way.
An Annotation Tool
39
Knowledge Discovery Methodology
Raw
Corpus
Annotated
Corpus
Initial
Tagger
Annotation
Editor
Annotation
Guidelines
Machine
Learning
Program
Raw
Corpus
Learned
Rules
Annotated
Corpus
Rule
Apply
Knowledge
Base?
40
Topics
1.Finding Syntactic Patterns in Human
Languages
2.Meaning from Patterns
3.Patterns from Language in the Large
4.Bridging the Rationalist-Empiricist Divide
5. Applications
6.Conclusion
41
Application #1: Machine Translation
• Using different techniques for linguistic
analysis, we can:
– Parse the contents of one language
– Generate another language consisting of the
same content
– If we have an intermediate language then we
may be able to translate between pairs
– Success modest: check Google translate!
Machine Translation on the Web
http://complingone.georgetown.edu/~linguist/GU-CLI/GU-CLI-home.html
 ‫اصيب‬ ‫من‬ ‫عليهما‬ ‫االعتداء‬ ‫جراء‬ ‫لحم‬ ‫بيت‬ ‫من‬ ‫الجنوب‬ ‫الى‬ ‫المعصرة‬ ‫قرية‬ ‫في‬ ‫وجروح‬ ‫برضوض‬ ‫متظاهران‬
‫قوات‬ ‫قبل‬ ‫من‬ ‫عليهما‬ ‫االعتداء‬ ‫جراء‬ ‫لحم‬ ‫بيت‬ ‫من‬ ‫الجنوب‬ ‫الى‬ ‫المعصرة‬ ‫قرية‬ ‫في‬ ‫وجروح‬ ‫برضوض‬ ‫تظاهران‬
‫العنصري‬ ‫الفصل‬ ‫جدار‬ ‫ضد‬ ‫تنظم‬ ‫التي‬ ‫االسبوعية‬ ‫المسيرة‬ ‫خالل‬ ‫االسرائيلي‬ ‫.االحتالل‬ ‫امام‬ ‫من‬ ‫المسيرة‬ ‫طلقت‬
‫رفع‬ ‫ذلك‬ ‫وخالل‬ ‫واالجانب‬ ‫العرب‬ ‫المتظاهرين‬ ‫من‬ ‫العشرات‬ ‫بمشاركة‬ ‫شوارعها‬ ‫لتجوب‬ ‫الثانوية‬ ‫القرية‬ ‫مدرسة‬
‫الشعب‬ ‫بحق‬ ‫المتواصلة‬ ‫االسرائيلية‬ ‫والممارسات‬ ‫باالجراءات‬ ‫المنددة‬ ‫والالفتات‬ ‫الفلسطينية‬ ‫االعالم‬ ‫المتظاهرون‬
‫الفلسطيني‬ ‫الشعب‬ ‫لدى‬ ‫االسرى‬ ‫قضية‬ ‫محورية‬ ‫الى‬ ‫اشارة‬ ‫في‬ ‫االسرى‬ ‫من‬ ‫عدد‬ ‫صور‬ ‫المتظاهرون‬ ‫رفع‬ ‫كما‬ ، ‫ي‬
‫االسرائيلي‬ ‫االحتالل‬ ‫جنود‬ ‫من‬ ‫العشرات‬ ‫واجهها‬ ‫العنصري‬ ‫الفصل‬ ‫جدار‬ ‫اقامة‬ ‫مشارف‬ ‫الى‬ ‫المسيرة‬ ‫وصول‬ ‫دى‬
 Injured demonstrators sustained bruises and cuts to the mill in the villag
south of Bethlehem in the attack on them by the Israeli occupation force
during the weekly march organized against the Apartheid Wall.
 The march began in front of the village school secondary to roam the str
with dozens of demonstrators Arabs and foreigners, and during that
demonstrators raised Palestinian flags and banners condemning the actio
and practices of Israel's continued right of the Palestinian people, as the
protesters portrayed a number of prisoners in reference to the central i
of the prisoners to the Palestinian people and the arrival of the march to
outskirts of the establishment of the apartheid wall and faced dozens of
Israeli soldiers
If languages were all very similar….
… then MT would be easier
 Dialects
- http://rinkworks.com/dialect/
 Spanish to Portuguese….
 Spanish to French
 English to Japanese
 …………..
MT Approaches
Interlingua
Semantics Semantics
Syntax Syntax
Morphology MorphologyDirect
Syntactic
Transfer
Semantic
Transfer
MT Using Parallel Treebanks
S
NP VP
NomD
Nom RelClause
RelPro VP
V
who
NP
hit the dog
is my brother
the
N
boy
S
NP VP
NomD
Nom RelClause
RelPro VP
V
who
NP
hit the dog
is my brother
the
N
boy
S
NP VP
Nom
RelClause Nom
VP
V
butta
otootoda
N
otokonokowa
NP
inuo
S
NP VP
Nom
RelClause Nom
VP
V
butta
otootoda
N
otokonokowa
NP
inuo
Application #2: Understanding a Simple Narrative
(Question Answering)
Yesterday Holly was running a marathon
when she twisted her ankle. David had
pushed her.
1. When did the running occur?
Yesterday.
2. When did the twisting occur?
Yesterday, during the running.
3. Did the pushing occur before the twisting?
Yes.
4. Did Holly keep running after twisting her ankle?
Maybe not????
47
Question Answering by Computer
(Temporal Questions)
Yesterday Holly was running a marathon when
she twisted her ankle. David had pushed her.
09042005 09052005
run
twist
ankle
duringfinishes
or
during before
push
before
during
1. When did the running occur?
Yesterday.
2. When did the twisting occur?
Yesterday, during the running.
3. Did the pushing occur before the twisting?
Yes.
4. Did Holly keep running after twisting her ankle?
Maybe not????
Application #3: Information Extraction
KEY:
Trigger word tagging
Named Entity tagging
Chunk parsing: NGs, VGs,
preps, conjunctions
Bridgestone Sports Co. said
Friday it has set up a joint
venture in Taiwan with a local
concern and a Japanese
trading house to produce golf
clubs to be shipped to Japan.
CompanyNG Set-UPVG Joint-
VentureNG with CompanyNG
ProduceVG ProductNG
The joint venture, Bridgestone
Sports Taiwan Cp., capitalized
at 20 million new Taiwan
dollars, will start production in
January 1990 with production
of 20,000 iron and “metal
wood” clubs a month.
Information Extraction: Filling Templates
Activity:
Type: PRODUCTION
Company:
Product: golf clubs
Start-date:
Bridgestone Sports Co. said
Friday it has set up a joint
venture in Taiwan with a local
concern and a Japanese trading
house to produce golf clubs to
be shipped to Japan.
Activity:
Type: PRODUCTION
Company: Bridgestone Sports
Taiwan Co
Product: iron and “metal wood”
clubs
Start-date: DURING 1990
The joint venture, Bridgestone
Sports Taiwan Cp., capitalized
at 20 million new Taiwan
dollars, will start production in
January 1990 with production
of 20,000 iron and “metal
wood” clubs a month.
50
Conclusion
• NLP programs can carry out a number of
very interesting tasks
– Part-of-speech disambiguation
– Parsing
– Information extraction
– Machine Translation
– Question Answering
• These programs have impacts on the way
we communicate
• These capabilities also have important
implications for cognitive science

More Related Content

What's hot

Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)Yuriy Guts
 
Building a Pipeline for State-of-the-Art Natural Language Processing Using Hu...
Building a Pipeline for State-of-the-Art Natural Language Processing Using Hu...Building a Pipeline for State-of-the-Art Natural Language Processing Using Hu...
Building a Pipeline for State-of-the-Art Natural Language Processing Using Hu...Databricks
 
natural language processing help at myassignmenthelp.net
natural language processing  help at myassignmenthelp.netnatural language processing  help at myassignmenthelp.net
natural language processing help at myassignmenthelp.netwww.myassignmenthelp.net
 
Natural language procssing
Natural language procssing Natural language procssing
Natural language procssing Rajnish Raj
 
Natural Language Processing with Python
Natural Language Processing with PythonNatural Language Processing with Python
Natural Language Processing with PythonBenjamin Bengfort
 
Natural Language Processing seminar review
Natural Language Processing seminar review Natural Language Processing seminar review
Natural Language Processing seminar review Jayneel Vora
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processingrohitnayak
 
NLP using transformers
NLP using transformers NLP using transformers
NLP using transformers Arvind Devaraj
 
Introduction to Natural Language Processing (NLP)
Introduction to Natural Language Processing (NLP)Introduction to Natural Language Processing (NLP)
Introduction to Natural Language Processing (NLP)VenkateshMurugadas
 
A Panorama of Natural Language Processing
A Panorama of Natural Language ProcessingA Panorama of Natural Language Processing
A Panorama of Natural Language ProcessingTed Xiao
 
Introduction to natural language processing, history and origin
Introduction to natural language processing, history and originIntroduction to natural language processing, history and origin
Introduction to natural language processing, history and originShubhankar Mohan
 
Natural language processing (nlp)
Natural language processing (nlp)Natural language processing (nlp)
Natural language processing (nlp)Kuppusamy P
 
Glove global vectors for word representation
Glove global vectors for word representationGlove global vectors for word representation
Glove global vectors for word representationhyunyoung Lee
 
Introduction to natural language processing (NLP)
Introduction to natural language processing (NLP)Introduction to natural language processing (NLP)
Introduction to natural language processing (NLP)Alia Hamwi
 
Natural Language Processing (NLP) - Introduction
Natural Language Processing (NLP) - IntroductionNatural Language Processing (NLP) - Introduction
Natural Language Processing (NLP) - IntroductionAritra Mukherjee
 
Natural language processing
Natural language processingNatural language processing
Natural language processingKarenVacca
 
Recurrent Neural Networks, LSTM and GRU
Recurrent Neural Networks, LSTM and GRURecurrent Neural Networks, LSTM and GRU
Recurrent Neural Networks, LSTM and GRUananth
 
Natural language processing (NLP) introduction
Natural language processing (NLP) introductionNatural language processing (NLP) introduction
Natural language processing (NLP) introductionRobert Lujo
 

What's hot (20)

Language models
Language modelsLanguage models
Language models
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
 
Building a Pipeline for State-of-the-Art Natural Language Processing Using Hu...
Building a Pipeline for State-of-the-Art Natural Language Processing Using Hu...Building a Pipeline for State-of-the-Art Natural Language Processing Using Hu...
Building a Pipeline for State-of-the-Art Natural Language Processing Using Hu...
 
natural language processing help at myassignmenthelp.net
natural language processing  help at myassignmenthelp.netnatural language processing  help at myassignmenthelp.net
natural language processing help at myassignmenthelp.net
 
Natural language procssing
Natural language procssing Natural language procssing
Natural language procssing
 
Natural Language Processing with Python
Natural Language Processing with PythonNatural Language Processing with Python
Natural Language Processing with Python
 
Natural Language Processing seminar review
Natural Language Processing seminar review Natural Language Processing seminar review
Natural Language Processing seminar review
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processing
 
NLP using transformers
NLP using transformers NLP using transformers
NLP using transformers
 
Introduction to Natural Language Processing (NLP)
Introduction to Natural Language Processing (NLP)Introduction to Natural Language Processing (NLP)
Introduction to Natural Language Processing (NLP)
 
A Panorama of Natural Language Processing
A Panorama of Natural Language ProcessingA Panorama of Natural Language Processing
A Panorama of Natural Language Processing
 
Introduction to natural language processing, history and origin
Introduction to natural language processing, history and originIntroduction to natural language processing, history and origin
Introduction to natural language processing, history and origin
 
Natural language processing (nlp)
Natural language processing (nlp)Natural language processing (nlp)
Natural language processing (nlp)
 
Glove global vectors for word representation
Glove global vectors for word representationGlove global vectors for word representation
Glove global vectors for word representation
 
Introduction to natural language processing (NLP)
Introduction to natural language processing (NLP)Introduction to natural language processing (NLP)
Introduction to natural language processing (NLP)
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Natural Language Processing (NLP) - Introduction
Natural Language Processing (NLP) - IntroductionNatural Language Processing (NLP) - Introduction
Natural Language Processing (NLP) - Introduction
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Recurrent Neural Networks, LSTM and GRU
Recurrent Neural Networks, LSTM and GRURecurrent Neural Networks, LSTM and GRU
Recurrent Neural Networks, LSTM and GRU
 
Natural language processing (NLP) introduction
Natural language processing (NLP) introductionNatural language processing (NLP) introduction
Natural language processing (NLP) introduction
 

Viewers also liked

Jarrar: Introduction to Information Retrieval
Jarrar: Introduction to Information RetrievalJarrar: Introduction to Information Retrieval
Jarrar: Introduction to Information RetrievalMustafa Jarrar
 
Introduction to Artificial Intelligence
Introduction to Artificial Intelligence Introduction to Artificial Intelligence
Introduction to Artificial Intelligence Mustafa Jarrar
 
Jarrar: Un-informed Search
Jarrar: Un-informed SearchJarrar: Un-informed Search
Jarrar: Un-informed SearchMustafa Jarrar
 
Jarrar: Informed Search
Jarrar: Informed Search  Jarrar: Informed Search
Jarrar: Informed Search Mustafa Jarrar
 
Jarrar: Description Logic
Jarrar: Description LogicJarrar: Description Logic
Jarrar: Description LogicMustafa Jarrar
 
Jarrar: Introduction to Natural Language Processing
Jarrar: Introduction to Natural Language ProcessingJarrar: Introduction to Natural Language Processing
Jarrar: Introduction to Natural Language ProcessingMustafa Jarrar
 
Dependency Parsing
Dependency ParsingDependency Parsing
Dependency ParsingJinho Choi
 
Information retreival, By Hadi Mohammadzadeh
Information retreival, By Hadi MohammadzadehInformation retreival, By Hadi Mohammadzadeh
Information retreival, By Hadi MohammadzadehHadi Mohammadzadeh
 
Всеволод Демкин "Natural language processing на практике"
Всеволод Демкин "Natural language processing на практике"Всеволод Демкин "Natural language processing на практике"
Всеволод Демкин "Natural language processing на практике"GeeksLab Odessa
 
Business Process Implementation
Business Process ImplementationBusiness Process Implementation
Business Process ImplementationMustafa Jarrar
 
Natural Language Processing Crash Course
Natural Language Processing Crash CourseNatural Language Processing Crash Course
Natural Language Processing Crash CourseCharlie Greenbacker
 
Natural language processing with python and amharic syntax parse tree by dani...
Natural language processing with python and amharic syntax parse tree by dani...Natural language processing with python and amharic syntax parse tree by dani...
Natural language processing with python and amharic syntax parse tree by dani...Daniel Adenew
 
Crash Course in Natural Language Processing (2016)
Crash Course in Natural Language Processing (2016)Crash Course in Natural Language Processing (2016)
Crash Course in Natural Language Processing (2016)Vsevolod Dyomkin
 
Natural Language Processing: Parsing
Natural Language Processing: ParsingNatural Language Processing: Parsing
Natural Language Processing: ParsingRushdi Shams
 
Jarrar: Propositional Logic Inference Methods
Jarrar: Propositional Logic Inference MethodsJarrar: Propositional Logic Inference Methods
Jarrar: Propositional Logic Inference MethodsMustafa Jarrar
 
Parsing Natural Scenes and Natural Language with Recursive Neural Networks
Parsing Natural Scenes and Natural Language with Recursive Neural NetworksParsing Natural Scenes and Natural Language with Recursive Neural Networks
Parsing Natural Scenes and Natural Language with Recursive Neural Networksjie cao
 
Jarrar: Probabilistic Language Modeling - Introduction to N-grams
Jarrar: Probabilistic Language Modeling - Introduction to N-gramsJarrar: Probabilistic Language Modeling - Introduction to N-grams
Jarrar: Probabilistic Language Modeling - Introduction to N-gramsMustafa Jarrar
 
Habash: Arabic Natural Language Processing
Habash: Arabic Natural Language ProcessingHabash: Arabic Natural Language Processing
Habash: Arabic Natural Language ProcessingMustafa Jarrar
 
Natural Language Processing in Practice
Natural Language Processing in PracticeNatural Language Processing in Practice
Natural Language Processing in PracticeVsevolod Dyomkin
 

Viewers also liked (20)

Jarrar: Introduction to Information Retrieval
Jarrar: Introduction to Information RetrievalJarrar: Introduction to Information Retrieval
Jarrar: Introduction to Information Retrieval
 
Introduction to Artificial Intelligence
Introduction to Artificial Intelligence Introduction to Artificial Intelligence
Introduction to Artificial Intelligence
 
Jarrar: Un-informed Search
Jarrar: Un-informed SearchJarrar: Un-informed Search
Jarrar: Un-informed Search
 
Jarrar: Informed Search
Jarrar: Informed Search  Jarrar: Informed Search
Jarrar: Informed Search
 
Jarrar: Description Logic
Jarrar: Description LogicJarrar: Description Logic
Jarrar: Description Logic
 
Jarrar: Introduction to Natural Language Processing
Jarrar: Introduction to Natural Language ProcessingJarrar: Introduction to Natural Language Processing
Jarrar: Introduction to Natural Language Processing
 
Dependency Parsing
Dependency ParsingDependency Parsing
Dependency Parsing
 
Information retreival, By Hadi Mohammadzadeh
Information retreival, By Hadi MohammadzadehInformation retreival, By Hadi Mohammadzadeh
Information retreival, By Hadi Mohammadzadeh
 
Всеволод Демкин "Natural language processing на практике"
Всеволод Демкин "Natural language processing на практике"Всеволод Демкин "Natural language processing на практике"
Всеволод Демкин "Natural language processing на практике"
 
Business Process Implementation
Business Process ImplementationBusiness Process Implementation
Business Process Implementation
 
Jarrar: Games
Jarrar: GamesJarrar: Games
Jarrar: Games
 
Natural Language Processing Crash Course
Natural Language Processing Crash CourseNatural Language Processing Crash Course
Natural Language Processing Crash Course
 
Natural language processing with python and amharic syntax parse tree by dani...
Natural language processing with python and amharic syntax parse tree by dani...Natural language processing with python and amharic syntax parse tree by dani...
Natural language processing with python and amharic syntax parse tree by dani...
 
Crash Course in Natural Language Processing (2016)
Crash Course in Natural Language Processing (2016)Crash Course in Natural Language Processing (2016)
Crash Course in Natural Language Processing (2016)
 
Natural Language Processing: Parsing
Natural Language Processing: ParsingNatural Language Processing: Parsing
Natural Language Processing: Parsing
 
Jarrar: Propositional Logic Inference Methods
Jarrar: Propositional Logic Inference MethodsJarrar: Propositional Logic Inference Methods
Jarrar: Propositional Logic Inference Methods
 
Parsing Natural Scenes and Natural Language with Recursive Neural Networks
Parsing Natural Scenes and Natural Language with Recursive Neural NetworksParsing Natural Scenes and Natural Language with Recursive Neural Networks
Parsing Natural Scenes and Natural Language with Recursive Neural Networks
 
Jarrar: Probabilistic Language Modeling - Introduction to N-grams
Jarrar: Probabilistic Language Modeling - Introduction to N-gramsJarrar: Probabilistic Language Modeling - Introduction to N-grams
Jarrar: Probabilistic Language Modeling - Introduction to N-grams
 
Habash: Arabic Natural Language Processing
Habash: Arabic Natural Language ProcessingHabash: Arabic Natural Language Processing
Habash: Arabic Natural Language Processing
 
Natural Language Processing in Practice
Natural Language Processing in PracticeNatural Language Processing in Practice
Natural Language Processing in Practice
 

Similar to Adnan: Introduction to Natural Language Processing

Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4DigiGurukul
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingToine Bogers
 
Deep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word EmbeddingsDeep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word EmbeddingsRoelof Pieters
 
textprocessingboth.pptx
textprocessingboth.pptxtextprocessingboth.pptx
textprocessingboth.pptxbdiot
 
NLP introduced and in 47 slides Lecture 1.ppt
NLP introduced and in 47 slides Lecture 1.pptNLP introduced and in 47 slides Lecture 1.ppt
NLP introduced and in 47 slides Lecture 1.pptOlusolaTop
 
Natural language processing
Natural language processingNatural language processing
Natural language processingBasha Chand
 
Natural Language Processing Course in AI
Natural Language Processing Course in AINatural Language Processing Course in AI
Natural Language Processing Course in AISATHYANARAYANAKB
 
NLP_guest_lecture.pdf
NLP_guest_lecture.pdfNLP_guest_lecture.pdf
NLP_guest_lecture.pdfSoha82
 
Intro 2 document
Intro 2 documentIntro 2 document
Intro 2 documentUma Kant
 
NOVA Data Science Meetup 1/19/2017 - Presentation 2
NOVA Data Science Meetup 1/19/2017 - Presentation 2NOVA Data Science Meetup 1/19/2017 - Presentation 2
NOVA Data Science Meetup 1/19/2017 - Presentation 2NOVA DATASCIENCE
 
Visual-Semantic Embeddings: some thoughts on Language
Visual-Semantic Embeddings: some thoughts on LanguageVisual-Semantic Embeddings: some thoughts on Language
Visual-Semantic Embeddings: some thoughts on LanguageRoelof Pieters
 
Computational linguistics
Computational linguisticsComputational linguistics
Computational linguisticsshrey bhate
 
Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Engineering Intelligent NLP Applications Using Deep Learning – Part 1Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Engineering Intelligent NLP Applications Using Deep Learning – Part 1Saurabh Kaushik
 

Similar to Adnan: Introduction to Natural Language Processing (20)

Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
intro.ppt
intro.pptintro.ppt
intro.ppt
 
Deep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word EmbeddingsDeep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word Embeddings
 
textprocessingboth.pptx
textprocessingboth.pptxtextprocessingboth.pptx
textprocessingboth.pptx
 
NLP introduced and in 47 slides Lecture 1.ppt
NLP introduced and in 47 slides Lecture 1.pptNLP introduced and in 47 slides Lecture 1.ppt
NLP introduced and in 47 slides Lecture 1.ppt
 
Intro
IntroIntro
Intro
 
Intro
IntroIntro
Intro
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Natural Language Processing Course in AI
Natural Language Processing Course in AINatural Language Processing Course in AI
Natural Language Processing Course in AI
 
NLP_guest_lecture.pdf
NLP_guest_lecture.pdfNLP_guest_lecture.pdf
NLP_guest_lecture.pdf
 
Intro 2 document
Intro 2 documentIntro 2 document
Intro 2 document
 
NOVA Data Science Meetup 1/19/2017 - Presentation 2
NOVA Data Science Meetup 1/19/2017 - Presentation 2NOVA Data Science Meetup 1/19/2017 - Presentation 2
NOVA Data Science Meetup 1/19/2017 - Presentation 2
 
Visual-Semantic Embeddings: some thoughts on Language
Visual-Semantic Embeddings: some thoughts on LanguageVisual-Semantic Embeddings: some thoughts on Language
Visual-Semantic Embeddings: some thoughts on Language
 
Introduction to linguistics
Introduction to linguisticsIntroduction to linguistics
Introduction to linguistics
 
Computational linguistics
Computational linguisticsComputational linguistics
Computational linguistics
 
1 Introduction.ppt
1 Introduction.ppt1 Introduction.ppt
1 Introduction.ppt
 
Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Engineering Intelligent NLP Applications Using Deep Learning – Part 1Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Engineering Intelligent NLP Applications Using Deep Learning – Part 1
 
Nlp
NlpNlp
Nlp
 
NLP
NLPNLP
NLP
 

More from Mustafa Jarrar

Clustering Arabic Tweets for Sentiment Analysis
Clustering Arabic Tweets for Sentiment AnalysisClustering Arabic Tweets for Sentiment Analysis
Clustering Arabic Tweets for Sentiment AnalysisMustafa Jarrar
 
Classifying Processes and Basic Formal Ontology
Classifying Processes  and Basic Formal OntologyClassifying Processes  and Basic Formal Ontology
Classifying Processes and Basic Formal OntologyMustafa Jarrar
 
Discrete Mathematics Course Outline
Discrete Mathematics Course OutlineDiscrete Mathematics Course Outline
Discrete Mathematics Course OutlineMustafa Jarrar
 
Business Process Design and Re-engineering
Business Process Design and Re-engineeringBusiness Process Design and Re-engineering
Business Process Design and Re-engineeringMustafa Jarrar
 
BPMN 2.0 Analytical Constructs
BPMN 2.0 Analytical ConstructsBPMN 2.0 Analytical Constructs
BPMN 2.0 Analytical ConstructsMustafa Jarrar
 
BPMN 2.0 Descriptive Constructs
BPMN 2.0 Descriptive Constructs  BPMN 2.0 Descriptive Constructs
BPMN 2.0 Descriptive Constructs Mustafa Jarrar
 
Introduction to Business Process Management
Introduction to Business Process ManagementIntroduction to Business Process Management
Introduction to Business Process ManagementMustafa Jarrar
 
Customer Complaint Ontology
Customer Complaint Ontology Customer Complaint Ontology
Customer Complaint Ontology Mustafa Jarrar
 
Subset, Equality, and Exclusion Rules
Subset, Equality, and Exclusion RulesSubset, Equality, and Exclusion Rules
Subset, Equality, and Exclusion RulesMustafa Jarrar
 
Schema Modularization in ORM
Schema Modularization in ORMSchema Modularization in ORM
Schema Modularization in ORMMustafa Jarrar
 
On Computer Science Trends and Priorities in Palestine
On Computer Science Trends and Priorities in PalestineOn Computer Science Trends and Priorities in Palestine
On Computer Science Trends and Priorities in PalestineMustafa Jarrar
 
Lessons from Class Recording & Publishing of Eight Online Courses
Lessons from Class Recording & Publishing of Eight Online CoursesLessons from Class Recording & Publishing of Eight Online Courses
Lessons from Class Recording & Publishing of Eight Online CoursesMustafa Jarrar
 
Presentation curras paper-emnlp2014-final
Presentation curras paper-emnlp2014-finalPresentation curras paper-emnlp2014-final
Presentation curras paper-emnlp2014-finalMustafa Jarrar
 
Jarrar: Future Internet in Horizon 2020 Calls
Jarrar: Future Internet in Horizon 2020 CallsJarrar: Future Internet in Horizon 2020 Calls
Jarrar: Future Internet in Horizon 2020 CallsMustafa Jarrar
 
Riestra: How to Design and engineer Competitive Horizon 2020 Proposals
Riestra: How to Design and engineer Competitive Horizon 2020 ProposalsRiestra: How to Design and engineer Competitive Horizon 2020 Proposals
Riestra: How to Design and engineer Competitive Horizon 2020 ProposalsMustafa Jarrar
 
Bouquet: SIERA Workshop on The Pillars of Horizon2020
Bouquet: SIERA Workshop on The Pillars of Horizon2020Bouquet: SIERA Workshop on The Pillars of Horizon2020
Bouquet: SIERA Workshop on The Pillars of Horizon2020Mustafa Jarrar
 
Jarrar: Sparql Project
Jarrar: Sparql ProjectJarrar: Sparql Project
Jarrar: Sparql ProjectMustafa Jarrar
 
Jarrar: Logical Foundation of Ontology Engineering
Jarrar: Logical Foundation of Ontology EngineeringJarrar: Logical Foundation of Ontology Engineering
Jarrar: Logical Foundation of Ontology EngineeringMustafa Jarrar
 
Jarrar: Stepwise Methodologies for Developing Ontologies
Jarrar: Stepwise Methodologies for Developing OntologiesJarrar: Stepwise Methodologies for Developing Ontologies
Jarrar: Stepwise Methodologies for Developing OntologiesMustafa Jarrar
 
Jarrar: Ontology Modeling using OntoClean Methodology
Jarrar: Ontology Modeling using OntoClean MethodologyJarrar: Ontology Modeling using OntoClean Methodology
Jarrar: Ontology Modeling using OntoClean MethodologyMustafa Jarrar
 

More from Mustafa Jarrar (20)

Clustering Arabic Tweets for Sentiment Analysis
Clustering Arabic Tweets for Sentiment AnalysisClustering Arabic Tweets for Sentiment Analysis
Clustering Arabic Tweets for Sentiment Analysis
 
Classifying Processes and Basic Formal Ontology
Classifying Processes  and Basic Formal OntologyClassifying Processes  and Basic Formal Ontology
Classifying Processes and Basic Formal Ontology
 
Discrete Mathematics Course Outline
Discrete Mathematics Course OutlineDiscrete Mathematics Course Outline
Discrete Mathematics Course Outline
 
Business Process Design and Re-engineering
Business Process Design and Re-engineeringBusiness Process Design and Re-engineering
Business Process Design and Re-engineering
 
BPMN 2.0 Analytical Constructs
BPMN 2.0 Analytical ConstructsBPMN 2.0 Analytical Constructs
BPMN 2.0 Analytical Constructs
 
BPMN 2.0 Descriptive Constructs
BPMN 2.0 Descriptive Constructs  BPMN 2.0 Descriptive Constructs
BPMN 2.0 Descriptive Constructs
 
Introduction to Business Process Management
Introduction to Business Process ManagementIntroduction to Business Process Management
Introduction to Business Process Management
 
Customer Complaint Ontology
Customer Complaint Ontology Customer Complaint Ontology
Customer Complaint Ontology
 
Subset, Equality, and Exclusion Rules
Subset, Equality, and Exclusion RulesSubset, Equality, and Exclusion Rules
Subset, Equality, and Exclusion Rules
 
Schema Modularization in ORM
Schema Modularization in ORMSchema Modularization in ORM
Schema Modularization in ORM
 
On Computer Science Trends and Priorities in Palestine
On Computer Science Trends and Priorities in PalestineOn Computer Science Trends and Priorities in Palestine
On Computer Science Trends and Priorities in Palestine
 
Lessons from Class Recording & Publishing of Eight Online Courses
Lessons from Class Recording & Publishing of Eight Online CoursesLessons from Class Recording & Publishing of Eight Online Courses
Lessons from Class Recording & Publishing of Eight Online Courses
 
Presentation curras paper-emnlp2014-final
Presentation curras paper-emnlp2014-finalPresentation curras paper-emnlp2014-final
Presentation curras paper-emnlp2014-final
 
Jarrar: Future Internet in Horizon 2020 Calls
Jarrar: Future Internet in Horizon 2020 CallsJarrar: Future Internet in Horizon 2020 Calls
Jarrar: Future Internet in Horizon 2020 Calls
 
Riestra: How to Design and engineer Competitive Horizon 2020 Proposals
Riestra: How to Design and engineer Competitive Horizon 2020 ProposalsRiestra: How to Design and engineer Competitive Horizon 2020 Proposals
Riestra: How to Design and engineer Competitive Horizon 2020 Proposals
 
Bouquet: SIERA Workshop on The Pillars of Horizon2020
Bouquet: SIERA Workshop on The Pillars of Horizon2020Bouquet: SIERA Workshop on The Pillars of Horizon2020
Bouquet: SIERA Workshop on The Pillars of Horizon2020
 
Jarrar: Sparql Project
Jarrar: Sparql ProjectJarrar: Sparql Project
Jarrar: Sparql Project
 
Jarrar: Logical Foundation of Ontology Engineering
Jarrar: Logical Foundation of Ontology EngineeringJarrar: Logical Foundation of Ontology Engineering
Jarrar: Logical Foundation of Ontology Engineering
 
Jarrar: Stepwise Methodologies for Developing Ontologies
Jarrar: Stepwise Methodologies for Developing OntologiesJarrar: Stepwise Methodologies for Developing Ontologies
Jarrar: Stepwise Methodologies for Developing Ontologies
 
Jarrar: Ontology Modeling using OntoClean Methodology
Jarrar: Ontology Modeling using OntoClean MethodologyJarrar: Ontology Modeling using OntoClean Methodology
Jarrar: Ontology Modeling using OntoClean Methodology
 

Recently uploaded

What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 

Recently uploaded (20)

What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 

Adnan: Introduction to Natural Language Processing

  • 1. Introduction to Natural Language Processing Markus Dickinson Linguistics @georgetown.edu With some modifications
  • 2. 2 Languages and Intelligence • Languages: Natural (mandarin) and Artificial (prolog/lisp), and in between? • Processing of Computer languages is easy: why? • Intelligence: Natural? and Artificial (AI). • What is AI?: behave as humans do! Whatever that means! Turing Test • Language related activities: acquisition, speech,… manifestation of intelligence
  • 3. 3 Languages and Intelligence • Historically: peaks and valleys. • Tendency to underestimate the problem and overestimate the computing power. • NLP Goes along with AI: Rise and Decline • Now is a peak, 20 years ago may have been a valley. • The internet, small devices may be a driving force.
  • 4. 4 What is NLP? • Natural Language Processing (NLP) – Computers use to process (analyze, understand, generate) natural language – A somewhat applied field • Computational Linguistics (CL) – Computational aspects of the human language faculty – More theoretical
  • 5. 5 Why Study NLP? • Human language interesting & challenging – NLP offers insights into language • Language is the medium of the web • Interdisciplinary: Linguistics, CS, psychology, math, EE, • Help in communication – With computers (ASR, TTS)-Interfaces-HCI – With other humans (MT) • Ambitious yet practical
  • 6. 6 Goals of NLP • Scientific Goal –Identify the computational machinery needed for an agent to exhibit various forms of linguistic behavior • Engineering Goal –Design, implement, and test systems that process natural languages for practical applications • Ups and downs-Historically: more with the web
  • 7. 7 Applications • speech processing: get flight information or book a hotel over the phone, TTS (for the blind) • information extraction: discover names of people and events they participate in, from a document • machine translation: translate a document from one human language into another • question answering: find answers to natural language questions in a text collection or database • summarization: generate a short biography of Noam Chomsky from one or more news articles • OCR: both print and handwritten. • More inportant with Web and Mobile devices!
  • 8. 8 General Themes • Ambiguity of Language • Language as a formal system • Rule-based vs. Statistical Methods • The need for efficiency • Syntax/Semantics/Data mining • Multilingual across/languages Support
  • 9. 9 Ambiguity of language • Phonetic – [raIt] = write, right, rite (Soundex) ‫سائد‬-‫صائد‬ • Lexical – can = noun, verb, modal (‫ذهب‬) • Structural – I saw the man with the telescope ‫بالنظارة‬ ‫فقط‬ ‫رأيته‬ • Semantic →dish = physical plate, menu item (‫مشروع‬) • Referential: The son asked the father to drive him home – (‫شعر‬ ‫تصفيف‬ ‫البنت‬ ‫من‬ ‫األم‬ ‫طلبت‬‫ها‬) → All of these make NLP difficult
  • 10. 10 Language as a formal system • We can treat parts of language formally – Language = a set of acceptable strings in an alphabet – Define a model to recognize/generate language (but recall: language before grammar) • Works for different levels of language (phonology, morphology, etc.) • Can use finite-state automata, context-free grammars, etc. to represent language
  • 11. 11 Rule-based & Statistical Methods • Theoretical linguistics captures abstract properties of language • NLP can more or less follow theoretical insights – Rule-based: model system with linguistic rules – Statistical: model system with probabilities of what normally happens: trust what people usually do/write/say • Hybrid models combine the two
  • 12. 12 The need for efficiency • Simply writing down linguistic insights isn’t sufficient to have a working system • Programs need to run in real-time, i.e., be efficient – There are thousands of grammar rules which might be applied to a sentence • Use insights from computer science – To find the best parse, use chart parsing, a form of dynamic programming • Recall: computers are powerful but “never” as powerful as need be.
  • 13. 13 Preview of Topics 1.Finding Syntactic Patterns in Human Languages: Lg. as Formal System 2.Meaning from Patterns 3.Patterns from Language in the Large 4.Bridging the Rationalist-Empiricist Divide 5.Applications 6.Conclusion
  • 14. 14 The Problem of Syntactic Analysis • Assume input sentence S in natural language L • Assume you have rules (grammar G) that describe syntactic regularities (patterns or structures) found in sentences of L • Given S & G, find syntactic structure of S • Such a structure is called a parse tree
  • 15. 15 Example 1 S  NP VP VP  V NP VP  V S VP he NP V slept NP  I NP  he V  slept V  ate V  drinks Grammar Parse Tree S  NS| VS NS  M K VS  V NP NP M  N NP  N|NS K  N
  • 16. 16 Parsing Example 1 • S  NP VP • VP  V NP • VP  V • NP  I • NP  he • V  slept • V  ate • V  drinks
  • 17. 17 More Complex Sentences • I can fish. • I saw the elephant in my pajamas. • These sentences exhibit ambiguity • Computers will have to find the acceptable or most likely meaning(s). • I can fish. (‫يعلب؟‬ ،‫يستطيع‬)
  • 18. 18 Example 2 S NP VP Pronoun V NP NI can fish S NP VP Pronoun V NP NI can fish S NP VP Pronoun Aux V I can fish S NP VP Pronoun Aux V I can fish
  • 19. 19 Example 3 • NP  D Nom • Nom  Nom RelClause • Nom  N • RelClause  RelPro VP • VP  V NP • D  the • D  my • V  is • V  hit • N  dog • N  boy • N  brother • RelPro  who S NP VP NomD Nom RelClause RelPro VP V who NP hit the dog is my brother the N boy S NP VP NomD Nom RelClause RelPro VP V who NP hit the dog is my brother the N boy
  • 20. 20 Topics 1.Finding Syntactic Patterns in Human Languages 2. Meaning from Patterns 3.Patterns from Language in the Large 4.Bridging the Rationalist-Empiricist Divide 5.Applications 6.Conclusion
  • 21. 21 Meaning from a Parse Tree • I can fish. • We want to understand – Who does what? • the canner is me, the action is canning, and the thing canned is fish. • e.g. Canning(ME, FishStuff) • This is a logic representation of meaning S NP VP Pronoun V NP NI can fish S NP VP Pronoun V NP NI can fish We can do this by • associating meanings with lexical items in the tree • then using rules to figure out what the S as a whole means
  • 22. 22 Meaning from a Parse Tree (Details) • Let’s augment the grammar with feature constraints • S  NP VP – <S subj> =<NP> – <S>=<VP> • VP V NP – <VP> = <V> – <VP obj> =<NP> S NP VP Pronoun V NP NI can fish S NP VP Pronoun V NP NI can fish *2:[pred: Canning] [subj: *1 pred: *2 obj: *3] *1[sem: ME] *3[sem: Fish Stuff] [pred: *2 obj: *3]
  • 23. 23 Grammar Induction • Start with a tree bank = collection of parsed sentences • Extract grammar rules corresponding to parse trees, estimating the probability of the grammar rule based on its frequency P(Aβ | A) = Count(Aβ) / Count(A) • You then have a probabilistic grammar, derived from a corpus of parse trees • How does this grammar compare to grammars created by human intuition? • How do you get the corpus?
  • 24. 24 Finite-State Analysis A finite-state machine for recognizing NPs:  initial=0; final ={2}  0->N->2  0->D->1  1->N->2  2->N->2 An equivalent regular expression for NP’s /D? N+/ A regular expression for recognizing simple sentences /(Prep D? A* N+)* (D? N) (Prep D? A* N+)* (V_tns|Aux V_ing) (Prep D? A* N+)*/ We can also “cheat” a bit in our linguistic analysis
  • 25. 25 Topics 1.Finding Syntactic Patterns in Human Languages 2.Meaning from Patterns 3.Patterns from Language in the Large 4.Bridging the Rationalist-Empiricist Divide 5.Applications 6.Conclusion
  • 26. 26 Empirical Approaches to NLP • Empiricism: knowledge is derived from experience • Rationalism: knowledge is derived from reason • NLP is, by necessity, focused on ‘performance’, in that naturally-occurring linguistic data has to be processed – Have to process data characterized by false starts, hesitations, elliptical sentences, long and complex sentences, input in a complex format, etc. • The methodology used is corpus-based – linguistic analysis (phonological, morphological, syntactic, semantic, etc.) carried out on a fairly large scale – rules are derived by humans or machines from looking at phenomena in situ (with statistics playing an important role)
  • 27. 27 Which Words are the Most Frequent? Common Words in Tom Sawyer (71,730 words), from Manning & Schutze p.21 • Will these counts hold in a different corpus (and genre, cf. Tom)? • What happens if you have 8-9M words?
  • 28. 28 Data Sparseness • Many low-frequency words • Fewer high-frequency words. • Only a few words will have lots of examples. • About 50% of word types occur only once • Over 90% occur 10 times or less. Word Frequency Number of words of that frequency 1 3993 2 1292 3 664 4 410 5 243 6 199 7 172 8 131 9 82 10 91 11-50 540 51-100 99 >100 102 Frequency of word types in Tom Sawyer, from M&S 22.
  • 29. 29 Zipf’s Law: Frequency is inversely proportional to rank Word Freq f Rank r f.r the 3332 1 3332 and 2972 2 5944 a 1775 3 5325 he 877 10 8770 but 410 20 8200 be 294 30 8820 there 222 40 8880 one 172 50 8600 about 158 60 9480 more 138 70 9660 never 124 80 9920 oh 116 90 10440 two 104 100 10400 turned 51 200 10200 you’ll 30 300 9000 name 21 400 8400 comes 16 500 8000 group 13 600 7800 lead 11 700 7700 friends 10 800 8000 begin 9 900 8100 family 8 1000 8000 brushed 4 2000 8000 sins 2 3000 6000 could 2 4000 8000 applausive 1 8000 8000 Empirical evaluation of Zipf’s Law on Tom Sawyer, from M&S 23.
  • 30. 30 Illustration of Zipf’s Law logarithmic scale (Brown Corpus, from M&S p. 30)
  • 31. 31 Empiricism: Part-of-Speech Tagging • Word statistics are only so useful • We want to be able to deduce linguistic properties of the text • Part-of-speech (POS) Tagging = assigning a POS (lexical category) to every word in a text – Words can be ambiguous – What is the best way to disambiguate?
  • 32. 32 Part-of-Speech Disambiguation Secretariat/NNP is/VBZ expected/VBN to/TO race/VB tomorrow/NN The/DT reason/NN for/IN the/DT race/NN for/IN outer/JJ space/NN is … • Given a sentence W1…Wn and a tagset of lexical categories, find the most likely tag C1..Cn for each word in the sentence • Tagset – e.g., Penn Treebank (45 tags) • Note that many of the words may have unambiguous tags • The tagger also has to deal with unknown words
  • 33. Penn Tree Bank Tagset
  • 34. A Statistical Method for POS Tagging MD NN VB PRP he 0 0 0 .3 will .8 .2 0 0 race 0 .4 .6 0 lexical generation probs he|PRP 0.3 will|MD 0.8 race|NN 0.4 race|VB 0.6 will|NN 0.2 .4 .6.3 .7 .8 .2 <s>| 1 C|R MD NN VB PRP MD .4 .6 NN .3 .7 PRP .8 .2  1 POS bigram probs Find the value of C1..Cn which maximizes: i=1, n P(Wi| Ci) * P(Ci| Ci-1) lexical generation probabilities POS bigram probabilities
  • 35. Chomsky’s Critique of Corpus-Based Methods 1. Corpora model performance, while linguistics is aimed at the explanation of competence If you define linguistics that way, linguistic theories will never be able to deal with actual, messy data 2. Natural language is in principle infinite, whereas corpora are finite, so many examples will be missed Excellent point, which needs to be understood by anyone working with a corpus. But does that mean corpora are useless?  Introspection is unreliable (prone to performance factors, cf. only short sentences), and pretty useless with child data.  Insights from a corpus might lead to generalization/induction beyond the corpus– if the corpus is a good sample of the “text population” 3. Ungrammatical examples won’t be available in a corpus Depends on the corpus, e.g., spontaneous speech, language learners, etc. The notion of grammaticality is not that clear - Who did you see [pictures/?a picture/??his picture/*John’s picture] of?
  • 36. 36 Topics 1.Finding Syntactic Patterns in Human Languages 2.Meaning from Patterns 3.Patterns from Language in the Large 4.Bridging the Rationalist-Empiricist Divide 5.Applications 6.Conclusion
  • 37. 37 The Annotation of Data • If we want to learn linguistic properties from data, we need to annotate the data – Train on annotated data – Test methods on other annotated data • Through the annotation of corpora, we encode linguistic information in a computer-usable way.
  • 40. 40 Topics 1.Finding Syntactic Patterns in Human Languages 2.Meaning from Patterns 3.Patterns from Language in the Large 4.Bridging the Rationalist-Empiricist Divide 5. Applications 6.Conclusion
  • 41. 41 Application #1: Machine Translation • Using different techniques for linguistic analysis, we can: – Parse the contents of one language – Generate another language consisting of the same content – If we have an intermediate language then we may be able to translate between pairs – Success modest: check Google translate!
  • 42. Machine Translation on the Web http://complingone.georgetown.edu/~linguist/GU-CLI/GU-CLI-home.html  ‫اصيب‬ ‫من‬ ‫عليهما‬ ‫االعتداء‬ ‫جراء‬ ‫لحم‬ ‫بيت‬ ‫من‬ ‫الجنوب‬ ‫الى‬ ‫المعصرة‬ ‫قرية‬ ‫في‬ ‫وجروح‬ ‫برضوض‬ ‫متظاهران‬ ‫قوات‬ ‫قبل‬ ‫من‬ ‫عليهما‬ ‫االعتداء‬ ‫جراء‬ ‫لحم‬ ‫بيت‬ ‫من‬ ‫الجنوب‬ ‫الى‬ ‫المعصرة‬ ‫قرية‬ ‫في‬ ‫وجروح‬ ‫برضوض‬ ‫تظاهران‬ ‫العنصري‬ ‫الفصل‬ ‫جدار‬ ‫ضد‬ ‫تنظم‬ ‫التي‬ ‫االسبوعية‬ ‫المسيرة‬ ‫خالل‬ ‫االسرائيلي‬ ‫.االحتالل‬ ‫امام‬ ‫من‬ ‫المسيرة‬ ‫طلقت‬ ‫رفع‬ ‫ذلك‬ ‫وخالل‬ ‫واالجانب‬ ‫العرب‬ ‫المتظاهرين‬ ‫من‬ ‫العشرات‬ ‫بمشاركة‬ ‫شوارعها‬ ‫لتجوب‬ ‫الثانوية‬ ‫القرية‬ ‫مدرسة‬ ‫الشعب‬ ‫بحق‬ ‫المتواصلة‬ ‫االسرائيلية‬ ‫والممارسات‬ ‫باالجراءات‬ ‫المنددة‬ ‫والالفتات‬ ‫الفلسطينية‬ ‫االعالم‬ ‫المتظاهرون‬ ‫الفلسطيني‬ ‫الشعب‬ ‫لدى‬ ‫االسرى‬ ‫قضية‬ ‫محورية‬ ‫الى‬ ‫اشارة‬ ‫في‬ ‫االسرى‬ ‫من‬ ‫عدد‬ ‫صور‬ ‫المتظاهرون‬ ‫رفع‬ ‫كما‬ ، ‫ي‬ ‫االسرائيلي‬ ‫االحتالل‬ ‫جنود‬ ‫من‬ ‫العشرات‬ ‫واجهها‬ ‫العنصري‬ ‫الفصل‬ ‫جدار‬ ‫اقامة‬ ‫مشارف‬ ‫الى‬ ‫المسيرة‬ ‫وصول‬ ‫دى‬  Injured demonstrators sustained bruises and cuts to the mill in the villag south of Bethlehem in the attack on them by the Israeli occupation force during the weekly march organized against the Apartheid Wall.  The march began in front of the village school secondary to roam the str with dozens of demonstrators Arabs and foreigners, and during that demonstrators raised Palestinian flags and banners condemning the actio and practices of Israel's continued right of the Palestinian people, as the protesters portrayed a number of prisoners in reference to the central i of the prisoners to the Palestinian people and the arrival of the march to outskirts of the establishment of the apartheid wall and faced dozens of Israeli soldiers
  • 43. If languages were all very similar…. … then MT would be easier  Dialects - http://rinkworks.com/dialect/  Spanish to Portuguese….  Spanish to French  English to Japanese  …………..
  • 44. MT Approaches Interlingua Semantics Semantics Syntax Syntax Morphology MorphologyDirect Syntactic Transfer Semantic Transfer
  • 45. MT Using Parallel Treebanks S NP VP NomD Nom RelClause RelPro VP V who NP hit the dog is my brother the N boy S NP VP NomD Nom RelClause RelPro VP V who NP hit the dog is my brother the N boy S NP VP Nom RelClause Nom VP V butta otootoda N otokonokowa NP inuo S NP VP Nom RelClause Nom VP V butta otootoda N otokonokowa NP inuo
  • 46. Application #2: Understanding a Simple Narrative (Question Answering) Yesterday Holly was running a marathon when she twisted her ankle. David had pushed her. 1. When did the running occur? Yesterday. 2. When did the twisting occur? Yesterday, during the running. 3. Did the pushing occur before the twisting? Yes. 4. Did Holly keep running after twisting her ankle? Maybe not????
  • 47. 47 Question Answering by Computer (Temporal Questions) Yesterday Holly was running a marathon when she twisted her ankle. David had pushed her. 09042005 09052005 run twist ankle duringfinishes or during before push before during 1. When did the running occur? Yesterday. 2. When did the twisting occur? Yesterday, during the running. 3. Did the pushing occur before the twisting? Yes. 4. Did Holly keep running after twisting her ankle? Maybe not????
  • 48. Application #3: Information Extraction KEY: Trigger word tagging Named Entity tagging Chunk parsing: NGs, VGs, preps, conjunctions Bridgestone Sports Co. said Friday it has set up a joint venture in Taiwan with a local concern and a Japanese trading house to produce golf clubs to be shipped to Japan. CompanyNG Set-UPVG Joint- VentureNG with CompanyNG ProduceVG ProductNG The joint venture, Bridgestone Sports Taiwan Cp., capitalized at 20 million new Taiwan dollars, will start production in January 1990 with production of 20,000 iron and “metal wood” clubs a month.
  • 49. Information Extraction: Filling Templates Activity: Type: PRODUCTION Company: Product: golf clubs Start-date: Bridgestone Sports Co. said Friday it has set up a joint venture in Taiwan with a local concern and a Japanese trading house to produce golf clubs to be shipped to Japan. Activity: Type: PRODUCTION Company: Bridgestone Sports Taiwan Co Product: iron and “metal wood” clubs Start-date: DURING 1990 The joint venture, Bridgestone Sports Taiwan Cp., capitalized at 20 million new Taiwan dollars, will start production in January 1990 with production of 20,000 iron and “metal wood” clubs a month.
  • 50. 50 Conclusion • NLP programs can carry out a number of very interesting tasks – Part-of-speech disambiguation – Parsing – Information extraction – Machine Translation – Question Answering • These programs have impacts on the way we communicate • These capabilities also have important implications for cognitive science