SlideShare una empresa de Scribd logo
1 de 94
Descargar para leer sin conexión
constructional syntactic analysis for information
                  access tasks

                    jussi karlgren



              yandex, february 16, 2011
jussi karlgren

ph d in (computational) linguistics from stockholm

senior researcher in information access at sics, stockholm

docent in language technology at univ of helsinki

founding partner, gavagai ab, stockholm
• independent non-profit
  research institute
• about 100-200 researchers
• ... networks, distributed
  systems, programming tools,
  collaborative environments,
  information access, design,
  digital art...
• recent startup company
• about 7-8 employees
• extracts actionable
  intelligence from very large
  text streams
why use computational methods and machinery for information
access?

two reasons:
  1   amount of data is overwhelming → reduce data complexity
      let’s call these “simple” tasks

  2   signal is weak and complex → peer closer into data
      let’s call these “difficult” tasks
why use computational methods and machinery for information
access?

two reasons:
  1   amount of data is overwhelming → reduce data complexity
      let’s call these “simple” tasks

  2   signal is weak and complex → peer closer into data
      let’s call these “difficult” tasks
why use computational methods and machinery for information
access?

two reasons:
  1   amount of data is overwhelming → reduce data complexity
      let’s call these “simple” tasks

  2   signal is weak and complex → peer closer into data
      let’s call these “difficult” tasks
why use computational methods and machinery for information
access?

two reasons:
  1   amount of data is overwhelming → reduce data complexity
      let’s call these “simple” tasks

  2   signal is weak and complex → peer closer into data
      let’s call these “difficult” tasks
for the simple tasks the sensible thing to do is to

pound the text into small bits and count the various type of bit.
for the simple tasks the sensible thing to do is to

pound the text into small bits and count the various type of bit.
this works well up to a point.

for search engines.

but not for e.g. authorship attribution.
this works well up to a point.

for search engines.

but not for e.g. authorship attribution.
this works well up to a point.

for search engines.

but not for e.g. authorship attribution.
what is the next sensible thing to do?

try to organise the bits into piles first, generalising them,

or

try to see if the bits have relations to each other, building more
complex structures.
which involves non-trivial complex decisions and results in a brittle, error-prone procedure.
what is the next sensible thing to do?

try to organise the bits into piles first, generalising them,

or

try to see if the bits have relations to each other, building more
complex structures.
which involves non-trivial complex decisions and results in a brittle, error-prone procedure.
what is the next sensible thing to do?

try to organise the bits into piles first, generalising them,

or

try to see if the bits have relations to each other, building more
complex structures.
which involves non-trivial complex decisions and results in a brittle, error-prone procedure.
why is parsing impractical?
  1   new text
  2   categories unfounded in data
  3   dependencies not based on necessity or efficiency
why is parsing impractical?
  1   new text
  2   categories unfounded in data
  3   dependencies not based on necessity or efficiency
why is parsing impractical?
  1   new text
  2   categories unfounded in data
  3   dependencies not based on necessity or efficiency
why is parsing impractical?
  1   new text
  2   categories unfounded in data
  3   dependencies not based on necessity or efficiency
what is in the signal?

“It is this, I think, that commentators mean when they say glibly
that the ‘world changed’ after Sept 11.”

words? or something more?
what is in the signal?

“It is this, I think, that commentators mean when they say glibly
that the ‘world changed’ after Sept 11.”

words? or something more?
linguistics has an answer.

but that answer doesn’t help much in practical applications.
linguistics has an answer.

but that answer doesn’t help much in practical applications.
what is in the signal to begin with?

just words?
and a pattern?
what is in the signal to begin with?

just words?
and a pattern?
what is in the signal to begin with?

just words?
and a pattern?
„ah, but the words are in the clause, the pattern is only expressed
by the words that participate in it.”
so patterns do not exist when not in use?

do words exist outside their usage?

„To ask where a verbal operant is when a response is not in the course of
being emitted is like asking where one’s knee-jerk is when the physician is
not tapping the patellar tendon.”
B.F. Skinner, Verbal Behavior
so patterns do not exist when not in use?

do words exist outside their usage?

„To ask where a verbal operant is when a response is not in the course of
being emitted is like asking where one’s knee-jerk is when the physician is
not tapping the patellar tendon.”
B.F. Skinner, Verbal Behavior
so patterns do not exist when not in use?

do words exist outside their usage?

„To ask where a verbal operant is when a response is not in the course of
being emitted is like asking where one’s knee-jerk is when the physician is
not tapping the patellar tendon.”
B.F. Skinner, Verbal Behavior
we claim that patterns are part of the signal,
- not incidental to it,
- nor secondary to the terms in it.

this appears to be a contentious statement.
we claim that patterns are part of the signal,
- not incidental to it,
- nor secondary to the terms in it.

this appears to be a contentious statement.
radical construction grammar (cxg)


   1   syntax-lexicon continuum
   2   form and function specified in unified model
   3   structurally cohesive
(william croft, 2005)
1. syntax-lexicon continuum
 Construction type                   Examples
 Complex and abstract   syntax       sbj be-tense verb-en by agent
 Complex and concrete   idiom        up-tense the ante
 Complex and bound      morphology   noun-s
 Atomic and abstract    category     adj, clause
 Atomic and concrete    lexicon      this, green


are all equal: constructions are the primitive elements

→ no parts of speech, no syntactic categories necessary
2. form and function specified in unified model

→ no separate syntactic (or semantic) component necessary
3. structurally cohesive

→ everything is constructions and nothing else; everything is
specific and nothing is universal
practically:

the pattern of an utterance is a feature with the same ontological
status as the terms that occur in the utterance.

constructions and lexemes both have conceptual meaning.

constructions or patterns are present even without recourse to the
words in it.
practically:

the pattern of an utterance is a feature with the same ontological
status as the terms that occur in the utterance.

constructions and lexemes both have conceptual meaning.

constructions or patterns are present even without recourse to the
words in it.
practically:

the pattern of an utterance is a feature with the same ontological
status as the terms that occur in the utterance.

constructions and lexemes both have conceptual meaning.

constructions or patterns are present even without recourse to the
words in it.
our claim:

to study pattern occurrences, no coupling between the features and
the words carrying them needs to be done.

this is quite convenient.
which is good.
our claim:

to study pattern occurrences, no coupling between the features and
the words carrying them needs to be done.

this is quite convenient.
which is good.
patterns, in various forms have been used in language technology
for some time:

linguistic string project. (1965-1998)
Naomi Sager et al



leading to

information extraction.
large number of adhoc pattern descriptions, closely based on data as observed in use
patterns, in various forms have been used in language technology
for some time:

linguistic string project. (1965-1998)
Naomi Sager et al



leading to

information extraction.
large number of adhoc pattern descriptions, closely based on data as observed in use
now turn to one example task: identification and analysis of
attitude
attitude analysis can be done on any text source

blogs: unfettered discourse, wom, low publication threshold, no
editorial control

but it’s new text — new processing practice necessary
attitude analysis can be done on any text source

blogs: unfettered discourse, wom, low publication threshold, no
editorial control

but it’s new text — new processing practice necessary
a prototypical attitudinal expression




          Expression    WHO       FEELS WHAT       ABOUT WHAT
 I like sauerkraut      I         like             sauerkraut
   Kissing is nice      ?         nice             kiss
                        someone   sentiment term   topic

is this picture true?
a prototypical attitudinal expression




          Expression    WHO       FEELS WHAT       ABOUT WHAT
 I like sauerkraut      I         like             sauerkraut
   Kissing is nice      ?         nice             kiss
                        someone   sentiment term   topic

is this picture true?
it is this, i think, that commentators mean when they say glibly
that the ‘world changed’ after sept 11.
president hafez al-assad has said that peace was a pressing need for
the region and the world at large and syria, considering peace a
strategic option would take steps towards peace.
mr cohen, beginning an eight-day european tour including a nato
defence ministers’ meeting in brussels today and tomorrow, said he
expected further international action soon, though not necessarily
military intervention.
the designers from house on fire do not like random play.
sauerkraut is damn good but kimchi is even better.
bertram powerboats have a deep v hull and handle well in choppy
sea.
m.a.k. halliday thought it natural to view syntax from a functional
perspective.
our claim:

attitude is not only lexical or lexicon is not only words & terms

                 “He blew me off” vs “He blew off”
  “He has the best result, we cannot fail him” vs “This is the best
                   coffee, we cannot fail with it”
                       “Fifth Avenue”, “9/11”
an experiment
we’ll hand code a number of sample constructions to test our claim
that they might be useful to identify attitudinal expressions.

remember: to study patterns — we do not need to encode explict
linkage to words!
we’ll hand code a number of sample constructions to test our claim
that they might be useful to identify attitudinal expressions.

remember: to study patterns — we do not need to encode explict
linkage to words!
we represent each sentence using three separate sets of features:
           I content words
           F form words
          K constructions
I features




content words – nouns, adjectives, verbs (including verbal uses of
participles), adverbs, abbreviations, numerals, interjections, and
negation
F features




function words – prepositions, determiners, conjunctions, pronouns
K : sentence structure




transitivity, predicate, relative, and object clauses, tense shift within
sentence
K : various adverbials




adverbials of location, time, manner, condition, quantity, clause
adverbial, clause initial adverbials
K : morphology of sentence constituents




present or past tense, adjectives in base, comparative, or superlative
form
K : word dependencies and categories




subordinate conjunctions, negations, prepositional post modifiers,
verb chains, quantifiers, particle verbs, prepositional phrases,
adjective modifiers
“It is this, I think, that commentators mean when they
     say glibly that the ‘world changed’ after Sept 11.”
I   be think commentator mean when say glibly
    world change sept 11
F   it this i that they that the after
K   AdvlTim, AdvlMan, ObjCls, PredCls,
    TRIn, TRtr, TRmix, TnsPres, TnsPast,
    TnsShift
in preliminary experiments with SVM feature selection we find that
several of the K features have high rank for categorisation, notably
TnsShift, TnsPast, TRmix, PredCls.
TnsShift



“Noam Chomsky saidpast that what makes human language unique
ispresent recursive centre embedding”

“M.A.K. Halliday believedpast that grammar, viewed functionally,
ispresent natural”

→ saves us from acquiring and maintaining lists of verbs of
utterance, pronuncement, and cognition.
TnsShift



“Noam Chomsky saidpast that what makes human language unique
ispresent recursive centre embedding”

“M.A.K. Halliday believedpast that grammar, viewed functionally,
ispresent natural”

→ saves us from acquiring and maintaining lists of verbs of
utterance, pronuncement, and cognition.
0.15


                                                               KmpAdj
                    0.10




                                                                   AdjMod
                                     TnsShift
                                                     Neg
                                                               RelCls
                    0.05




                                                                TnsPast
Factor 2 (13.8 %)




                                                                            VChain
                                                                                         k.1
                    0.00




                            ObjCls                                                   k.2
                                       Pos
                                           TRmix                    NoAtt
                                                PredCls
                    -0.05




                                                      SupAdj
                                                Neu
                    -0.10




                                                          AdvlMan                    k
                    -0.15




                                                                            Neg
our experiment:
  1   represent sentences using three sets of features: I , F , K .
  2   build a language representation using one year of newsprint:
      test for differences between KT, MD, GH.
  3   test sets of attitudinal sentences: SEMEVAL, NTCIR 6 & 7,
      MPQA.
  4   put test sentences in word space (random indexing, 2000
      dims) with added feature indicating attitude.
  5   extract feature vector from word space and run through SVM.
  6   test with five-fold crossvalidation.
our experiment:
  1   represent sentences using three sets of features: I , F , K .
  2   build a language representation using one year of newsprint:
      test for differences between KT, MD, GH.
  3   test sets of attitudinal sentences: SEMEVAL, NTCIR 6 & 7,
      MPQA.
  4   put test sentences in word space (random indexing, 2000
      dims) with added feature indicating attitude.
  5   extract feature vector from word space and run through SVM.
  6   test with five-fold crossvalidation.
our experiment:
  1   represent sentences using three sets of features: I , F , K .
  2   build a language representation using one year of newsprint:
      test for differences between KT, MD, GH.
  3   test sets of attitudinal sentences: SEMEVAL, NTCIR 6 & 7,
      MPQA.
  4   put test sentences in word space (random indexing, 2000
      dims) with added feature indicating attitude.
  5   extract feature vector from word space and run through SVM.
  6   test with five-fold crossvalidation.
our experiment:
  1   represent sentences using three sets of features: I , F , K .
  2   build a language representation using one year of newsprint:
      test for differences between KT, MD, GH.
  3   test sets of attitudinal sentences: SEMEVAL, NTCIR 6 & 7,
      MPQA.
  4   put test sentences in word space (random indexing, 2000
      dims) with added feature indicating attitude.
  5   extract feature vector from word space and run through SVM.
  6   test with five-fold crossvalidation.
our experiment:
  1   represent sentences using three sets of features: I , F , K .
  2   build a language representation using one year of newsprint:
      test for differences between KT, MD, GH.
  3   test sets of attitudinal sentences: SEMEVAL, NTCIR 6 & 7,
      MPQA.
  4   put test sentences in word space (random indexing, 2000
      dims) with added feature indicating attitude.
  5   extract feature vector from word space and run through SVM.
  6   test with five-fold crossvalidation.
our experiment:
  1   represent sentences using three sets of features: I , F , K .
  2   build a language representation using one year of newsprint:
      test for differences between KT, MD, GH.
  3   test sets of attitudinal sentences: SEMEVAL, NTCIR 6 & 7,
      MPQA.
  4   put test sentences in word space (random indexing, 2000
      dims) with added feature indicating attitude.
  5   extract feature vector from word space and run through SVM.
  6   test with five-fold crossvalidation.
experimental data:

                     NTCIR 6   NTCIR 7   SEMEVAL   MPQA
       Attitudinal     1 392     1 075        76    6 021
   Non-attitudinal     4 416     3 201       174    4 982
             Total     5 808     4 276       250   11 003
F1


                              NTCIR 6 NTCIR 7         MPQA     SEMEVAL
                     I           46.1       45.2        63.4        42.4
                    F            44.9       47.5        65.4        40.4
                    K            42.3       43.6        63.7        33.8
                   IF            45.9       47.4        67.3        41.4
                   IK            45.9       48.6        67.0        38.6
                  FK             46.1       47.9        68.0        37.5
                 IFK            47.5        48.6        69.2        41.8
     Precision range              approx 40        approx 70   approx 30
        Recall range                         approx 55-65

K features often help and never really hurt.
(karlgren et al, ECIR 2010)
SEMEVAL is different:
 Discovered Boys Bring Shock, Joy                       (+45)
 Iraq Car Bombings Kill 22 People, Wound more than 60   (−98)
1   results tie with reported NTCIR and SEMEVAL results, not far
    from best MPQA results.
2   combinations with K generally better than those without.
3   SEMEVAL data: much lower results, no surprise given
    terseness.
4   background language model has some effect: Glasgow Herald
    better precision; Korea Times better for recall for NTCIR data.
1   results tie with reported NTCIR and SEMEVAL results, not far
    from best MPQA results.
2   combinations with K generally better than those without.
3   SEMEVAL data: much lower results, no surprise given
    terseness.
4   background language model has some effect: Glasgow Herald
    better precision; Korea Times better for recall for NTCIR data.
1   results tie with reported NTCIR and SEMEVAL results, not far
    from best MPQA results.
2   combinations with K generally better than those without.
3   SEMEVAL data: much lower results, no surprise given
    terseness.
4   background language model has some effect: Glasgow Herald
    better precision; Korea Times better for recall for NTCIR data.
1   results tie with reported NTCIR and SEMEVAL results, not far
    from best MPQA results.
2   combinations with K generally better than those without.
3   SEMEVAL data: much lower results, no surprise given
    terseness.
4   background language model has some effect: Glasgow Herald
    better precision; Korea Times better for recall for NTCIR data.
but this was all hand coded.
next (preliminary) experiment: skeletons.
put sentences in word space (random indexing, 2000 dims) with
added feature for each trigram of structural terms
prove utility by better choice of task?

  • sentiment and opinion identification
  • quote identification
  • novelty detection
  • authorship attribution
  • summarisation
  • terminology mining

suggestions?
prove utility by better choice of task?

  • sentiment and opinion identification
  • quote identification
  • novelty detection
  • authorship attribution
  • summarisation
  • terminology mining

suggestions?
prove utility by better choice of task?

  • sentiment and opinion identification
  • quote identification
  • novelty detection
  • authorship attribution
  • summarisation
  • terminology mining

suggestions?
take home


1   constructional models have suitable granularity for simple
    difficult tasks
2   constructional models provide simple methodology to test the
    effect of language structure
3   constructional features are not subsidiary to word occurrence
    features
4   constructional analysis has a long history in language
    technology
5   constructional analysis has an opportunity to influence
    linguistics
take home


1   constructional models have suitable granularity for simple
    difficult tasks
2   constructional models provide simple methodology to test the
    effect of language structure
3   constructional features are not subsidiary to word occurrence
    features
4   constructional analysis has a long history in language
    technology
5   constructional analysis has an opportunity to influence
    linguistics
take home


1   constructional models have suitable granularity for simple
    difficult tasks
2   constructional models provide simple methodology to test the
    effect of language structure
3   constructional features are not subsidiary to word occurrence
    features
4   constructional analysis has a long history in language
    technology
5   constructional analysis has an opportunity to influence
    linguistics
take home


1   constructional models have suitable granularity for simple
    difficult tasks
2   constructional models provide simple methodology to test the
    effect of language structure
3   constructional features are not subsidiary to word occurrence
    features
4   constructional analysis has a long history in language
    technology
5   constructional analysis has an opportunity to influence
    linguistics
take home


1   constructional models have suitable granularity for simple
    difficult tasks
2   constructional models provide simple methodology to test the
    effect of language structure
3   constructional features are not subsidiary to word occurrence
    features
4   constructional analysis has a long history in language
    technology
5   constructional analysis has an opportunity to influence
    linguistics
to discuss




1   what constitutes a construction?
2   what is not a construction?
3   what sort of tasks can use constructions profitably?
4   what sort of abstractions do we want to use for describing
    constructions productively?
5   how can we learn constructions automatically?

Más contenido relacionado

Similar a Constructional Syntactic Analysis for Attitude Identification

Superfast Type Inference Engine
Superfast Type Inference EngineSuperfast Type Inference Engine
Superfast Type Inference EngineGanesh Kumar
 
Technical Style Workshop Part 1
Technical Style Workshop Part 1Technical Style Workshop Part 1
Technical Style Workshop Part 1Eddie Hollon
 
What Is Thesis In Essay.pdf
What Is Thesis In Essay.pdfWhat Is Thesis In Essay.pdf
What Is Thesis In Essay.pdfCrystal Wright
 
Pragmatics and Discourse , context & speech acts
Pragmatics and Discourse , context & speech actsPragmatics and Discourse , context & speech acts
Pragmatics and Discourse , context & speech actsNaeemIqbal88
 
Patrick Hanks - Why lexicographers should take more notice of phraseology, co...
Patrick Hanks - Why lexicographers should take more notice of phraseology, co...Patrick Hanks - Why lexicographers should take more notice of phraseology, co...
Patrick Hanks - Why lexicographers should take more notice of phraseology, co...Scottish Language Dictionaries
 
Use Your Words: Content Strategy to Influence Behavior
Use Your Words: Content Strategy to Influence BehaviorUse Your Words: Content Strategy to Influence Behavior
Use Your Words: Content Strategy to Influence BehaviorLiz Danzico
 
Evidenced based practice In this writing, locate an article pert.docx
Evidenced based practice In this writing, locate an article pert.docxEvidenced based practice In this writing, locate an article pert.docx
Evidenced based practice In this writing, locate an article pert.docxturveycharlyn
 
Stylistics chapter 3 BY ALEXIS JOHN BENEDICTO
Stylistics chapter 3 BY  ALEXIS JOHN BENEDICTOStylistics chapter 3 BY  ALEXIS JOHN BENEDICTO
Stylistics chapter 3 BY ALEXIS JOHN BENEDICTObenedictojohnalexis
 
vdocuments.mx_language-comprehension.ppt
vdocuments.mx_language-comprehension.pptvdocuments.mx_language-comprehension.ppt
vdocuments.mx_language-comprehension.pptSyedNadeemAbbas6
 
Don't demo facts. Demo stories! (handouts)
Don't demo facts. Demo stories! (handouts)Don't demo facts. Demo stories! (handouts)
Don't demo facts. Demo stories! (handouts)Tudor Girba
 
Communication-Collaboration (1).pptx
Communication-Collaboration (1).pptxCommunication-Collaboration (1).pptx
Communication-Collaboration (1).pptxIanJamesCalanza
 
Discourse Or Document? Issues of adopting Emerging Digital Genres for Scholar...
Discourse Or Document? Issues of adopting Emerging Digital Genres for Scholar...Discourse Or Document? Issues of adopting Emerging Digital Genres for Scholar...
Discourse Or Document? Issues of adopting Emerging Digital Genres for Scholar...Cornelius Puschmann
 
Sentiment Analysis in Twitter with Lightweight Discourse Analysis
Sentiment Analysis in Twitter with Lightweight Discourse AnalysisSentiment Analysis in Twitter with Lightweight Discourse Analysis
Sentiment Analysis in Twitter with Lightweight Discourse Analysis Naveen Kumar
 
Componential Analysis of Meaning.pptx
Componential Analysis of Meaning.pptxComponential Analysis of Meaning.pptx
Componential Analysis of Meaning.pptxRasha sakr
 

Similar a Constructional Syntactic Analysis for Attitude Identification (20)

Syntactic devices
Syntactic devicesSyntactic devices
Syntactic devices
 
Superfast Type Inference Engine
Superfast Type Inference EngineSuperfast Type Inference Engine
Superfast Type Inference Engine
 
Technical Style Workshop Part 1
Technical Style Workshop Part 1Technical Style Workshop Part 1
Technical Style Workshop Part 1
 
What Is Thesis In Essay.pdf
What Is Thesis In Essay.pdfWhat Is Thesis In Essay.pdf
What Is Thesis In Essay.pdf
 
Pragmatics and Discourse , context & speech acts
Pragmatics and Discourse , context & speech actsPragmatics and Discourse , context & speech acts
Pragmatics and Discourse , context & speech acts
 
Patrick Hanks - Why lexicographers should take more notice of phraseology, co...
Patrick Hanks - Why lexicographers should take more notice of phraseology, co...Patrick Hanks - Why lexicographers should take more notice of phraseology, co...
Patrick Hanks - Why lexicographers should take more notice of phraseology, co...
 
Use Your Words: Content Strategy to Influence Behavior
Use Your Words: Content Strategy to Influence BehaviorUse Your Words: Content Strategy to Influence Behavior
Use Your Words: Content Strategy to Influence Behavior
 
A class action
A class actionA class action
A class action
 
Evidenced based practice In this writing, locate an article pert.docx
Evidenced based practice In this writing, locate an article pert.docxEvidenced based practice In this writing, locate an article pert.docx
Evidenced based practice In this writing, locate an article pert.docx
 
Stylistics chapter 3 BY ALEXIS JOHN BENEDICTO
Stylistics chapter 3 BY  ALEXIS JOHN BENEDICTOStylistics chapter 3 BY  ALEXIS JOHN BENEDICTO
Stylistics chapter 3 BY ALEXIS JOHN BENEDICTO
 
vdocuments.mx_language-comprehension.ppt
vdocuments.mx_language-comprehension.pptvdocuments.mx_language-comprehension.ppt
vdocuments.mx_language-comprehension.ppt
 
Don't demo facts. Demo stories! (handouts)
Don't demo facts. Demo stories! (handouts)Don't demo facts. Demo stories! (handouts)
Don't demo facts. Demo stories! (handouts)
 
Communication-Collaboration (1).pptx
Communication-Collaboration (1).pptxCommunication-Collaboration (1).pptx
Communication-Collaboration (1).pptx
 
Discourse Or Document? Issues of adopting Emerging Digital Genres for Scholar...
Discourse Or Document? Issues of adopting Emerging Digital Genres for Scholar...Discourse Or Document? Issues of adopting Emerging Digital Genres for Scholar...
Discourse Or Document? Issues of adopting Emerging Digital Genres for Scholar...
 
Sentiment Analysis in Twitter with Lightweight Discourse Analysis
Sentiment Analysis in Twitter with Lightweight Discourse AnalysisSentiment Analysis in Twitter with Lightweight Discourse Analysis
Sentiment Analysis in Twitter with Lightweight Discourse Analysis
 
sent_analysis_report
sent_analysis_reportsent_analysis_report
sent_analysis_report
 
Euro Exam
Euro Exam Euro Exam
Euro Exam
 
How to Do Things with Cases
How to Do Things with CasesHow to Do Things with Cases
How to Do Things with Cases
 
Componential Analysis of Meaning.pptx
Componential Analysis of Meaning.pptxComponential Analysis of Meaning.pptx
Componential Analysis of Meaning.pptx
 
Class5 - Languaging
Class5 - LanguagingClass5 - Languaging
Class5 - Languaging
 

Más de yaevents

Администрирование небольших сервисов или один за всех и 100 на одного. Роман ...
Администрирование небольших сервисов или один за всех и 100 на одного. Роман ...Администрирование небольших сервисов или один за всех и 100 на одного. Роман ...
Администрирование небольших сервисов или один за всех и 100 на одного. Роман ...yaevents
 
Мониторинг со всех сторон. Алексей Симаков, Яндекс
Мониторинг со всех сторон. Алексей Симаков, ЯндексМониторинг со всех сторон. Алексей Симаков, Яндекс
Мониторинг со всех сторон. Алексей Симаков, Яндексyaevents
 
Истории про разработку сайтов. Сергей Бережной, Яндекс
Истории про разработку сайтов. Сергей Бережной, ЯндексИстории про разработку сайтов. Сергей Бережной, Яндекс
Истории про разработку сайтов. Сергей Бережной, Яндексyaevents
 
Разработка приложений для Android на С++. Юрий Береза, Shturmann
Разработка приложений для Android на С++. Юрий Береза, ShturmannРазработка приложений для Android на С++. Юрий Береза, Shturmann
Разработка приложений для Android на С++. Юрий Береза, Shturmannyaevents
 
Кросс-платформенная разработка под мобильные устройства. Дмитрий Жестилевский...
Кросс-платформенная разработка под мобильные устройства. Дмитрий Жестилевский...Кросс-платформенная разработка под мобильные устройства. Дмитрий Жестилевский...
Кросс-платформенная разработка под мобильные устройства. Дмитрий Жестилевский...yaevents
 
Сложнейшие техники, применяемые буткитами и полиморфными вирусами. Вячеслав З...
Сложнейшие техники, применяемые буткитами и полиморфными вирусами. Вячеслав З...Сложнейшие техники, применяемые буткитами и полиморфными вирусами. Вячеслав З...
Сложнейшие техники, применяемые буткитами и полиморфными вирусами. Вячеслав З...yaevents
 
Сканирование уязвимостей со вкусом Яндекса. Тарас Иващенко, Яндекс
Сканирование уязвимостей со вкусом Яндекса. Тарас Иващенко, ЯндексСканирование уязвимостей со вкусом Яндекса. Тарас Иващенко, Яндекс
Сканирование уязвимостей со вкусом Яндекса. Тарас Иващенко, Яндексyaevents
 
Масштабируемость Hadoop в Facebook. Дмитрий Мольков, Facebook
Масштабируемость Hadoop в Facebook. Дмитрий Мольков, FacebookМасштабируемость Hadoop в Facebook. Дмитрий Мольков, Facebook
Масштабируемость Hadoop в Facebook. Дмитрий Мольков, Facebookyaevents
 
Контроль зверей: инструменты для управления и мониторинга распределенных сист...
Контроль зверей: инструменты для управления и мониторинга распределенных сист...Контроль зверей: инструменты для управления и мониторинга распределенных сист...
Контроль зверей: инструменты для управления и мониторинга распределенных сист...yaevents
 
Юнит-тестирование и Google Mock. Влад Лосев, Google
Юнит-тестирование и Google Mock. Влад Лосев, GoogleЮнит-тестирование и Google Mock. Влад Лосев, Google
Юнит-тестирование и Google Mock. Влад Лосев, Googleyaevents
 
C++11 (formerly known as C++0x) is the new C++ language standard. Dave Abraha...
C++11 (formerly known as C++0x) is the new C++ language standard. Dave Abraha...C++11 (formerly known as C++0x) is the new C++ language standard. Dave Abraha...
C++11 (formerly known as C++0x) is the new C++ language standard. Dave Abraha...yaevents
 
Зачем обычному программисту знать языки, на которых почти никто не пишет. Але...
Зачем обычному программисту знать языки, на которых почти никто не пишет. Але...Зачем обычному программисту знать языки, на которых почти никто не пишет. Але...
Зачем обычному программисту знать языки, на которых почти никто не пишет. Але...yaevents
 
В поисках математики. Михаил Денисенко, Нигма
В поисках математики. Михаил Денисенко, НигмаВ поисках математики. Михаил Денисенко, Нигма
В поисках математики. Михаил Денисенко, Нигмаyaevents
 
Using classifiers to compute similarities between face images. Prof. Lior Wol...
Using classifiers to compute similarities between face images. Prof. Lior Wol...Using classifiers to compute similarities between face images. Prof. Lior Wol...
Using classifiers to compute similarities between face images. Prof. Lior Wol...yaevents
 
Поисковая технология "Спектр". Андрей Плахов, Яндекс
Поисковая технология "Спектр". Андрей Плахов, ЯндексПоисковая технология "Спектр". Андрей Плахов, Яндекс
Поисковая технология "Спектр". Андрей Плахов, Яндексyaevents
 
Julia Stoyanovich - Making interval-based clustering rank-aware
Julia Stoyanovich - Making interval-based clustering rank-awareJulia Stoyanovich - Making interval-based clustering rank-aware
Julia Stoyanovich - Making interval-based clustering rank-awareyaevents
 
Mike Thelwall - Sentiment strength detection for the social web: From YouTube...
Mike Thelwall - Sentiment strength detection for the social web: From YouTube...Mike Thelwall - Sentiment strength detection for the social web: From YouTube...
Mike Thelwall - Sentiment strength detection for the social web: From YouTube...yaevents
 
Evangelos Kanoulas — Advances in Information Retrieval Evaluation
Evangelos Kanoulas — Advances in Information Retrieval EvaluationEvangelos Kanoulas — Advances in Information Retrieval Evaluation
Evangelos Kanoulas — Advances in Information Retrieval Evaluationyaevents
 
Ben Carterett — Advances in Information Retrieval Evaluation
Ben Carterett — Advances in Information Retrieval EvaluationBen Carterett — Advances in Information Retrieval Evaluation
Ben Carterett — Advances in Information Retrieval Evaluationyaevents
 
Raffaele Perego "Efficient Query Suggestions in the Long Tail"
Raffaele Perego "Efficient Query Suggestions in the Long Tail"Raffaele Perego "Efficient Query Suggestions in the Long Tail"
Raffaele Perego "Efficient Query Suggestions in the Long Tail"yaevents
 

Más de yaevents (20)

Администрирование небольших сервисов или один за всех и 100 на одного. Роман ...
Администрирование небольших сервисов или один за всех и 100 на одного. Роман ...Администрирование небольших сервисов или один за всех и 100 на одного. Роман ...
Администрирование небольших сервисов или один за всех и 100 на одного. Роман ...
 
Мониторинг со всех сторон. Алексей Симаков, Яндекс
Мониторинг со всех сторон. Алексей Симаков, ЯндексМониторинг со всех сторон. Алексей Симаков, Яндекс
Мониторинг со всех сторон. Алексей Симаков, Яндекс
 
Истории про разработку сайтов. Сергей Бережной, Яндекс
Истории про разработку сайтов. Сергей Бережной, ЯндексИстории про разработку сайтов. Сергей Бережной, Яндекс
Истории про разработку сайтов. Сергей Бережной, Яндекс
 
Разработка приложений для Android на С++. Юрий Береза, Shturmann
Разработка приложений для Android на С++. Юрий Береза, ShturmannРазработка приложений для Android на С++. Юрий Береза, Shturmann
Разработка приложений для Android на С++. Юрий Береза, Shturmann
 
Кросс-платформенная разработка под мобильные устройства. Дмитрий Жестилевский...
Кросс-платформенная разработка под мобильные устройства. Дмитрий Жестилевский...Кросс-платформенная разработка под мобильные устройства. Дмитрий Жестилевский...
Кросс-платформенная разработка под мобильные устройства. Дмитрий Жестилевский...
 
Сложнейшие техники, применяемые буткитами и полиморфными вирусами. Вячеслав З...
Сложнейшие техники, применяемые буткитами и полиморфными вирусами. Вячеслав З...Сложнейшие техники, применяемые буткитами и полиморфными вирусами. Вячеслав З...
Сложнейшие техники, применяемые буткитами и полиморфными вирусами. Вячеслав З...
 
Сканирование уязвимостей со вкусом Яндекса. Тарас Иващенко, Яндекс
Сканирование уязвимостей со вкусом Яндекса. Тарас Иващенко, ЯндексСканирование уязвимостей со вкусом Яндекса. Тарас Иващенко, Яндекс
Сканирование уязвимостей со вкусом Яндекса. Тарас Иващенко, Яндекс
 
Масштабируемость Hadoop в Facebook. Дмитрий Мольков, Facebook
Масштабируемость Hadoop в Facebook. Дмитрий Мольков, FacebookМасштабируемость Hadoop в Facebook. Дмитрий Мольков, Facebook
Масштабируемость Hadoop в Facebook. Дмитрий Мольков, Facebook
 
Контроль зверей: инструменты для управления и мониторинга распределенных сист...
Контроль зверей: инструменты для управления и мониторинга распределенных сист...Контроль зверей: инструменты для управления и мониторинга распределенных сист...
Контроль зверей: инструменты для управления и мониторинга распределенных сист...
 
Юнит-тестирование и Google Mock. Влад Лосев, Google
Юнит-тестирование и Google Mock. Влад Лосев, GoogleЮнит-тестирование и Google Mock. Влад Лосев, Google
Юнит-тестирование и Google Mock. Влад Лосев, Google
 
C++11 (formerly known as C++0x) is the new C++ language standard. Dave Abraha...
C++11 (formerly known as C++0x) is the new C++ language standard. Dave Abraha...C++11 (formerly known as C++0x) is the new C++ language standard. Dave Abraha...
C++11 (formerly known as C++0x) is the new C++ language standard. Dave Abraha...
 
Зачем обычному программисту знать языки, на которых почти никто не пишет. Але...
Зачем обычному программисту знать языки, на которых почти никто не пишет. Але...Зачем обычному программисту знать языки, на которых почти никто не пишет. Але...
Зачем обычному программисту знать языки, на которых почти никто не пишет. Але...
 
В поисках математики. Михаил Денисенко, Нигма
В поисках математики. Михаил Денисенко, НигмаВ поисках математики. Михаил Денисенко, Нигма
В поисках математики. Михаил Денисенко, Нигма
 
Using classifiers to compute similarities between face images. Prof. Lior Wol...
Using classifiers to compute similarities between face images. Prof. Lior Wol...Using classifiers to compute similarities between face images. Prof. Lior Wol...
Using classifiers to compute similarities between face images. Prof. Lior Wol...
 
Поисковая технология "Спектр". Андрей Плахов, Яндекс
Поисковая технология "Спектр". Андрей Плахов, ЯндексПоисковая технология "Спектр". Андрей Плахов, Яндекс
Поисковая технология "Спектр". Андрей Плахов, Яндекс
 
Julia Stoyanovich - Making interval-based clustering rank-aware
Julia Stoyanovich - Making interval-based clustering rank-awareJulia Stoyanovich - Making interval-based clustering rank-aware
Julia Stoyanovich - Making interval-based clustering rank-aware
 
Mike Thelwall - Sentiment strength detection for the social web: From YouTube...
Mike Thelwall - Sentiment strength detection for the social web: From YouTube...Mike Thelwall - Sentiment strength detection for the social web: From YouTube...
Mike Thelwall - Sentiment strength detection for the social web: From YouTube...
 
Evangelos Kanoulas — Advances in Information Retrieval Evaluation
Evangelos Kanoulas — Advances in Information Retrieval EvaluationEvangelos Kanoulas — Advances in Information Retrieval Evaluation
Evangelos Kanoulas — Advances in Information Retrieval Evaluation
 
Ben Carterett — Advances in Information Retrieval Evaluation
Ben Carterett — Advances in Information Retrieval EvaluationBen Carterett — Advances in Information Retrieval Evaluation
Ben Carterett — Advances in Information Retrieval Evaluation
 
Raffaele Perego "Efficient Query Suggestions in the Long Tail"
Raffaele Perego "Efficient Query Suggestions in the Long Tail"Raffaele Perego "Efficient Query Suggestions in the Long Tail"
Raffaele Perego "Efficient Query Suggestions in the Long Tail"
 

Constructional Syntactic Analysis for Attitude Identification

  • 1. constructional syntactic analysis for information access tasks jussi karlgren yandex, february 16, 2011
  • 2. jussi karlgren ph d in (computational) linguistics from stockholm senior researcher in information access at sics, stockholm docent in language technology at univ of helsinki founding partner, gavagai ab, stockholm
  • 3. • independent non-profit research institute • about 100-200 researchers • ... networks, distributed systems, programming tools, collaborative environments, information access, design, digital art...
  • 4. • recent startup company • about 7-8 employees • extracts actionable intelligence from very large text streams
  • 5. why use computational methods and machinery for information access? two reasons: 1 amount of data is overwhelming → reduce data complexity let’s call these “simple” tasks 2 signal is weak and complex → peer closer into data let’s call these “difficult” tasks
  • 6. why use computational methods and machinery for information access? two reasons: 1 amount of data is overwhelming → reduce data complexity let’s call these “simple” tasks 2 signal is weak and complex → peer closer into data let’s call these “difficult” tasks
  • 7. why use computational methods and machinery for information access? two reasons: 1 amount of data is overwhelming → reduce data complexity let’s call these “simple” tasks 2 signal is weak and complex → peer closer into data let’s call these “difficult” tasks
  • 8. why use computational methods and machinery for information access? two reasons: 1 amount of data is overwhelming → reduce data complexity let’s call these “simple” tasks 2 signal is weak and complex → peer closer into data let’s call these “difficult” tasks
  • 9. for the simple tasks the sensible thing to do is to pound the text into small bits and count the various type of bit.
  • 10. for the simple tasks the sensible thing to do is to pound the text into small bits and count the various type of bit.
  • 11. this works well up to a point. for search engines. but not for e.g. authorship attribution.
  • 12. this works well up to a point. for search engines. but not for e.g. authorship attribution.
  • 13. this works well up to a point. for search engines. but not for e.g. authorship attribution.
  • 14. what is the next sensible thing to do? try to organise the bits into piles first, generalising them, or try to see if the bits have relations to each other, building more complex structures. which involves non-trivial complex decisions and results in a brittle, error-prone procedure.
  • 15. what is the next sensible thing to do? try to organise the bits into piles first, generalising them, or try to see if the bits have relations to each other, building more complex structures. which involves non-trivial complex decisions and results in a brittle, error-prone procedure.
  • 16. what is the next sensible thing to do? try to organise the bits into piles first, generalising them, or try to see if the bits have relations to each other, building more complex structures. which involves non-trivial complex decisions and results in a brittle, error-prone procedure.
  • 17. why is parsing impractical? 1 new text 2 categories unfounded in data 3 dependencies not based on necessity or efficiency
  • 18. why is parsing impractical? 1 new text 2 categories unfounded in data 3 dependencies not based on necessity or efficiency
  • 19. why is parsing impractical? 1 new text 2 categories unfounded in data 3 dependencies not based on necessity or efficiency
  • 20. why is parsing impractical? 1 new text 2 categories unfounded in data 3 dependencies not based on necessity or efficiency
  • 21. what is in the signal? “It is this, I think, that commentators mean when they say glibly that the ‘world changed’ after Sept 11.” words? or something more?
  • 22. what is in the signal? “It is this, I think, that commentators mean when they say glibly that the ‘world changed’ after Sept 11.” words? or something more?
  • 23. linguistics has an answer. but that answer doesn’t help much in practical applications.
  • 24. linguistics has an answer. but that answer doesn’t help much in practical applications.
  • 25. what is in the signal to begin with? just words? and a pattern?
  • 26. what is in the signal to begin with? just words? and a pattern?
  • 27. what is in the signal to begin with? just words? and a pattern?
  • 28. „ah, but the words are in the clause, the pattern is only expressed by the words that participate in it.”
  • 29. so patterns do not exist when not in use? do words exist outside their usage? „To ask where a verbal operant is when a response is not in the course of being emitted is like asking where one’s knee-jerk is when the physician is not tapping the patellar tendon.” B.F. Skinner, Verbal Behavior
  • 30. so patterns do not exist when not in use? do words exist outside their usage? „To ask where a verbal operant is when a response is not in the course of being emitted is like asking where one’s knee-jerk is when the physician is not tapping the patellar tendon.” B.F. Skinner, Verbal Behavior
  • 31. so patterns do not exist when not in use? do words exist outside their usage? „To ask where a verbal operant is when a response is not in the course of being emitted is like asking where one’s knee-jerk is when the physician is not tapping the patellar tendon.” B.F. Skinner, Verbal Behavior
  • 32. we claim that patterns are part of the signal, - not incidental to it, - nor secondary to the terms in it. this appears to be a contentious statement.
  • 33. we claim that patterns are part of the signal, - not incidental to it, - nor secondary to the terms in it. this appears to be a contentious statement.
  • 34. radical construction grammar (cxg) 1 syntax-lexicon continuum 2 form and function specified in unified model 3 structurally cohesive (william croft, 2005)
  • 35. 1. syntax-lexicon continuum Construction type Examples Complex and abstract syntax sbj be-tense verb-en by agent Complex and concrete idiom up-tense the ante Complex and bound morphology noun-s Atomic and abstract category adj, clause Atomic and concrete lexicon this, green are all equal: constructions are the primitive elements → no parts of speech, no syntactic categories necessary
  • 36. 2. form and function specified in unified model → no separate syntactic (or semantic) component necessary
  • 37. 3. structurally cohesive → everything is constructions and nothing else; everything is specific and nothing is universal
  • 38. practically: the pattern of an utterance is a feature with the same ontological status as the terms that occur in the utterance. constructions and lexemes both have conceptual meaning. constructions or patterns are present even without recourse to the words in it.
  • 39. practically: the pattern of an utterance is a feature with the same ontological status as the terms that occur in the utterance. constructions and lexemes both have conceptual meaning. constructions or patterns are present even without recourse to the words in it.
  • 40. practically: the pattern of an utterance is a feature with the same ontological status as the terms that occur in the utterance. constructions and lexemes both have conceptual meaning. constructions or patterns are present even without recourse to the words in it.
  • 41. our claim: to study pattern occurrences, no coupling between the features and the words carrying them needs to be done. this is quite convenient. which is good.
  • 42. our claim: to study pattern occurrences, no coupling between the features and the words carrying them needs to be done. this is quite convenient. which is good.
  • 43. patterns, in various forms have been used in language technology for some time: linguistic string project. (1965-1998) Naomi Sager et al leading to information extraction. large number of adhoc pattern descriptions, closely based on data as observed in use
  • 44. patterns, in various forms have been used in language technology for some time: linguistic string project. (1965-1998) Naomi Sager et al leading to information extraction. large number of adhoc pattern descriptions, closely based on data as observed in use
  • 45. now turn to one example task: identification and analysis of attitude
  • 46. attitude analysis can be done on any text source blogs: unfettered discourse, wom, low publication threshold, no editorial control but it’s new text — new processing practice necessary
  • 47. attitude analysis can be done on any text source blogs: unfettered discourse, wom, low publication threshold, no editorial control but it’s new text — new processing practice necessary
  • 48. a prototypical attitudinal expression Expression WHO FEELS WHAT ABOUT WHAT I like sauerkraut I like sauerkraut Kissing is nice ? nice kiss someone sentiment term topic is this picture true?
  • 49. a prototypical attitudinal expression Expression WHO FEELS WHAT ABOUT WHAT I like sauerkraut I like sauerkraut Kissing is nice ? nice kiss someone sentiment term topic is this picture true?
  • 50. it is this, i think, that commentators mean when they say glibly that the ‘world changed’ after sept 11. president hafez al-assad has said that peace was a pressing need for the region and the world at large and syria, considering peace a strategic option would take steps towards peace. mr cohen, beginning an eight-day european tour including a nato defence ministers’ meeting in brussels today and tomorrow, said he expected further international action soon, though not necessarily military intervention. the designers from house on fire do not like random play. sauerkraut is damn good but kimchi is even better. bertram powerboats have a deep v hull and handle well in choppy sea. m.a.k. halliday thought it natural to view syntax from a functional perspective.
  • 51. our claim: attitude is not only lexical or lexicon is not only words & terms “He blew me off” vs “He blew off” “He has the best result, we cannot fail him” vs “This is the best coffee, we cannot fail with it” “Fifth Avenue”, “9/11”
  • 53. we’ll hand code a number of sample constructions to test our claim that they might be useful to identify attitudinal expressions. remember: to study patterns — we do not need to encode explict linkage to words!
  • 54. we’ll hand code a number of sample constructions to test our claim that they might be useful to identify attitudinal expressions. remember: to study patterns — we do not need to encode explict linkage to words!
  • 55. we represent each sentence using three separate sets of features: I content words F form words K constructions
  • 56. I features content words – nouns, adjectives, verbs (including verbal uses of participles), adverbs, abbreviations, numerals, interjections, and negation
  • 57. F features function words – prepositions, determiners, conjunctions, pronouns
  • 58. K : sentence structure transitivity, predicate, relative, and object clauses, tense shift within sentence
  • 59. K : various adverbials adverbials of location, time, manner, condition, quantity, clause adverbial, clause initial adverbials
  • 60. K : morphology of sentence constituents present or past tense, adjectives in base, comparative, or superlative form
  • 61. K : word dependencies and categories subordinate conjunctions, negations, prepositional post modifiers, verb chains, quantifiers, particle verbs, prepositional phrases, adjective modifiers
  • 62. “It is this, I think, that commentators mean when they say glibly that the ‘world changed’ after Sept 11.” I be think commentator mean when say glibly world change sept 11 F it this i that they that the after K AdvlTim, AdvlMan, ObjCls, PredCls, TRIn, TRtr, TRmix, TnsPres, TnsPast, TnsShift
  • 63. in preliminary experiments with SVM feature selection we find that several of the K features have high rank for categorisation, notably TnsShift, TnsPast, TRmix, PredCls.
  • 64. TnsShift “Noam Chomsky saidpast that what makes human language unique ispresent recursive centre embedding” “M.A.K. Halliday believedpast that grammar, viewed functionally, ispresent natural” → saves us from acquiring and maintaining lists of verbs of utterance, pronuncement, and cognition.
  • 65. TnsShift “Noam Chomsky saidpast that what makes human language unique ispresent recursive centre embedding” “M.A.K. Halliday believedpast that grammar, viewed functionally, ispresent natural” → saves us from acquiring and maintaining lists of verbs of utterance, pronuncement, and cognition.
  • 66. 0.15 KmpAdj 0.10 AdjMod TnsShift Neg RelCls 0.05 TnsPast Factor 2 (13.8 %) VChain k.1 0.00 ObjCls k.2 Pos TRmix NoAtt PredCls -0.05 SupAdj Neu -0.10 AdvlMan k -0.15 Neg
  • 67. our experiment: 1 represent sentences using three sets of features: I , F , K . 2 build a language representation using one year of newsprint: test for differences between KT, MD, GH. 3 test sets of attitudinal sentences: SEMEVAL, NTCIR 6 & 7, MPQA. 4 put test sentences in word space (random indexing, 2000 dims) with added feature indicating attitude. 5 extract feature vector from word space and run through SVM. 6 test with five-fold crossvalidation.
  • 68. our experiment: 1 represent sentences using three sets of features: I , F , K . 2 build a language representation using one year of newsprint: test for differences between KT, MD, GH. 3 test sets of attitudinal sentences: SEMEVAL, NTCIR 6 & 7, MPQA. 4 put test sentences in word space (random indexing, 2000 dims) with added feature indicating attitude. 5 extract feature vector from word space and run through SVM. 6 test with five-fold crossvalidation.
  • 69. our experiment: 1 represent sentences using three sets of features: I , F , K . 2 build a language representation using one year of newsprint: test for differences between KT, MD, GH. 3 test sets of attitudinal sentences: SEMEVAL, NTCIR 6 & 7, MPQA. 4 put test sentences in word space (random indexing, 2000 dims) with added feature indicating attitude. 5 extract feature vector from word space and run through SVM. 6 test with five-fold crossvalidation.
  • 70. our experiment: 1 represent sentences using three sets of features: I , F , K . 2 build a language representation using one year of newsprint: test for differences between KT, MD, GH. 3 test sets of attitudinal sentences: SEMEVAL, NTCIR 6 & 7, MPQA. 4 put test sentences in word space (random indexing, 2000 dims) with added feature indicating attitude. 5 extract feature vector from word space and run through SVM. 6 test with five-fold crossvalidation.
  • 71. our experiment: 1 represent sentences using three sets of features: I , F , K . 2 build a language representation using one year of newsprint: test for differences between KT, MD, GH. 3 test sets of attitudinal sentences: SEMEVAL, NTCIR 6 & 7, MPQA. 4 put test sentences in word space (random indexing, 2000 dims) with added feature indicating attitude. 5 extract feature vector from word space and run through SVM. 6 test with five-fold crossvalidation.
  • 72. our experiment: 1 represent sentences using three sets of features: I , F , K . 2 build a language representation using one year of newsprint: test for differences between KT, MD, GH. 3 test sets of attitudinal sentences: SEMEVAL, NTCIR 6 & 7, MPQA. 4 put test sentences in word space (random indexing, 2000 dims) with added feature indicating attitude. 5 extract feature vector from word space and run through SVM. 6 test with five-fold crossvalidation.
  • 73. experimental data: NTCIR 6 NTCIR 7 SEMEVAL MPQA Attitudinal 1 392 1 075 76 6 021 Non-attitudinal 4 416 3 201 174 4 982 Total 5 808 4 276 250 11 003
  • 74. F1 NTCIR 6 NTCIR 7 MPQA SEMEVAL I 46.1 45.2 63.4 42.4 F 44.9 47.5 65.4 40.4 K 42.3 43.6 63.7 33.8 IF 45.9 47.4 67.3 41.4 IK 45.9 48.6 67.0 38.6 FK 46.1 47.9 68.0 37.5 IFK 47.5 48.6 69.2 41.8 Precision range approx 40 approx 70 approx 30 Recall range approx 55-65 K features often help and never really hurt. (karlgren et al, ECIR 2010)
  • 75. SEMEVAL is different: Discovered Boys Bring Shock, Joy (+45) Iraq Car Bombings Kill 22 People, Wound more than 60 (−98)
  • 76. 1 results tie with reported NTCIR and SEMEVAL results, not far from best MPQA results. 2 combinations with K generally better than those without. 3 SEMEVAL data: much lower results, no surprise given terseness. 4 background language model has some effect: Glasgow Herald better precision; Korea Times better for recall for NTCIR data.
  • 77. 1 results tie with reported NTCIR and SEMEVAL results, not far from best MPQA results. 2 combinations with K generally better than those without. 3 SEMEVAL data: much lower results, no surprise given terseness. 4 background language model has some effect: Glasgow Herald better precision; Korea Times better for recall for NTCIR data.
  • 78. 1 results tie with reported NTCIR and SEMEVAL results, not far from best MPQA results. 2 combinations with K generally better than those without. 3 SEMEVAL data: much lower results, no surprise given terseness. 4 background language model has some effect: Glasgow Herald better precision; Korea Times better for recall for NTCIR data.
  • 79. 1 results tie with reported NTCIR and SEMEVAL results, not far from best MPQA results. 2 combinations with K generally better than those without. 3 SEMEVAL data: much lower results, no surprise given terseness. 4 background language model has some effect: Glasgow Herald better precision; Korea Times better for recall for NTCIR data.
  • 80. but this was all hand coded.
  • 82.
  • 83. put sentences in word space (random indexing, 2000 dims) with added feature for each trigram of structural terms
  • 84.
  • 85.
  • 86. prove utility by better choice of task? • sentiment and opinion identification • quote identification • novelty detection • authorship attribution • summarisation • terminology mining suggestions?
  • 87. prove utility by better choice of task? • sentiment and opinion identification • quote identification • novelty detection • authorship attribution • summarisation • terminology mining suggestions?
  • 88. prove utility by better choice of task? • sentiment and opinion identification • quote identification • novelty detection • authorship attribution • summarisation • terminology mining suggestions?
  • 89. take home 1 constructional models have suitable granularity for simple difficult tasks 2 constructional models provide simple methodology to test the effect of language structure 3 constructional features are not subsidiary to word occurrence features 4 constructional analysis has a long history in language technology 5 constructional analysis has an opportunity to influence linguistics
  • 90. take home 1 constructional models have suitable granularity for simple difficult tasks 2 constructional models provide simple methodology to test the effect of language structure 3 constructional features are not subsidiary to word occurrence features 4 constructional analysis has a long history in language technology 5 constructional analysis has an opportunity to influence linguistics
  • 91. take home 1 constructional models have suitable granularity for simple difficult tasks 2 constructional models provide simple methodology to test the effect of language structure 3 constructional features are not subsidiary to word occurrence features 4 constructional analysis has a long history in language technology 5 constructional analysis has an opportunity to influence linguistics
  • 92. take home 1 constructional models have suitable granularity for simple difficult tasks 2 constructional models provide simple methodology to test the effect of language structure 3 constructional features are not subsidiary to word occurrence features 4 constructional analysis has a long history in language technology 5 constructional analysis has an opportunity to influence linguistics
  • 93. take home 1 constructional models have suitable granularity for simple difficult tasks 2 constructional models provide simple methodology to test the effect of language structure 3 constructional features are not subsidiary to word occurrence features 4 constructional analysis has a long history in language technology 5 constructional analysis has an opportunity to influence linguistics
  • 94. to discuss 1 what constitutes a construction? 2 what is not a construction? 3 what sort of tasks can use constructions profitably? 4 what sort of abstractions do we want to use for describing constructions productively? 5 how can we learn constructions automatically?