SlideShare una empresa de Scribd logo
1 de 28
Descargar para leer sin conexión
Empirical Validation of Reichenbach’s Tense
                               Framework

                             Leon Derczynski and Robert Gaizauskas

                                               University of Sheffield


                                                 21 March 2013




Leon Derczynski and Robert Gaizauskas                                  University of Sheffield
Empirical Validation of Reichenbach’s Tense Framework
The Role of Time


       Why is time important in language processing?
              World state changes constantly
              Every empirical assertion has temporal bounds
              “The sky is blue”, but it was not always
              Without it, na¨ knowledge extraction will fail (given an
                            ıve
              Almanac of Presidents, who is President?)
       Temporal relations critical




Leon Derczynski and Robert Gaizauskas                              University of Sheffield
Empirical Validation of Reichenbach’s Tense Framework
Representations




       Attempts to reify time in discourse
              (ISO-)TimeML: XML-like standard
              TimeBank: about 180 documents of newswire




Leon Derczynski and Robert Gaizauskas                     University of Sheffield
Empirical Validation of Reichenbach’s Tense Framework
Utility



       Automatic temporal IE immediately useful for:
              Fact-bounding (TAC KBP)
              Clinical: summarisation, event ordering
              NLG: Carsim, Babytalk
              Machine translation




Leon Derczynski and Robert Gaizauskas                   University of Sheffield
Empirical Validation of Reichenbach’s Tense Framework
Relations

       Temporal relations difficult to extract
       What do they look like?
              A before B
              A includes B
       TempEval 2 results suggest event-event are hardest       0




       0. Verhagen, M. et al. 2010. “SemEval-2010 task 13: TempEval-2” in Proc.
       Int’l Workshop on Semantic Evaluation


Leon Derczynski and Robert Gaizauskas                                   University of Sheffield
Empirical Validation of Reichenbach’s Tense Framework
The problem seems “ML-resistant”
       TE2 best performance: 0.65 accurate; MCC baseline: 0.59
       accurate
       Sophisticated features: 1.5% improvement




       We need more insight
       Many event-event relations are between tensed verbs
Leon Derczynski and Robert Gaizauskas                            University of Sheffield
Empirical Validation of Reichenbach’s Tense Framework
Formal framework


       Tripartite perception of time                    1




       What about the perfect?
       1. McTaggart, J.M.E. 1908 “The Unreality of Time” Mind: A Quarterly
       Review of Psychology and Philosophy, 17




Leon Derczynski and Robert Gaizauskas                                  University of Sheffield
Empirical Validation of Reichenbach’s Tense Framework
Formal framework



       Another similar partitioning, for “perspective”




       “By 9p.m., I will have left”




Leon Derczynski and Robert Gaizauskas                    University of Sheffield
Empirical Validation of Reichenbach’s Tense Framework
Reichenbach

       Let’s introduce a framework of tense and aspect       2

              Each verb happens at event time, E
              The verb is uttered at speech time, S
              Past tense: E < S John ran.
              Present tense: E = S I’m free!
       Reflects basic tripartite model


       2. Reichenbach, H. 1947 “The Tense of Verbs” in Elements of Symbolic Logic,
       Macmillan


Leon Derczynski and Robert Gaizauskas                                   University of Sheffield
Empirical Validation of Reichenbach’s Tense Framework
Reference time


       What differentiates simple past from past perfect?
       Add reference time - three points S, E , R
              John ran. is not the same as John had run.
              Introduce abstract reference time, R
              John had run. E < R < S
       R acts as abstract focus
       Corresponds to centre of advanced tripartite




Leon Derczynski and Robert Gaizauskas                      University of Sheffield
Empirical Validation of Reichenbach’s Tense Framework
Reichenbachian tenses

       What tenses can we have?
              Relation       Reichenbach’s Tense Name   English Tense Name   Example
              E<R<S          Anterior past              Past perfect         I had slept
              E=R<S         Simple past                Simple past          I slept
              R<E<S      
              R<S=E          Posterior past                                  I expected that I
              R<S<E                                                             would sleep
              E<S=R          Anterior present           Present perfect      I have slept
              S=R=E          Simple present             Simple present       I sleep
              S=R<E         Posterior present          Simple future        I will sleep (Je vais dormir)
              S<E<R      
              S=E<R          Anterior future            Future perfect       I will have slept
              E<S<R      
              S<R=E          Simple future              Simple future        I will sleep (Je dormirai)
              S<R<E          Posterior future                                I shall be going to sleep

                                        Table : Reichenbach’s tenses


       Total 19 combinations: the above are useful for English

Leon Derczynski and Robert Gaizauskas                                                              University of Sheffield
Empirical Validation of Reichenbach’s Tense Framework
Permanence of the reference point



       How can we use this for temporal relations?
       Principle of permanence
       “although the events referred to in the clauses may occupy different
       time points, the reference point should be the same for all clauses”
       Shared R
       Applies when verb events are in the same context: “sequence of
       tenses”
       More on this later!




Leon Derczynski and Robert Gaizauskas                             University of Sheffield
Empirical Validation of Reichenbach’s Tense Framework
Time to validate

       With permanence, we can reason about event order
       This seems great, but first:

       Is Reichenbach’s framework correct?

       Let’s look at the data
              7935 EVENTs
              6418 TLINKs
       We’ll have to connect Reichenbach’s framework with TimeML
       semantics first


Leon Derczynski and Robert Gaizauskas                       University of Sheffield
Empirical Validation of Reichenbach’s Tense Framework
TimeML tense and aspect



                                TimeML tense            TimeML aspect
                                     past                    none
                                   present                  perfect
                                    future                progressive
                                                             both

       Progressive? This isn’t in the framework




Leon Derczynski and Robert Gaizauskas                                   University of Sheffield
Empirical Validation of Reichenbach’s Tense Framework
Progressive

       As TimeML assumes events are intervals, let’s do the same:




       Decompose progressives into incipitive and concluding instants
       E → Es , E f
       Event is viewed at a point where it is ongoing
       Place R between Es and Ef

Leon Derczynski and Robert Gaizauskas                            University of Sheffield
Empirical Validation of Reichenbach’s Tense Framework
Connect the two

       Now we can describe tensed TimeML events in Reichenbachian
       terms:
                     TimeML Tense            TimeML Aspect   Reichenbach structure
                     PAST                    NONE                 E =R<S
                     PAST                    PROGRESSIVE      Es < R < S, R < Ef
                     PAST                    PERFECTIVE           Ef < R < S
                     PRESENT                 NONE                 E =R=S
                     PRESENT                 PROGRESSIVE       Es < R = S < Ef
                     PRESENT                 PERFECTIVE           Ef < R = S
                     FUTURE                  NONE                 S<R=E
                     FUTURE                  PROGRESSIVE      S < R < Ef , Es < R
                     FUTURE                  PERFECTIVE        S < Es < Ef < R

       Table : TimeML tense/aspect combinations, in terms of the Reichenbach
       framework.


Leon Derczynski and Robert Gaizauskas                                                University of Sheffield
Empirical Validation of Reichenbach’s Tense Framework
Relation ambiguity

       The target for validation: temporal relations
       Follow Allen’s relation set of 14 3
       Our Reichenbach triples underspecific for the precise interval
       relation. E.g.:
              If E1 is simple past and E2 simple future
              Tense suggests that E1 starts before E2
              There are many Allen interval relation types for this - before,
              during, includes

       3. Allen, J. 1983 “Maintaining Knowledge about Temporal Intervals” Comm.
       ACM 26 (11)


Leon Derczynski and Robert Gaizauskas                                  University of Sheffield
Empirical Validation of Reichenbach’s Tense Framework
TimeML relation disjunctions



       Solution: use disjunctions
       Use Reichenbach to just constrain the relation type
              Tense suggests that E1 starts before E2
              The available Allen relation types for E1 / E2 are:
              before, ibefore, during, ended by and includes.
       Any one of these relation types is a valid response.




Leon Derczynski and Robert Gaizauskas                               University of Sheffield
Empirical Validation of Reichenbach’s Tense Framework
Freksa’s Semi-Intervals
                            Surprise Observation Slide!
       Build set of Allen disjunctions from all possible combs. of R’bach
       triples that come from TimeML tenses
       Identical to groups in Freska’s semi-interval algebra 4


         X is older than Y                              X [before, ibefore, ended by, in-
         Y is younger than X                            cludes, during] Y


       – which was designed for annotating natural language

       (are Freksa relations more appropriate than Allen’s,
       for this task?)
       4. Freksa, C. 1992 “Temporal reasoning based on semi-
       intervals” AI 54 (1)
Leon Derczynski and Robert Gaizauskas                                          University of Sheffield
Empirical Validation of Reichenbach’s Tense Framework
Recap



       So: now we can
              Map TimeML verb events into Reichenbach triples
              Temporally relate Reichenbach verb events
              Map Reichenbach event relations back to TimeML
       Which pairs of verbs, e.g. which temporally related events to
       choose?




Leon Derczynski and Robert Gaizauskas                            University of Sheffield
Empirical Validation of Reichenbach’s Tense Framework
Temporal context


       TLINK requirements:
              Event-Event;
              PoS = verb;
              same temporal context
       Reichenbach unclear – “sequence of tenses”
       Possible for expert annotator to label
       We prefer an automatic method!




Leon Derczynski and Robert Gaizauskas                   University of Sheffield
Empirical Validation of Reichenbach’s Tense Framework
Context modelling


       Need to model context
       Perhaps proximity in text can hint at relatedness?
              same sentence
              same or adjacent sent
       Same S R order
              R should be in same place
              S shouldn’t move




Leon Derczynski and Robert Gaizauskas                       University of Sheffield
Empirical Validation of Reichenbach’s Tense Framework
Results

       For TLINKs with event verb arguments in the same context
       What proportion have relation types within constraints of R’bach’s
       framework?
                  Context model                         TLINKs    Consistent
                  None (all pairs)                        1 167       81.5%
                  Same sentence, same SR                    300       88.0%
                  Same sentence                             600       71.2%
                  Same / adjacent sentence, same SR         566      91.9%
                  Same / adjacent sentence                  913       78.3%

       Table : Consistency of relation types suggested by Reichenbach’s
       framework with ground-truth.


       Relation type IAA: 0.77

Leon Derczynski and Robert Gaizauskas                                      University of Sheffield
Empirical Validation of Reichenbach’s Tense Framework
Super-stringent results

       Sometimes no constraint is possible
       e.g. “Jack went to school, Jill went to the circus”
       What if we exclude these?
          Context model                                 Non-“all” TLINKs    Consistent
          None (all pairs)                                            481       55.1%
          Same sentence, same SR                                       95       62.1%
          Same sentence                                               346       50.0%
          Same / adjacent sentence, same SR                           143      67.8%
          Same / adjacent sentence                                    422       53.1%

       Table : Consistency of relation types suggested by Reichenbach’s
       framework with ground-truth.


       Relation type IAA: 0.77

Leon Derczynski and Robert Gaizauskas                                           University of Sheffield
Empirical Validation of Reichenbach’s Tense Framework
Summary


       What have we done?
              Extended Reichenbach’s framework to account for progressive
              Described a mapping between R’bach and TimeML
              Applied this to event-event relations
       Finding: Reichenbach’s framework appropriately constrains
       TimeML relation type
       The model is not contradicted by data, but in fact supported




Leon Derczynski and Robert Gaizauskas                            University of Sheffield
Empirical Validation of Reichenbach’s Tense Framework
Comments


       Temporal annotation is hard for humans, which gives machines
       problems
       New problem: temporal context
       Are the Allen full-interval relations over-specific for linguistic
       annotation?
       Annotation of Reichenbach in TimeML              5


       5. Derczynski, Gaizauskas. 2011 “An Annotation Scheme for Reichenbach’s
       Verbal Tense Structure” in Proc. ISA-6




Leon Derczynski and Robert Gaizauskas                                   University of Sheffield
Empirical Validation of Reichenbach’s Tense Framework
Thank you




                                                   Thank you!
                                         Are there any questions?




Leon Derczynski and Robert Gaizauskas                               University of Sheffield
Empirical Validation of Reichenbach’s Tense Framework
Events and Times
       How else can we use the model?
       Positional use

              Sets R to equal a timex (At 10p.m. I had showered)
              Select event-time relations using dependency parses
              Only consider cases where the event and time are linguistically
              connected
              Add a feature hinting at the ordering
              We reach 75% accuracy from a 66% baseline

       Also used for timex standard transduction        6

       6. Derczynski et al. 2012 “Massively increasing TIMEX3
       resources: a transduction approach” in Proc. LREC
Leon Derczynski and Robert Gaizauskas                               University of Sheffield
Empirical Validation of Reichenbach’s Tense Framework

Más contenido relacionado

Más de Leon Derczynski

Corpus Annotation through Crowdsourcing: Towards Best Practice Guidelines
Corpus Annotation through Crowdsourcing: Towards Best Practice GuidelinesCorpus Annotation through Crowdsourcing: Towards Best Practice Guidelines
Corpus Annotation through Crowdsourcing: Towards Best Practice GuidelinesLeon Derczynski
 
Passive-Aggressive Sequence Labeling with Discriminative Post-Editing for Rec...
Passive-Aggressive Sequence Labeling with Discriminative Post-Editing for Rec...Passive-Aggressive Sequence Labeling with Discriminative Post-Editing for Rec...
Passive-Aggressive Sequence Labeling with Discriminative Post-Editing for Rec...Leon Derczynski
 
Starting to Process Social Media
Starting to Process Social MediaStarting to Process Social Media
Starting to Process Social MediaLeon Derczynski
 
Christmas Presentation at Aarhus: What I do
Christmas Presentation at Aarhus: What I doChristmas Presentation at Aarhus: What I do
Christmas Presentation at Aarhus: What I doLeon Derczynski
 
Recognising and Interpreting Named Temporal Expressions
Recognising and Interpreting Named Temporal ExpressionsRecognising and Interpreting Named Temporal Expressions
Recognising and Interpreting Named Temporal ExpressionsLeon Derczynski
 
TwitIE: An Open-Source Information Extraction Pipeline for Microblog Text
TwitIE: An Open-Source Information Extraction Pipeline for Microblog TextTwitIE: An Open-Source Information Extraction Pipeline for Microblog Text
TwitIE: An Open-Source Information Extraction Pipeline for Microblog TextLeon Derczynski
 
Twitter Part-of-Speech Tagging for All: Overcoming Sparse and Noisy Data
 Twitter Part-of-Speech Tagging for All:  Overcoming Sparse and Noisy Data Twitter Part-of-Speech Tagging for All:  Overcoming Sparse and Noisy Data
Twitter Part-of-Speech Tagging for All: Overcoming Sparse and Noisy DataLeon Derczynski
 
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...Leon Derczynski
 
Determining the Types of Temporal Relations in Discourse
Determining the Types of Temporal Relations in DiscourseDetermining the Types of Temporal Relations in Discourse
Determining the Types of Temporal Relations in DiscourseLeon Derczynski
 
Microblog-genre noise and its impact on semantic annotation accuracy
Microblog-genre noise and its impact on semantic annotation accuracyMicroblog-genre noise and its impact on semantic annotation accuracy
Microblog-genre noise and its impact on semantic annotation accuracyLeon Derczynski
 
Towards Context-Aware Search and Analysis on Social Media Data
Towards Context-Aware Search and Analysis on Social Media DataTowards Context-Aware Search and Analysis on Social Media Data
Towards Context-Aware Search and Analysis on Social Media DataLeon Derczynski
 
Determining the Types of Temporal Relations in Discourse
Determining the Types of Temporal Relations in DiscourseDetermining the Types of Temporal Relations in Discourse
Determining the Types of Temporal Relations in DiscourseLeon Derczynski
 
TIMEN: An Open Temporal Expression Normalisation Resource
TIMEN: An Open Temporal Expression Normalisation ResourceTIMEN: An Open Temporal Expression Normalisation Resource
TIMEN: An Open Temporal Expression Normalisation ResourceLeon Derczynski
 
Review of: Challenges of migrating to agile methodologies
Review of: Challenges of migrating to agile methodologiesReview of: Challenges of migrating to agile methodologies
Review of: Challenges of migrating to agile methodologiesLeon Derczynski
 
A data driven approach to query expansion in question answering
A data driven approach to query expansion in question answeringA data driven approach to query expansion in question answering
A data driven approach to query expansion in question answeringLeon Derczynski
 
A Corpus-based Study of Temporal Signals
A Corpus-based Study of Temporal SignalsA Corpus-based Study of Temporal Signals
A Corpus-based Study of Temporal SignalsLeon Derczynski
 
Word Sense Disambiguation and Induction
Word Sense Disambiguation and InductionWord Sense Disambiguation and Induction
Word Sense Disambiguation and InductionLeon Derczynski
 
Using signals to improve automatic classification of temporal relations
Using signals to improve automatic classification of temporal relationsUsing signals to improve automatic classification of temporal relations
Using signals to improve automatic classification of temporal relationsLeon Derczynski
 
An Annotation Scheme for Reichenbach's Verbal Tense Structure
An Annotation Scheme for Reichenbach's Verbal Tense StructureAn Annotation Scheme for Reichenbach's Verbal Tense Structure
An Annotation Scheme for Reichenbach's Verbal Tense StructureLeon Derczynski
 
RTMBank: Capturing Verbs with Reichenbach's Tense Model
RTMBank: Capturing Verbs with Reichenbach's Tense ModelRTMBank: Capturing Verbs with Reichenbach's Tense Model
RTMBank: Capturing Verbs with Reichenbach's Tense ModelLeon Derczynski
 

Más de Leon Derczynski (20)

Corpus Annotation through Crowdsourcing: Towards Best Practice Guidelines
Corpus Annotation through Crowdsourcing: Towards Best Practice GuidelinesCorpus Annotation through Crowdsourcing: Towards Best Practice Guidelines
Corpus Annotation through Crowdsourcing: Towards Best Practice Guidelines
 
Passive-Aggressive Sequence Labeling with Discriminative Post-Editing for Rec...
Passive-Aggressive Sequence Labeling with Discriminative Post-Editing for Rec...Passive-Aggressive Sequence Labeling with Discriminative Post-Editing for Rec...
Passive-Aggressive Sequence Labeling with Discriminative Post-Editing for Rec...
 
Starting to Process Social Media
Starting to Process Social MediaStarting to Process Social Media
Starting to Process Social Media
 
Christmas Presentation at Aarhus: What I do
Christmas Presentation at Aarhus: What I doChristmas Presentation at Aarhus: What I do
Christmas Presentation at Aarhus: What I do
 
Recognising and Interpreting Named Temporal Expressions
Recognising and Interpreting Named Temporal ExpressionsRecognising and Interpreting Named Temporal Expressions
Recognising and Interpreting Named Temporal Expressions
 
TwitIE: An Open-Source Information Extraction Pipeline for Microblog Text
TwitIE: An Open-Source Information Extraction Pipeline for Microblog TextTwitIE: An Open-Source Information Extraction Pipeline for Microblog Text
TwitIE: An Open-Source Information Extraction Pipeline for Microblog Text
 
Twitter Part-of-Speech Tagging for All: Overcoming Sparse and Noisy Data
 Twitter Part-of-Speech Tagging for All:  Overcoming Sparse and Noisy Data Twitter Part-of-Speech Tagging for All:  Overcoming Sparse and Noisy Data
Twitter Part-of-Speech Tagging for All: Overcoming Sparse and Noisy Data
 
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
 
Determining the Types of Temporal Relations in Discourse
Determining the Types of Temporal Relations in DiscourseDetermining the Types of Temporal Relations in Discourse
Determining the Types of Temporal Relations in Discourse
 
Microblog-genre noise and its impact on semantic annotation accuracy
Microblog-genre noise and its impact on semantic annotation accuracyMicroblog-genre noise and its impact on semantic annotation accuracy
Microblog-genre noise and its impact on semantic annotation accuracy
 
Towards Context-Aware Search and Analysis on Social Media Data
Towards Context-Aware Search and Analysis on Social Media DataTowards Context-Aware Search and Analysis on Social Media Data
Towards Context-Aware Search and Analysis on Social Media Data
 
Determining the Types of Temporal Relations in Discourse
Determining the Types of Temporal Relations in DiscourseDetermining the Types of Temporal Relations in Discourse
Determining the Types of Temporal Relations in Discourse
 
TIMEN: An Open Temporal Expression Normalisation Resource
TIMEN: An Open Temporal Expression Normalisation ResourceTIMEN: An Open Temporal Expression Normalisation Resource
TIMEN: An Open Temporal Expression Normalisation Resource
 
Review of: Challenges of migrating to agile methodologies
Review of: Challenges of migrating to agile methodologiesReview of: Challenges of migrating to agile methodologies
Review of: Challenges of migrating to agile methodologies
 
A data driven approach to query expansion in question answering
A data driven approach to query expansion in question answeringA data driven approach to query expansion in question answering
A data driven approach to query expansion in question answering
 
A Corpus-based Study of Temporal Signals
A Corpus-based Study of Temporal SignalsA Corpus-based Study of Temporal Signals
A Corpus-based Study of Temporal Signals
 
Word Sense Disambiguation and Induction
Word Sense Disambiguation and InductionWord Sense Disambiguation and Induction
Word Sense Disambiguation and Induction
 
Using signals to improve automatic classification of temporal relations
Using signals to improve automatic classification of temporal relationsUsing signals to improve automatic classification of temporal relations
Using signals to improve automatic classification of temporal relations
 
An Annotation Scheme for Reichenbach's Verbal Tense Structure
An Annotation Scheme for Reichenbach's Verbal Tense StructureAn Annotation Scheme for Reichenbach's Verbal Tense Structure
An Annotation Scheme for Reichenbach's Verbal Tense Structure
 
RTMBank: Capturing Verbs with Reichenbach's Tense Model
RTMBank: Capturing Verbs with Reichenbach's Tense ModelRTMBank: Capturing Verbs with Reichenbach's Tense Model
RTMBank: Capturing Verbs with Reichenbach's Tense Model
 

Empirical Validation of Reichenbach's Tense Framework

  • 1. Empirical Validation of Reichenbach’s Tense Framework Leon Derczynski and Robert Gaizauskas University of Sheffield 21 March 2013 Leon Derczynski and Robert Gaizauskas University of Sheffield Empirical Validation of Reichenbach’s Tense Framework
  • 2. The Role of Time Why is time important in language processing? World state changes constantly Every empirical assertion has temporal bounds “The sky is blue”, but it was not always Without it, na¨ knowledge extraction will fail (given an ıve Almanac of Presidents, who is President?) Temporal relations critical Leon Derczynski and Robert Gaizauskas University of Sheffield Empirical Validation of Reichenbach’s Tense Framework
  • 3. Representations Attempts to reify time in discourse (ISO-)TimeML: XML-like standard TimeBank: about 180 documents of newswire Leon Derczynski and Robert Gaizauskas University of Sheffield Empirical Validation of Reichenbach’s Tense Framework
  • 4. Utility Automatic temporal IE immediately useful for: Fact-bounding (TAC KBP) Clinical: summarisation, event ordering NLG: Carsim, Babytalk Machine translation Leon Derczynski and Robert Gaizauskas University of Sheffield Empirical Validation of Reichenbach’s Tense Framework
  • 5. Relations Temporal relations difficult to extract What do they look like? A before B A includes B TempEval 2 results suggest event-event are hardest 0 0. Verhagen, M. et al. 2010. “SemEval-2010 task 13: TempEval-2” in Proc. Int’l Workshop on Semantic Evaluation Leon Derczynski and Robert Gaizauskas University of Sheffield Empirical Validation of Reichenbach’s Tense Framework
  • 6. The problem seems “ML-resistant” TE2 best performance: 0.65 accurate; MCC baseline: 0.59 accurate Sophisticated features: 1.5% improvement We need more insight Many event-event relations are between tensed verbs Leon Derczynski and Robert Gaizauskas University of Sheffield Empirical Validation of Reichenbach’s Tense Framework
  • 7. Formal framework Tripartite perception of time 1 What about the perfect? 1. McTaggart, J.M.E. 1908 “The Unreality of Time” Mind: A Quarterly Review of Psychology and Philosophy, 17 Leon Derczynski and Robert Gaizauskas University of Sheffield Empirical Validation of Reichenbach’s Tense Framework
  • 8. Formal framework Another similar partitioning, for “perspective” “By 9p.m., I will have left” Leon Derczynski and Robert Gaizauskas University of Sheffield Empirical Validation of Reichenbach’s Tense Framework
  • 9. Reichenbach Let’s introduce a framework of tense and aspect 2 Each verb happens at event time, E The verb is uttered at speech time, S Past tense: E < S John ran. Present tense: E = S I’m free! Reflects basic tripartite model 2. Reichenbach, H. 1947 “The Tense of Verbs” in Elements of Symbolic Logic, Macmillan Leon Derczynski and Robert Gaizauskas University of Sheffield Empirical Validation of Reichenbach’s Tense Framework
  • 10. Reference time What differentiates simple past from past perfect? Add reference time - three points S, E , R John ran. is not the same as John had run. Introduce abstract reference time, R John had run. E < R < S R acts as abstract focus Corresponds to centre of advanced tripartite Leon Derczynski and Robert Gaizauskas University of Sheffield Empirical Validation of Reichenbach’s Tense Framework
  • 11. Reichenbachian tenses What tenses can we have? Relation Reichenbach’s Tense Name English Tense Name Example E<R<S Anterior past Past perfect I had slept E=R<S  Simple past Simple past I slept R<E<S  R<S=E Posterior past I expected that I R<S<E  would sleep E<S=R Anterior present Present perfect I have slept S=R=E Simple present Simple present I sleep S=R<E  Posterior present Simple future I will sleep (Je vais dormir) S<E<R  S=E<R Anterior future Future perfect I will have slept E<S<R  S<R=E Simple future Simple future I will sleep (Je dormirai) S<R<E Posterior future I shall be going to sleep Table : Reichenbach’s tenses Total 19 combinations: the above are useful for English Leon Derczynski and Robert Gaizauskas University of Sheffield Empirical Validation of Reichenbach’s Tense Framework
  • 12. Permanence of the reference point How can we use this for temporal relations? Principle of permanence “although the events referred to in the clauses may occupy different time points, the reference point should be the same for all clauses” Shared R Applies when verb events are in the same context: “sequence of tenses” More on this later! Leon Derczynski and Robert Gaizauskas University of Sheffield Empirical Validation of Reichenbach’s Tense Framework
  • 13. Time to validate With permanence, we can reason about event order This seems great, but first: Is Reichenbach’s framework correct? Let’s look at the data 7935 EVENTs 6418 TLINKs We’ll have to connect Reichenbach’s framework with TimeML semantics first Leon Derczynski and Robert Gaizauskas University of Sheffield Empirical Validation of Reichenbach’s Tense Framework
  • 14. TimeML tense and aspect TimeML tense TimeML aspect past none present perfect future progressive both Progressive? This isn’t in the framework Leon Derczynski and Robert Gaizauskas University of Sheffield Empirical Validation of Reichenbach’s Tense Framework
  • 15. Progressive As TimeML assumes events are intervals, let’s do the same: Decompose progressives into incipitive and concluding instants E → Es , E f Event is viewed at a point where it is ongoing Place R between Es and Ef Leon Derczynski and Robert Gaizauskas University of Sheffield Empirical Validation of Reichenbach’s Tense Framework
  • 16. Connect the two Now we can describe tensed TimeML events in Reichenbachian terms: TimeML Tense TimeML Aspect Reichenbach structure PAST NONE E =R<S PAST PROGRESSIVE Es < R < S, R < Ef PAST PERFECTIVE Ef < R < S PRESENT NONE E =R=S PRESENT PROGRESSIVE Es < R = S < Ef PRESENT PERFECTIVE Ef < R = S FUTURE NONE S<R=E FUTURE PROGRESSIVE S < R < Ef , Es < R FUTURE PERFECTIVE S < Es < Ef < R Table : TimeML tense/aspect combinations, in terms of the Reichenbach framework. Leon Derczynski and Robert Gaizauskas University of Sheffield Empirical Validation of Reichenbach’s Tense Framework
  • 17. Relation ambiguity The target for validation: temporal relations Follow Allen’s relation set of 14 3 Our Reichenbach triples underspecific for the precise interval relation. E.g.: If E1 is simple past and E2 simple future Tense suggests that E1 starts before E2 There are many Allen interval relation types for this - before, during, includes 3. Allen, J. 1983 “Maintaining Knowledge about Temporal Intervals” Comm. ACM 26 (11) Leon Derczynski and Robert Gaizauskas University of Sheffield Empirical Validation of Reichenbach’s Tense Framework
  • 18. TimeML relation disjunctions Solution: use disjunctions Use Reichenbach to just constrain the relation type Tense suggests that E1 starts before E2 The available Allen relation types for E1 / E2 are: before, ibefore, during, ended by and includes. Any one of these relation types is a valid response. Leon Derczynski and Robert Gaizauskas University of Sheffield Empirical Validation of Reichenbach’s Tense Framework
  • 19. Freksa’s Semi-Intervals Surprise Observation Slide! Build set of Allen disjunctions from all possible combs. of R’bach triples that come from TimeML tenses Identical to groups in Freska’s semi-interval algebra 4 X is older than Y X [before, ibefore, ended by, in- Y is younger than X cludes, during] Y – which was designed for annotating natural language (are Freksa relations more appropriate than Allen’s, for this task?) 4. Freksa, C. 1992 “Temporal reasoning based on semi- intervals” AI 54 (1) Leon Derczynski and Robert Gaizauskas University of Sheffield Empirical Validation of Reichenbach’s Tense Framework
  • 20. Recap So: now we can Map TimeML verb events into Reichenbach triples Temporally relate Reichenbach verb events Map Reichenbach event relations back to TimeML Which pairs of verbs, e.g. which temporally related events to choose? Leon Derczynski and Robert Gaizauskas University of Sheffield Empirical Validation of Reichenbach’s Tense Framework
  • 21. Temporal context TLINK requirements: Event-Event; PoS = verb; same temporal context Reichenbach unclear – “sequence of tenses” Possible for expert annotator to label We prefer an automatic method! Leon Derczynski and Robert Gaizauskas University of Sheffield Empirical Validation of Reichenbach’s Tense Framework
  • 22. Context modelling Need to model context Perhaps proximity in text can hint at relatedness? same sentence same or adjacent sent Same S R order R should be in same place S shouldn’t move Leon Derczynski and Robert Gaizauskas University of Sheffield Empirical Validation of Reichenbach’s Tense Framework
  • 23. Results For TLINKs with event verb arguments in the same context What proportion have relation types within constraints of R’bach’s framework? Context model TLINKs Consistent None (all pairs) 1 167 81.5% Same sentence, same SR 300 88.0% Same sentence 600 71.2% Same / adjacent sentence, same SR 566 91.9% Same / adjacent sentence 913 78.3% Table : Consistency of relation types suggested by Reichenbach’s framework with ground-truth. Relation type IAA: 0.77 Leon Derczynski and Robert Gaizauskas University of Sheffield Empirical Validation of Reichenbach’s Tense Framework
  • 24. Super-stringent results Sometimes no constraint is possible e.g. “Jack went to school, Jill went to the circus” What if we exclude these? Context model Non-“all” TLINKs Consistent None (all pairs) 481 55.1% Same sentence, same SR 95 62.1% Same sentence 346 50.0% Same / adjacent sentence, same SR 143 67.8% Same / adjacent sentence 422 53.1% Table : Consistency of relation types suggested by Reichenbach’s framework with ground-truth. Relation type IAA: 0.77 Leon Derczynski and Robert Gaizauskas University of Sheffield Empirical Validation of Reichenbach’s Tense Framework
  • 25. Summary What have we done? Extended Reichenbach’s framework to account for progressive Described a mapping between R’bach and TimeML Applied this to event-event relations Finding: Reichenbach’s framework appropriately constrains TimeML relation type The model is not contradicted by data, but in fact supported Leon Derczynski and Robert Gaizauskas University of Sheffield Empirical Validation of Reichenbach’s Tense Framework
  • 26. Comments Temporal annotation is hard for humans, which gives machines problems New problem: temporal context Are the Allen full-interval relations over-specific for linguistic annotation? Annotation of Reichenbach in TimeML 5 5. Derczynski, Gaizauskas. 2011 “An Annotation Scheme for Reichenbach’s Verbal Tense Structure” in Proc. ISA-6 Leon Derczynski and Robert Gaizauskas University of Sheffield Empirical Validation of Reichenbach’s Tense Framework
  • 27. Thank you Thank you! Are there any questions? Leon Derczynski and Robert Gaizauskas University of Sheffield Empirical Validation of Reichenbach’s Tense Framework
  • 28. Events and Times How else can we use the model? Positional use Sets R to equal a timex (At 10p.m. I had showered) Select event-time relations using dependency parses Only consider cases where the event and time are linguistically connected Add a feature hinting at the ordering We reach 75% accuracy from a 66% baseline Also used for timex standard transduction 6 6. Derczynski et al. 2012 “Massively increasing TIMEX3 resources: a transduction approach” in Proc. LREC Leon Derczynski and Robert Gaizauskas University of Sheffield Empirical Validation of Reichenbach’s Tense Framework