SlideShare una empresa de Scribd logo
1 de 38
Descargar para leer sin conexión
filannim@cs.man.ac.uk 
School of Computer Science 
presentation 1st AHA! Workshop, COLING 2014 
Dublin, 23/08/2014 
Mining temporal 
footprints from Wikipedia 
Michele Filannino, Goran Nenadic
presentation 1st AHA! Workshop, COLING 2014 
Dublin, 23/08/2014 / 25 
introduction 
■ Temporal information is crucial for organising 
structured and unstructured data 
■ Several temporal information extraction (TIE) 
systems are nowadays available 
● thanks to TempEval challenge series 
2
presentation 1st AHA! Workshop, COLING 2014 
Dublin, 23/08/2014 / 25 
ManTIME 
URL: http://www.cs.man.ac.uk/~filannim/mantime.html 
3
Test with long text 4 / 23
presentation 1st AHA! Workshop, COLING 2014 
Dublin, 23/08/2014 / 25 
temporal footprint 
A temporal footprint is a 
continuous period on the time-line 
that temporally defines the 
existence of a particular concept. 
Immanuel Kant, Paul Guyer, and Allen W Wood. 1998. Critique of pure reason. Cambridge 
University Press. 
5
presentation 1st AHA! Workshop, COLING 2014 
Dublin, 23/08/2014 / 25 
problem 
Can we predict temporal footprints from 
encyclopaedic descriptions of concepts? 
■ input: textual description of a concept 
■ output: prediction of a temporal 
interval
Web 
Cellphone 
Computer 
Car 
Richard Feynman 
Bicycle 
Carl Friedrich Gauss 
French revolution 
Age of Enlightenment 
Galileo Galilei 
Leonardo Da Vinci 
Christopher Columbus 
Renaissance 
Arming sword 
High Middle Ages 
Gengis Khan 
1000 1100 1200 1300 1400 1500 1600 1700 1800 1900 2000 
Object Person Historical period 
Examples of temporal footprints 7 / 23
8 / 23
8 / 23
8 / 23
8 / 23
presentation 1st AHA! Workshop, COLING 2014 
Dublin, 23/08/2014 / 25 
methodology 
1. date mention extraction 
2. outlier filtering 
3. normal distribution fitting 
4. prediction 
9
presentation 1st AHA! Workshop, COLING 2014 
1360 1410 1460 1510 1560 1610 1660 1710 1760 1810 
Dublin, 23/08/2014 / 25 
date mentions extraction 
0.050 
0.038 
freq 
0.025 
0.013 
0.000 
time (in years) 
10
presentation 1st AHA! Workshop, COLING 2014 
outlier filtering 
γ param. 
1360 1410 1460 1510 1560 1610 1660 1710 1760 1810 
Dublin, 23/08/2014 / 25 
freq 
0.050 
0.038 
0.025 
0.013 
0.000 
time (in years) 
Gamma parameter controls the outlier region’s boundaries. 
11
presentation 1st AHA! Workshop, COLING 2014 
normal distribution fitting 
0.050 
0.038 
freq 
1360 1410 1460 1510 1560 1610 1660 1710 1760 1810 
12 Dublin, 23/08/2014 / 25 
0.025 
0.013 
0.000 
time (in years) 
Alpha and Beta parameters control the size and offset of the gaussian bell. 
α param.
presentation 1st AHA! Workshop, COLING 2014 
normal distribution fitting 
0.050 
0.038 
freq 
1360 1410 1460 1510 1560 1610 1660 1710 1760 1810 
12 Dublin, 23/08/2014 / 25 
0.025 
0.013 
0.000 
time (in years) 
Alpha and Beta parameters control the size and offset of the gaussian bell. 
α param.
presentation 1st AHA! Workshop, COLING 2014 
normal distribution fitting 
β param. 
1360 1410 1460 1510 1560 1610 1660 1710 1760 1810 
Dublin, 23/08/2014 / 25 
freq 
0.050 
0.038 
0.025 
0.013 
0.000 
time (in years) 
Alpha and Beta parameters control the size and offset of the gaussian bell. 
13
presentation 1st AHA! Workshop, COLING 2014 
normal distribution fitting 
β param. 
1360 1410 1460 1510 1560 1610 1660 1710 1760 1810 
Dublin, 23/08/2014 / 25 
freq 
0.050 
0.038 
0.025 
0.013 
0.000 
time (in years) 
Alpha and Beta parameters control the size and offset of the gaussian bell. 
13
presentation 1st AHA! Workshop, COLING 2014 
Dublin, 23/08/2014 / 25 
error measure 
gold 
prediction 
Fatima De Carvalho. 1996. Histogrammes et indices de proximite ́en analyse donne és 
symboliques. Acyes de l’e ćole d’e t́e ́sur l’analyse des donne és symboliques. LISE-CEREMADE, 
Universite ́de Paris IX Dauphine, pages 101–127. 
14 
union overlap
presentation 1st AHA! Workshop, COLING 2014 
Dublin, 23/08/2014 / 25 
error measure 
Fatima De Carvalho. 1996. Histogrammes et indices de proximite ́en analyse donne és 
symboliques. Acyes de l’e ćole d’e t́e ́sur l’analyse des donne és symboliques. LISE-CEREMADE, 
Universite ́de Paris IX Dauphine, pages 101–127. 
15 
union 
gold 
prediction
presentation 1st AHA! Workshop, COLING 2014 
Dublin, 23/08/2014 / 25 
strategies 
A. RegEx 
B. RegEx + Filtering 
C. RegEx + Filtering + Gaussian fitting 
D. HeidelTime + Filtering + Gaussian fitting 
16
presentation 1st AHA! Workshop, COLING 2014 
Dublin, 23/08/2014 / 25 
evaluation 
■ subject: people 
■ lived from 1000 AD to 2014 
● text from Wikipedia web pages 
● year of birth and death from DBpedia 
■ 228,824 people collected 
■ simple definition of temporal footprint 
● birth and death dates 
17
#people 
500 
400 
300 
200 
100 
0 
0 250 500 750 1000 1250 1500 1750 2000 2250 2500 2750 3000 3250 3500 3750 
#words 
People per textual length 1 8 / 23
presentation 1st AHA! Workshop, COLING 2014 
Dublin, 23/08/2014 / 25 
aggregate results 
19 
Strategy 
Mean 
Distance 
Error 
Standard 
Deviation 
RegEx 0.2636 0.3409 
RegEx + Filtering 0.2596 0.3090 
RegEx + Filtering + Gaussian fitting 0.3503 0.2430 
HeidelTime + Filtering + Gaussian fitting 0.5980 0.2470
presentation 1st AHA! Workshop, COLING 2014 
1112 3336 5560 7785 10009 12233 14458 16682 18906 21131 23355 25579 27804 
Dublin, 23/08/2014 / 25 
results 
1.0 
0.8 
0.6 
MDE 
0.4 
0.2 
0.0 
#words 
20 RegEx RegEx + Filtering 
HeidelTime + Filtering + Gaussian fitting RegEx + Filtering + Gaussian fitting
presentation 1st AHA! Workshop, COLING 2014 
1112 3336 5560 7785 10009 12233 14458 16682 18906 21131 23355 25579 27804 
Dublin, 23/08/2014 / 25 
results 
1.0 
0.8 
0.6 
MDE 
0.4 
0.2 
0.0 
#words 
20 RegEx RegEx + Filtering 
HeidelTime + Filtering + Gaussian fitting RegEx + Filtering + Gaussian fitting
presentation 1st AHA! Workshop, COLING 2014 
results 
■ Galileo Galilei (1564-1642), prediction: 1556-1654 
Dublin, 23/08/2014 / 25 
E: 0.204 
21
presentation 1st AHA! Workshop, COLING 2014 
results 
■ Robin Williams (1951 - 2014), prediction: 1953-2006 
Dublin, 23/08/2014 / 25 
E: 0.159 
22
presentation 1st AHA! Workshop, COLING 2014 
other types of temporal footprint? 
■ Christopher Columbus will die in 2057 ?! 
Dublin, 23/08/2014 / 25 
Prediction: 1366-2057 (1451-1506), E: 0.92 
23
presentation 1st AHA! Workshop, COLING 2014 
other types of temporal footprint? 
■ Christopher Columbus will die in 2057 ?! 
Dublin, 23/08/2014 / 25 
Prediction: 1366-2057 (1451-1506), E: 0.92 
23
presentation 1st AHA! Workshop, COLING 2014 
other types of temporal footprint? 
■ Christopher Columbus will die in 2057 ?! 
Dublin, 23/08/2014 / 25 
Prediction: 1366-2057 (1451-1506), E: 0.92 
23
presentation 1st AHA! Workshop, COLING 2014 
other types of temporal footprint? 
■ Christopher Columbus will die in 2057 ?! 
Dublin, 23/08/2014 / 25 
Prediction: 1366-2057 (1451-1506), E: 0.92 
23 
AHA!
presentation 1st AHA! Workshop, COLING 2014 
physical existence vs. social coverage 
■ Anne Frank’s footprint is shifted in the future 
24 
Dublin, 23/08/2014 / 25
presentation 1st AHA! Workshop, COLING 2014 
physical existence vs. social coverage 
■ Anne Frank’s footprint is shifted in the future 
24 
Dublin, 23/08/2014 / 25
presentation 1st AHA! Workshop, COLING 2014 
physical existence vs. social coverage 
■ Anne Frank’s footprint is shifted in the future 
24 
Dublin, 23/08/2014 / 25
presentation 1st AHA! Workshop, COLING 2014 
conclusions 
■ how the methodology behaves on different 
Dublin, 23/08/2014 / 25 
languages? how on different sources? 
■ oracle-like side-effect behaviour: 
• Apple Inc. will be closed down this year 
• Stanford University will be closed down in 2029 
■ Future works 
• mixture of normal distributions 
25
Thank you.
? QUESTIONS 
Contact: 
filannim@cs.man.ac.uk 
! 
Visit: 
tinyurl.com/temporal-footprints

Más contenido relacionado

Destacado

Using machine learning to predict temporal orientation of search engines’ que...
Using machine learning to predict temporal orientation of search engines’ que...Using machine learning to predict temporal orientation of search engines’ que...
Using machine learning to predict temporal orientation of search engines’ que...
Michele Filannino
 
Algoritmo di text-similarity per l'annotazione semantica di Web Service
Algoritmo di text-similarity per l'annotazione semantica di Web ServiceAlgoritmo di text-similarity per l'annotazione semantica di Web Service
Algoritmo di text-similarity per l'annotazione semantica di Web Service
Michele Filannino
 
Femmes M D
Femmes M DFemmes M D
Femmes M D
klaus124
 
Semantic Web Service Annotation
Semantic Web Service AnnotationSemantic Web Service Annotation
Semantic Web Service Annotation
Michele Filannino
 
Modulo di serendipità in un Item Recommender System
Modulo di serendipità in un Item Recommender SystemModulo di serendipità in un Item Recommender System
Modulo di serendipità in un Item Recommender System
Michele Filannino
 
Funny Management Stories
Funny Management StoriesFunny Management Stories
Funny Management Stories
sajithmarakar
 

Destacado (11)

Using machine learning to predict temporal orientation of search engines’ que...
Using machine learning to predict temporal orientation of search engines’ que...Using machine learning to predict temporal orientation of search engines’ que...
Using machine learning to predict temporal orientation of search engines’ que...
 
My research taster project
My research taster projectMy research taster project
My research taster project
 
Tecniche fuzzy per l'elaborazione del linguaggio naturale
Tecniche fuzzy per l'elaborazione del linguaggio naturaleTecniche fuzzy per l'elaborazione del linguaggio naturale
Tecniche fuzzy per l'elaborazione del linguaggio naturale
 
Algoritmo di text-similarity per l'annotazione semantica di Web Service
Algoritmo di text-similarity per l'annotazione semantica di Web ServiceAlgoritmo di text-similarity per l'annotazione semantica di Web Service
Algoritmo di text-similarity per l'annotazione semantica di Web Service
 
Femmes M D
Femmes M DFemmes M D
Femmes M D
 
Sviluppo di un algoritmo di similarità a supporto dell'annotazione semantica ...
Sviluppo di un algoritmo di similarità a supporto dell'annotazione semantica ...Sviluppo di un algoritmo di similarità a supporto dell'annotazione semantica ...
Sviluppo di un algoritmo di similarità a supporto dell'annotazione semantica ...
 
Temporal information extraction in the general and clinical domain
Temporal information extraction in the general and clinical domainTemporal information extraction in the general and clinical domain
Temporal information extraction in the general and clinical domain
 
Can computers understand time?
Can computers understand time?Can computers understand time?
Can computers understand time?
 
Semantic Web Service Annotation
Semantic Web Service AnnotationSemantic Web Service Annotation
Semantic Web Service Annotation
 
Modulo di serendipità in un Item Recommender System
Modulo di serendipità in un Item Recommender SystemModulo di serendipità in un Item Recommender System
Modulo di serendipità in un Item Recommender System
 
Funny Management Stories
Funny Management StoriesFunny Management Stories
Funny Management Stories
 

Último

Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
levieagacer
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
Silpa
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.
Silpa
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
ANSARKHAN96
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 

Último (20)

Cyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptxCyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptx
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
 
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRLGwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
 
Genome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxGenome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptx
 
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxClimate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
 
LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.
 
Chemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdfChemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdf
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
 
Role of AI in seed science Predictive modelling and Beyond.pptx
Role of AI in seed science  Predictive modelling and  Beyond.pptxRole of AI in seed science  Predictive modelling and  Beyond.pptx
Role of AI in seed science Predictive modelling and Beyond.pptx
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
 
Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.
 
Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 

Mining temporal footprints from Wikipedia

  • 1. filannim@cs.man.ac.uk School of Computer Science presentation 1st AHA! Workshop, COLING 2014 Dublin, 23/08/2014 Mining temporal footprints from Wikipedia Michele Filannino, Goran Nenadic
  • 2. presentation 1st AHA! Workshop, COLING 2014 Dublin, 23/08/2014 / 25 introduction ■ Temporal information is crucial for organising structured and unstructured data ■ Several temporal information extraction (TIE) systems are nowadays available ● thanks to TempEval challenge series 2
  • 3. presentation 1st AHA! Workshop, COLING 2014 Dublin, 23/08/2014 / 25 ManTIME URL: http://www.cs.man.ac.uk/~filannim/mantime.html 3
  • 4. Test with long text 4 / 23
  • 5. presentation 1st AHA! Workshop, COLING 2014 Dublin, 23/08/2014 / 25 temporal footprint A temporal footprint is a continuous period on the time-line that temporally defines the existence of a particular concept. Immanuel Kant, Paul Guyer, and Allen W Wood. 1998. Critique of pure reason. Cambridge University Press. 5
  • 6. presentation 1st AHA! Workshop, COLING 2014 Dublin, 23/08/2014 / 25 problem Can we predict temporal footprints from encyclopaedic descriptions of concepts? ■ input: textual description of a concept ■ output: prediction of a temporal interval
  • 7. Web Cellphone Computer Car Richard Feynman Bicycle Carl Friedrich Gauss French revolution Age of Enlightenment Galileo Galilei Leonardo Da Vinci Christopher Columbus Renaissance Arming sword High Middle Ages Gengis Khan 1000 1100 1200 1300 1400 1500 1600 1700 1800 1900 2000 Object Person Historical period Examples of temporal footprints 7 / 23
  • 12. presentation 1st AHA! Workshop, COLING 2014 Dublin, 23/08/2014 / 25 methodology 1. date mention extraction 2. outlier filtering 3. normal distribution fitting 4. prediction 9
  • 13. presentation 1st AHA! Workshop, COLING 2014 1360 1410 1460 1510 1560 1610 1660 1710 1760 1810 Dublin, 23/08/2014 / 25 date mentions extraction 0.050 0.038 freq 0.025 0.013 0.000 time (in years) 10
  • 14. presentation 1st AHA! Workshop, COLING 2014 outlier filtering γ param. 1360 1410 1460 1510 1560 1610 1660 1710 1760 1810 Dublin, 23/08/2014 / 25 freq 0.050 0.038 0.025 0.013 0.000 time (in years) Gamma parameter controls the outlier region’s boundaries. 11
  • 15. presentation 1st AHA! Workshop, COLING 2014 normal distribution fitting 0.050 0.038 freq 1360 1410 1460 1510 1560 1610 1660 1710 1760 1810 12 Dublin, 23/08/2014 / 25 0.025 0.013 0.000 time (in years) Alpha and Beta parameters control the size and offset of the gaussian bell. α param.
  • 16. presentation 1st AHA! Workshop, COLING 2014 normal distribution fitting 0.050 0.038 freq 1360 1410 1460 1510 1560 1610 1660 1710 1760 1810 12 Dublin, 23/08/2014 / 25 0.025 0.013 0.000 time (in years) Alpha and Beta parameters control the size and offset of the gaussian bell. α param.
  • 17. presentation 1st AHA! Workshop, COLING 2014 normal distribution fitting β param. 1360 1410 1460 1510 1560 1610 1660 1710 1760 1810 Dublin, 23/08/2014 / 25 freq 0.050 0.038 0.025 0.013 0.000 time (in years) Alpha and Beta parameters control the size and offset of the gaussian bell. 13
  • 18. presentation 1st AHA! Workshop, COLING 2014 normal distribution fitting β param. 1360 1410 1460 1510 1560 1610 1660 1710 1760 1810 Dublin, 23/08/2014 / 25 freq 0.050 0.038 0.025 0.013 0.000 time (in years) Alpha and Beta parameters control the size and offset of the gaussian bell. 13
  • 19. presentation 1st AHA! Workshop, COLING 2014 Dublin, 23/08/2014 / 25 error measure gold prediction Fatima De Carvalho. 1996. Histogrammes et indices de proximite ́en analyse donne és symboliques. Acyes de l’e ćole d’e t́e ́sur l’analyse des donne és symboliques. LISE-CEREMADE, Universite ́de Paris IX Dauphine, pages 101–127. 14 union overlap
  • 20. presentation 1st AHA! Workshop, COLING 2014 Dublin, 23/08/2014 / 25 error measure Fatima De Carvalho. 1996. Histogrammes et indices de proximite ́en analyse donne és symboliques. Acyes de l’e ćole d’e t́e ́sur l’analyse des donne és symboliques. LISE-CEREMADE, Universite ́de Paris IX Dauphine, pages 101–127. 15 union gold prediction
  • 21. presentation 1st AHA! Workshop, COLING 2014 Dublin, 23/08/2014 / 25 strategies A. RegEx B. RegEx + Filtering C. RegEx + Filtering + Gaussian fitting D. HeidelTime + Filtering + Gaussian fitting 16
  • 22. presentation 1st AHA! Workshop, COLING 2014 Dublin, 23/08/2014 / 25 evaluation ■ subject: people ■ lived from 1000 AD to 2014 ● text from Wikipedia web pages ● year of birth and death from DBpedia ■ 228,824 people collected ■ simple definition of temporal footprint ● birth and death dates 17
  • 23. #people 500 400 300 200 100 0 0 250 500 750 1000 1250 1500 1750 2000 2250 2500 2750 3000 3250 3500 3750 #words People per textual length 1 8 / 23
  • 24. presentation 1st AHA! Workshop, COLING 2014 Dublin, 23/08/2014 / 25 aggregate results 19 Strategy Mean Distance Error Standard Deviation RegEx 0.2636 0.3409 RegEx + Filtering 0.2596 0.3090 RegEx + Filtering + Gaussian fitting 0.3503 0.2430 HeidelTime + Filtering + Gaussian fitting 0.5980 0.2470
  • 25. presentation 1st AHA! Workshop, COLING 2014 1112 3336 5560 7785 10009 12233 14458 16682 18906 21131 23355 25579 27804 Dublin, 23/08/2014 / 25 results 1.0 0.8 0.6 MDE 0.4 0.2 0.0 #words 20 RegEx RegEx + Filtering HeidelTime + Filtering + Gaussian fitting RegEx + Filtering + Gaussian fitting
  • 26. presentation 1st AHA! Workshop, COLING 2014 1112 3336 5560 7785 10009 12233 14458 16682 18906 21131 23355 25579 27804 Dublin, 23/08/2014 / 25 results 1.0 0.8 0.6 MDE 0.4 0.2 0.0 #words 20 RegEx RegEx + Filtering HeidelTime + Filtering + Gaussian fitting RegEx + Filtering + Gaussian fitting
  • 27. presentation 1st AHA! Workshop, COLING 2014 results ■ Galileo Galilei (1564-1642), prediction: 1556-1654 Dublin, 23/08/2014 / 25 E: 0.204 21
  • 28. presentation 1st AHA! Workshop, COLING 2014 results ■ Robin Williams (1951 - 2014), prediction: 1953-2006 Dublin, 23/08/2014 / 25 E: 0.159 22
  • 29. presentation 1st AHA! Workshop, COLING 2014 other types of temporal footprint? ■ Christopher Columbus will die in 2057 ?! Dublin, 23/08/2014 / 25 Prediction: 1366-2057 (1451-1506), E: 0.92 23
  • 30. presentation 1st AHA! Workshop, COLING 2014 other types of temporal footprint? ■ Christopher Columbus will die in 2057 ?! Dublin, 23/08/2014 / 25 Prediction: 1366-2057 (1451-1506), E: 0.92 23
  • 31. presentation 1st AHA! Workshop, COLING 2014 other types of temporal footprint? ■ Christopher Columbus will die in 2057 ?! Dublin, 23/08/2014 / 25 Prediction: 1366-2057 (1451-1506), E: 0.92 23
  • 32. presentation 1st AHA! Workshop, COLING 2014 other types of temporal footprint? ■ Christopher Columbus will die in 2057 ?! Dublin, 23/08/2014 / 25 Prediction: 1366-2057 (1451-1506), E: 0.92 23 AHA!
  • 33. presentation 1st AHA! Workshop, COLING 2014 physical existence vs. social coverage ■ Anne Frank’s footprint is shifted in the future 24 Dublin, 23/08/2014 / 25
  • 34. presentation 1st AHA! Workshop, COLING 2014 physical existence vs. social coverage ■ Anne Frank’s footprint is shifted in the future 24 Dublin, 23/08/2014 / 25
  • 35. presentation 1st AHA! Workshop, COLING 2014 physical existence vs. social coverage ■ Anne Frank’s footprint is shifted in the future 24 Dublin, 23/08/2014 / 25
  • 36. presentation 1st AHA! Workshop, COLING 2014 conclusions ■ how the methodology behaves on different Dublin, 23/08/2014 / 25 languages? how on different sources? ■ oracle-like side-effect behaviour: • Apple Inc. will be closed down this year • Stanford University will be closed down in 2029 ■ Future works • mixture of normal distributions 25
  • 38. ? QUESTIONS Contact: filannim@cs.man.ac.uk ! Visit: tinyurl.com/temporal-footprints