SlideShare una empresa de Scribd logo
1 de 91
eDictor:(a chronology)
eDictor:(a chronology)
Roundtable: e-dictor, Advances and Perspectives.
Workshop: Construction and use
of large annotated corpora
Campinas, Sept. 9, 2013.
2004-2006
2004-2006
Preliminary Ideas
The preliminary ideas that would result
in the development of eDictor in 2007
started in 2004 with a project that aimed
at restructuring the text-preparation
system at the Tycho Brahe Corpus.
>
2004-2006
http://www.ime.usp.br/~tycho/participants/psousa/memorias/index.html
PAIXÃO DE SOUSA, M.C. Memórias do Texto: Aspectos tecnológicos
na construção de um corpus histórico do português. Post-doc Research
Project, 2004-2007. Unicamp/Fapesp.
Essentially, the idea was that the Corpus
would be constituted of
single-source documents
that could contain all relevant annotations
(textual, philological, linguistic).
>
2004-2006
This was achieved in partnership with
computer scientist Thorsten Trippel, from the
University of Bielefeld.
He suggested we used the XML annotation
language to re-encode the Corpus, and XSLT to
transform each document into different
presentations of the encoded information.
>
2004-2006
Our central idea was to encapsulate edition
interferences at the word level, i.e. for each
token in the corpus – so that each element of
the pair would be available to different modules
of analysis.
>
2004-2006
This first idea was applied to a few pilot texts, and
published as a poster at the annual conference of the
ALLC in 2004
PAIXÃO DE SOUSA, M. C.; TRIPPEL, T. Single source process
Historic corpora for diverse uses.
In: Proceedings of the Association for Literary and Linguistic
Computing (ALLC) Annual Conference, 2004.
>
2004-2006
In 2005, the Corpus went through a complete
re-encoding process.
2004-2006
>
The restructured Corpus was composed
of XML documents that, via XSLT
transformations, would render different
(HTML and TXT) versions, adequate
for different visualization and processing needs,
as we had originally planned.
>
2004-2006
The
Tycho Brahe
Corpus,
restructured
(XML base)
2004-2006
The Tycho Brahe Corpus, restructured
(“catalogue” view)
The Tycho Brahe Corpus, restructured
(“original” view)
The Tycho Brahe Corpus, restructured
(“modernized” view)
The Tycho Brahe Corpus, restructured (simple text for further processing)
[ prologue (author: P.M. Gandavo)]
[ title: AO MUITO ILUSTRE SENHOR DOM LIONIS PEREIRA, Epístola de Pero de Magalhães. ]
[g_008_s_43] Neste pequeno serviço (muito ilustre senhor ) que ofereço a Vossa Mercê das primícias de meu
fraco entendimento, poderá em alguma maneira conhecer os desejos que tenho de pagar com minha
possibilidade alguma parte do muito que se deve à ínclita fama de vosso heróico nome.
[g_008_s_44] E isto assim pelo merecimento do nobilíssimo sangue e clara progênie de onde traz sua origem,
como pelos troféus das grandes vitórias , e casos bem afortunados que lhe hão sucedido nessas partes do
Oriente em que Deus o quis favorecer com tão larga mão, que não cuido ser toda minha vida bastante para
satisfazer à menor parte de seus louvores .
[g_008_s_45] E como todas estas razões me ponham em tanta obrigação , e eu entenda que outra nenhuma
coisa deve ser mais aceita a pessoas de altos ânimos que a lição das escrituras , por cujos meios se alcançam
os segredos de todas as ciências , e os homens vêm a ilustrar seus nomes e perpetuar os na terra com fama
imortal , determinei escolher a Vossa Mercê entre os mais senhores da terra , e dedicar lhe esta breve história .
[g_008_s_46] A qual espero que folgue de ver com atenção e receber me a benignamente debaixo de seu
amparo : assim por ser coisa nova , e eu a escrever como testemunha de vista : como por saber quão particular
afeição Vossa Mercê tem às coisas do engenho , e que por esta causa lhe não será menos aceito o exercício das
escrituras , que o das armas.
[g_008_s_47] Por onde com muita razão favorecido desta confiança possa seguramente sair a luz com esta
pequena empresa e divulgar a pela terra sem nenhum receio , tendo por defensor dela a Vossa Mercê Cuja muito
ilustre pessoa nosso Senhor guarde e acrescente sua vida e estado por longos e felizes anos .
[ end prologue ]
Along with the application of the new single-
source system to the Corpus, new ideas started
to pop up.
Some of them were carried on, some were not.
2004-2006
>
The main thing that we wanted to do back then
and still have not done is...
... to integrate syntactic annotation
into this same, single-source system...
2004-2006
>
Other ideas were a little more fruitful: the
integration of other, less complex levels of
linguistic annotation (such as items of
lexicological interest); and the expansion of the
system to include the possibility of critical
editions, in which more than one version of the
same text could be compared.
2004-2006
>
PAIXÃO DE SOUSA, M. C. A Anotação da variação de grafia no Corpus
Histórico do Português Tycho Brahe: Frentes abertas para estudos do léxico. V
Encontro de Corpora: Lingüística de Corpus: a aplicabilidade nos estudos sobre
Léxico, São Carlos, 2005.
PAIXÃO DE SOUSA, M. C. Memórias do Texto. Mesa-redonda “Bibliotecas e bancos de
dados digitais de literatura”, II Simpósio Nacional de Literatura e Informática,
Florianópolis, 2005.
Published in 2006 as:
PAIXÃO DE SOUSA, M. C. Memórias do Texto. Texto Digital (UERJ), v. 1, p. 10, 2006.
PAIXÃO DE SOUSA, M. C. Critical Hipereditions and the new challenges for text-
critique. Seminário Internacional Literaturas: Del texto al hipertexto. Madri, Universidade
Complutense, setembro de 2006.
Published in 2007 as:
PAIXÃO DE SOUSA, M. C. Digital Text: Conceptual and methodological frontiers. In: Dolores
Romero; Amelia Sanz. (Org.). Literatures in the Digital Era: Theory and Praxis. Cambridge:
Cambridge Scholarly, 2007.
By 2006 the single-source encoding system was
mature; a first manual was prepared and a more
complete paper on these results was published.
>
2004-2006
http://www.ime.usp.br/~tycho/participants/psousa/memorias/critical_hyper/ece_Frameset.html
Electronic Editions and Tycho Brahe Text Preparation Manual
June 2006
TRIPPEL, T.; PAIXÃO DE SOUSA, M. C. Metadata and XML standards
at work: a corpus repository of Historical Portuguese texts. V
International Conference on Language Resources and Evaluation (LREC),
2006.
TRIPPEL, T.; PAIXÃO DE SOUSA, M. C. Metadata and XML standards
at work: a corpus repository of Historical Portuguese texts. V
International Conference on Language Resources and Evaluation (LREC),
2006.
Meanwhile...
... as the system was presented to a wider range
of potential users outside Tycho Brahe,
new challenges emerged.
>
2004-2006
I Oficina de Anotação – Projeto CorPorA.
Salvador, 19-21 de abril, 2006.
The 1st annotation workshop outside the Tycho
Brahe team, in 2006 in Salvador, was an
important breakthrough.
It was then that we noticed that the original
techniques used to annotate the XML
documents (“by hand”, in E-Macs) and to
transform them (by coding XSL into the system
via Saxon) was not adequate for teams with a less
computational, and more philological
background.
>
2004-2006
I Oficina de Anotação – Projeto CorPorA.
Salvador, 19-21 de abril, 2006.
After the workshop in 2006 it became clear that
if we wanted more teams to use the single-
source annotation system, we would have to
build a software that could perform the
annotation and transformation tasks in a
user-friendly interface.
In other words... it was then that the idea of
eDictor took shape.
>
2004-2006
2007
2007
eDictor is launched!
eDictor beta 1.0 was developed in 2007 by
Prof. Fabio N. Kepler (then a post-
graduate student at IME-USP’s computer science
program), and was first presented in the same
year at the VI Encontro de Linguística
de Corpus, at USP.
2007
>
PAIXÃO DE SOUSA, M. C.; KEPLER, F. N. E-dictor: uma
ferramenta integrada para a anotação de edição e classe de
palavras. VI Encontro de Lingüística de Corpus, São Paulo, 2007.
2007
This first version of eDictor
contained the core functions
of the original text encoding system:
an XML annotation module
and the possibility of XSLT
transformation exportation.
>
2007
Plus... it included a
morphosyntactic tagging function!
This first version of eDictor
contained the core functions
of the original text encoding system:
an XML annotation module
and the possibility of XSLT
transformation exportation.
>
Interface of eDictor 1.0 beta 01
2008-2012
2008-2012
years of growing into new uses
Two important aspects mark the years
2008 to 2012 for the development of eDictor.
The first was the arrival of a new team member,
Pablo P. F. Faria, who joined F. Kepler in
developing the software after the first version.
>
2008-2012
The second important aspect was that, while
up to 2008 the main application of the single-
source system (first manually and later with
eDictor) was the restructuring of the Tycho
Brahe Corpus, after 2008 the system started to
be used beyond Tycho Brahe.
>
2008-2012
>
2008-2012
This was important because, as the different
projects have different aims, the tool started to
include new technical aspects.
The second important aspect was that, while up to
2008 the main application of the single-source
system (first manually and later with eDictor)
was the restructuring of the Tycho Brahe
Corpus, after 2008 the system started to be
used beyond Tycho Brahe.
> For instance, in 2009 eDictor started to be used
by the Brasiliana USP team.
One of the main particularities of this context
was that eDictor was used as a corrector for
automatic character recognition (OCR)
– and new edition categories had to be created.
2008-2012
PAIXÃO DE SOUSA, M. C. Desafios do processamento de textos antigos: primeiros
experimentos na Brasiliana Digital . I Workshop de Linguística Computacional da USP,
2009.
PAIXÃO DE SOUSA, M. C.; KEPLER, F. N.; FARIA, P. P. F. O Processamento
automático de textos antigos: Desafios e Experiências. Workshop de Linguística de Corpus
do Projeto Para a História do Português Brasileiro (PHPB), São Paulo, 2010.
PAIXÃO DE SOUSA, M. C. Desafios do processamento de textos antigos: primeiros
experimentos na Brasiliana Digital . I Workshop de Linguística Computacional da USP,
2009.
PAIXÃO DE SOUSA, M. C. Desafios do processamento de textos antigos: primeiros
experimentos na Brasiliana Digital . I Workshop de Linguística Computacional da USP,
2009.
(Abbyy Finereader 10.0 training module)
<w id="s_6#86">
<o> amiſjade</o>
<e t="ocr">amiſſade</e>
<e t="gra">amissade</e>
<e t="mod">amizade </e>
<m v="N"/>
</w>
PAIXÃO DE SOUSA, M. C. Desafios do processamento de textos antigos: primeiros
experimentos na Brasiliana Digital . I Workshop de Linguística Computacional da USP,
2009.
> One important consequence for eDictor was
the possibility of adding new edition categories
to the tools Preference archive.
> Some of these developments were presented
at the VIII Encontro de Linguística
de Corpus in 2009 by Pablo Faria; this
presentation would be published as a book
chapter in 2010.
PAIXÃO DE SOUSA, M. C.; KEPLER, F. N.; FARIA, P. E-dictor: Novas
perspectivas na codificação e edição de corpora de textos históricos. In:
VIII Encontro de Linguística de Corpus, 2009, Rio de Janeiro. 2009.
Interface of eDictor in 2009 – Edition Module
Example of changes after 1.0 beta 001:
Edition Tab – “edition” became an open category
> More importantly, researchers that used
manuscript documents became interested in
eDictor.
The special needs of this kind of material led
to very important developments in the tool.
2008-2012
> The first group of manuscript documents to
be worked with the tool was the corpus of
XIXth century letters from the PhD thesis of
Zenaide Carneiro (2005) – now part of the
corpus CEDOH.
The edition of this corpus in XML had been
idealized at the time of the 2006 workshop in
Salvador - and from the start, it brought to
the development of eDictor the challenge of
dealing with particular categories and edition
needs of manuscripts.
2008-2012
> One important example of developments
brought by the needs of manuscript editors
are the fac-simile view functionalities.
They were developed by Pablo Faria after
eDictor started to be used by the team at
CEDOH and by the team lead by Celia
Lopes at LaborHistórico, at UFRJ.
2008-2012
The CEDOH corpus, with integrated fac-simile view of
manuscripts.>
The CEDOH corpus, with integrated fac-simile view of manuscripts.
This new exporting format - Hypertext with fac-
simile view – was integrated in later versions of
eDictor, and is currently used by other projects.
LaborHistorico – Laboratório para a História do Português Brasileiro,
Universidade Federal do Rio de Janeiro. Coord. Célia Lopes
Workshop: “Edição Digital e Divulgação de Textos Antigos”,
Rio de Janeiro, 3-5 de fevereiro, 2010.
The corpus at LaborHistorico,
with integrated fac-simile view of manuscripts.>
> The corpus at LaborHistorico,
with integrated fac-simile view of manuscripts.
> The workshops with the new teams of
users, organized between 2010-2012,
resulted in the development of new builds
for eDictor beta 1.0 – and also, thanks to
the expansion in the number of users,
in 2010 we finally got to make a
manual...
2008-2012
First Version of eDictor’s Manual (2010)
First Version of eDictor’s Manual (2010)
(... actually, the only version so far)
> As a result of this
expansion, between
2009 and 2012
ten builds of eDictor
beta 1.0 were made,
reflecting the additions
that were pointed out as
necessary by the
different user teams.
2008-2012
Two important publications were prepared
during this period: a poster session at the
ALC meeting of 2010, presented by P. Faria,
and the chapter for the book “Caminhos da
Linguística de Corpus”.
In these papers we tried to cover the
backgound on eDictor’s creation, the new
developments, and the challenges ahead.
2008-2012
>
FARIA, P. P. F.; PAIXÃO DE SOUSA, M. C.; KEPLER, F. N. An Integrated Tool for
Annotating Historical Corpora. The Fourth Linguistic Annotation Workshop (LAW IV) at
The 48th Annual Meeting of the Association for Computational Linguistics (ALC 2010),
Uppsala, 2010.
PAIXÃO DE SOUSA, M. C.; KEPLER, F. N.; FARIA, P. E-dictor: Novas
perspectivas na codificação e edição de corpora de textos históricos. In: Tania
Shepherd; Tony Berber Sardinha; Marcia Veirano Pinto. (Org.). Caminhos da
linguística de corpus. Campinas: Mercado de Letras, 2010.
2013
2013
and now, what?
> eDictor 1.0 beta build 010 is the current
version under use. The main differences
in comparison to beta 001 are the
additions related to fac-simile
integration (in transcription module
and in export functionalities) and some
bug-fixing in the editions module.
But there are still bugs to be busted!
2013
Interface of eDictor 1.0 beta b010
Interface of eDictor 1.0 beta b010
2013
> In the end of 2012, a new, web-based
version of eDictor was idealized by Luiz
Veronesi, and is currently under
construction
Web-based version of eDictor,
under construction
by Luiz Veronezi
Version 1.0 beta b010 of eDictor is currently being used
by seven projects in Brazil and in Portugal
>
Corpus Anotado do Português Tycho Brahe
(Universidade Estadual de Campinas)
Grupo de Pesquisas Humanidades Digitais
(Universidade de São Paulo)
Laboratório de História do Português Brasileiro
(Universidade Federal do Rio de Janeiro)
P.S. – Projeto Arquivo Digital de Escrita Quotidiana em Portugal e Espanha na Época Moderna
(Universidade de Lisboa)
Corpus Eletrônico de Documentos Históricos do Sertão, CEDOHS
(Universidade Federal de Feira de Santana)
Memória Conquistense
(Universidade Estadual do Sudoeste da Bahia)
> Version 1.0 beta b010 of eDictor is currently being used
by seven projects in Brazil and in Portugal
There is still a lot to be done
if we want to make eDictor
a stable and fully transferrable
tool.
but of course ...>
The spirit of this tool has been one of
growing into the users’ needs and
requests. It will become a better
tool if we work together on what
we want it to be.
>
So we are very excited
about this workshop!
>
So we are very excited
about this workshop!
Here’s one idea of
how we could work:
>
We are launching today (09/09/2013) a new webpage for eDictor, at
http://manualedictor.wordpress.com/.
We are launching today (09/09/2013) a new webpage for eDictor, at
http://manualedictor.wordpress.com/.
We could use these days at the workshop
to build more documentation and group it on the page.
That was it.
That was it.
Thank you!
That was it.
Thank you!
Universidade de São Paulo
Maria Clara Paixão de Sousa
mariaclara@usp.br
eDictor:•(a chronology)
Roundtable: e-dictor, Advances and Perspectives.
Workshop: Construction and use
of large annotated corpora
Campinas, Sept. 9, 2013.
Roundtable: e-dictor, Advances and Perspectives.
Workshop: Construction and use
of large annotated corpora
Campinas, Sept. 9, 2013.

Más contenido relacionado

Similar a 2013 e dictor_a_chronology

USING MACHINE LEARNING TO BUILD A SEMI-INTELLIGENT BOT
USING MACHINE LEARNING TO BUILD A SEMI-INTELLIGENT BOT USING MACHINE LEARNING TO BUILD A SEMI-INTELLIGENT BOT
USING MACHINE LEARNING TO BUILD A SEMI-INTELLIGENT BOT ecij
 
USING MACHINE LEARNING TO BUILD A SEMI-INTELLIGENT BOT
USING MACHINE LEARNING TO BUILD A SEMI-INTELLIGENT BOT USING MACHINE LEARNING TO BUILD A SEMI-INTELLIGENT BOT
USING MACHINE LEARNING TO BUILD A SEMI-INTELLIGENT BOT ecij
 
OpenWN-PT: a Brazilian Wordnet for all
OpenWN-PT: a Brazilian Wordnet for allOpenWN-PT: a Brazilian Wordnet for all
OpenWN-PT: a Brazilian Wordnet for allAlexandre Rademaker
 
2011 Pharo Roadmap explained
2011 Pharo Roadmap explained2011 Pharo Roadmap explained
2011 Pharo Roadmap explainedPharo
 
Baroque Music Essay Conclusion. Online assignment writing service.
Baroque Music Essay Conclusion. Online assignment writing service.Baroque Music Essay Conclusion. Online assignment writing service.
Baroque Music Essay Conclusion. Online assignment writing service.Alexandra Romero
 
Logics and Ontologies for Portuguese Understanding
Logics and Ontologies for Portuguese UnderstandingLogics and Ontologies for Portuguese Understanding
Logics and Ontologies for Portuguese UnderstandingValeria de Paiva
 
A Little Smalltalk.pdf
A Little Smalltalk.pdfA Little Smalltalk.pdf
A Little Smalltalk.pdfssuser0d34762
 
Does DH Scholarship Take Place in the Lab?
Does DH Scholarship Take Place in the Lab?Does DH Scholarship Take Place in the Lab?
Does DH Scholarship Take Place in the Lab?Shawn Day
 
Organization of Patent as Open Source Software based Open Access Repository Item
Organization of Patent as Open Source Software based Open Access Repository ItemOrganization of Patent as Open Source Software based Open Access Repository Item
Organization of Patent as Open Source Software based Open Access Repository ItemMoumita Ash
 
Applying Linked Open Data to a digital library: best practices and lessons le...
Applying Linked Open Data to a digital library: best practices and lessons le...Applying Linked Open Data to a digital library: best practices and lessons le...
Applying Linked Open Data to a digital library: best practices and lessons le...IMPACT Centre of Competence
 
Poster: Using Open Source Tools to Improve Access to Oral History Collections
Poster: Using Open Source Tools to Improve Access to Oral History CollectionsPoster: Using Open Source Tools to Improve Access to Oral History Collections
Poster: Using Open Source Tools to Improve Access to Oral History CollectionsBecky Yoose
 
OpenMinTeD, LIBER conference 2017
OpenMinTeD, LIBER conference 2017OpenMinTeD, LIBER conference 2017
OpenMinTeD, LIBER conference 2017openminted_eu
 
Collaborations with Collection Holding Institutions
Collaborations with Collection Holding InstitutionsCollaborations with Collection Holding Institutions
Collaborations with Collection Holding InstitutionsParthenos
 
Good Behavior Essay In English
Good Behavior Essay In EnglishGood Behavior Essay In English
Good Behavior Essay In EnglishEmily Garcia
 
Hub Innovations Spaceforall 2009
Hub Innovations Spaceforall 2009Hub Innovations Spaceforall 2009
Hub Innovations Spaceforall 2009Jane Stevenson
 
Mining, Representation and Reasoning with Temporal Expressions in the Legal D...
Mining, Representation and Reasoning with Temporal Expressions in the Legal D...Mining, Representation and Reasoning with Temporal Expressions in the Legal D...
Mining, Representation and Reasoning with Temporal Expressions in the Legal D...María Navas Loro
 
BHL Developments - Prague
BHL Developments - PragueBHL Developments - Prague
BHL Developments - PragueChris Freeland
 
Henry Iii Fine Rolls Project
Henry Iii Fine Rolls ProjectHenry Iii Fine Rolls Project
Henry Iii Fine Rolls ProjectMatteo Starri
 

Similar a 2013 e dictor_a_chronology (20)

USING MACHINE LEARNING TO BUILD A SEMI-INTELLIGENT BOT
USING MACHINE LEARNING TO BUILD A SEMI-INTELLIGENT BOT USING MACHINE LEARNING TO BUILD A SEMI-INTELLIGENT BOT
USING MACHINE LEARNING TO BUILD A SEMI-INTELLIGENT BOT
 
USING MACHINE LEARNING TO BUILD A SEMI-INTELLIGENT BOT
USING MACHINE LEARNING TO BUILD A SEMI-INTELLIGENT BOT USING MACHINE LEARNING TO BUILD A SEMI-INTELLIGENT BOT
USING MACHINE LEARNING TO BUILD A SEMI-INTELLIGENT BOT
 
OpenWN-PT: a Brazilian Wordnet for all
OpenWN-PT: a Brazilian Wordnet for allOpenWN-PT: a Brazilian Wordnet for all
OpenWN-PT: a Brazilian Wordnet for all
 
2011 Pharo Roadmap explained
2011 Pharo Roadmap explained2011 Pharo Roadmap explained
2011 Pharo Roadmap explained
 
Baroque Music Essay Conclusion. Online assignment writing service.
Baroque Music Essay Conclusion. Online assignment writing service.Baroque Music Essay Conclusion. Online assignment writing service.
Baroque Music Essay Conclusion. Online assignment writing service.
 
Logics and Ontologies for Portuguese Understanding
Logics and Ontologies for Portuguese UnderstandingLogics and Ontologies for Portuguese Understanding
Logics and Ontologies for Portuguese Understanding
 
A Little Smalltalk.pdf
A Little Smalltalk.pdfA Little Smalltalk.pdf
A Little Smalltalk.pdf
 
Does DH Scholarship Take Place in the Lab?
Does DH Scholarship Take Place in the Lab?Does DH Scholarship Take Place in the Lab?
Does DH Scholarship Take Place in the Lab?
 
Socializing and disseminating the academic and intellectual creation: experie...
Socializing and disseminating the academic and intellectual creation: experie...Socializing and disseminating the academic and intellectual creation: experie...
Socializing and disseminating the academic and intellectual creation: experie...
 
Organization of Patent as Open Source Software based Open Access Repository Item
Organization of Patent as Open Source Software based Open Access Repository ItemOrganization of Patent as Open Source Software based Open Access Repository Item
Organization of Patent as Open Source Software based Open Access Repository Item
 
Applying Linked Open Data to a digital library: best practices and lessons le...
Applying Linked Open Data to a digital library: best practices and lessons le...Applying Linked Open Data to a digital library: best practices and lessons le...
Applying Linked Open Data to a digital library: best practices and lessons le...
 
Poster: Using Open Source Tools to Improve Access to Oral History Collections
Poster: Using Open Source Tools to Improve Access to Oral History CollectionsPoster: Using Open Source Tools to Improve Access to Oral History Collections
Poster: Using Open Source Tools to Improve Access to Oral History Collections
 
OpenMinTeD, LIBER conference 2017
OpenMinTeD, LIBER conference 2017OpenMinTeD, LIBER conference 2017
OpenMinTeD, LIBER conference 2017
 
Collaborations with Collection Holding Institutions
Collaborations with Collection Holding InstitutionsCollaborations with Collection Holding Institutions
Collaborations with Collection Holding Institutions
 
Good Behavior Essay In English
Good Behavior Essay In EnglishGood Behavior Essay In English
Good Behavior Essay In English
 
Hub Innovations Spaceforall 2009
Hub Innovations Spaceforall 2009Hub Innovations Spaceforall 2009
Hub Innovations Spaceforall 2009
 
Mining, Representation and Reasoning with Temporal Expressions in the Legal D...
Mining, Representation and Reasoning with Temporal Expressions in the Legal D...Mining, Representation and Reasoning with Temporal Expressions in the Legal D...
Mining, Representation and Reasoning with Temporal Expressions in the Legal D...
 
CALICO 2010 Workshop
CALICO 2010  Workshop CALICO 2010  Workshop
CALICO 2010 Workshop
 
BHL Developments - Prague
BHL Developments - PragueBHL Developments - Prague
BHL Developments - Prague
 
Henry Iii Fine Rolls Project
Henry Iii Fine Rolls ProjectHenry Iii Fine Rolls Project
Henry Iii Fine Rolls Project
 

Último

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 

Último (20)

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 

2013 e dictor_a_chronology

  • 2. eDictor:(a chronology) Roundtable: e-dictor, Advances and Perspectives. Workshop: Construction and use of large annotated corpora Campinas, Sept. 9, 2013.
  • 5. The preliminary ideas that would result in the development of eDictor in 2007 started in 2004 with a project that aimed at restructuring the text-preparation system at the Tycho Brahe Corpus. > 2004-2006
  • 6. http://www.ime.usp.br/~tycho/participants/psousa/memorias/index.html PAIXÃO DE SOUSA, M.C. Memórias do Texto: Aspectos tecnológicos na construção de um corpus histórico do português. Post-doc Research Project, 2004-2007. Unicamp/Fapesp.
  • 7. Essentially, the idea was that the Corpus would be constituted of single-source documents that could contain all relevant annotations (textual, philological, linguistic). > 2004-2006
  • 8. This was achieved in partnership with computer scientist Thorsten Trippel, from the University of Bielefeld. He suggested we used the XML annotation language to re-encode the Corpus, and XSLT to transform each document into different presentations of the encoded information. > 2004-2006
  • 9. Our central idea was to encapsulate edition interferences at the word level, i.e. for each token in the corpus – so that each element of the pair would be available to different modules of analysis. > 2004-2006
  • 10. This first idea was applied to a few pilot texts, and published as a poster at the annual conference of the ALLC in 2004 PAIXÃO DE SOUSA, M. C.; TRIPPEL, T. Single source process Historic corpora for diverse uses. In: Proceedings of the Association for Literary and Linguistic Computing (ALLC) Annual Conference, 2004. > 2004-2006
  • 11. In 2005, the Corpus went through a complete re-encoding process. 2004-2006 >
  • 12. The restructured Corpus was composed of XML documents that, via XSLT transformations, would render different (HTML and TXT) versions, adequate for different visualization and processing needs, as we had originally planned. > 2004-2006
  • 14. The Tycho Brahe Corpus, restructured (“catalogue” view)
  • 15. The Tycho Brahe Corpus, restructured (“original” view)
  • 16. The Tycho Brahe Corpus, restructured (“modernized” view)
  • 17. The Tycho Brahe Corpus, restructured (simple text for further processing) [ prologue (author: P.M. Gandavo)] [ title: AO MUITO ILUSTRE SENHOR DOM LIONIS PEREIRA, Epístola de Pero de Magalhães. ] [g_008_s_43] Neste pequeno serviço (muito ilustre senhor ) que ofereço a Vossa Mercê das primícias de meu fraco entendimento, poderá em alguma maneira conhecer os desejos que tenho de pagar com minha possibilidade alguma parte do muito que se deve à ínclita fama de vosso heróico nome. [g_008_s_44] E isto assim pelo merecimento do nobilíssimo sangue e clara progênie de onde traz sua origem, como pelos troféus das grandes vitórias , e casos bem afortunados que lhe hão sucedido nessas partes do Oriente em que Deus o quis favorecer com tão larga mão, que não cuido ser toda minha vida bastante para satisfazer à menor parte de seus louvores . [g_008_s_45] E como todas estas razões me ponham em tanta obrigação , e eu entenda que outra nenhuma coisa deve ser mais aceita a pessoas de altos ânimos que a lição das escrituras , por cujos meios se alcançam os segredos de todas as ciências , e os homens vêm a ilustrar seus nomes e perpetuar os na terra com fama imortal , determinei escolher a Vossa Mercê entre os mais senhores da terra , e dedicar lhe esta breve história . [g_008_s_46] A qual espero que folgue de ver com atenção e receber me a benignamente debaixo de seu amparo : assim por ser coisa nova , e eu a escrever como testemunha de vista : como por saber quão particular afeição Vossa Mercê tem às coisas do engenho , e que por esta causa lhe não será menos aceito o exercício das escrituras , que o das armas. [g_008_s_47] Por onde com muita razão favorecido desta confiança possa seguramente sair a luz com esta pequena empresa e divulgar a pela terra sem nenhum receio , tendo por defensor dela a Vossa Mercê Cuja muito ilustre pessoa nosso Senhor guarde e acrescente sua vida e estado por longos e felizes anos . [ end prologue ]
  • 18. Along with the application of the new single- source system to the Corpus, new ideas started to pop up. Some of them were carried on, some were not. 2004-2006 >
  • 19. The main thing that we wanted to do back then and still have not done is... ... to integrate syntactic annotation into this same, single-source system... 2004-2006 >
  • 20. Other ideas were a little more fruitful: the integration of other, less complex levels of linguistic annotation (such as items of lexicological interest); and the expansion of the system to include the possibility of critical editions, in which more than one version of the same text could be compared. 2004-2006 >
  • 21. PAIXÃO DE SOUSA, M. C. A Anotação da variação de grafia no Corpus Histórico do Português Tycho Brahe: Frentes abertas para estudos do léxico. V Encontro de Corpora: Lingüística de Corpus: a aplicabilidade nos estudos sobre Léxico, São Carlos, 2005.
  • 22. PAIXÃO DE SOUSA, M. C. Memórias do Texto. Mesa-redonda “Bibliotecas e bancos de dados digitais de literatura”, II Simpósio Nacional de Literatura e Informática, Florianópolis, 2005. Published in 2006 as: PAIXÃO DE SOUSA, M. C. Memórias do Texto. Texto Digital (UERJ), v. 1, p. 10, 2006.
  • 23. PAIXÃO DE SOUSA, M. C. Critical Hipereditions and the new challenges for text- critique. Seminário Internacional Literaturas: Del texto al hipertexto. Madri, Universidade Complutense, setembro de 2006. Published in 2007 as: PAIXÃO DE SOUSA, M. C. Digital Text: Conceptual and methodological frontiers. In: Dolores Romero; Amelia Sanz. (Org.). Literatures in the Digital Era: Theory and Praxis. Cambridge: Cambridge Scholarly, 2007.
  • 24. By 2006 the single-source encoding system was mature; a first manual was prepared and a more complete paper on these results was published. > 2004-2006
  • 26. TRIPPEL, T.; PAIXÃO DE SOUSA, M. C. Metadata and XML standards at work: a corpus repository of Historical Portuguese texts. V International Conference on Language Resources and Evaluation (LREC), 2006.
  • 27. TRIPPEL, T.; PAIXÃO DE SOUSA, M. C. Metadata and XML standards at work: a corpus repository of Historical Portuguese texts. V International Conference on Language Resources and Evaluation (LREC), 2006.
  • 28. Meanwhile... ... as the system was presented to a wider range of potential users outside Tycho Brahe, new challenges emerged. > 2004-2006
  • 29. I Oficina de Anotação – Projeto CorPorA. Salvador, 19-21 de abril, 2006.
  • 30. The 1st annotation workshop outside the Tycho Brahe team, in 2006 in Salvador, was an important breakthrough. It was then that we noticed that the original techniques used to annotate the XML documents (“by hand”, in E-Macs) and to transform them (by coding XSL into the system via Saxon) was not adequate for teams with a less computational, and more philological background. > 2004-2006
  • 31. I Oficina de Anotação – Projeto CorPorA. Salvador, 19-21 de abril, 2006.
  • 32. After the workshop in 2006 it became clear that if we wanted more teams to use the single- source annotation system, we would have to build a software that could perform the annotation and transformation tasks in a user-friendly interface. In other words... it was then that the idea of eDictor took shape. > 2004-2006
  • 33. 2007
  • 35. eDictor beta 1.0 was developed in 2007 by Prof. Fabio N. Kepler (then a post- graduate student at IME-USP’s computer science program), and was first presented in the same year at the VI Encontro de Linguística de Corpus, at USP. 2007 >
  • 36. PAIXÃO DE SOUSA, M. C.; KEPLER, F. N. E-dictor: uma ferramenta integrada para a anotação de edição e classe de palavras. VI Encontro de Lingüística de Corpus, São Paulo, 2007.
  • 37. 2007 This first version of eDictor contained the core functions of the original text encoding system: an XML annotation module and the possibility of XSLT transformation exportation. >
  • 38. 2007 Plus... it included a morphosyntactic tagging function! This first version of eDictor contained the core functions of the original text encoding system: an XML annotation module and the possibility of XSLT transformation exportation. >
  • 39. Interface of eDictor 1.0 beta 01
  • 41. 2008-2012 years of growing into new uses
  • 42. Two important aspects mark the years 2008 to 2012 for the development of eDictor. The first was the arrival of a new team member, Pablo P. F. Faria, who joined F. Kepler in developing the software after the first version. > 2008-2012
  • 43. The second important aspect was that, while up to 2008 the main application of the single- source system (first manually and later with eDictor) was the restructuring of the Tycho Brahe Corpus, after 2008 the system started to be used beyond Tycho Brahe. > 2008-2012
  • 44. > 2008-2012 This was important because, as the different projects have different aims, the tool started to include new technical aspects. The second important aspect was that, while up to 2008 the main application of the single-source system (first manually and later with eDictor) was the restructuring of the Tycho Brahe Corpus, after 2008 the system started to be used beyond Tycho Brahe.
  • 45. > For instance, in 2009 eDictor started to be used by the Brasiliana USP team. One of the main particularities of this context was that eDictor was used as a corrector for automatic character recognition (OCR) – and new edition categories had to be created. 2008-2012
  • 46. PAIXÃO DE SOUSA, M. C. Desafios do processamento de textos antigos: primeiros experimentos na Brasiliana Digital . I Workshop de Linguística Computacional da USP, 2009.
  • 47. PAIXÃO DE SOUSA, M. C.; KEPLER, F. N.; FARIA, P. P. F. O Processamento automático de textos antigos: Desafios e Experiências. Workshop de Linguística de Corpus do Projeto Para a História do Português Brasileiro (PHPB), São Paulo, 2010.
  • 48. PAIXÃO DE SOUSA, M. C. Desafios do processamento de textos antigos: primeiros experimentos na Brasiliana Digital . I Workshop de Linguística Computacional da USP, 2009.
  • 49. PAIXÃO DE SOUSA, M. C. Desafios do processamento de textos antigos: primeiros experimentos na Brasiliana Digital . I Workshop de Linguística Computacional da USP, 2009. (Abbyy Finereader 10.0 training module)
  • 50. <w id="s_6#86"> <o> amiſjade</o> <e t="ocr">amiſſade</e> <e t="gra">amissade</e> <e t="mod">amizade </e> <m v="N"/> </w> PAIXÃO DE SOUSA, M. C. Desafios do processamento de textos antigos: primeiros experimentos na Brasiliana Digital . I Workshop de Linguística Computacional da USP, 2009.
  • 51. > One important consequence for eDictor was the possibility of adding new edition categories to the tools Preference archive.
  • 52. > Some of these developments were presented at the VIII Encontro de Linguística de Corpus in 2009 by Pablo Faria; this presentation would be published as a book chapter in 2010.
  • 53. PAIXÃO DE SOUSA, M. C.; KEPLER, F. N.; FARIA, P. E-dictor: Novas perspectivas na codificação e edição de corpora de textos históricos. In: VIII Encontro de Linguística de Corpus, 2009, Rio de Janeiro. 2009.
  • 54. Interface of eDictor in 2009 – Edition Module
  • 55. Example of changes after 1.0 beta 001: Edition Tab – “edition” became an open category
  • 56. > More importantly, researchers that used manuscript documents became interested in eDictor. The special needs of this kind of material led to very important developments in the tool. 2008-2012
  • 57. > The first group of manuscript documents to be worked with the tool was the corpus of XIXth century letters from the PhD thesis of Zenaide Carneiro (2005) – now part of the corpus CEDOH. The edition of this corpus in XML had been idealized at the time of the 2006 workshop in Salvador - and from the start, it brought to the development of eDictor the challenge of dealing with particular categories and edition needs of manuscripts. 2008-2012
  • 58. > One important example of developments brought by the needs of manuscript editors are the fac-simile view functionalities. They were developed by Pablo Faria after eDictor started to be used by the team at CEDOH and by the team lead by Celia Lopes at LaborHistórico, at UFRJ. 2008-2012
  • 59. The CEDOH corpus, with integrated fac-simile view of manuscripts.>
  • 60. The CEDOH corpus, with integrated fac-simile view of manuscripts.
  • 61. This new exporting format - Hypertext with fac- simile view – was integrated in later versions of eDictor, and is currently used by other projects.
  • 62. LaborHistorico – Laboratório para a História do Português Brasileiro, Universidade Federal do Rio de Janeiro. Coord. Célia Lopes Workshop: “Edição Digital e Divulgação de Textos Antigos”, Rio de Janeiro, 3-5 de fevereiro, 2010.
  • 63. The corpus at LaborHistorico, with integrated fac-simile view of manuscripts.>
  • 64. > The corpus at LaborHistorico, with integrated fac-simile view of manuscripts.
  • 65. > The workshops with the new teams of users, organized between 2010-2012, resulted in the development of new builds for eDictor beta 1.0 – and also, thanks to the expansion in the number of users, in 2010 we finally got to make a manual... 2008-2012
  • 66. First Version of eDictor’s Manual (2010)
  • 67. First Version of eDictor’s Manual (2010) (... actually, the only version so far)
  • 68. > As a result of this expansion, between 2009 and 2012 ten builds of eDictor beta 1.0 were made, reflecting the additions that were pointed out as necessary by the different user teams. 2008-2012
  • 69. Two important publications were prepared during this period: a poster session at the ALC meeting of 2010, presented by P. Faria, and the chapter for the book “Caminhos da Linguística de Corpus”. In these papers we tried to cover the backgound on eDictor’s creation, the new developments, and the challenges ahead. 2008-2012 >
  • 70. FARIA, P. P. F.; PAIXÃO DE SOUSA, M. C.; KEPLER, F. N. An Integrated Tool for Annotating Historical Corpora. The Fourth Linguistic Annotation Workshop (LAW IV) at The 48th Annual Meeting of the Association for Computational Linguistics (ALC 2010), Uppsala, 2010.
  • 71. PAIXÃO DE SOUSA, M. C.; KEPLER, F. N.; FARIA, P. E-dictor: Novas perspectivas na codificação e edição de corpora de textos históricos. In: Tania Shepherd; Tony Berber Sardinha; Marcia Veirano Pinto. (Org.). Caminhos da linguística de corpus. Campinas: Mercado de Letras, 2010.
  • 72. 2013
  • 74. > eDictor 1.0 beta build 010 is the current version under use. The main differences in comparison to beta 001 are the additions related to fac-simile integration (in transcription module and in export functionalities) and some bug-fixing in the editions module. But there are still bugs to be busted! 2013
  • 75. Interface of eDictor 1.0 beta b010
  • 76. Interface of eDictor 1.0 beta b010
  • 77. 2013 > In the end of 2012, a new, web-based version of eDictor was idealized by Luiz Veronesi, and is currently under construction
  • 78. Web-based version of eDictor, under construction by Luiz Veronezi
  • 79. Version 1.0 beta b010 of eDictor is currently being used by seven projects in Brazil and in Portugal >
  • 80. Corpus Anotado do Português Tycho Brahe (Universidade Estadual de Campinas) Grupo de Pesquisas Humanidades Digitais (Universidade de São Paulo) Laboratório de História do Português Brasileiro (Universidade Federal do Rio de Janeiro) P.S. – Projeto Arquivo Digital de Escrita Quotidiana em Portugal e Espanha na Época Moderna (Universidade de Lisboa) Corpus Eletrônico de Documentos Históricos do Sertão, CEDOHS (Universidade Federal de Feira de Santana) Memória Conquistense (Universidade Estadual do Sudoeste da Bahia) > Version 1.0 beta b010 of eDictor is currently being used by seven projects in Brazil and in Portugal
  • 81. There is still a lot to be done if we want to make eDictor a stable and fully transferrable tool. but of course ...>
  • 82. The spirit of this tool has been one of growing into the users’ needs and requests. It will become a better tool if we work together on what we want it to be. >
  • 83. So we are very excited about this workshop! >
  • 84. So we are very excited about this workshop! Here’s one idea of how we could work: >
  • 85. We are launching today (09/09/2013) a new webpage for eDictor, at http://manualedictor.wordpress.com/.
  • 86. We are launching today (09/09/2013) a new webpage for eDictor, at http://manualedictor.wordpress.com/. We could use these days at the workshop to build more documentation and group it on the page.
  • 89. That was it. Thank you! Universidade de São Paulo Maria Clara Paixão de Sousa mariaclara@usp.br
  • 90. eDictor:•(a chronology) Roundtable: e-dictor, Advances and Perspectives. Workshop: Construction and use of large annotated corpora Campinas, Sept. 9, 2013.
  • 91. Roundtable: e-dictor, Advances and Perspectives. Workshop: Construction and use of large annotated corpora Campinas, Sept. 9, 2013.