SlideShare una empresa de Scribd logo
1 de 27
Corpora, Tracked Changes,
and PDFs
Some useful tips at no cost!
The Translation and Localization
Conference 2017
Patricia M. Ferreira Larrieux
EN <> ES <> IT Medical & Technical Translator
Agenda
About me
Purpose of this presentation
Working with corpora: your way to specialized
terminology
Extracting tracked changes
Searching in multiple PDF files at once
The Translation and Localization Conference 2017 2
About me
The Translation and Localization Conference 2017 3
 Born in Uruguay, living in Italy since
1990
 Degree in English<>Spanish Translation
 Ran my own translation company for 7
years
 10 years at Johnson & Johnson (2003-
2013)
 May 2013: returned to freelancing
 Currently freelance medical & technical
EN<>ES<>IT translator
 +300K words translated in 2016
 Member of: CTPU, ITI, ASETRAD,
TREMÉDICA, MET
Purpose of This Presentation
Sharing tips on:
Corpora – how to use BootCat & AntConc
Tracked changes – how to use DocTools
ExtractData
PDFs – searching multiple PDFs with Acrobat
Reader
Note: I am in no way connected with the
respective owners of these software programs!
The Translation and Localization Conference 2017 4
What is a corpus?
The Translation and Localization Conference 2017 5
What is a corpus?
The Translation and Localization Conference 2017 6
Why are Corpora useful for Translators?
They are a great resource for terminology and
phraseology.
Monolingual corpora in the target language have
proved to be an outstanding terminological tool for
specialized translation (Bowker, 1998)
The Translation and Localization Conference 2017 7
Online Corpora
The British National Corpus
http://corpus.byu.edu/bnc/
A collection of English corpora
http://corpus.leeds.ac.uk/protected/query.html
Michigan Corpus of Academic Spoken English
http://quod.lib.umich.edu/cgi/c/corpus/corpus?c=mic
ase;page=simple
The Translation and Localization Conference 2017 8
Online Corpora (cont’d)
Corpus de Referencia del Español Actual (CREA)
http://www.rae.es/recursos/banco-de-datos/crea
Corpora created by Mark Davies, Professor of
Linguistics at Brigham Young University.
http://corpus.byu.edu/corpora.asp
Paisà
http://www.corpusitaliano.it/
The Translation and Localization Conference 2017 9
Building Your Own Corpora:
BootCat Front End
The Translation and Localization Conference 2017 10
BootCat Front End is a free software developed by a group of
linguists from the Universities of Bologna (Forlì Campus),
Trento and Zagreb:
Marco Baroni (Trento) & Silvia Bernardini (Forlì) — wrote the
original scripts
Eros Zanchetta (Forlì) — wrote the BootCaT front-end and the
Bing URL collector, updated a few other scripts and maintains
this website
Nikola Ljubešić (Zagreb) — wrote the BootCaTExtractor
included since version 0.7 of the frontend and version 0.1.8 of
the toolkit.
Cyrus Shaoul (University of Alberta) — contributed the (now
retired) script to collect pages from Yahoo
Building Your Own Corpora:
BootCat Front End
The Translation and Localization Conference 2017 11
Download the app from this link:
http://bootcat.dipintra.it/?section=download
Get a Search Engine Key. See instructions here:
http://bit.ly/SearchEngineKey
Check the online Tutorial:
http://bit.ly/BC_Tutorial
Using BootCat Frontend
The Translation and Localization Conference 2017 12
Using BootCat Frontend
The Translation and Localization Conference 2017 13
AntConc: Exploring Your Corpus
The Translation and Localization Conference 2017 14
A free software developed by Dr. Laurence
Anthony, a Professor in the Faculty of Science
and Engineering at Waseda University, Japan. He
is a former director of the Center for English
Language Education (CELESE) and coordinator
of the CELESE technical English program.
AntConc: Exploring Your Corpus
The Translation and Localization Conference 2017 15
Download the app from this link:
http://www.laurenceanthony.net/software.html
Download the manual from this link:
http://bit.ly/AC_Manual
AntConc: Exploring Your Corpus – The
Concordance Window
The Translation and Localization Conference 2017 16
AntConc: Exploring Your Corpus – The
Collocates Window
The Translation and Localization Conference 2017 17
DocTools: Extracting Tracked Changes
The Translation and Localization Conference 2017 18
DocTools: Extracting Tracked Changes
The Translation and Localization Conference 2017 19
ExtractData: a free word add-in developed by Lene Fredborg
Some highlights from her website: https://wordaddins.com
Established DocTools in 2006.
+20 years working professionally with Word and programming
add-ins and macros in Visual Basic for Applications (VBA)
Developed several add-ins that can function as stand-alone
products
Via her website, she makes add-ins available to Word users in
general
Her motto: “Time-saving tools made for you!”
DocTools: Extracting Tracked Changes
The Translation and Localization Conference 2017 20
A Word add-in that works in Word 2007, Word
2010, Word 2013, and Word 2016 (Windows
only).
Send a request to get the app, and check the
installation instructions, from this link:
http://bit.ly/DocTools_Request
After installation, you will see a new «DocTools»
tab in Word
DocTools: Extracting Tracked Changes
The Translation and Localization Conference 2017 21
Acrobat Reader: Searching in multiple
PDFs
The Translation and Localization Conference 2017 22
Here’s how to proceed:
1) Save all the PDF files where you would like to
search in a single folder.
2) Open one file with Acrobat Reader.
3) Click «Shift+CTRL+F» or choose «Advanced
Search» from the Edit menu.
Acrobat Reader: The Advanced Search
Window
The Translation and Localization Conference 2017 23
4) Select “All PDF Documents in”.
5) Navigate to the folder where you
saved all your files (Step 1).
6) Type the word(s) to search for in the
search box.
Acrobat Reader: The Advanced Search
Window
24
7) When this window pops up, click “Allow”.
8) After a few seconds, your search results will display
in the advanced search window.
9) Click the plus sign (+) to see all results in each file.
10) Click on the result line to jump to the PDF
document.
The Translation and Localization Conference 2017
Acrobat Reader: The Advanced Search
Window
The Translation and Localization Conference 2017 25
9) Click the plus sign (+) to see all
results in each file.
10) Click on the result line to jump to
the PDF document.
Patricia María Ferreira Larrieux
E-mail: patricia.ferreira@language.proz.com
Website: www.pmferreira-larrieux.it
Linkedin profile: https://www.linkedin.com/in/pmferreiralarrieux/
ProZ profile: http://www.proz.com/profile/4437
Twitter: @PFerreiraLarr
The Translation and Localization Conference 2017 26
Thank you!
The Translation and Localization Conference 2017 27

Más contenido relacionado

Destacado

The Skills Cross-over: building a career through science communication
The Skills Cross-over: building a career through science communicationThe Skills Cross-over: building a career through science communication
The Skills Cross-over: building a career through science communicationEsther De Smet
 
Campamentos de verano El Alamo 2017 Madrid
Campamentos de verano El Alamo 2017 MadridCampamentos de verano El Alamo 2017 Madrid
Campamentos de verano El Alamo 2017 MadridVeleta3000
 
今日こそ理解するHot変換
今日こそ理解するHot変換今日こそ理解するHot変換
今日こそ理解するHot変換Yuki Takahashi
 
Inside Developer Relations at AWS
Inside Developer Relations at AWSInside Developer Relations at AWS
Inside Developer Relations at AWSAdam FitzGerald
 
Alla Scoperta di Wikipedia - 25.03.2017
Alla Scoperta di Wikipedia - 25.03.2017Alla Scoperta di Wikipedia - 25.03.2017
Alla Scoperta di Wikipedia - 25.03.2017La Scuola Open Source
 
The top 5 Kubernetes metrics to monitor
The top 5 Kubernetes metrics to monitorThe top 5 Kubernetes metrics to monitor
The top 5 Kubernetes metrics to monitorSysdig
 
Social media privacy and safety
Social media privacy and safetySocial media privacy and safety
Social media privacy and safetySarah K Miller
 
CP3P Public Private Partnerships Training
CP3P Public Private Partnerships TrainingCP3P Public Private Partnerships Training
CP3P Public Private Partnerships TrainingTraining Bytesize
 
Architecting the Digital Enterprise
Architecting the Digital EnterpriseArchitecting the Digital Enterprise
Architecting the Digital EnterpriseNuwan Bandara
 
Gamification and education
Gamification and educationGamification and education
Gamification and educationRoman Rackwitz
 
Baromètre IDAOS Lab "Digital & Social" 3ème édition 2015
Baromètre IDAOS Lab "Digital & Social" 3ème édition 2015Baromètre IDAOS Lab "Digital & Social" 3ème édition 2015
Baromètre IDAOS Lab "Digital & Social" 3ème édition 2015Franck Perrier
 

Destacado (15)

The Skills Cross-over: building a career through science communication
The Skills Cross-over: building a career through science communicationThe Skills Cross-over: building a career through science communication
The Skills Cross-over: building a career through science communication
 
Campamentos de verano El Alamo 2017 Madrid
Campamentos de verano El Alamo 2017 MadridCampamentos de verano El Alamo 2017 Madrid
Campamentos de verano El Alamo 2017 Madrid
 
Kiara collagen serum
Kiara collagen serumKiara collagen serum
Kiara collagen serum
 
今日こそ理解するHot変換
今日こそ理解するHot変換今日こそ理解するHot変換
今日こそ理解するHot変換
 
Inside Developer Relations at AWS
Inside Developer Relations at AWSInside Developer Relations at AWS
Inside Developer Relations at AWS
 
Alla Scoperta di Wikipedia - 25.03.2017
Alla Scoperta di Wikipedia - 25.03.2017Alla Scoperta di Wikipedia - 25.03.2017
Alla Scoperta di Wikipedia - 25.03.2017
 
The top 5 Kubernetes metrics to monitor
The top 5 Kubernetes metrics to monitorThe top 5 Kubernetes metrics to monitor
The top 5 Kubernetes metrics to monitor
 
20160126 université act 5 stratégies observance
20160126 université act 5 stratégies observance20160126 université act 5 stratégies observance
20160126 université act 5 stratégies observance
 
Social media privacy and safety
Social media privacy and safetySocial media privacy and safety
Social media privacy and safety
 
CP3P Public Private Partnerships Training
CP3P Public Private Partnerships TrainingCP3P Public Private Partnerships Training
CP3P Public Private Partnerships Training
 
Architecting the Digital Enterprise
Architecting the Digital EnterpriseArchitecting the Digital Enterprise
Architecting the Digital Enterprise
 
Predictive Analytics as a Product
Predictive Analytics as a Product Predictive Analytics as a Product
Predictive Analytics as a Product
 
Gamification and education
Gamification and educationGamification and education
Gamification and education
 
Baromètre IDAOS Lab "Digital & Social" 3ème édition 2015
Baromètre IDAOS Lab "Digital & Social" 3ème édition 2015Baromètre IDAOS Lab "Digital & Social" 3ème édition 2015
Baromètre IDAOS Lab "Digital & Social" 3ème édition 2015
 
Fedaia17
Fedaia17Fedaia17
Fedaia17
 

Similar a Corpora, tracked changes, and PDFs: some useful tips, at no cost!

OpenOffice.org around the Globe
OpenOffice.org around the GlobeOpenOffice.org around the Globe
OpenOffice.org around the GlobeAlexandro Colorado
 
Research Tool - End Note
Research Tool - End NoteResearch Tool - End Note
Research Tool - End Noteador
 
2007 acendio portenier_lucien_w_1130
2007 acendio portenier_lucien_w_11302007 acendio portenier_lucien_w_1130
2007 acendio portenier_lucien_w_1130tbnext
 
LRC XIII Localisation Conference - Using community feedback to improve social...
LRC XIII Localisation Conference - Using community feedback to improve social...LRC XIII Localisation Conference - Using community feedback to improve social...
LRC XIII Localisation Conference - Using community feedback to improve social...sarni
 
FINOS June 2018 Members Meeting - Plotting Your Journey in Open Source
FINOS June 2018 Members Meeting - Plotting Your Journey in Open SourceFINOS June 2018 Members Meeting - Plotting Your Journey in Open Source
FINOS June 2018 Members Meeting - Plotting Your Journey in Open SourceFINOS
 
Department information system
Department information systemDepartment information system
Department information systemSUMIT MIshra
 
COAR Venice 2017 Next Generation Repository Session: What can be done, right ...
COAR Venice 2017 Next Generation Repository Session: What can be done, right ...COAR Venice 2017 Next Generation Repository Session: What can be done, right ...
COAR Venice 2017 Next Generation Repository Session: What can be done, right ...Andrea Bollini
 
COAR Venice 2017 Next Generation Repository Session: What can be done, right ...
COAR Venice 2017 Next Generation Repository Session: What can be done, right ...COAR Venice 2017 Next Generation Repository Session: What can be done, right ...
COAR Venice 2017 Next Generation Repository Session: What can be done, right ...4Science
 
Dictionary project report.docx
Dictionary project report.docxDictionary project report.docx
Dictionary project report.docxkishoreadhikari2
 
Doc.next - The Future of the Documentation Project
Doc.next - The Future of the Documentation ProjectDoc.next - The Future of the Documentation Project
Doc.next - The Future of the Documentation ProjectAlexandro Colorado
 
Sharepoint Document Conversion
Sharepoint Document ConversionSharepoint Document Conversion
Sharepoint Document ConversionColin Gardner
 
Toward FAIR Semantic Resources
Toward FAIR Semantic ResourcesToward FAIR Semantic Resources
Toward FAIR Semantic ResourcesEUDAT
 
Developing of a web-based application to facilitate patient treatment adheren...
Developing of a web-based application to facilitate patient treatment adheren...Developing of a web-based application to facilitate patient treatment adheren...
Developing of a web-based application to facilitate patient treatment adheren...Gunther Eysenbach
 
Developing of a web-based application to facilitate patient treatment adheren...
Developing of a web-based application to facilitate patient treatment adheren...Developing of a web-based application to facilitate patient treatment adheren...
Developing of a web-based application to facilitate patient treatment adheren...Gunther Eysenbach
 
KOS Management - The case of the Organic.Edunet Ontology
KOS Management - The case of the Organic.Edunet OntologyKOS Management - The case of the Organic.Edunet Ontology
KOS Management - The case of the Organic.Edunet OntologyVassilis Protonotarios
 

Similar a Corpora, tracked changes, and PDFs: some useful tips, at no cost! (20)

OpenOffice.org around the Globe
OpenOffice.org around the GlobeOpenOffice.org around the Globe
OpenOffice.org around the Globe
 
Research Tool - End Note
Research Tool - End NoteResearch Tool - End Note
Research Tool - End Note
 
2007 acendio portenier_lucien_w_1130
2007 acendio portenier_lucien_w_11302007 acendio portenier_lucien_w_1130
2007 acendio portenier_lucien_w_1130
 
LRC XIII Localisation Conference - Using community feedback to improve social...
LRC XIII Localisation Conference - Using community feedback to improve social...LRC XIII Localisation Conference - Using community feedback to improve social...
LRC XIII Localisation Conference - Using community feedback to improve social...
 
FINOS June 2018 Members Meeting - Plotting Your Journey in Open Source
FINOS June 2018 Members Meeting - Plotting Your Journey in Open SourceFINOS June 2018 Members Meeting - Plotting Your Journey in Open Source
FINOS June 2018 Members Meeting - Plotting Your Journey in Open Source
 
semantify.it
semantify.itsemantify.it
semantify.it
 
An Application for Performing Real Time Speech Translation in Mobile Environment
An Application for Performing Real Time Speech Translation in Mobile EnvironmentAn Application for Performing Real Time Speech Translation in Mobile Environment
An Application for Performing Real Time Speech Translation in Mobile Environment
 
Department information system
Department information systemDepartment information system
Department information system
 
COAR Venice 2017 Next Generation Repository Session: What can be done, right ...
COAR Venice 2017 Next Generation Repository Session: What can be done, right ...COAR Venice 2017 Next Generation Repository Session: What can be done, right ...
COAR Venice 2017 Next Generation Repository Session: What can be done, right ...
 
COAR Venice 2017 Next Generation Repository Session: What can be done, right ...
COAR Venice 2017 Next Generation Repository Session: What can be done, right ...COAR Venice 2017 Next Generation Repository Session: What can be done, right ...
COAR Venice 2017 Next Generation Repository Session: What can be done, right ...
 
Knowledge Organization Systems (KOS): Management of Classification Systems in...
Knowledge Organization Systems (KOS): Management of Classification Systems in...Knowledge Organization Systems (KOS): Management of Classification Systems in...
Knowledge Organization Systems (KOS): Management of Classification Systems in...
 
Dictionary project report.docx
Dictionary project report.docxDictionary project report.docx
Dictionary project report.docx
 
Doc.next - The Future of the Documentation Project
Doc.next - The Future of the Documentation ProjectDoc.next - The Future of the Documentation Project
Doc.next - The Future of the Documentation Project
 
Sharepoint Document Conversion
Sharepoint Document ConversionSharepoint Document Conversion
Sharepoint Document Conversion
 
Toward FAIR Semantic Resources
Toward FAIR Semantic ResourcesToward FAIR Semantic Resources
Toward FAIR Semantic Resources
 
Developing of a web-based application to facilitate patient treatment adheren...
Developing of a web-based application to facilitate patient treatment adheren...Developing of a web-based application to facilitate patient treatment adheren...
Developing of a web-based application to facilitate patient treatment adheren...
 
Developing of a web-based application to facilitate patient treatment adheren...
Developing of a web-based application to facilitate patient treatment adheren...Developing of a web-based application to facilitate patient treatment adheren...
Developing of a web-based application to facilitate patient treatment adheren...
 
KOS Management - The case of the Organic.Edunet Ontology
KOS Management - The case of the Organic.Edunet OntologyKOS Management - The case of the Organic.Edunet Ontology
KOS Management - The case of the Organic.Edunet Ontology
 
Foss Presentation
Foss PresentationFoss Presentation
Foss Presentation
 
Olf2016
Olf2016Olf2016
Olf2016
 

Último

Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationkaushalgiri8080
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about usDynamic Netsoft
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...OnePlan Solutions
 
Active Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfActive Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfCionsystems
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software DevelopersVinodh Ram
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number SystemsJheuzeDellosa
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...OnePlan Solutions
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️anilsa9823
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
Test Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendTest Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendArshad QA
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3
 

Último (20)

Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanation
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about us
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
 
Active Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfActive Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdf
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software Developers
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number Systems
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveVip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
 
Test Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendTest Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and Backend
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
 

Corpora, tracked changes, and PDFs: some useful tips, at no cost!

  • 1. Corpora, Tracked Changes, and PDFs Some useful tips at no cost! The Translation and Localization Conference 2017 Patricia M. Ferreira Larrieux EN <> ES <> IT Medical & Technical Translator
  • 2. Agenda About me Purpose of this presentation Working with corpora: your way to specialized terminology Extracting tracked changes Searching in multiple PDF files at once The Translation and Localization Conference 2017 2
  • 3. About me The Translation and Localization Conference 2017 3  Born in Uruguay, living in Italy since 1990  Degree in English<>Spanish Translation  Ran my own translation company for 7 years  10 years at Johnson & Johnson (2003- 2013)  May 2013: returned to freelancing  Currently freelance medical & technical EN<>ES<>IT translator  +300K words translated in 2016  Member of: CTPU, ITI, ASETRAD, TREMÉDICA, MET
  • 4. Purpose of This Presentation Sharing tips on: Corpora – how to use BootCat & AntConc Tracked changes – how to use DocTools ExtractData PDFs – searching multiple PDFs with Acrobat Reader Note: I am in no way connected with the respective owners of these software programs! The Translation and Localization Conference 2017 4
  • 5. What is a corpus? The Translation and Localization Conference 2017 5
  • 6. What is a corpus? The Translation and Localization Conference 2017 6
  • 7. Why are Corpora useful for Translators? They are a great resource for terminology and phraseology. Monolingual corpora in the target language have proved to be an outstanding terminological tool for specialized translation (Bowker, 1998) The Translation and Localization Conference 2017 7
  • 8. Online Corpora The British National Corpus http://corpus.byu.edu/bnc/ A collection of English corpora http://corpus.leeds.ac.uk/protected/query.html Michigan Corpus of Academic Spoken English http://quod.lib.umich.edu/cgi/c/corpus/corpus?c=mic ase;page=simple The Translation and Localization Conference 2017 8
  • 9. Online Corpora (cont’d) Corpus de Referencia del Español Actual (CREA) http://www.rae.es/recursos/banco-de-datos/crea Corpora created by Mark Davies, Professor of Linguistics at Brigham Young University. http://corpus.byu.edu/corpora.asp Paisà http://www.corpusitaliano.it/ The Translation and Localization Conference 2017 9
  • 10. Building Your Own Corpora: BootCat Front End The Translation and Localization Conference 2017 10 BootCat Front End is a free software developed by a group of linguists from the Universities of Bologna (Forlì Campus), Trento and Zagreb: Marco Baroni (Trento) & Silvia Bernardini (Forlì) — wrote the original scripts Eros Zanchetta (Forlì) — wrote the BootCaT front-end and the Bing URL collector, updated a few other scripts and maintains this website Nikola Ljubešić (Zagreb) — wrote the BootCaTExtractor included since version 0.7 of the frontend and version 0.1.8 of the toolkit. Cyrus Shaoul (University of Alberta) — contributed the (now retired) script to collect pages from Yahoo
  • 11. Building Your Own Corpora: BootCat Front End The Translation and Localization Conference 2017 11 Download the app from this link: http://bootcat.dipintra.it/?section=download Get a Search Engine Key. See instructions here: http://bit.ly/SearchEngineKey Check the online Tutorial: http://bit.ly/BC_Tutorial
  • 12. Using BootCat Frontend The Translation and Localization Conference 2017 12
  • 13. Using BootCat Frontend The Translation and Localization Conference 2017 13
  • 14. AntConc: Exploring Your Corpus The Translation and Localization Conference 2017 14 A free software developed by Dr. Laurence Anthony, a Professor in the Faculty of Science and Engineering at Waseda University, Japan. He is a former director of the Center for English Language Education (CELESE) and coordinator of the CELESE technical English program.
  • 15. AntConc: Exploring Your Corpus The Translation and Localization Conference 2017 15 Download the app from this link: http://www.laurenceanthony.net/software.html Download the manual from this link: http://bit.ly/AC_Manual
  • 16. AntConc: Exploring Your Corpus – The Concordance Window The Translation and Localization Conference 2017 16
  • 17. AntConc: Exploring Your Corpus – The Collocates Window The Translation and Localization Conference 2017 17
  • 18. DocTools: Extracting Tracked Changes The Translation and Localization Conference 2017 18
  • 19. DocTools: Extracting Tracked Changes The Translation and Localization Conference 2017 19 ExtractData: a free word add-in developed by Lene Fredborg Some highlights from her website: https://wordaddins.com Established DocTools in 2006. +20 years working professionally with Word and programming add-ins and macros in Visual Basic for Applications (VBA) Developed several add-ins that can function as stand-alone products Via her website, she makes add-ins available to Word users in general Her motto: “Time-saving tools made for you!”
  • 20. DocTools: Extracting Tracked Changes The Translation and Localization Conference 2017 20 A Word add-in that works in Word 2007, Word 2010, Word 2013, and Word 2016 (Windows only). Send a request to get the app, and check the installation instructions, from this link: http://bit.ly/DocTools_Request After installation, you will see a new «DocTools» tab in Word
  • 21. DocTools: Extracting Tracked Changes The Translation and Localization Conference 2017 21
  • 22. Acrobat Reader: Searching in multiple PDFs The Translation and Localization Conference 2017 22 Here’s how to proceed: 1) Save all the PDF files where you would like to search in a single folder. 2) Open one file with Acrobat Reader. 3) Click «Shift+CTRL+F» or choose «Advanced Search» from the Edit menu.
  • 23. Acrobat Reader: The Advanced Search Window The Translation and Localization Conference 2017 23 4) Select “All PDF Documents in”. 5) Navigate to the folder where you saved all your files (Step 1). 6) Type the word(s) to search for in the search box.
  • 24. Acrobat Reader: The Advanced Search Window 24 7) When this window pops up, click “Allow”. 8) After a few seconds, your search results will display in the advanced search window. 9) Click the plus sign (+) to see all results in each file. 10) Click on the result line to jump to the PDF document. The Translation and Localization Conference 2017
  • 25. Acrobat Reader: The Advanced Search Window The Translation and Localization Conference 2017 25 9) Click the plus sign (+) to see all results in each file. 10) Click on the result line to jump to the PDF document.
  • 26. Patricia María Ferreira Larrieux E-mail: patricia.ferreira@language.proz.com Website: www.pmferreira-larrieux.it Linkedin profile: https://www.linkedin.com/in/pmferreiralarrieux/ ProZ profile: http://www.proz.com/profile/4437 Twitter: @PFerreiraLarr The Translation and Localization Conference 2017 26
  • 27. Thank you! The Translation and Localization Conference 2017 27

Notas del editor

  1. The World Wide Web is a mine of language data of unprecedented richness and ease of access. It is also the only viable source of "disposable" corpora, built ad hoc for a specific purpose (e.g. a translation or interpreting task). These corpora are essential resources for language professionals who routinely work with specialized languages, often in areas where neologisms and new terms are introduced at a fast pace and where standard reference corpora have to be complemented by easy-to-construct, focused, up-to-date text collections.
  2. The World Wide Web is a mine of language data of unprecedented richness and ease of access. It is also the only viable source of "disposable" corpora, built ad hoc for a specific purpose (e.g. a translation or interpreting task). These corpora are essential resources for language professionals who routinely work with specialized languages, often in areas where neologisms and new terms are introduced at a fast pace and where standard reference corpora have to be complemented by easy-to-construct, focused, up-to-date text collections.
  3. The World Wide Web is a mine of language data of unprecedented richness and ease of access. It is also the only viable source of "disposable" corpora, built ad hoc for a specific purpose (e.g. a translation or interpreting task). These corpora are essential resources for language professionals who routinely work with specialized languages, often in areas where neologisms and new terms are introduced at a fast pace and where standard reference corpora have to be complemented by easy-to-construct, focused, up-to-date text collections.
  4. The World Wide Web is a mine of language data of unprecedented richness and ease of access. It is also the only viable source of "disposable" corpora, built ad hoc for a specific purpose (e.g. a translation or interpreting task). These corpora are essential resources for language professionals who routinely work with specialized languages, often in areas where neologisms and new terms are introduced at a fast pace and where standard reference corpora have to be complemented by easy-to-construct, focused, up-to-date text collections.
  5. The World Wide Web is a mine of language data of unprecedented richness and ease of access. It is also the only viable source of "disposable" corpora, built ad hoc for a specific purpose (e.g. a translation or interpreting task). These corpora are essential resources for language professionals who routinely work with specialized languages, often in areas where neologisms and new terms are introduced at a fast pace and where standard reference corpora have to be complemented by easy-to-construct, focused, up-to-date text collections.
  6. The World Wide Web is a mine of language data of unprecedented richness and ease of access. It is also the only viable source of "disposable" corpora, built ad hoc for a specific purpose (e.g. a translation or interpreting task). These corpora are essential resources for language professionals who routinely work with specialized languages, often in areas where neologisms and new terms are introduced at a fast pace and where standard reference corpora have to be complemented by easy-to-construct, focused, up-to-date text collections.
  7. The World Wide Web is a mine of language data of unprecedented richness and ease of access. It is also the only viable source of "disposable" corpora, built ad hoc for a specific purpose (e.g. a translation or interpreting task). These corpora are essential resources for language professionals who routinely work with specialized languages, often in areas where neologisms and new terms are introduced at a fast pace and where standard reference corpora have to be complemented by easy-to-construct, focused, up-to-date text collections.
  8. The World Wide Web is a mine of language data of unprecedented richness and ease of access. It is also the only viable source of "disposable" corpora, built ad hoc for a specific purpose (e.g. a translation or interpreting task). These corpora are essential resources for language professionals who routinely work with specialized languages, often in areas where neologisms and new terms are introduced at a fast pace and where standard reference corpora have to be complemented by easy-to-construct, focused, up-to-date text collections.
  9. The World Wide Web is a mine of language data of unprecedented richness and ease of access. It is also the only viable source of "disposable" corpora, built ad hoc for a specific purpose (e.g. a translation or interpreting task). These corpora are essential resources for language professionals who routinely work with specialized languages, often in areas where neologisms and new terms are introduced at a fast pace and where standard reference corpora have to be complemented by easy-to-construct, focused, up-to-date text collections.