SlideShare una empresa de Scribd logo
1 de 40
Crowdsourcing Transcription
with Open Source Software
Ben Brumfield
MAC Fall Symposium 2013
Why Transcribe?

Crowdsourcing can be
− Tagging
− Georectification
− Identification

But if you've got scanned documents, you've got
a problem
Serendipity: One Volunteer's Story
Nat Wooding
– Semi-retired data analyst
– 200 pages of Julia Brumfield's 1923 diary in nine
months
– No relation to diarist
Serendipity: One Volunteer's Story
Nat Wooding
– Semi-retired data analyst
– 200 pages of Julia Brumfield's 1923 diary in nine
months
– No relation to diarist
– Great uncle was diarist's letter carrier, also
named Nat Wooding
Why Crowdsource?
Free Labor!
Why Crowdsource?
Free Labor!
“Free as in beer”
“Free as in speech”
“Free as in....
Free as in puppy!
http://www.flickr.com/photos/magnusbrath/7614518858/
Why Crowdsource?
“At its best, crowdsourcing is not about
getting someone to do work for you, it is
about offering your users the
opportunity to participate in public
memory.”
– Trevor Owens, “Crowdsourcing Cultural Heritage:
The Objectives are Upside-down”
Why Crowdsource?
“By engaging the public in digitising our
collections, we are
− Increasing the scientific literacy of the public
− Providing increased access to our collections
− Building an advocacy network for our collections
and our institutions.”
– Paul Flemons, Australian Museum
Why Crowdsource?

Convert website visitors into volunteers

Convert volunteers into advocates

What's next?
Questions?
Choosing a Transcription Platform

The good news:
– More than 30 tools to choose from!
Choosing a Transcription Platform

The good news:
– More than 30 tools to choose from!

The bad news:
– More than 30 tools to choose from!
Selection Factors
● Source Material
● Transcript Purpose
● Organizational/Project Management Fit
● Financial and Technical Resources
Source Material
● Is it of interest to anyone else?
● Is it under copyright?
● Does it need restricted access?
● Is it composed of “text” or “records”?
● How complex is the layout? How
important is that layout?
Purpose
•How will you be using the transcribed data?
– Traditional print editions
– Searchable online editions
•Do you want to use the system to analyze
the text?
•Do you need to import the transcripts into
other systems?
•Is public engagement the only goal?
Organizational Fit
•How important is traditional editorial
workflow?
•Will you rely on volunteers? How will you
find and motivate them?
•What is the duration of the project?
•Is there a "final version"?
•Is TEI a mandate?
Financial and Technical Resources
•System administrators to install non-hosted
software?
•Money to pay hosting costs?
•Programming skills to customize a tool?
•Money to pay programmers for
customization?
•Support for on-going costs to keep the site
running, however small?
The Tools
● Recent (oldest started in 2005)
● Influenced by origin
● Still pretty raw
● Most require tech expertise for set-up and
customization
● All require making trade-offs
http://tinyurl.com/TranscriptionToolGDoc
Open-source, On-site Tools
Scripto
Bentham Transcription Desk
NARA Transcribr Drupal Module
Zooniverse Scribe
Quick Definitions
MediaWiki: Popular software framework for
runnning wiki projects
Wikipedia, Wikisource, Wiktionary, Wikitravel:
Projects running on MediaWiki
WikiMedia: Organization running many—but not
all—MediaWiki-based wiki projects.
Hosted Tools
Virtual Transcription Laboratory
Wikisource.org
FromThePage.com
Virtual Transcription Laboratory
Virtual Transcription Laboratory
Wikisource
Live demo of State Library of Queensland on
Wikisource showing project page, edit screen,
and editorial workflow.
Recommendation of Lori and the GLAMWiki
group to help organizations navigate the
community.
FromThePage
Live demo of FromThePage showing edit
screen, wiki-linking a single term, read pages
for a subject, full-text search on name variants,
and auto-link.
Thanks!
Ben Brumfield
benwbrum@gmail.com
@benwbrum
http://manuscripttranscription.blogspot.com
My transcription tools:
– FromThePage.com
– OpenSourceIndexing.org
http://tinyurl.com/TranscriptionToolGDoc
Crowdsourcing Transcription with Open Source Software

Más contenido relacionado

Similar a Crowdsourcing Transcription with Open Source Software

Geek Empowerment - The Real Heart of Open Source
Geek Empowerment - The Real Heart of Open SourceGeek Empowerment - The Real Heart of Open Source
Geek Empowerment - The Real Heart of Open SourceRussell Pavlicek
 
F+ presentation public en
F+ presentation public enF+ presentation public en
F+ presentation public enSergiy Gladkyy
 
Ubiquitous Angels; ambient sensor networks to crowd source crisis response an...
Ubiquitous Angels; ambient sensor networks to crowd source crisis response an...Ubiquitous Angels; ambient sensor networks to crowd source crisis response an...
Ubiquitous Angels; ambient sensor networks to crowd source crisis response an...Anselm Hook
 
PyData Texas 2015 Keynote
PyData Texas 2015 KeynotePyData Texas 2015 Keynote
PyData Texas 2015 KeynotePeter Wang
 
Analytic Journalism: Digital Evolution in the Datasphere
Analytic Journalism: Digital Evolution in the DatasphereAnalytic Journalism: Digital Evolution in the Datasphere
Analytic Journalism: Digital Evolution in the DatasphereJ T "Tom" Johnson
 
HEL_Data_Journalism_Jessica_Mariani
HEL_Data_Journalism_Jessica_MarianiHEL_Data_Journalism_Jessica_Mariani
HEL_Data_Journalism_Jessica_Marianijessicamariani
 
The Elusive Nature of Software Documentation
The Elusive Nature of Software DocumentationThe Elusive Nature of Software Documentation
The Elusive Nature of Software DocumentationMargaret-Anne Storey
 
Why Computer Science is a Great Choice
Why Computer Science is a Great ChoiceWhy Computer Science is a Great Choice
Why Computer Science is a Great Choiceturingfan
 
Accessibility & Universal Design
Accessibility & Universal DesignAccessibility & Universal Design
Accessibility & Universal DesignSrutiVijaykumar
 
Ficod 2011 (keynote file)
Ficod 2011 (keynote file)Ficod 2011 (keynote file)
Ficod 2011 (keynote file)Tim O'Reilly
 
What is open source?
What is open source?What is open source?
What is open source?Ahmet Bulut
 
Community, Unifying the Geeks to Create Value - Demi Ben-Ari
Community, Unifying the Geeks to Create Value - Demi Ben-AriCommunity, Unifying the Geeks to Create Value - Demi Ben-Ari
Community, Unifying the Geeks to Create Value - Demi Ben-AriDemi Ben-Ari
 
The Well Connected Facility
The Well Connected FacilityThe Well Connected Facility
The Well Connected FacilityRyan Duggan
 
Open source for Libraries
Open source for LibrariesOpen source for Libraries
Open source for LibrariesNicole Baratta
 
Open Sesame (and other open movements)
Open Sesame (and other open movements)Open Sesame (and other open movements)
Open Sesame (and other open movements)Dorothea Salo
 
Open Your Mind: Open Source in Libraries
Open Your Mind: Open Source in LibrariesOpen Your Mind: Open Source in Libraries
Open Your Mind: Open Source in LibrariesNicole Baratta
 
Of Dodos, 'Karma' & Free Software in the Library
Of Dodos, 'Karma' & Free Software in the LibraryOf Dodos, 'Karma' & Free Software in the Library
Of Dodos, 'Karma' & Free Software in the LibraryIndranil Das Gupta
 
Digital Libraries and the quest for information curation
Digital Libraries and the quest for information curationDigital Libraries and the quest for information curation
Digital Libraries and the quest for information curationLuis Borges Gouveia
 

Similar a Crowdsourcing Transcription with Open Source Software (20)

Geek Empowerment - The Real Heart of Open Source
Geek Empowerment - The Real Heart of Open SourceGeek Empowerment - The Real Heart of Open Source
Geek Empowerment - The Real Heart of Open Source
 
F+ presentation public en
F+ presentation public enF+ presentation public en
F+ presentation public en
 
Ubiquitous Angels; ambient sensor networks to crowd source crisis response an...
Ubiquitous Angels; ambient sensor networks to crowd source crisis response an...Ubiquitous Angels; ambient sensor networks to crowd source crisis response an...
Ubiquitous Angels; ambient sensor networks to crowd source crisis response an...
 
PyData Texas 2015 Keynote
PyData Texas 2015 KeynotePyData Texas 2015 Keynote
PyData Texas 2015 Keynote
 
Analytic Journalism: Digital Evolution in the Datasphere
Analytic Journalism: Digital Evolution in the DatasphereAnalytic Journalism: Digital Evolution in the Datasphere
Analytic Journalism: Digital Evolution in the Datasphere
 
HEL_Data_Journalism_Jessica_Mariani
HEL_Data_Journalism_Jessica_MarianiHEL_Data_Journalism_Jessica_Mariani
HEL_Data_Journalism_Jessica_Mariani
 
Evc2014
Evc2014Evc2014
Evc2014
 
The Elusive Nature of Software Documentation
The Elusive Nature of Software DocumentationThe Elusive Nature of Software Documentation
The Elusive Nature of Software Documentation
 
Why Computer Science is a Great Choice
Why Computer Science is a Great ChoiceWhy Computer Science is a Great Choice
Why Computer Science is a Great Choice
 
Accessibility & Universal Design
Accessibility & Universal DesignAccessibility & Universal Design
Accessibility & Universal Design
 
Ficod 2011 (keynote file)
Ficod 2011 (keynote file)Ficod 2011 (keynote file)
Ficod 2011 (keynote file)
 
What is open source?
What is open source?What is open source?
What is open source?
 
Community, Unifying the Geeks to Create Value - Demi Ben-Ari
Community, Unifying the Geeks to Create Value - Demi Ben-AriCommunity, Unifying the Geeks to Create Value - Demi Ben-Ari
Community, Unifying the Geeks to Create Value - Demi Ben-Ari
 
The Well Connected Facility
The Well Connected FacilityThe Well Connected Facility
The Well Connected Facility
 
Open source for Libraries
Open source for LibrariesOpen source for Libraries
Open source for Libraries
 
Open Source for Libraries
Open Source for LibrariesOpen Source for Libraries
Open Source for Libraries
 
Open Sesame (and other open movements)
Open Sesame (and other open movements)Open Sesame (and other open movements)
Open Sesame (and other open movements)
 
Open Your Mind: Open Source in Libraries
Open Your Mind: Open Source in LibrariesOpen Your Mind: Open Source in Libraries
Open Your Mind: Open Source in Libraries
 
Of Dodos, 'Karma' & Free Software in the Library
Of Dodos, 'Karma' & Free Software in the LibraryOf Dodos, 'Karma' & Free Software in the Library
Of Dodos, 'Karma' & Free Software in the Library
 
Digital Libraries and the quest for information curation
Digital Libraries and the quest for information curationDigital Libraries and the quest for information curation
Digital Libraries and the quest for information curation
 

Último

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 

Último (20)

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 

Crowdsourcing Transcription with Open Source Software

  • 1. Crowdsourcing Transcription with Open Source Software Ben Brumfield MAC Fall Symposium 2013
  • 2. Why Transcribe?  Crowdsourcing can be − Tagging − Georectification − Identification  But if you've got scanned documents, you've got a problem
  • 3.
  • 4. Serendipity: One Volunteer's Story Nat Wooding – Semi-retired data analyst – 200 pages of Julia Brumfield's 1923 diary in nine months – No relation to diarist
  • 5. Serendipity: One Volunteer's Story Nat Wooding – Semi-retired data analyst – 200 pages of Julia Brumfield's 1923 diary in nine months – No relation to diarist – Great uncle was diarist's letter carrier, also named Nat Wooding
  • 6.
  • 7.
  • 9. Why Crowdsource? Free Labor! “Free as in beer” “Free as in speech” “Free as in....
  • 10. Free as in puppy! http://www.flickr.com/photos/magnusbrath/7614518858/
  • 11. Why Crowdsource? “At its best, crowdsourcing is not about getting someone to do work for you, it is about offering your users the opportunity to participate in public memory.” – Trevor Owens, “Crowdsourcing Cultural Heritage: The Objectives are Upside-down”
  • 12.
  • 13.
  • 14. Why Crowdsource? “By engaging the public in digitising our collections, we are − Increasing the scientific literacy of the public − Providing increased access to our collections − Building an advocacy network for our collections and our institutions.” – Paul Flemons, Australian Museum
  • 15. Why Crowdsource?  Convert website visitors into volunteers  Convert volunteers into advocates  What's next?
  • 17. Choosing a Transcription Platform  The good news: – More than 30 tools to choose from!
  • 18. Choosing a Transcription Platform  The good news: – More than 30 tools to choose from!  The bad news: – More than 30 tools to choose from!
  • 19. Selection Factors ● Source Material ● Transcript Purpose ● Organizational/Project Management Fit ● Financial and Technical Resources
  • 20. Source Material ● Is it of interest to anyone else? ● Is it under copyright? ● Does it need restricted access? ● Is it composed of “text” or “records”? ● How complex is the layout? How important is that layout?
  • 21. Purpose •How will you be using the transcribed data? – Traditional print editions – Searchable online editions •Do you want to use the system to analyze the text? •Do you need to import the transcripts into other systems? •Is public engagement the only goal?
  • 22. Organizational Fit •How important is traditional editorial workflow? •Will you rely on volunteers? How will you find and motivate them? •What is the duration of the project? •Is there a "final version"? •Is TEI a mandate?
  • 23. Financial and Technical Resources •System administrators to install non-hosted software? •Money to pay hosting costs? •Programming skills to customize a tool? •Money to pay programmers for customization? •Support for on-going costs to keep the site running, however small?
  • 24. The Tools ● Recent (oldest started in 2005) ● Influenced by origin ● Still pretty raw ● Most require tech expertise for set-up and customization ● All require making trade-offs http://tinyurl.com/TranscriptionToolGDoc
  • 25. Open-source, On-site Tools Scripto Bentham Transcription Desk NARA Transcribr Drupal Module Zooniverse Scribe
  • 26. Quick Definitions MediaWiki: Popular software framework for runnning wiki projects Wikipedia, Wikisource, Wiktionary, Wikitravel: Projects running on MediaWiki WikiMedia: Organization running many—but not all—MediaWiki-based wiki projects.
  • 27.
  • 28.
  • 29.
  • 30.
  • 31.
  • 32.
  • 33.
  • 34. Hosted Tools Virtual Transcription Laboratory Wikisource.org FromThePage.com
  • 37. Wikisource Live demo of State Library of Queensland on Wikisource showing project page, edit screen, and editorial workflow. Recommendation of Lori and the GLAMWiki group to help organizations navigate the community.
  • 38. FromThePage Live demo of FromThePage showing edit screen, wiki-linking a single term, read pages for a subject, full-text search on name variants, and auto-link.
  • 39. Thanks! Ben Brumfield benwbrum@gmail.com @benwbrum http://manuscripttranscription.blogspot.com My transcription tools: – FromThePage.com – OpenSourceIndexing.org http://tinyurl.com/TranscriptionToolGDoc