This presentation will discuss two current crowdsourcing activities initiated with BHL content: Science Gossip (implemented by ConSciCom on top of the Zooniverse platform) and two online games (Beanstalk and Smorball developed by Tiltfactor @ Darmouth)
Engaging the Citizen Scientist in Content Enhancement for BHL
1. Engaging the citizen scientist in
content enhancement for BHL
Trish Rose-Sandler, William Ulate, Max Seidman,
Mary Flanagan, Geoffrey Belknap, Victoria Van
Hyning, Jim O'Donnell
TDWG 2015
Nairobi, Kenya
2. Introduction
This presentation will discuss two current
crowdsourcing activities that the Biodiversity
Heritage Library (BHL) has initiated.
• Zooniverse platform – Science Gossip
• Two online games – Beanstalk and Smorball
6. Why we are doing this?
• Content enhancement.
• We want to search the data in
ways that we currently can't.
• Engage new audiences with BHL
content outside of our own portal
users.
9. Illustrations discovery, tagging and description
• Illustration discovery is the need,
• Illustration tagging is the task and
• Science Gossip is the site in the
Zooniverse platform.
10. Science Gossip – Zooniverse
• Partner with a team in UK called ConSciCom
(Constructing Scientific Communities) that
analyses Citizen Science of the 19th Century
and the 21st Century.
– ConSciCom provided the Subject Expertise,
– BHL provided the content (19th Century
Periodicals) and
– Zooniverse provided the platform (and IT
Support) in which users are asked to classify
and describe natural history illustrations from
19th century periodicals found within the
BHL.
11. • Step 1. Are there illustrations on the page?
• Step 2. Ask users to draw boundaries around an
illustration in the page
• Step 3. Classify it.
• Step 4. Mark species, inscriptions and
contributors
• Step 5. Talk! - A key piece to user engagement!
Quality control. Frequent users turned into
moderators. FAQ creators.
12. Advantages
• Details of Talk – After seeing in action, giving heavy users a place to
commune with other taggers. First time comments around BHL
content from an audience that was mostly new to BHL.
• We had a labor-ready workforce in the Zooniverse platform was
already formed, they had the critical mass community.
• Took a moderate quantity of our own resources and time to develop
because of the possibility of leveraging the platform that already
existed – website up in a couple of months.
• Embedded quality control
• Experience in different fields by each of the stakeholders
17. Thank you
• Trish Rose-Sandler
Principal Investigator
Purposeful Gaming and BHL
• Smorball and Beanstalk were designed as part of the Purposeful Gaming
and BHL project, which explores how digital games can make scanned
content more accessible and searchable for cultural institutions.
• Based at the Missouri Botanical Garden in St. Louis, Missouri, “Purposeful
Gaming and BHL” was established in 2013 through an Institute of Museum
and Library Services (IMLS) grant and includes partners at Harvard
University, Cornell University, and The New York Botanical Garden.
Notas del editor
The first is through the Zooniverse platform in which users are asked to classify and describe natural history illustrations from 19th century periodicals found within the BHL.
The second is through two online games, called Beanstalk and Smorball, which BHL commissioned from Tiltfactor (http://www.tiltfactor.org/), based at Dartmouth College.
The first is through the Zooniverse platform in which users are asked to classify and describe natural history illustrations from 19th century periodicals found within the BHL.
The second is through two online games, called Beanstalk and Smorball, which BHL commissioned from Tiltfactor (http://www.tiltfactor.org/), based at Dartmouth College.
The first is through the Zooniverse platform in which users are asked to classify and describe natural history illustrations from 19th century periodicals found within the BHL.
The second is through two online games, called Beanstalk and Smorball, which BHL commissioned from Tiltfactor (http://www.tiltfactor.org/), based at Dartmouth College.
The first is through the Zooniverse platform in which users are asked to classify and describe natural history illustrations from 19th century periodicals found within the BHL.
What’s the Purpose?
When a book is first digitized, its pages are merely image files and the text cannot be searched. Optical character recognition software (OCR) converts these page images into machine encoded text that can be searched, but historic literature has many idiosyncrasies that inhibit accurate OCR. BHL wanted to harness the power of crowdsourcing and the fun of gaming to allow humans to help correct inaccurate OCR. The games present extracted words from BHL books that users type out, thus verifying the spelling. These submissions are used to correct the OCR in BHL. By presenting users with a high volume of words in rapid succession during each play-through, we can receive a large number of word corrections and achieve a significant level of OCR correction.
Corpus, lots of valuable illustrations hidden, hard to identify specifically, discoverability.
After initially running automatic algorithms for basic identification, that reduced the amount of pages to review, we used crowdsourcing to filter out the false positives and identify more specifically the type of illustration available
The second is through two online games, called Beanstalk and Smorball, which BHL commissioned from Tiltfactor (http://www.tiltfactor.org/), based at Dartmouth College.
Different audiences.