SlideShare una empresa de Scribd logo
1 de 5
Descargar para leer sin conexión
Image access via FlickrThe Biodiversity Heritage
Library (BHL) is....
Art of Life project
More than just a pretty picture: improving the discoverability of
illustrations in the Biodiversity Heritage Library
by Gilbert Borrego, Grace Costantino, Bianca Crowley, Kyle Jaebker, Trish Rose-Sandler
Hidden within BHL literature are
millions of rich illustrations
• An open access digital library
for historic biodiversity literature
• An open data repository of
taxonomic names and
bibliographic information
BHL staff manually identify and push BHL images to a
Flickr stream (www.flickr.com/photos/biodivlibrary) but
the process does not scale to the millions of images
available
The Art of Life project , enabled by a grant from NEH,
aims to automate the process of identifying and
tagging images via algorithms
Users can add tags to images
in Flickr so that they are
searchable. They are also
encouraged to add species
names via machine tags so
BHL can automatically share
these images with the
Encyclopedia of Life
(http://eol.org/collections/53002)
The project defined a metadata schema for natural history
illustrations that will help crowdsource more detailed
descriptions via image portals such as Wikimedia Commons
(http://tinyurl.com/9hm7nsb)
www.biodiversitylibrary.org
Uploading Images to Flickr
The Biodiversity Heritage Library (BHL, www.biodiversitylibrary.org) provides access to thousands of scientific
illustrations through the social media site, Flickr. To expedite the process of uploading these images to Flickr, a
workflow was developed within BHL’s backend database. When paginating, or enhancing a book’s page metada-
ta, staff can click a single button to upload all illustrations within that book to Flickr. Bibliographic information
and a link to the image in BHL are also embedded during the process.
This workflow was internally documented in the form of a tutorial to ensure that all BHL partners can contribute
to this effort and be part of the program’s expanding outreach efforts.
The use of Flickr as an outreach platform exposes our rich image collection to search engines and new users.
Additionally, it allows us to provide images of species to include on the Encyclopedia of Life’s taxon pages. While
the original intention of BHL’ Flickr account was to provide easy access to scientific figures, plates and illustra-
tions, the site has taken on a life of its own and is being repurposed by users all around the world in the most
imaginative ways.
From BHL’s backend dashboard, staff select the
pages to upload to Flickr.
Final view in Flickr.
Once images are uploaded, staff can create sets, add
additional bibliographic information, and assign
sets to collections.
Visit the BHL Flickr today! http://www.flickr.com/photos/biodivlibrary
Learn how you can help add species names to BHL Images:
http://www.flickr.com/groups/encyclopedia_of_life/discuss/72157629515768640/
The Flickr Tagging Process
Crowdsourcing Species Identification and Image Tagging
The Biodiversity Heritage Library (BHL, www.biodiversitylibrary.org), an open access digital library consortium
for biodiversity literature, utilizes Flickr to provide access to thousands of images extracted from its digital
collections. In order to improve discoverability and usability of these images, BHL crowdsources the task of
adding species name machine tags to images in Flickr.
Tags are searchable keywords that users can apply to images in Flickr. Machine tags are specially formatted to be
read by computers: taxonomy:binomial=“Genus species”
BHL encourages its users to identify the species depicted in an image using the book’s image descriptions and
add that species name to the image as a machine tag. By adding these tags to BHL images, users can search
within Flickr for images of specific species and BHL can automatically share these images with the Encyclopedia
of Life (EOL, www.eol.org).
EOL is an open access project dedicated to providing a webpage for every species. EOL harvests machine-tagged
images from the BHL Flickr, uploads them to a BHL Image Collection in EOL, and automatically associates the
images with the matching species page. To date, thousands of machine-tagged images have been added to EOL.
Visit the BHL Flickr today! http://www.flickr.com/photos/biodivlibrary
Learn how you can help add species names to BHL Images:
http://www.flickr.com/groups/encyclopedia_of_life/discuss/72157629515768640/
Find an image in Flickr
Add a species name machine tag
The image is automatically ingested into the BHL Image Collection in EOL
And automatically associated
with the corresponding species
page in EOL
Users clamor for the Art of Life
The Art of Life project evolved out of a need to improve access to the rich corpus of natural history illustrations
hidden within the digitized pages of books and journals in the Biodiversity Heritage Library (BHL,
www.biodiversitylibrary.org). Currently, these illustrations have no descriptive metadata such as title, creator or
subject matter that can be searched. The only way to uncover these gems is by opening up a BHL book or vol-
ume and scrolling through page by page.
One solution has been for BHL staff to manually identify pages that contain illustrations and to push those pages
into a BHL Flickr stream which allows for discovery through themed collections and in some cases species
names. While this approach has resulted in improved access to some of BHL’s illustrations, it requires significant
staff time and the process does not scale well to the millions of images that are present within the BHL pages.
Example of an illustration described using Art of Life schemaIllustration schema elements.
Visit the BHL Flickr today!
http://www.flickr.com/photos/biodivlibrary
Read more about the Art of Life project:
http://biodivlib.wikispaces.com/Art+of+Life
Elements chosen
were a mix of VRA
Core 4.0 and
Darwin Core
Workflow diagram that outlines how each illustration will move through the Art of Life processes.
Thus, the Art of Life project was designed as a solution for automating the process of image identification and
crowdsourcing their descriptions. The project is a partnership between the Missouri Botanical Garden and the
Indianapolis Museum of Art and supported by the National Endowment for the Humanities. It runs from May
2012-April 2014. The Art of Life has five primary objectives: 1) define a metadata schema appropriate for nat-
ural history illustrations, 2) build algorithms to automatically identify BHL pages with illustrations, 3) sort and
classify the illustrations, 4) crowdsource descriptions through tagging applications; and 5) integrate descriptive
metadata back into BHL and share images and descriptions with audiences outside of BHL. These illustrations
will be of interest to a diversity of audiences including: artists; biologists; humanities scholars; librarians; educa-
tors; citizen scientists.
Automating the Heavy Lifting
Using Algorithms to Identify Images in BHL
In the Art of Life project, the Indianapolis Museum of
Art (IMA) and the Biodiversity Heritage Library (BHL,
www.biodiversitylibrary.org) have been working to
develop algorithms to identify images from the pages
of books and journals digitized from the BHL. Multiple
algorithms are being developed including ABBYY
OCR, contrast, color, and compression. These
algorithms are being tested to determine the most
efficient and accurate means of identifying images.
The IMA developed a set of software tools for running
and analyzing the results of the algorithms. This
software allows for the import of publications and
journals determined to be good test samples for the
algorithms. These samples termed the “Gold Standard”
are being used to evaluate the algorithms for how
useful they will be in determining if a scan contains a
sketch or drawing. Using a custom built interface for
reviewing the results, accurate processing results can be
seen as well as false positives. In addition to the visual
review of results, analysis across the entire “Gold Stan-
dard” is ongoing to determine the best combination of
algorithms.
Once completed, the algorithms will be deployed on a
cluster to process the entire BHL collection. After the
processing has been completed the metadata will be
used to add additional descriptive and finding aides.
This will allow users to discover and process
illustrations from the books and journals that used to
be very hard to discover.
Visit the BHL Flickr today!
http://www.flickr.com/photos/biodivlibrary
Read more about the Art of Life project:
http://biodivlib.wikispaces.com/Art+of+Life
Learn how you can help add species names to
BHL Images:
http://www.flickr.com/groups/encyclopedia_of_life/
discuss/72157629515768640/
Algorithm Results Viewer
Compression Ratio Algorithm Analysis
Close-up Algorithm Result

Más contenido relacionado

Similar a More than just a pretty picture: improving the discoverability of illustrations in the Biodiversity Heritage Library

Biodiversity Heritiage Library: progress and process
Biodiversity Heritiage Library: progress and processBiodiversity Heritiage Library: progress and process
Biodiversity Heritiage Library: progress and processPhil Cryer
 
BHL Developments thru Jan-Sep 2009
BHL Developments thru Jan-Sep 2009BHL Developments thru Jan-Sep 2009
BHL Developments thru Jan-Sep 2009Chris Freeland
 
BHL Developments - Prague
BHL Developments - PragueBHL Developments - Prague
BHL Developments - PragueChris Freeland
 
Library 2.013: Greasing Squeaky Wheels: A User-Centered Approach to Collectio...
Library 2.013: Greasing Squeaky Wheels: A User-Centered Approach to Collectio...Library 2.013: Greasing Squeaky Wheels: A User-Centered Approach to Collectio...
Library 2.013: Greasing Squeaky Wheels: A User-Centered Approach to Collectio...Bianca Crowley
 
Crowdsourcing your cultural heritage collections: considerations when choosi...
Crowdsourcing your cultural heritage collections:  considerations when choosi...Crowdsourcing your cultural heritage collections:  considerations when choosi...
Crowdsourcing your cultural heritage collections: considerations when choosi...Trish Rose-Sandler
 
Bibliographic References in BHL
Bibliographic References in BHLBibliographic References in BHL
Bibliographic References in BHLWilliam Ulate
 
BHL / EOL technology sit down
BHL / EOL technology sit downBHL / EOL technology sit down
BHL / EOL technology sit downChris Freeland
 
Hekman van vliet commons paper
Hekman van vliet commons paperHekman van vliet commons paper
Hekman van vliet commons paperArthurBrussee
 
Finding Images Workshop for Graduate Students
Finding Images Workshop for Graduate StudentsFinding Images Workshop for Graduate Students
Finding Images Workshop for Graduate StudentsJenna Rinalducci
 
Collection assessment in a collaborative environment: Biodiversity Heritage L...
Collection assessment in a collaborative environment: Biodiversity Heritage L...Collection assessment in a collaborative environment: Biodiversity Heritage L...
Collection assessment in a collaborative environment: Biodiversity Heritage L...Connie Rinaldo
 
Creating An Ideal Image Library Website And Database
Creating An Ideal Image Library Website And DatabaseCreating An Ideal Image Library Website And Database
Creating An Ideal Image Library Website And Databasebecky bristol
 
Next Generation Technical Services May 2009 Calhoun
Next Generation Technical Services May 2009 CalhounNext Generation Technical Services May 2009 Calhoun
Next Generation Technical Services May 2009 CalhounKaren S Calhoun
 
The Biodiversity Heritage Library and bibliographic citations: towards new u...
The Biodiversity Heritage Library and bibliographic citations: towards new u...The Biodiversity Heritage Library and bibliographic citations: towards new u...
The Biodiversity Heritage Library and bibliographic citations: towards new u...Trish Rose-Sandler
 
Handout for Tagging and User-Contributed Metadata
Handout for Tagging and User-Contributed MetadataHandout for Tagging and User-Contributed Metadata
Handout for Tagging and User-Contributed MetadataJenn Riley
 
Liberating OA figures from PDF to Flickr (A Pro-iBiosphere talk)
Liberating OA figures from PDF to Flickr (A Pro-iBiosphere talk)Liberating OA figures from PDF to Flickr (A Pro-iBiosphere talk)
Liberating OA figures from PDF to Flickr (A Pro-iBiosphere talk)Ross Mounce
 
The Biodiversity Heritage Library & Botany: Empowering Discovery through Free...
The Biodiversity Heritage Library & Botany: Empowering Discovery through Free...The Biodiversity Heritage Library & Botany: Empowering Discovery through Free...
The Biodiversity Heritage Library & Botany: Empowering Discovery through Free...Martin Kalfatovic
 
Biodiversity Heritage Library & JSTOR Plant Science
Biodiversity Heritage Library & JSTOR Plant ScienceBiodiversity Heritage Library & JSTOR Plant Science
Biodiversity Heritage Library & JSTOR Plant ScienceChris Freeland
 
Biodiversity Heritage Library: Content liberator
Biodiversity Heritage Library: Content liberatorBiodiversity Heritage Library: Content liberator
Biodiversity Heritage Library: Content liberatorKeri Thompson
 

Similar a More than just a pretty picture: improving the discoverability of illustrations in the Biodiversity Heritage Library (20)

Biodiversity Heritiage Library: progress and process
Biodiversity Heritiage Library: progress and processBiodiversity Heritiage Library: progress and process
Biodiversity Heritiage Library: progress and process
 
MLA Handout 2012
MLA Handout 2012MLA Handout 2012
MLA Handout 2012
 
BHL @ #TDWG09
BHL @ #TDWG09BHL @ #TDWG09
BHL @ #TDWG09
 
BHL Developments thru Jan-Sep 2009
BHL Developments thru Jan-Sep 2009BHL Developments thru Jan-Sep 2009
BHL Developments thru Jan-Sep 2009
 
BHL Developments - Prague
BHL Developments - PragueBHL Developments - Prague
BHL Developments - Prague
 
Library 2.013: Greasing Squeaky Wheels: A User-Centered Approach to Collectio...
Library 2.013: Greasing Squeaky Wheels: A User-Centered Approach to Collectio...Library 2.013: Greasing Squeaky Wheels: A User-Centered Approach to Collectio...
Library 2.013: Greasing Squeaky Wheels: A User-Centered Approach to Collectio...
 
Crowdsourcing your cultural heritage collections: considerations when choosi...
Crowdsourcing your cultural heritage collections:  considerations when choosi...Crowdsourcing your cultural heritage collections:  considerations when choosi...
Crowdsourcing your cultural heritage collections: considerations when choosi...
 
Bibliographic References in BHL
Bibliographic References in BHLBibliographic References in BHL
Bibliographic References in BHL
 
BHL / EOL technology sit down
BHL / EOL technology sit downBHL / EOL technology sit down
BHL / EOL technology sit down
 
Hekman van vliet commons paper
Hekman van vliet commons paperHekman van vliet commons paper
Hekman van vliet commons paper
 
Finding Images Workshop for Graduate Students
Finding Images Workshop for Graduate StudentsFinding Images Workshop for Graduate Students
Finding Images Workshop for Graduate Students
 
Collection assessment in a collaborative environment: Biodiversity Heritage L...
Collection assessment in a collaborative environment: Biodiversity Heritage L...Collection assessment in a collaborative environment: Biodiversity Heritage L...
Collection assessment in a collaborative environment: Biodiversity Heritage L...
 
Creating An Ideal Image Library Website And Database
Creating An Ideal Image Library Website And DatabaseCreating An Ideal Image Library Website And Database
Creating An Ideal Image Library Website And Database
 
Next Generation Technical Services May 2009 Calhoun
Next Generation Technical Services May 2009 CalhounNext Generation Technical Services May 2009 Calhoun
Next Generation Technical Services May 2009 Calhoun
 
The Biodiversity Heritage Library and bibliographic citations: towards new u...
The Biodiversity Heritage Library and bibliographic citations: towards new u...The Biodiversity Heritage Library and bibliographic citations: towards new u...
The Biodiversity Heritage Library and bibliographic citations: towards new u...
 
Handout for Tagging and User-Contributed Metadata
Handout for Tagging and User-Contributed MetadataHandout for Tagging and User-Contributed Metadata
Handout for Tagging and User-Contributed Metadata
 
Liberating OA figures from PDF to Flickr (A Pro-iBiosphere talk)
Liberating OA figures from PDF to Flickr (A Pro-iBiosphere talk)Liberating OA figures from PDF to Flickr (A Pro-iBiosphere talk)
Liberating OA figures from PDF to Flickr (A Pro-iBiosphere talk)
 
The Biodiversity Heritage Library & Botany: Empowering Discovery through Free...
The Biodiversity Heritage Library & Botany: Empowering Discovery through Free...The Biodiversity Heritage Library & Botany: Empowering Discovery through Free...
The Biodiversity Heritage Library & Botany: Empowering Discovery through Free...
 
Biodiversity Heritage Library & JSTOR Plant Science
Biodiversity Heritage Library & JSTOR Plant ScienceBiodiversity Heritage Library & JSTOR Plant Science
Biodiversity Heritage Library & JSTOR Plant Science
 
Biodiversity Heritage Library: Content liberator
Biodiversity Heritage Library: Content liberatorBiodiversity Heritage Library: Content liberator
Biodiversity Heritage Library: Content liberator
 

Más de Trish Rose-Sandler

Botanists and annotations: use cases and their relevance for the larger scie...
Botanists and annotations:  use cases and their relevance for the larger scie...Botanists and annotations:  use cases and their relevance for the larger scie...
Botanists and annotations: use cases and their relevance for the larger scie...Trish Rose-Sandler
 
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...Foundations to Actions: Extending Innovations to Digital Libraries in Partner...
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...Trish Rose-Sandler
 
The history of biodiversity through words and pictures
The history of biodiversity through words and picturesThe history of biodiversity through words and pictures
The history of biodiversity through words and picturesTrish Rose-Sandler
 
The Art of Life: merging the worlds of art and science
The Art of Life:  merging the worlds of art and scienceThe Art of Life:  merging the worlds of art and science
The Art of Life: merging the worlds of art and scienceTrish Rose-Sandler
 
Special libraries association meeting march 2014
Special libraries association meeting march 2014Special libraries association meeting march 2014
Special libraries association meeting march 2014Trish Rose-Sandler
 
Finding a goldmine of natural history illustrations within BHL texts: the Ar...
Finding a goldmine of natural history illustrations within BHL texts:  the Ar...Finding a goldmine of natural history illustrations within BHL texts:  the Ar...
Finding a goldmine of natural history illustrations within BHL texts: the Ar...Trish Rose-Sandler
 
Breathing new life into old data - How opening your collection can spark imag...
Breathing new life into old data - How opening your collection can spark imag...Breathing new life into old data - How opening your collection can spark imag...
Breathing new life into old data - How opening your collection can spark imag...Trish Rose-Sandler
 
Revealing and Contextualizing the treasures of the Biodiversity Heritage Libr...
Revealing and Contextualizing the treasures of the Biodiversity Heritage Libr...Revealing and Contextualizing the treasures of the Biodiversity Heritage Libr...
Revealing and Contextualizing the treasures of the Biodiversity Heritage Libr...Trish Rose-Sandler
 
Reach Out! Opportunities for the Visual Resource Center
Reach Out!  Opportunities for the Visual Resource CenterReach Out!  Opportunities for the Visual Resource Center
Reach Out! Opportunities for the Visual Resource CenterTrish Rose-Sandler
 
Building the new open linked library: Theory and Practice
Building the new open linked library: Theory and PracticeBuilding the new open linked library: Theory and Practice
Building the new open linked library: Theory and PracticeTrish Rose-Sandler
 
Crowd-sourcing the creation of "articles" within the Biodiversity Heritage Li...
Crowd-sourcing the creation of "articles" within the Biodiversity Heritage Li...Crowd-sourcing the creation of "articles" within the Biodiversity Heritage Li...
Crowd-sourcing the creation of "articles" within the Biodiversity Heritage Li...Trish Rose-Sandler
 

Más de Trish Rose-Sandler (12)

Botanists and annotations: use cases and their relevance for the larger scie...
Botanists and annotations:  use cases and their relevance for the larger scie...Botanists and annotations:  use cases and their relevance for the larger scie...
Botanists and annotations: use cases and their relevance for the larger scie...
 
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...Foundations to Actions: Extending Innovations to Digital Libraries in Partner...
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...
 
The history of biodiversity through words and pictures
The history of biodiversity through words and picturesThe history of biodiversity through words and pictures
The history of biodiversity through words and pictures
 
The Art of Life: merging the worlds of art and science
The Art of Life:  merging the worlds of art and scienceThe Art of Life:  merging the worlds of art and science
The Art of Life: merging the worlds of art and science
 
Special libraries association meeting march 2014
Special libraries association meeting march 2014Special libraries association meeting march 2014
Special libraries association meeting march 2014
 
Finding a goldmine of natural history illustrations within BHL texts: the Ar...
Finding a goldmine of natural history illustrations within BHL texts:  the Ar...Finding a goldmine of natural history illustrations within BHL texts:  the Ar...
Finding a goldmine of natural history illustrations within BHL texts: the Ar...
 
Breathing new life into old data - How opening your collection can spark imag...
Breathing new life into old data - How opening your collection can spark imag...Breathing new life into old data - How opening your collection can spark imag...
Breathing new life into old data - How opening your collection can spark imag...
 
Revealing and Contextualizing the treasures of the Biodiversity Heritage Libr...
Revealing and Contextualizing the treasures of the Biodiversity Heritage Libr...Revealing and Contextualizing the treasures of the Biodiversity Heritage Libr...
Revealing and Contextualizing the treasures of the Biodiversity Heritage Libr...
 
Reach Out! Opportunities for the Visual Resource Center
Reach Out!  Opportunities for the Visual Resource CenterReach Out!  Opportunities for the Visual Resource Center
Reach Out! Opportunities for the Visual Resource Center
 
The Art of Life project
The Art of Life projectThe Art of Life project
The Art of Life project
 
Building the new open linked library: Theory and Practice
Building the new open linked library: Theory and PracticeBuilding the new open linked library: Theory and Practice
Building the new open linked library: Theory and Practice
 
Crowd-sourcing the creation of "articles" within the Biodiversity Heritage Li...
Crowd-sourcing the creation of "articles" within the Biodiversity Heritage Li...Crowd-sourcing the creation of "articles" within the Biodiversity Heritage Li...
Crowd-sourcing the creation of "articles" within the Biodiversity Heritage Li...
 

Último

Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 

Último (20)

Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 

More than just a pretty picture: improving the discoverability of illustrations in the Biodiversity Heritage Library

  • 1. Image access via FlickrThe Biodiversity Heritage Library (BHL) is.... Art of Life project More than just a pretty picture: improving the discoverability of illustrations in the Biodiversity Heritage Library by Gilbert Borrego, Grace Costantino, Bianca Crowley, Kyle Jaebker, Trish Rose-Sandler Hidden within BHL literature are millions of rich illustrations • An open access digital library for historic biodiversity literature • An open data repository of taxonomic names and bibliographic information BHL staff manually identify and push BHL images to a Flickr stream (www.flickr.com/photos/biodivlibrary) but the process does not scale to the millions of images available The Art of Life project , enabled by a grant from NEH, aims to automate the process of identifying and tagging images via algorithms Users can add tags to images in Flickr so that they are searchable. They are also encouraged to add species names via machine tags so BHL can automatically share these images with the Encyclopedia of Life (http://eol.org/collections/53002) The project defined a metadata schema for natural history illustrations that will help crowdsource more detailed descriptions via image portals such as Wikimedia Commons (http://tinyurl.com/9hm7nsb) www.biodiversitylibrary.org
  • 2. Uploading Images to Flickr The Biodiversity Heritage Library (BHL, www.biodiversitylibrary.org) provides access to thousands of scientific illustrations through the social media site, Flickr. To expedite the process of uploading these images to Flickr, a workflow was developed within BHL’s backend database. When paginating, or enhancing a book’s page metada- ta, staff can click a single button to upload all illustrations within that book to Flickr. Bibliographic information and a link to the image in BHL are also embedded during the process. This workflow was internally documented in the form of a tutorial to ensure that all BHL partners can contribute to this effort and be part of the program’s expanding outreach efforts. The use of Flickr as an outreach platform exposes our rich image collection to search engines and new users. Additionally, it allows us to provide images of species to include on the Encyclopedia of Life’s taxon pages. While the original intention of BHL’ Flickr account was to provide easy access to scientific figures, plates and illustra- tions, the site has taken on a life of its own and is being repurposed by users all around the world in the most imaginative ways. From BHL’s backend dashboard, staff select the pages to upload to Flickr. Final view in Flickr. Once images are uploaded, staff can create sets, add additional bibliographic information, and assign sets to collections. Visit the BHL Flickr today! http://www.flickr.com/photos/biodivlibrary Learn how you can help add species names to BHL Images: http://www.flickr.com/groups/encyclopedia_of_life/discuss/72157629515768640/
  • 3. The Flickr Tagging Process Crowdsourcing Species Identification and Image Tagging The Biodiversity Heritage Library (BHL, www.biodiversitylibrary.org), an open access digital library consortium for biodiversity literature, utilizes Flickr to provide access to thousands of images extracted from its digital collections. In order to improve discoverability and usability of these images, BHL crowdsources the task of adding species name machine tags to images in Flickr. Tags are searchable keywords that users can apply to images in Flickr. Machine tags are specially formatted to be read by computers: taxonomy:binomial=“Genus species” BHL encourages its users to identify the species depicted in an image using the book’s image descriptions and add that species name to the image as a machine tag. By adding these tags to BHL images, users can search within Flickr for images of specific species and BHL can automatically share these images with the Encyclopedia of Life (EOL, www.eol.org). EOL is an open access project dedicated to providing a webpage for every species. EOL harvests machine-tagged images from the BHL Flickr, uploads them to a BHL Image Collection in EOL, and automatically associates the images with the matching species page. To date, thousands of machine-tagged images have been added to EOL. Visit the BHL Flickr today! http://www.flickr.com/photos/biodivlibrary Learn how you can help add species names to BHL Images: http://www.flickr.com/groups/encyclopedia_of_life/discuss/72157629515768640/ Find an image in Flickr Add a species name machine tag The image is automatically ingested into the BHL Image Collection in EOL And automatically associated with the corresponding species page in EOL
  • 4. Users clamor for the Art of Life The Art of Life project evolved out of a need to improve access to the rich corpus of natural history illustrations hidden within the digitized pages of books and journals in the Biodiversity Heritage Library (BHL, www.biodiversitylibrary.org). Currently, these illustrations have no descriptive metadata such as title, creator or subject matter that can be searched. The only way to uncover these gems is by opening up a BHL book or vol- ume and scrolling through page by page. One solution has been for BHL staff to manually identify pages that contain illustrations and to push those pages into a BHL Flickr stream which allows for discovery through themed collections and in some cases species names. While this approach has resulted in improved access to some of BHL’s illustrations, it requires significant staff time and the process does not scale well to the millions of images that are present within the BHL pages. Example of an illustration described using Art of Life schemaIllustration schema elements. Visit the BHL Flickr today! http://www.flickr.com/photos/biodivlibrary Read more about the Art of Life project: http://biodivlib.wikispaces.com/Art+of+Life Elements chosen were a mix of VRA Core 4.0 and Darwin Core Workflow diagram that outlines how each illustration will move through the Art of Life processes. Thus, the Art of Life project was designed as a solution for automating the process of image identification and crowdsourcing their descriptions. The project is a partnership between the Missouri Botanical Garden and the Indianapolis Museum of Art and supported by the National Endowment for the Humanities. It runs from May 2012-April 2014. The Art of Life has five primary objectives: 1) define a metadata schema appropriate for nat- ural history illustrations, 2) build algorithms to automatically identify BHL pages with illustrations, 3) sort and classify the illustrations, 4) crowdsource descriptions through tagging applications; and 5) integrate descriptive metadata back into BHL and share images and descriptions with audiences outside of BHL. These illustrations will be of interest to a diversity of audiences including: artists; biologists; humanities scholars; librarians; educa- tors; citizen scientists.
  • 5. Automating the Heavy Lifting Using Algorithms to Identify Images in BHL In the Art of Life project, the Indianapolis Museum of Art (IMA) and the Biodiversity Heritage Library (BHL, www.biodiversitylibrary.org) have been working to develop algorithms to identify images from the pages of books and journals digitized from the BHL. Multiple algorithms are being developed including ABBYY OCR, contrast, color, and compression. These algorithms are being tested to determine the most efficient and accurate means of identifying images. The IMA developed a set of software tools for running and analyzing the results of the algorithms. This software allows for the import of publications and journals determined to be good test samples for the algorithms. These samples termed the “Gold Standard” are being used to evaluate the algorithms for how useful they will be in determining if a scan contains a sketch or drawing. Using a custom built interface for reviewing the results, accurate processing results can be seen as well as false positives. In addition to the visual review of results, analysis across the entire “Gold Stan- dard” is ongoing to determine the best combination of algorithms. Once completed, the algorithms will be deployed on a cluster to process the entire BHL collection. After the processing has been completed the metadata will be used to add additional descriptive and finding aides. This will allow users to discover and process illustrations from the books and journals that used to be very hard to discover. Visit the BHL Flickr today! http://www.flickr.com/photos/biodivlibrary Read more about the Art of Life project: http://biodivlib.wikispaces.com/Art+of+Life Learn how you can help add species names to BHL Images: http://www.flickr.com/groups/encyclopedia_of_life/ discuss/72157629515768640/ Algorithm Results Viewer Compression Ratio Algorithm Analysis Close-up Algorithm Result