A talk presented January 20, 2013 in the Indo-US Joint Workshop on Biodiversity Informatics at the Ashoka Trust for Research in Ecology and the Environment in Bangalore, India.
$ Love Spells^ 💎 (310) 882-6330 in West Virginia, WV | Psychic Reading Best B...
Building EOL species pages
1. Building and aggregating species pages
Cynthia Parr, PhD @cydparr
Species Pages Group @eol
Chief Scientist
2. Goals
1. To provide some practical information about
tools for capturing and sharing species
information
2. To get feedback on EOL tools and processes
for future improvement
3. To discuss good ideas that can inspire each
other as we work both together and
separately
3. Show of hands
• Do you contribute to a website(s) already?
• Do you run your own website?
• Would you rather just publish scientific
papers?
4. Outline
• EOL is part of a sharing network
• Standards for sharing species descriptions
• Tools for capturing content
– Direct on EOL
– Multimedia repositories
– Wikis
– Literature
– Specialist web sites
• Sharing content with EOL
• Curating EOL content
• Using content from EOL
5. EOL is part of a rich data sharing ecosystem about
living organisms
From Parr et al. 2012. Evolutionary informatics: unifying knowledge about the diversity of life.
Trends in Ecology and Evolution 26: 94-103.
10. Use your own classification standard
You do NOT have to follow a designated classification
Share your taxonomic hierarchy with your EOL content
EOL does its best to match the content with pages we
already have
Contribute to COL and ITIS (and their partners)
Then all projects will have access
11. Outline
• EOL is part of a sharing network
• Standards for sharing species descriptions
• Tools for capturing content
– Direct on EOL
– Multimedia repositories
– Wikis
– Literature
– Specialist web sites
• Sharing content with EOL
• Curating EOL content
• Using content from EOL
15. Choosing a subject determines where
on the EOL page the information is
displayed
Also important for other applications
that want to use the content
17. If you have a lot of content
• Work with a hub like India Biodiversity Portal
• Spreadsheet upload to EOL
• Find a specialist website that is already a
partner with EOL
26. Wikipedia/Wikimedia
BUT
Only the English version, starts unreviewed
Highly formatted, not always consistent
Not always easy to review
About 150K taxa, 122K images
Easy to add new taxa
28. Pensoft.net
• Pensoft has a process to
generate EOL-compliant
XML for new species
• Also sends images to
Morphbank, specimens
to GBIF
• They registered the URL
at EOL
• Our script checks for
changes once a day
• EOL Open Access Fund
33. Outline
• EOL is part of a sharing network
• Standards for sharing species descriptions
• Tools for capturing content
– Direct on EOL
– Multimedia repositories
– Wikis
– Literature
– Specialist web sites
• Sharing content with EOL
• Curating EOL content
• Using content from EOL
34. Working with existing EOL partner?
• Using their tools
Add content
Follow • May need to add special tags
• May need to change licenses
instructions • May need to add to an EOL group
• Your content will be updated
That’s all! automatically whenever we harvest
from that partner
35. If you already have your own website
(or spreadsheet)
• Create EOL member account
Register on EOL • Add content partner with contact info
Prepare resource • Spreadsheet OR Darwin Core Archive
• EOL will work with you
file
• Provide file for one time upload
Add resource file • Or provide URL and set harvest frequency
• Notifications of traffic statistics
Receive feedback • Curation actions
• Comments (Please respond!)
44. Outline
• EOL is part of a sharing network for species
• Standards for sharing species descriptions
• Tools for capturing content
– Direct on EOL
– Multimedia repositories
– Wikis
– Literature
– Specialist web sites
• Sharing content with EOL
• Curating EOL content
• Using content from EOL
45. EOL curation http://eol.org/info/curators
• Trust, untrust, hide taxon associations
• Add new taxon association
• Leave a comment
• Add a rating
46. • Set preferred classifications
• Add and set preferred
common names
More info at:
http://eol.org/info/curators
47. http://eol.org/info/curators
• Register as a member at eol.org
• Edit your profile and provide your credentials
– For assistant curator, just your real name
– For full curator need academic affiliation OR
publication reference OR referral from another full
curator
• Curate only as much as you are able
48. Outline
• EOL is part of a sharing network
• Standards for sharing species descriptions
• Tools for capturing content
– Direct on EOL
– Multimedia repositories
– Wikis
– Literature
– Specialist web sites
• Sharing content with EOL
• Curating EOL content
• Using content from EOL
49. Using content from EOL
Manually API
No need to ask
Docs at http://eol.org/api
permission
Follow license
conditions Register for an API key
Provide attribution Follow terms of use
Set up either a dynamic
query or refresh a cache of
results
50. Questions?
• Contact our partners at the India Biodiversity Portal
• Contact me at parrc@si.edu
• Use the contact us form on EOL
flickr.com/groups/encyclopediaoflife
inaturalist.org, morphbank.org
biodiversitylibrary.org
pensoft.net, plazi.org
scratchpads.eu
eol.org/info/cp_getting_started
eol.org/info/curators
eol.org/api
Notas del editor
Includes genetic information, geospatial information, descriptive information, evolutionary information, and published literature.Some of these are using ontologies and we are integrating with them in interesting ways.Box 3. Linking and sharing biodiversity dataA vast ecosystem of projects is already managing and, to somedegree, sharing information about living organisms. Figure Ishows an incompletely known network of biodiversity-relateddatabases. Two kinds of hub are apparent: hubs connectingdifferent kinds of data (e.g. EOL) and hubs that connect manypartners and subnetworks sharing the same kind of data (e.g. GBIF,BHL and Catalogue of Life). Most hubs have received majorfunding from research and charitable foundations and activelyfacilitate use of data-sharing standards, such as Darwin Core.Some non-hub projects have been leaders in connecting to manyhubs; these projects enjoy effective individual and/or institutionalsupport for data sharing. Such an analysis provides: (i) a baselineunderstanding of redundancy and resiliency for long-term dataaccess; (ii) characterization of the environment for distributedannotation and quality control; and (iii) identification of isolatedprojects and appropriate mechanisms for connecting them to thenetwork. A more complete analysis, including the volume of dataflow, could enable real-time assessment of network health. Theinformatics community assumes that this ‘eBiosphere’ [75] haspositive impacts on progress in evolutionary and biodiversityscience as well as societal challenges, but demonstrating theseimpacts remains a challenge.Figure I. An incompletely known network of biodiversity-related databases. Dots represent 1631 projects, clustered and color-coded by the Clauset-Newman-Moore algorithm[76] using NodeXL [77], with 1704 instances of data sharing (re-use of the same data), deep hyperlinking (e.g. a page for a taxon in one online database links to appropriateresources in another online database), or indexing. The size of dot reflects the degree (i.e. number of links); links between hubs (degree >15) are enlarged for emphasis.OBIS = Ocean Biogeographic Information System, WoRMS = World Register of Marine Species, AKN = Avian Knowledge Network. See main text and Box 1 for otheracronyms. Projects included Encyclopedia of Life (EOL), a leading online aggregator of descriptive information about all organisms, its partners and their partners, and 588projects registered in the Taxonomic Databases Working Group database. The cluster of unconnected dots indicates that no linkages are known among hundreds of projects.